AI Infrastructure Architect

Responsibilities: Design a unified AI Infra & Serving architecture platform for composite AI workloads such as LLM Training & Inference, RLHF, Agent, and Multimodal processing.

All potential candidates should read through the following details of this job with care before making an application.

This platform will integrate inference, orchestration, and state management, defining the technical evolution path for Serverless AI Agentic Serving Design a heterogeneous execution framework across CPU/GPU/NPU for agent memory, tool invocation, and long-running multi-turn conversations and tasks.

Build an efficient memory/KV-cache/vector store/logging and state-management subsystem to support agent retrieval, planning, and persistent memory.

Build a high-performance Runtime/Framework that defines the next-generation Serverless AI foundation through elastic scaling, cold start optimization, batch processing, function-based inference, request orchestration, dynamic decoupled deployment, and other features to support performance scenarios such as multiple models, multi-tenancy, and high concurrency.

Key Requirements: Strong foundational knowledge in system architecture, or computer architecture, operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization xkybehq concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using Profiling/Tracing tools; experienced in analyzing and optimizing system-level bottlenecks regarding GPU utilization, memory/bandwidth, Interconnect Fabric, and network/storage paths Proficient in at least one system-level language (e.g., C/C++, Go, Rust) and one scripting language (e.g., Python)

Apply Now

Similar Jobs

AI Infrastructure Architect

Hiring Organisation: Microtech Global Ltd
Location: Edinburgh, UK

Job Description Responsibilities: Design a unified AI Infra & Serving architecture platform for composite AI workloads such as LLM Training & Inference, RLHF, Agent, and Multimodal processing. This platform will integrate inference, orchestration, and state management, defining the technical evolution path for ...

LLM Architect

Hiring Organisation: Bright Purple
Location: Edinburgh, City of Edinburgh, United Kingdom
Employment Type: Permanent

LLM Architect Edinburgh (on-site) £100k-120k + exceptional benefits A rare chance to drive the future of AI infrastructure at one of the world's leading R&D tech organisations. This is a senior opportunity with a global research leader ...

AI Infrastructure Architect

Hiring Organisation: Microtech Global Ltd
Location: Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type: Permanent

Responsibilities: Design a unified AI Infra & Serving architecture platform for composite AI workloads such as LLM Training & Inference, RLHF, Agent, and Multimodal processing. This platform will integrate inference, orchestration, and state management, defining the technical evolution path for Serverless AI ...

AI Infrastructure Architect

Hiring Organisation: Microtech Global Ltd
Location: Edinburgh, Midlothian, United Kingdom
Employment Type: Permanent
Salary: GBP Annual

Responsibilities: Design a unified AI Infra & Serving architecture platform for composite AI workloads such as LLM Training & Inference, RLHF, Agent, and Multimodal processing. This platform will integrate inference, orchestration, and state management, defining the technical evolution path for Serverless AI ...

Founding Engineer

Hiring Organisation: Corecom Consulting
Location: London, United Kingdom
Employment Type: Permanent

Overview We're hiring a Founding / Senior Software Engineer to join a well-funded, early-stage AI product team based in London. This is a hands-on, high-ownership role for someone who enjoys building real products from the ground ...

Applied AI Engineer

Hiring Organisation: Saragossa
Location: London Area, United Kingdom

Imagine how nice it would feel to work at a company that has the freedom to build solely AI solutions and AI agents. You could work with no restraints, have creative freedom, and be challenged to develop innovative AI solutions ...

AI Engineer

Hiring Organisation: trg.recruitment
Location: London Area, United Kingdom

AI Engineer Location: London (Onsite - 5 days per week) Type: Permanent Salary: £200,000 + generous equity package Interview Process: 2-4 stages I'm hiring for two YC-backed startups building foundational AI infrastructure - one focused on advancing reasoning capabilities ...

AI Technical Architect

Hiring Organisation: Randstad Digital
Location: City, London, United Kingdom
Employment Type: Contract
Salary: GBP 450 - 550 Daily

We are seeking a senior AI Technical Architect within a secure UK Government environment. You will bridge the gap between high-level stakeholder strategy and hands-on implementation of Generative AI and Large Language Models (LLMs). Hybrid - 2 days week ...