3 of 3 Ray Jobs in the UK excluding London

Lead Site Reliability Engineer - Operations Excellence

Hiring Organisation: Jobleads-UK
Location: Glasgow, Scotland, United Kingdom

Experience building AI agents using frameworks such as LangChain, CrewAI, LangGraph, or similar orchestration platforms Experience operating or integrating serving platforms such as KServe, Ray Serve, NVIDIA Triton Inference Server, Text Generation Inference (TGI), alongside vLLM/llm‐d Familiarity with Amazon SageMaker JumpStart, SageMaker Endpoints, and Amazon Bedrock … monitoring (e.g., hallucination, toxicity, drift detection) and tracing via OpenTelemetry conventions Contributions to open‐source LLM serving or inference projects (e.g., vLLM, llm‐d, Ray, KServe, Triton) #J-18808-Ljbffr ...

Senior Lead Software Engineer - LLM Ops Platform Reliability

Hiring Organisation: Jobleads-UK
Location: Glasgow, Scotland, United Kingdom

building AI agents using orchestration frameworks such as LangChain, LangGraph, CrewAI, or similar platforms Experience operating or integrating model serving platforms such as KServe, Ray Serve, or NVIDIA Triton Inference Server alongside other large language model serving stacks Familiarity with Amazon SageMaker JumpStart, SageMaker Endpoints, and Amazon Bedrock for managed … detection, toxicity filtering, and drift detection using open telemetry conventions Contributions to open-source large language model serving or inference projects, (vLLM, llm-d, Ray, KServe, Triton) #J-18808-Ljbffr ...

Systems Research Engineer - Distributed Systems / C++

Hiring Organisation: European Tech Recruit
Location: Edinburgh, Scotland, United Kingdom

depth profiling and performance tuning of inference pipelines, focusing on KV cache management. Develop low-latency, fault-tolerant AI serving frameworks using vLLM, Ray Serve, and PyTorch Distributed. Research and prototype novel techniques for cache sharing, data locality, and resource orchestration. Translate innovative designs into publishable contributions at top-tier … systems, or related field. Strong knowledge of Distributed Systems, OS internals, and Machine Learning systems architecture. Hands-on experience with LLM serving frameworks (vLLM, Ray Serve, TensorRT-LLM, or TGI). Proficiency in C/C++ for systems development and Python for research prototyping. Solid grounding in distributed algorithms, load ...