3 of 3 Low Latency Jobs in Edinburgh

Senior AI Engineer

Hiring Organisation: Jobleads-UK
Location: City of Edinburgh, Scotland, United Kingdom

APIs, and enterprise data integrations Fine‐tune, optimize, and deploy large language and multi‐modal models with a focus on scalability, reliability, and low‐latency performance Build scalable MLOps pipelines and cloud‐native AI applications to support secure, production‐grade deployments Collaborate with clients and cross‐functional teams ...

Systems Research Engineer - Distributed Systems / C++

Hiring Organisation: European Tech Recruit
Location: Edinburgh, Scotland, United Kingdom

workloads across CPU, GPU, and NPU clusters. Conduct in-depth profiling and performance tuning of inference pipelines, focusing on KV cache management. Develop low-latency, fault-tolerant AI serving frameworks using vLLM, Ray Serve, and PyTorch Distributed. Research and prototype novel techniques for cache sharing, data locality ...

Senior AI Engineer: GenAI for Enterprise Finance (Hybrid)

Hiring Organisation: Jobleads-UK
Location: City of Edinburgh, Scotland, United Kingdom

full AI engineering lifecycle, from architecture to production deployment, working with multi-disciplinary teams across client projects. The role emphasizes scalable MLOps, low-latency AI, and hands-on development in a client-facing, agile environment within Capco’s Technology Delivery team. #J-18808-Ljbffr ...