and communication skills. NICE TO HAVE: Hands-on experience with LLMs and Natural Language Processing (NLP) , including fine-tuning or prompt engineering. Familiarity with distributed computing or parallel processing (Ray, Spark, etc.). Experience deploying models in production environments (Docker, cloud services). Exposure to data engineering or working alongside data pipeline teams. A genuine passion for AI innovation and More ❯
deploying or optimising LLMs. Hands-on expertise in C Rust/Go for systems programming, plus Python for model integration. Strong knowledge of distributed runtimes and scheduling frameworks (e.g. Ray, Dask, MPI, or custom equivalents). Experience with GPU cluster management (CUDA, NCCL, Triton Inference Server) and performance tuning across accelerators. Solid grasp of cloud-native orchestration (Docker, Kubernetes, Helm More ❯
deploying or optimising LLMs. Hands-on expertise in C Rust/Go for systems programming, plus Python for model integration. Strong knowledge of distributed runtimes and scheduling frameworks (e.g. Ray, Dask, MPI, or custom equivalents). Experience with GPU cluster management (CUDA, NCCL, Triton Inference Server) and performance tuning across accelerators. Solid grasp of cloud-native orchestration (Docker, Kubernetes, Helm More ❯
london, south east england, united kingdom Hybrid/Remote Options
Sprout
features which deliver AI capabilities to some of the biggest names in the insurance industry. We are developing a modern real-time ML platform using technologies like FastAPI, PyTorch, Ray, k8s (helm + flux), Terraform, Postgres, Flink on AWS, React & Typescript. We operate a fully Python stack except for frontend and infrastructure code. We are very big fans of Infrastructure More ❯