Permanent CUDA Jobs in London

3 of 3 Permanent CUDA Jobs in London

Senior MLOps Engineer London

London, United Kingdom
Hybrid / WFH Options
Hudl
or create insights, that's a plus. Deeper systems knowledge. Extraexperience with any of the following would be an asset: developing GPU kernels and/or ML compilers (e.g. CUDA, OpenCL, TensorRT Plugins, MLIR, TVM, etc); optimizing systems to meet strict utilization and latency requirements with tools such as Nvidia NSight; and/or you've worked with embedded More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Creative Technologist

london, south east england, united kingdom
Stability AI
Python, with the ability to implement and customize code from research repositories. Solid understanding of machine learning concepts (formal ML background preferred but not required). Experience with Linux, CUDA, and PyTorch in a research or production setting. Demonstrated ability to create customer-specific generative solutions and workflows. Experience managing fine-tuning processes to improve model quality and diversity. More ❯
Posted:

Senior Machine Learning Engineer, Scaling and Performance

London, United Kingdom
Hybrid / WFH Options
InstaDeep Ltd
optimise state-of-the-art algorithms and architectures, ensuring compute efficiency and performance. Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs. Required Skills Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.) Expertise … with machine learning frameworks (JAX, Tensorflow, PyTorch etc.) Passion for profiling, identifying bottlenecks, and delivering efficient solutions. Highly Desirable Track record of successfully scaling ML models. Experience writing custom CUDA kernels or XLA operations. Understanding of GPU/TPU architectures and their implications for efficient ML systems. Fundamentals of modern Deep Learning Actively following ML trends and a desire … to push boundaries. Example Projects: Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development. Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects. Adapt algorithms for large-scale distributed architectures across HPC clusters. Employ memory-efficient techniques within models for increased parameter counts and longer context lengths. What We Offer: Real More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
CUDA
London
10th Percentile
£68,375
25th Percentile
£73,438
Median
£78,750
75th Percentile
£82,188
90th Percentile
£86,125