26 to 34 of 34 CUDA Jobs

Principal ML Systems Engineer: Real-Time Inference

Hiring Organisation
Jobleads-UK
Location
United Kingdom
searching for engineers who thrive in ambiguity and possess strong problem-solving skills. The role involves working on high-performance systems using C++, CUDA, and distributed architectures, ensuring models run reliably in production. The base salary range is £140,000 – £200,000, with additional equity and benefits. We value ...

Member of Technical Staff

Hiring Organisation
Geometric
Location
London Area, United Kingdom
lead candidates: Hire and mentor a small team of exceptional engineers and researchers Qualifications You've written and shipped high-performance or SOTA CUDA kernels Deep understanding of mixed precision, quantisation (INT4, INT8, FP8, MXFP4, block-scaled formats), kernel fusion, distributed computing strategies (TP, PP, CP) You've made ...

Senior GPU Architect (Graphics Processors R&D for AI)

Hiring Organisation
IC Resources
Location
United Kingdom
Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience ...

Cloud Engineer

Hiring Organisation
WWT EMEA UK LIMITED
Location
Manchester, North West, United Kingdom
Employment Type
Contract
Contract Rate
£77 per hour
scientific workloads) Provision and optimize GPU/TPU-based infrastructure on GCP (A3/A4, TPU pods) Analyze performance across frameworks (PyTorch, TensorFlow, JAX, CUDA, ROCm) Identify system bottlenecks (compute, memory, network, I/O) Build automation tools for benchmarking and reporting … Collaborate with teams to align workloads with optimal architecture Required Skills Strong experience with GCP (Compute Engine, GKE, Storage, Networking) Hands-on with NVIDIA (CUDA/NCCL), AMD (ROCm), and TPUs (XLA/JAX/TF) Solid knowledge of HPC concepts (MPI, RDMA, InfiniBand, Slurm/Kubernetes) Experience with ...

HPC AI Cloud Engineer

Hiring Organisation
WWT EMEA UK LIMITED
Location
Manchester, North West, United Kingdom
Employment Type
Contract
Contract Rate
£77 per hour
scientific workloads) Provision and optimize GPU/TPU-based infrastructure on GCP (A3/A4, TPU pods) Analyze performance across frameworks (PyTorch, TensorFlow, JAX, CUDA, ROCm) Identify system bottlenecks (compute, memory, network, I/O) Build automation tools for benchmarking and reporting … Collaborate with teams to align workloads with optimal architecture Required Skills Strong experience with GCP (Compute Engine, GKE, Storage, Networking) Hands-on with NVIDIA (CUDA/NCCL), AMD (ROCm), and TPUs (XLA/JAX/TF) Solid knowledge of HPC concepts (MPI, RDMA, InfiniBand, Slurm/Kubernetes) Experience with ...

Machine Learning Performance Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
We tackle the most complex problems in quantitative finance, by bringing scientific clarity to financial complexity. From our London HQ, we unite world‐class researchers and engineers in an environment that values deep exploration and ...

Quantitative Developer, VP

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
level optimisation Develop pricing models using numerical techniques such as Monte Carlo methods and partial differential equation (PDE) solvers Work with technologies including C++, CUDA, Python, and adjoint algorithmic differentiation (AAD) Contribute to the technical direction of the group, mentor junior team members, and collaborate closely with quant teams … experience in a high‐performance computing or numerical software role (experience outside of finance will be considered) Strong programming skills in C++; experience with CUDA and Python preferred Excellent background in computational mathematics, numerical analysis, or a related quantitative discipline Demonstrated ability to design, implement, and optimise complex mathematical ...

Machine Learning Performance Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
about efficient large-scale training, low-latency inference in real-time systems and high-throughput inference in research. Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems approach, including storage systems, networking and host- and GPU-level considerations. Zooming in, we also want … end. Low-level GPU knowledge of PTX, SASS, warps, cooperative groups, Tensor Cores and the memory hierarchy. Debugging and optimisation experience using tools like CUDA GDB, NSight Systems, NSight Compute-sight-systems and nsight-compute. Library knowledge of Triton, CUTLASS, CUB, Thrust, cuDNN and cuBLAS. Intuition about the latency ...

ML Systems Performance Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Quant Blueprint LLC is seeking an engineer proficient in low-level systems programming and optimization to enhance our machine learning team. This role centers on optimizing model performance, both for training and real-time inference. ...