CUDA Jobs in Scotland

12 of 12 CUDA Jobs in Scotland

Senior Software Developer

City, Aberdeen, United Kingdom
Leidos
Jira, Confluence, Git/GitHub, Perforce Helix ALM. Installer Development: Familiarity with WiX (Visual Studio tool) for building software installers. Specialized Technologies: Knowledge of cuda, gdal, Kakadu (kdu), QT5, OpenCV, SQLite, libAV, jpeg8, CMAKE, G . Join us in shaping cutting-edge software solutions for defense and sensor technologies. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Infrastructure Research Engineer (Edinburgh)

Edinburgh, UK
Project People
with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯
Posted:

Infrastructure Research Engineer

edinburgh, central scotland, United Kingdom
Project People
with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯
Posted:

Infrastructure Research Engineer

livingston, central scotland, United Kingdom
Project People
with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯
Posted:

Infrastructure Research Engineer

broughton, central scotland, United Kingdom
Project People
with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯
Posted:

Infrastructure Research Engineer

dunfermline, north east scotland, United Kingdom
Project People
with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯
Posted:

Senior GPU Architect (Graphics Processors R&D for AI)

glasgow, central scotland, United Kingdom
Hybrid / WFH Options
IC Resources
Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the More ❯
Posted:

Senior GPU Architect (Graphics Processors R&D for AI)

edinburgh, central scotland, United Kingdom
Hybrid / WFH Options
IC Resources
Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the More ❯
Posted:

Senior GPU Architect (Graphics Processors R&D for AI)

aberdeen, north east scotland, United Kingdom
Hybrid / WFH Options
IC Resources
Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the More ❯
Posted:

Staff AI Engineer

glasgow, central scotland, United Kingdom
Nscale
systems for data ingestion, processing, and model customisation. Optimise GPU utilisation and system performance using frameworks like DeepSpeed, Triton Inference Server, TensorRT, and custom CUDA/Rocm kernels. Conduct performance testing and resolve bottlenecks across training, fine-tuning, and inference workflows. Document and build tooling to ensure successful use … multimodal generative models. Expertise in distributed training frameworks like DeepSpeed or Fully Sharded Data Parallel (FSDP). Experience with GPU programming and optimisation e.g CUDA, TensorRT, or ROCm. Knowledge of fine-tuning methods, such as LoRA, prefix-tuning, and adapter-based techniques, and experience improving model performance. Experience with More ❯
Posted:

Staff AI Engineer

edinburgh, central scotland, United Kingdom
Nscale
systems for data ingestion, processing, and model customisation. Optimise GPU utilisation and system performance using frameworks like DeepSpeed, Triton Inference Server, TensorRT, and custom CUDA/Rocm kernels. Conduct performance testing and resolve bottlenecks across training, fine-tuning, and inference workflows. Document and build tooling to ensure successful use … multimodal generative models. Expertise in distributed training frameworks like DeepSpeed or Fully Sharded Data Parallel (FSDP). Experience with GPU programming and optimisation e.g CUDA, TensorRT, or ROCm. Knowledge of fine-tuning methods, such as LoRA, prefix-tuning, and adapter-based techniques, and experience improving model performance. Experience with More ❯
Posted:

Staff AI Engineer

aberdeen, north east scotland, United Kingdom
Nscale
systems for data ingestion, processing, and model customisation. Optimise GPU utilisation and system performance using frameworks like DeepSpeed, Triton Inference Server, TensorRT, and custom CUDA/Rocm kernels. Conduct performance testing and resolve bottlenecks across training, fine-tuning, and inference workflows. Document and build tooling to ensure successful use … multimodal generative models. Expertise in distributed training frameworks like DeepSpeed or Fully Sharded Data Parallel (FSDP). Experience with GPU programming and optimisation e.g CUDA, TensorRT, or ROCm. Knowledge of fine-tuning methods, such as LoRA, prefix-tuning, and adapter-based techniques, and experience improving model performance. Experience with More ❯
Posted: