robotics role Experience of computer vision or image processing Familiarity of Object Tracking and Prediction Experience with Protocol Buffers and messaging systems ( ROS) Containerisation ( docker) CI/CD experience CUDA, Triton Ability to code in Python Experience of cloud technologies ( AWS, GCP, Azure) Experience with observability platforms such as Grafana The Candidate Journey: Multi-Step and Two-Way No More ❯
ability to read, implement, and refine novel techniques from research literature Ability to write highly performant code, familiarity with parallel computing, profiling and optimization Proficiency with GPU programming, e.g. CUDA Experience delivering 3D tools for use by technical artists and animators Collaborative software development with git Additional Skills: Experience using and developing plugins for Maya and Houdini Previous successful More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
for generative tasks via an inference framework such as Ray or KServe (or similar) Production experience with running and tuning specialized hardware for Generative AI workloads, especially GPUs via CUDA Measured and articulate written and spoken communication skills. You work well with others and can craft concise and expressive thoughts into correspondence: emails, issues, investigations, documentation, onboarding materials, and More ❯
with LLM architectures and inference optimization techniques (e.g., batching, quantization). Experience deploying reliable, distributed, real-time model serving at scale. (Optional) Understanding of GPU architectures or experience with CUDA kernel programming. The cash compensation range for this role is $190,000 - $240,000. About Perplexity Since launching the world's first fully functional conversational answer engine over a More ❯
of systems and applications utilizing RDMA technologies. Experience with using communication libraries, such as MPI, NVIDIA Collective Communication Library (NCCL). Experience with GPU accelerator development frameworks, for example CUDA, OpenCL. Experience in developing and troubleshooting system level software. About Meta Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
global trading operations. Engineer core platform components: memory allocators, kernel bypass, custom RPC, and distributed compute frameworks. Optimise performance at the hardware/software boundary, including GPU acceleration and CUDA-based compute. Work on Linux kernel internals, networking stacks, and system-level debugging. Technical Requirements: 3+ years of experience in C++ (C++17/20) with strong knowledge of memory … management, concurrency, and performance tuning. Experience with GPU programming (CUDA), SIMD, and kernel-level development. Deep understanding of data structures, lock-free algorithms, and low-latency systems. Familiarity with Linux internals, system calls, and performance profiling tools. Background in platform engineering, distributed systems, or high-performance computing. Preferred Background: Participation in competitive programming contests (IOI, ICPC, Codeforces, etc.). More ❯
global trading operations. Engineer core platform components: memory allocators, kernel bypass, custom RPC, and distributed compute frameworks. Optimise performance at the hardware/software boundary, including GPU acceleration and CUDA-based compute. Work on Linux kernel internals, networking stacks, and system-level debugging. Technical Requirements: 3+ years of experience in C++ (C++17/20) with strong knowledge of memory … management, concurrency, and performance tuning. Experience with GPU programming (CUDA), SIMD, and kernel-level development. Deep understanding of data structures, lock-free algorithms, and low-latency systems. Familiarity with Linux internals, system calls, and performance profiling tools. Background in platform engineering, distributed systems, or high-performance computing. Preferred Background: Experience in high-frequency trading, market data systems, or real More ❯
using Docker and orchestrating them with Kubernetes for scalable model serving. Optimizing the performance of our Ultralytics YOLO11 models for various deployment targets, from high-performance cloud GPUs with CUDA to edge devices using frameworks like TensorRT and OpenVINO. Implementing robust systems for model monitoring and maintenance to track performance and detect data drift. Collaborating closely with our AI … experience with at least one major cloud provider ( GCP , Azure, AWS). Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible. Familiarity with GPU acceleration using CUDA and model optimization for inference. Knowledge of MLOps tools for experiment tracking, and model serving such as MLflow, Kubeflow, or Weights & Biases. Excellent problem-solving skills and the ability More ❯
London, England, United Kingdom Hybrid / WFH Options
Third Republic
quality and testing automation practices within the team. Effectively communicating product features to non-technical colleagues. Continuous learning and skill development. Qualifications: Experience with AWS services (EC2, Lambda), PyTorch, Cuda, TensorFlow, Sagemaker, and other machine learning technologies. Proficiency in programming languages such as C++ and Python. Familiarity with API Gateway, Step Functions, Terraform, Github Actions, DVC, Git, Ansible, Linux More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
as-code mindset Hands-on experience resolving GPU workload issues across compute clusters and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet () Experience with GPU communication and interconnect technologies (e.g. collective communication libraries More ❯
Contribute to hiring additional talent to our rapidly growing team The role will be exposed to a broad tech stack (e.g. ReactJS, Python, REST & GraphQL, OpenCV, PyTorch, GCP, AWS & CUDA, Kubernetes) and the cutting edge of computer vision and deep learning. Qualifications The right candidate will have a proven track record of relevant publications and previous experience managing applied More ❯
of shipping deep learning systems to production. Expert in deep learning (esp. sequential models, control, planning, or perception). Proficient in Python and other relevant languages (e.g. C++ and CUDA) and ML frameworks (esp. PyTorch), with a solid foundation in software engineering practices. Experience with real-time systems or robotics, ideally with simulation- or vehicle-in-the-loop components. More ❯
Social network you want to login/join with: P+S Personnel are pleased to be working on behalf of our clients, who are currently seeking a Data Engineer to join their team based in Norwich on a full-time, permanent More ❯
Stevenage Onsite (Ask about our 4-day compressed available) SC cleared Hertfordshire or South-West 12 Month contract (Subject to ext) UKEO Clearance Essential Experience C/C++ and CUDA programming GPU/CPU optimisation, Memory Management Technical report writing Object-Oriented Programming Desirable Network Programming Configuration control and model release processes Continuous Integration and Testing Proficiency in MATLAB More ❯
Location: Stevenage Onsite (Ask about our 4-day compressed available) SC clearedHertfordshire or South-West 12 Month contract (Subject to ext) UKEO Clearance Essential Experience C/C++ and CUDA programming GPU/CPU optimisation, Memory Management Technical report writing Object-Oriented Programming Desirable Network Programming Configuration control and model release processes Continuous Integration and Testing Proficiency in MATLAB More ❯
Software Engineer with graphics processing experience (GPU, CUDA) is re quired for a long term contract assignment based in Stevenage or Bristol - full time on site. Essential experience: C/C++ and CUDA programming Object-Oriented Programming GPU/CPU optimisation GPU/CPU Memory Management Technical report writing Desirable experience Network Programming Configuration control and model release More ❯
focus of this work is efficiency and run-time improvements of the simulations through the optimisation of GPU and CPU code. Responsibilities - Develop and optimise C/C++ and CUDA code to enhance the performance of the simulation engine - Manage the efficient utilisation of GPU and CPU resources through optimisation techniques - Implement robust memory management strategies to ensure optimal … development of technical reports and documentation - Collaborate with the team to integrate the simulation engine with various Matlab/Simulink models Required Skills and Qualifications C/C++ and CUDA programming Object-Oriented Programming GPU/CPU optimisation GPU/CPU Memory Management Technical report writing Location: Stevenage Clearance: Security Clearance - SC and UKEO (you MUST be a British More ❯
level position 5+ years' experience in software development. Good development skills in cloud visualization applications where knowledge is key. Computer Graphics WebGL OpenGL HTML5 MEAN stack. Java/C++ CUDA Augmented/Virtual Reality Game Engines Video streaming a plus. Please, get in touch to discuss and apply for this exciting role More ❯
using Docker and orchestrating them with Kubernetes for scalable model serving. Optimizing the performance of our Ultralytics YOLO11 models for various deployment targets, from high-performance cloud GPUs with CUDA to edge devices using frameworks like TensorRT and OpenVINO. Implementing robust systems for model monitoring and maintenance to track performance and detect data drift. Collaborating closely with our AI … experience with at least one major cloud provider ( GCP , Azure, AWS). Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible. Familiarity with GPU acceleration using CUDA and model optimization for inference. Knowledge of MLOps tools for experiment tracking, and model serving such as MLflow, Kubeflow, or Weights & Biases. Excellent problem-solving skills and the ability More ❯
using Docker and orchestrating them with Kubernetes for scalable model serving. Optimizing the performance of our Ultralytics YOLO11 models for various deployment targets, from high-performance cloud GPUs with CUDA to edge devices using frameworks like TensorRT and OpenVINO. Implementing robust systems for model monitoring and maintenance to track performance and detect data drift. Collaborating closely with our AI … experience with at least one major cloud provider ( GCP , Azure, AWS). Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible. Familiarity with GPU acceleration using CUDA and model optimization for inference. Knowledge of MLOps tools for experiment tracking, and model serving such as MLflow, Kubeflow, or Weights & Biases. Excellent problem-solving skills and the ability More ❯
Stevenage, Hertfordshire, South East, United Kingdom
Morson Talent
The focus of this work is efficiency and run-time improvements of the simulations through the optimisation of GPU and CPU code. Key Skillset Essential: C/C++ and CUDA programming Object-Oriented Programming GPU/CPU optimisation GPU/CPU Memory Management Technical report writing Desirable: Network Programming Configuration control and model release processes Continuous Integration and Testing More ❯