and optimize frameworks like TensorFlow and PyTorch for AMD GPUs in open-source repositories. Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations. Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance. Collaborate with GPU Library Teams: Work … closely with internal teams to analyze and improve training and inference performance on AMD GPUs. Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream. Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up … Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance. Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers. Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions. PREFERRED EXPERIENCE: GPU Kernel Development & Optimization: Proficient experienced in designing and optimizing More ❯
and optimize frameworks like TensorFlow and PyTorch for AMD GPUs in open-source repositories. Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations. Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance. Collaborate with GPU Library Teams: Work … closely with internal teams to analyze and improve training and inference performance on AMD GPUs. Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream. Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up … Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance. Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers. Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions. PREFERRED EXPERIENCE: GPU Kernel Development & Optimization: Proficient experienced in designing and optimizing More ❯
cambridge, east anglia, United Kingdom Hybrid / WFH Options
Virtasant
Director of Engineering – Cloud Cost Optimisation Platform About the Role Virtasant is on a mission to revolutionise cloud cost optimisation. Our platform helps businesses reduce waste and maximise efficiency across multi-cloud environments. We’re looking for a Director of Engineering to lead the continued development of our … platform, ensuring both its technical excellence and operational efficiency. This role is a hybrid of technical leadership, strategic direction, and hands-on engineering, requiring someone who can balance short-term client demands with long-term product vision. You’ll lead a high-performing engineering team, drive AI-driven … automation, and can navigate the complexity of scaling both teams and technology, this is the role for you. What You’ll Do: Lead the engineering and operations teams responsible for building and maintaining Virtasant’s cloud cost optimisation platform. Drive the strategic direction of the platform - balancing immediate client More ❯
phone and cold/warm calling skills Exceptional customer service skills Strong listening and sales skills Ability to achieve sales targets About us Aircraft Performance Group Aircraft Performance Group, LLC (APG) is a flight operations performanceengineering firm, established in 1999, that specializes in Runway Analysis … maintain a current worldwide database of airport information (over 7000+ airports) and have experience in digitizing Approved Flight Manuals for use within our own performance programs, as well as using aircraft manufacturer provided programs. We have experience in providing data based upon FAR, EASA and CASA requirements. APG is More ❯
London. Our work environment rewards innovation, speed, and bold thinking. The role We’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our Optical Tensor Processing Units (OTPUs). This isn’t just about scaling models—it’s about rethinking how AI … and scheduling. Whether it’s through compiler techniques, systems-level tuning, or custom runtime design, you’ll play a critical role in shaping the performance layer of our AI platform. This is a role for engineers who think in microseconds, not just model accuracy. If you’ve worked in … HFT, large-scale scientific compute, or AI infrastructure at serious scale, we’d love to talk. Responsibilities Design and build high-performance systems for running AI/ML workloads across distributed compute clusters Optimise for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Job Overview: High-performance ML workloads on Arm CPUs require the co-development of algorithms and highly optimized CPU kernels. In CT-ML (Central Technology, Machine Learning), rapid kernel prototyping is crucial for exploring algorithms and assessing trade-offs between model accuracy and performance. Successful prototypes drive future CPU … architecture development and serve as deliverables to Central Engineering for final production. Responsibilities: This position is part of a dedicated team within the CT-ML group focused on analyzing ML workloads and rapidly prototyping highly optimized CPU kernels to enhance model performance and accuracy. Required Skills and Experience … Strong interest and passion for implementing high-performance kernel code in dynamic environments. 4+ years of experience in implementing high-performance CPU kernels with vector and matrix extensions. Experience measuring and understanding performance metrics. Experience in creating efficient kernel development frameworks, including tools and testing methodologies. Deep More ❯