C and C++ programming, ML framework internals, compiler construction, and optimisation techniques. Key Deliverables: Implement Runtime: Build a runtime that seamlessly integrates with PyTorch, JAX, and TensorFlow (PJRT) for both training and inference execution patterns. The runtime must support asynchronous execution and multiple devices. Implement Compiler: Build a compiler that … optimisation patterns across operation fusion, layout optimisation, tiling, and scheduling. Implement Debugger & Diagnostics: Support optional runtime assertions and compile-time dumps, tensorboard timelines, and JAX I/O callbacks. Implement Functional Simulator: Build a functional simulator that mocks our kernel-space driver, allowing the software team to lower operations ahead More ❯
Compiler Engineers to join the team building the ML backend (compiler, run-time, and debugger) for our next-generation OPTUs that connect PyTorch, Tensorflow, JAX, and MXNet down to our low-level kernel drivers.Your mission will be to create seamless support for a broad ecosystem of large AI models, and … impact technical and business milestones that drive the success of Flux. Architect & Build: Design and implement our compiler, runtime, and debugger for PyTorch, TensorFlow, JAX, and MXNet on custom hardware. Optimise Performance: Apply advanced techniques (layout, fusion, scheduling, tiling) to eliminate bottlenecks and maximise throughput. Mentor & Encourage Standards: Lead code … C/C++ programming. Deep expertise in ML framework internals, compilers, low-level programming, and optimisation techniques. Deep expertise in optimising Tensorflow, PyTorch or JAX deep learning models. Deep expertise with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM. Practical experience applying machine learning in high-performance computing contexts. More ❯
Compiler Engineers to join the team building the ML backend (compiler, run-time, and debugger) for our next-generation OPTUs that connect PyTorch, Tensorflow, JAX, and MXNet down to our low-level kernel drivers.Your mission will be to create seamless support for a broad ecosystem of large AI models, and … impact technical and business milestones that drive the success of Flux. Architect & Build: Design and implement our compiler, runtime, and debugger for PyTorch, TensorFlow, JAX, and MXNet on custom hardware. Optimise Performance: Apply advanced techniques (layout, fusion, scheduling, tiling) to eliminate bottlenecks and maximise throughput. Mentor & Encourage Standards: Lead code … C/C++ programming. Deep expertise in ML framework internals, compilers, low-level programming, and optimisation techniques. Deep expertise in optimising Tensorflow, PyTorch or JAX deep learning models. Deep expertise with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM. Practical experience applying machine learning in high-performance computing contexts. More ❯
Compiler Engineers to join the team building the ML backend (compiler, run-time, and debugger) for our next-generation OPTUs that connect PyTorch, Tensorflow, JAX, and MXNet down to our low-level kernel drivers.Your mission will be to create seamless support for a broad ecosystem of large AI models, and … impact technical and business milestones that drive the success of Flux. Architect & Build: Design and implement our compiler, runtime, and debugger for PyTorch, TensorFlow, JAX, and MXNet on custom hardware. Optimise Performance: Apply advanced techniques (layout, fusion, scheduling, tiling) to eliminate bottlenecks and maximise throughput. Mentor & Encourage Standards: Lead code … C/C++ programming. Deep expertise in ML framework internals, compilers, low-level programming, and optimisation techniques. Deep expertise in optimising Tensorflow, PyTorch or JAX deep learning models. Deep expertise with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM. Practical experience applying machine learning in high-performance computing contexts. More ❯
to architect and build the ML backend (compiler, run-time, and debugger) for our next-generation OPTUs. You will own integration with PyTorch, Tensorflow, JAX, and MXNet down to our low-level kernel drivers. Your mission will be to create seamless support for a broad ecosystem of large AI models … ensure software and hardware are designed together to deliver maximum performance. Architect & Build: Design and implement our compiler, runtime, and debugger for PyTorch, TensorFlow, JAX, and MXNet on custom hardware. Optimize Performance: Apply advanced techniques (layout, fusion, scheduling, tiling) to eliminate bottlenecks and maximize throughput. Mentor & Define Standards: Lead code … on C/C++ programming. Extensive experience in ML framework internals, compilers, low-level programming, and optimisation techniques. Extensive experience optimising Tensorflow, PyTorch or JAX deep learning models. Extensive experience with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM. Practical experience applying machine learning in high-performance computing contexts. More ❯
to architect and build the ML backend (compiler, run-time, and debugger) for our next-generation OPTUs. You will own integration with PyTorch, Tensorflow, JAX, and MXNet down to our low-level kernel drivers. Your mission will be to create seamless support for a broad ecosystem of large AI models … ensure software and hardware are designed together to deliver maximum performance. Architect & Build: Design and implement our compiler, runtime, and debugger for PyTorch, TensorFlow, JAX, and MXNet on custom hardware. Optimize Performance: Apply advanced techniques (layout, fusion, scheduling, tiling) to eliminate bottlenecks and maximize throughput. Mentor & Define Standards: Lead code … on C/C++ programming. Extensive experience in ML framework internals, compilers, low-level programming, and optimisation techniques. Extensive experience optimising Tensorflow, PyTorch or JAX deep learning models. Extensive experience with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM. Practical experience applying machine learning in high-performance computing contexts. More ❯
to architect and build the ML backend (compiler, run-time, and debugger) for our next-generation OPTUs. You will own integration with PyTorch, Tensorflow, JAX, and MXNet down to our low-level kernel drivers. Your mission will be to create seamless support for a broad ecosystem of large AI models … ensure software and hardware are designed together to deliver maximum performance. Architect & Build: Design and implement our compiler, runtime, and debugger for PyTorch, TensorFlow, JAX, and MXNet on custom hardware. Optimize Performance: Apply advanced techniques (layout, fusion, scheduling, tiling) to eliminate bottlenecks and maximize throughput. Mentor & Define Standards: Lead code … on C/C++ programming. Extensive experience in ML framework internals, compilers, low-level programming, and optimisation techniques. Extensive experience optimising Tensorflow, PyTorch or JAX deep learning models. Extensive experience with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM. Practical experience applying machine learning in high-performance computing contexts. More ❯
optimisation patterns such as op fusion, layout optimisation, tiling, and scheduling. Debugger & Diagnostics Implement optional runtime assertions, compile-time dump mechanisms, TensorBoard timelines, and JAX I/O callback support. Functional Simulator Develop a simulator that mimics our kernel-space driver, enabling the software team to lower operations ahead of … experience in C/C++ software engineering Strong background in compilers , runtime systems , and low-level optimisations Deep familiarity with ML frameworks (e.g., PyTorch, JAX, TensorFlow) and their execution models Experience with high-performance computing or hardware-software co-design Strong problem-solving skills with a creative and pragmatic mindset More ❯
London, England, United Kingdom Hybrid / WFH Options
Alcides Fonseca
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team More ❯
At UnlikelyAI, we're looking for a Senior Applied Scientist to join our Applied Science team. This is a high-impact individual contributor position. You'll help drive the end-to-end lifecycle of projects: from identifying opportunities in literature More ❯
At UnlikelyAI, we're looking for a visionary Staff Applied Scientist to play a key leadership role in our Applied Science team and across the company. This is a high-impact individual contributor position with mentoring and coaching responsibilities - ideal More ❯
learning fundamentals.GPU compute programming models & optimization techniques.GPU compute framework development, maintenance, and optimization.Machine learning development using one or more ML frameworks (TensorFlow, PyTorch or JAX).Experience with adding computational graph support, runtime or device backend to Machine learning libraries (TensorFlow, PyTorch or JAX) support is a plus.Experience with high performance More ❯
Rd). Your Focus Design and implement robust experiments to evaluate specific model capabilities Build and maintain high-frequency evaluation pipelines using PyTorch or JAX Engineer benchmark datasets — collecting, filtering, and decontaminating data for meaningful evals Create evaluation protocols that measure the right capabilities and avoid metric gaming Research and … Proficient hands-on experience in machine learning, ideally with a focus on experimental design or evaluation Strong engineering skills in Python and PyTorch (or JAX) Deep understanding of training and evaluating large-scale deep learning models A scientific mindset — you know how to design a clean experiment and what makes More ❯
Rd). Your Focus Design and implement robust experiments to evaluate specific model capabilities Build and maintain high-frequency evaluation pipelines using PyTorch or JAX Engineer benchmark datasets — collecting, filtering, and decontaminating data for meaningful evals Create evaluation protocols that measure the right capabilities and avoid metric gaming Research and … Proficient hands-on experience in machine learning, ideally with a focus on experimental design or evaluation Strong engineering skills in Python and PyTorch (or JAX) Deep understanding of training and evaluating large-scale deep learning models A scientific mindset — you know how to design a clean experiment and what makes More ❯
Life at UiPath The people at UiPath believe in the transformative power of automation to change how the world works. We're committed to creating category-leading enterprise software that unleashes that power. To make that happen, we need people More ❯
Expertise and Requirements With expertise in production environments performing tasks like ML deployment using technologies such as PyTorch, TensorFlow, and JAX. Also, proficiency in frontend skills using React, TypeScript, or similar technologies. Experience in big data, real-time analytics, and More ❯