paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future. At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment … we back each other to deliver impact. Make Wayve the experience that defines your career The role We're looking for a curious and motivated ReinforcementLearning Intern to help advance the next generation of decision-making systems for autonomous driving. In this role, you'll work embedded in a research team to develop scalable RL algorithms that … enable vehicles to learn complex behaviors directly from experience — both in simulation and the real world. The ideal candidate has experience in some combination of reinforcementlearning, imitation learning, offline RL, or world modelling, and is motivated to apply cutting-edge research ideas to real-world embodied AI challenges. We're particularly interested in temporal credit assignment More ❯
london, south east england, united kingdom Hybrid/Remote Options
Anthropic
of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: You want to build and run elegant and thorough machine learning experiments to help us understand and steer the behavior of powerful AI systems. You care about making AI helpful, honest, and harmless, and are interested in the ways that … Testing the robustness of our safety techniques by training language models to subvert our safety techniques, and seeing how effective they are at subverting our interventions. Run multi-agent reinforcementlearning experiments to test out techniques like AI Debate. Build tooling to efficiently evaluate the effectiveness of novel LLM-generated jailbreaks. Write scripts and prompts to efficiently produce … efforts Pick up slack, even if it goes outside your job description Care about the impacts of AI Strong candidates may also: Have experience authoring research papers in machine learning, NLP, or AI safety Have experience with LLMs Have experience with reinforcementlearning Have experience with Kubernetes clusters and complex shared codebases Candidates need not have More ❯
Senior Quant Machine Learning Engineer sought by leading investment bank based in the city of London. **Inside IR35, 4 days a week on site** The role:To lead the design and deployment of ML-driven models across our trading and investment platforms. This is a high-impact, front-office role offering direct collaboration with traders, quant researchers, and technologists … and scalable model infrastructure for production deployment Mentor junior quants and engineers; contribute to knowledge-sharing and model governance processes Stay current with cutting-edge ML research (e.g., deep learning, generative models, reinforcementlearning) and assess applicability to financial markets Collaborate closely with cross-functional teams, including traders, data engineers, and software developers What We're Looking … fund, or tech firm Advanced degree (PhD or Master's) in Computer Science, Mathematics, Physics, Engineering, or related discipline Strong expertise in modern ML techniques: time-series forecasting, deep learning, ensemble methods, NLP, or RL Expert-level programming skills in Python and strong understanding of software engineering best practices Experience deploying ML models to production in real-time or More ❯
/orchestration tools such as Docker and Kubernetes. Proficiency in backend programming languages such as Go, Rust, Python, or C++. Familiarity with LLM inference pipelines, multi-agent architectures, or reinforcementlearning environments is a strong plus. Knowledge of network optimization, data streaming, and caching architectures preferred. Excellent collaboration and communication skills. Ability to commit 20–30 hours per More ❯
in the field, using your knowledge of state-of-the-art techniques to solve problems. LLMs will be your bread and butter, customized with advanced RAG techniques, finetuning and reinforcement learning. You'll work closely with other engineers to build fast, and you'll use Python and Kubernetes to deploy systems in production. What you'll do Work with … LLMs understand and interact with the millions of lines of code that run Kraken, leveraging techniques at the cutting-edge of the technology like GraphRAG, agentic workflows, finetuning, and reinforcementlearning Use classic ML and NLP techniques to complement and improve LLM systems Act as a center of excellence for the whole business in AI, as a floating More ❯
prompt filtering etc. What You'll Have Deep Understanding of GenAI - 2+ year experience of working with LLMs and using techniques such as tool calling, agentic workflows, finetuning and reinforcement learning. Data Product Development - 5+ years of experience building Python-based applications and/or data products, with hands-on work in data-intensive and machine learning systems. More ❯