6 of 6 Remote Reinforcement Learning Jobs

Machine Learning Engineer

Hiring Organisation: Experis UK
Location: London, UK

Title: Machine Learning Engineer Location: London, UK (Hybrid – 2–3 days onsite per week) Contract Type: Contract Duration: 6–12 months (possibility of extension) Start Date: ASAP Overview We are seeking an experienced Machine Learning Engineer to join our data science and AI engineering team on a contract … basis in London. The ideal candidate will be responsible for designing, developing, and deploying machine learning models and scalable data pipelines that support advanced analytics and intelligent automation initiatives. This role offers a hybrid work arrangement , combining flexibility with collaboration, and is ideal for a contractor who thrives ...

Machine Learning Engineer

Hiring Organisation: Higher - AI recruitment
Location: City of London, Greater London, UK

partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers . This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded … sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid) : London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using ...

Software Engineer (Applied AI)

Hiring Organisation: Euphoric
Location: United Kingdom

iteration of our next-generation benefits platform features that leverage personalization, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript … against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural ...

Full Stack Engineer

Hiring Organisation: Higher - AI recruitment
Location: City of London, Greater London, UK

deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology ...

Data Scientist

Hiring Organisation: Randstad Technologies Recruitment
Location: London, United Kingdom
Employment Type: Contract
Contract Rate: £450 - £480/day

preprocessing to ensure high-quality inputs for ML models. Model Development: Select and train appropriate architectures (BERT, GPT, etc.) using supervised, unsupervised, or reinforcement learning strategies. Prompt Engineering: Design, test, and iterate on complex prompts to elicit high-quality responses from LLMs while mitigating unintended behaviors. Evaluation & Optimization … establish automated monitoring systems to track drift and performance. Technical Requirements Core AI/ML: Strong experience in ML algorithms, LLM architectures, and deep learning frameworks. Generative AI: Proven expertise in Prompt Engineering and fine-tuning pre-trained models. Engineering: Proficiency in Python and experience designing data pipelines ...

AI Engineer

Hiring Organisation: Amber Labs
Location: City of London, Greater London, UK

deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. ...