london, south east england, united kingdom Hybrid/Remote Options
JPMorganChase
Description The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine Learning Solutions for our most challenging business problems. This includes ensuring the quality, integrity, and … generate insights and drive decision-making. The CDAO is also responsible for developing and implementing solutions that support the firm's commercial goals by harnessing artificial intelligence and machine learning technologies to develop new products, improve productivity, and enhance risk management effectively and responsibly. As a Summer Associate within the MLCOE, you will apply sophisticated machine learning methods … to a diverse range of complex domains, including natural language processing, large language models, speech recognition and understanding, reinforcementlearning, and recommendation systems. You will collaborate closely with MLCOE mentors, business experts, and technologists, conducting independent research and deploying solutions into production. A strong passion for machine learning, solid expertise in deep learning with hands-on More ❯
paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future. At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment … we back each other to deliver impact. Make Wayve the experience that defines your career The role We're looking for a curious and motivated ReinforcementLearning Intern to help advance the next generation of decision-making systems for autonomous driving. In this role, you'll work embedded in a research team to develop scalable RL algorithms that … enable vehicles to learn complex behaviors directly from experience — both in simulation and the real world. The ideal candidate has experience in some combination of reinforcementlearning, imitation learning, offline RL, or world modelling, and is motivated to apply cutting-edge research ideas to real-world embodied AI challenges. We're particularly interested in temporal credit assignment More ❯
Bishopton, Renfrewshire, Scotland, United Kingdom Hybrid/Remote Options
DXC Technology
for domain-specific applications Implement advanced prompt engineering strategies Leverage Retrieval-Augmented Generation (RAG) for enhanced contextual performance Build intelligent agents using frameworks like LangChain, LlamaIndex, CrewAI, AutoGen Apply reinforcementlearning techniques including Q-learning , policy gradients , and RLlib Collaborate with cross-functional teams to integrate AI solutions into scalable products Ensure best practices in data engineering … GPT, LLaMA, Mistral, Claude) Strong background in fine-tuning and prompt engineering Hands-on experience with RAG pipelines Familiarity with Agent Frameworks (LangChain, LlamaIndex, CrewAI, AutoGen) Solid understanding of reinforcementlearning concepts and tools (Q-learning, policy gradients, RLlib) Azure AI Engineer Associate certification (or willingness to obtain) Bachelor's degree in a relevant field or equivalent … with industry knowledge and technology Why Join Us? Work on impactful AI projects with real-world applications Be part of a collaborative and forward-thinking team Access to continuous learning and development opportunities Flexible working arrangements and a supportive work culture Ready to shape the future of AI? Apply now and bring your expertise to a team that values More ❯
AI Engineer - Remote We are currently working on an AI Engineer role with a global language and translation company. You will be designing, developing, and deploying cutting-edge machine learning solutions across the company.If you enjoy end-to-end ownership (from experimentation to deployment), working with AWS, Docker, MLflow, TensorFlow/PyTorch, and contributing to innovative projects with a … write and produce production-grade code in Python Experience with TensorFlow, PyTorch and Scikit-learn Experience with NLPs and LLMs Speech, Text or Audio data Strong knowledge of machine learning techniques and algorithms, including supervised and unsupervised learning, deep learning, and reinforcementlearningMore ❯
/orchestration tools such as Docker and Kubernetes. Proficiency in backend programming languages such as Go, Rust, Python, or C++. Familiarity with LLM inference pipelines, multi-agent architectures, or reinforcementlearning environments is a strong plus. Knowledge of network optimization, data streaming, and caching architectures preferred. Excellent collaboration and communication skills. Ability to commit 20–30 hours per More ❯
in the field, using your knowledge of state-of-the-art techniques to solve problems. LLMs will be your bread and butter, customized with advanced RAG techniques, finetuning and reinforcement learning. You'll work closely with other engineers to build fast, and you'll use Python and Kubernetes to deploy systems in production. What you'll do Work with … LLMs understand and interact with the millions of lines of code that run Kraken, leveraging techniques at the cutting-edge of the technology like GraphRAG, agentic workflows, finetuning, and reinforcementlearning Use classic ML and NLP techniques to complement and improve LLM systems Act as a center of excellence for the whole business in AI, as a floating More ❯
stapleford, midlands, united kingdom Hybrid/Remote Options
Launch Potato
Computer Science or equivalent experience Nice to Have Experience in adtech, e-commerce, or content platforms Published papers or significant ML contributions Experience with multi-objective optimization Knowledge of reinforcementlearning for personalization Compensation: $130,000 - $250,000 per year Total Compensation Base salary is set according to market rates for the nearest major metro and varies based More ❯
for our customers. You will use and improve multimodal LLMs to achieve new functionality for our customers and optimize their deployments (cloud and edge). Some of our deep learning models are truly tiny - the memory footprint of our smallest computer vision model is just 1MB. You will train and design more accurate models, while also enabling new and … team on the same software project. Nice To Have Trained neural networks that moved into production. Industry experience with efficient inference deployments (cloud or edge). Experience with Deep Reinforcement Learning. Benefits What we offer Competitive salary. Generous equity stake in the company. Relocation assistance. Choose your own laptop. 25 days of paid vacation time in addition to bank More ❯