Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Speechmatics Limited
Speechmatics is a cutting-edge applied AI Research company that is breaking down cultural barriers by building diverse and inclusive speech technology. We are looking for an experienced Machine Learning Engineer who can help us advance our automatic speechrecognition (ASR) engines for Flow, our new Conversational AI companion and create interactive voice … across dozens of GPUs. It's all about finding bottlenecks across our 50+ languages. Our main research focuses arelarge-scale self-supervised learning, building state-of-the-art speech pipelines and breaking ground on our own emotive text-to-speech (TTS) models. What you'll be doing Working with a diverse group of engineers across Speechmatics. … and the impact on WER. Developing new, state-of-the-art AI features, from training models, all the way to shipping it to production. Advancing end-to-end speech models and researching new approaches to TTS. We aim to get you onboarded and started on projects in your first few days. In addition, having a very collaborative culture More ❯
Build the future of Conversational AI with ConnexAI As a Data Scientist on the Automatic SpeechRecognition (ASR) team, you will be at the heart of building the voice models that power real-time AI systems used by millions globally daily. You’ll design, train, and refine cutting-edge ASR models that turn speech into actionable insights, helping our AI understand and respond with accuracy and speed. This is your chance to experiment with the latest architectures, push the boundaries of speech technology, and see your models make an immediate impact on products that transform how people and businesses communicate. This is a pivotal moment for ConnexAI, as we expand our … boundaries of conversational intelligence. Join us, and be part of the team setting the industry standard. Core Responsibilities Use Python to develop SOTA streaming and non-streaming automatic speechrecognition models with K2 (preferably) or NeMo toolkits. Experiment with new modelling approaches, neural architectures, and training techniques to improve ASR accuracy and latency. Work with More ❯
warrington, cheshire, north west england, united kingdom
ConnexAI
Build the future of Conversational AI with ConnexAI As a Data Scientist on the Automatic SpeechRecognition (ASR) team, you will be at the heart of building the voice models that power real-time AI systems used by millions globally daily. You’ll design, train, and refine cutting-edge ASR models that turn speech into actionable insights, helping our AI understand and respond with accuracy and speed. This is your chance to experiment with the latest architectures, push the boundaries of speech technology, and see your models make an immediate impact on products that transform how people and businesses communicate. This is a pivotal moment for ConnexAI, as we expand our … boundaries of conversational intelligence. Join us, and be part of the team setting the industry standard. Core Responsibilities Use Python to develop SOTA streaming and non-streaming automatic speechrecognition models with K2 (preferably) or NeMo toolkits. Experiment with new modelling approaches, neural architectures, and training techniques to improve ASR accuracy and latency. Work with More ❯
bolton, greater manchester, north west england, united kingdom
ConnexAI
Build the future of Conversational AI with ConnexAI As a Data Scientist on the Automatic SpeechRecognition (ASR) team, you will be at the heart of building the voice models that power real-time AI systems used by millions globally daily. You’ll design, train, and refine cutting-edge ASR models that turn speech into actionable insights, helping our AI understand and respond with accuracy and speed. This is your chance to experiment with the latest architectures, push the boundaries of speech technology, and see your models make an immediate impact on products that transform how people and businesses communicate. This is a pivotal moment for ConnexAI, as we expand our … boundaries of conversational intelligence. Join us, and be part of the team setting the industry standard. Core Responsibilities Use Python to develop SOTA streaming and non-streaming automatic speechrecognition models with K2 (preferably) or NeMo toolkits. Experiment with new modelling approaches, neural architectures, and training techniques to improve ASR accuracy and latency. Work with More ❯
minded, and a little restless—people who want to break the mould and try things that haven’t been done before. One week you might be experimenting with speechrecognitionor generative sound; the next, building a service that scales to millions. You’ll own projects end to end, obsessing over details and making sure … just function but feels right. Responsibilities: Leverage the latest technologies to build transformative experiences across our portfolio of devices Ideate, build and iterate quickly on projects that span speechrecognition, sound and image generation, behavioural understanding and data collection at scale Develop and fine-tune models to power cloud and on-device AI functionalities Use More ❯
minded, and a little restless—people who want to break the mould and try things that haven’t been done before. One week you might be experimenting with speechrecognitionor generative sound; the next, building a service that scales to millions. You’ll own projects end to end, obsessing over details and making sure … just function but feels right. Responsibilities: Leverage the latest technologies to build transformative experiences across our portfolio of devices Ideate, build and iterate quickly on projects that span speechrecognition, sound and image generation, behavioural understanding and data collection at scale Develop and fine-tune models to power cloud and on-device AI functionalities Use More ❯
london (city of london), south east england, united kingdom
Nothing
minded, and a little restless—people who want to break the mould and try things that haven’t been done before. One week you might be experimenting with speechrecognitionor generative sound; the next, building a service that scales to millions. You’ll own projects end to end, obsessing over details and making sure … just function but feels right. Responsibilities: Leverage the latest technologies to build transformative experiences across our portfolio of devices Ideate, build and iterate quickly on projects that span speechrecognition, sound and image generation, behavioural understanding and data collection at scale Develop and fine-tune models to power cloud and on-device AI functionalities Use More ❯
team is looking for ambitious experienced machine learning engineers with a background in one or more of the following areas: artificial intelligence, natural language understanding, machine learning, automated speechrecognition, conversational systems and building large scale production systems. You have a once in a lifetime opportunity to contribute to building the very core of the … CS fundamentals, with the ability to write algorithms with ease Good understanding of machine learning fundamentals like regression, classification, ranking, neural networks, and sequence models for text/speech Experience with advanced machine learning techniques, including LLM and generative AI Proven familiarity and experience deploying and maintaining machine learning models (transformer based models in production is a plus More ❯
in the UK, we work at the intersection of artificial intelligence, linguistics, and human insight. Our global clients rely on us to deliver top-tier datasets for Automatic SpeechRecognition (ASR), Text-to-Speech (TTS), Direct Preference Optimisation (DPO), Sentiment Analysis, and more. We’re a trusted data partner for model development, and … AI researchers to refine annotation standards. Identify and flag edge cases or quality issues in datasets. Contribute to projects related to training Large Language Models (LLMs) and improving speech AI systems. What We're Looking For Exceptional attention to detail — you can spot inconsistencies and anomalies in text and audio. Strong linguistic or cultural knowledge (multilingual skills are More ❯