Data Engineer (Speech)
Manchester, UK
ConnexAI
Build the future of Conversational AI with ConnexAI. As a Speech Data Engineer, your work will power the data behind real-time speech systems used by millions worldwide, ensuring our AI learns from clean, accurate, and reliable datasets. By curating and engineering the voice data that fuels our models, youll help shape products that transform how … scale speech datasets, ensuring they are versioned correctly, catalogued, and easy to retrieve. Build automated pipelines using Python, Bash, AWS, Docker, and automation tools to clean and validate speech data, identifying duplicates, corrupted files, or inconsistencies. Coordinate with the annotation team to manage labelling workflows and ensure high-quality, consistent annotations across datasets. Set up and monitor automated model … evaluation pipelines, tracking metrics such as Word Error Rate (WER) and Character Error Rate (CER) to provide actionable feedback to engineering and data science teams. Prepare specialised datasets for experimental or production models, supporting the development of cutting-edge ASR and conversational AI systems. Ensure all datasets are securely backed up, recoverable, and well-documented for use across More ❯
Employment Type: Part-time
Posted: