AI/ML Engineer - Voice AI/LLMs, Speech-to-Text

AI/ML Engineer - Voice AI/LLMs, Speech-to-Text, TTS

My client is building the future of voice AI in telecoms and are looking for an exceptional AI/ML Engineer to join them as part of the founding team.

The company is backed by a number of industry leading Global enterprises who have joined forces to build cutting edge technology together.

This is your chance to shape and help scale a brand-new venture at the cutting edge of telecommunications and artificial intelligence. You'll be the technical owner of their AI stack, integrating and optimizing world-class models from providers like OpenAI, Google, Anthropic, Deepgram, ElevenLabs, and more.

  • Own the AI Pipeline: From STT - LLM - TTS, you'll build, test, and optimize the voice AI stack.
  • Integrate Leading Models: Leverage APIs from top providers across Speech-to-Text, Text-to-Speech, Voice Cloning, and Large Language Models.
  • Benchmark + Optimize: Curate golden datasets to evaluate model accuracy, latency, and cost for real-world telecom use cases.
  • Enhance Personalization: Develop AI features for emotional intelligence, cultural nuances, and regional accents.
  • Collaborate Cross-Functionally: Work side-by-side with architects, Back End engineers, telco experts, and product managers.

Candidates should have experience in some of the following areas:

  • Strong hands-on experience integrating AI/ML APIs into production systems
  • Strong Python skills + comfort with data science libraries (Pandas, NumPy, etc.)
  • Experience with voice-related AI (STT, TTS, or Voice AI)
  • Deep understanding of latency vs accuracy vs cost trade-offs
  • Familiar with prompt engineering and RAG techniques
  • Background in microservices, APIs, and cloud-based deployments
  • Experience with MLOps tools and monitoring platforms (eg, DataDog, Arize)

Any experience in the following areas is a nice to have:

  • Experience with Voice LMMs (like GPT-4o, Nova Sonic)
  • Previous work in telecommunications or Real Time speech systems
  • Familiarity with voice cloning, multilingual models, or PEFT techniques
  • Knowledge of privacy and security protocols for voice data
  • Master's degree in ML, AI, or a related field

This is a career defining role and an opportunity to create something truly special. It is based in London with offices near Barbican and 3 days a week on-site on average. Salaries are flexible to attract the best talent Globally.

Company
Ventula Consulting
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 130,000 Annual
Posted
Company
Ventula Consulting
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 130,000 Annual
Posted