Artificial Intelligence Engineer
- Hiring Organisation
- Cubiq Recruitment
- Location
- Preston, Lancashire, UK
- Employment Type
- Full-time
system. Flexible setup: Remote across the EU, with optional co-working in London or Barcelona. What you'll do Obsessive about latency, you think in milliseconds, optimise for concurrency, and understand the trade-offs between speed, cost, and model performance. Design, implement, and productionise multi-agent LLM systems that … reason, plan, and coordinate. Develop FastAPI-based microservices optimised for low latency and high reliability. Engineer and evaluate RAG pipelines: hybrid retrieval, re-ranking, grounding, and context validation. Integrate real-time voice interfaces (STT/TTS, WebRTC, LiveKit) into intelligent conversational flows. Instrument and evaluate system performance using ...