voice-streaming, RAG optimisation and all in one system. Flexible setup: Remote across the EU, with optional co-working in London or Barcelona. What you’ll do Obsessive about latency, you think in milliseconds, optimise for concurrency, and understand the trade-offs between speed, cost, and model performance. Design, implement, and productionise multi-agent LLM systems that reason, plan … and coordinate. Develop FastAPI-based microservices optimised for lowlatency and high reliability. Engineer and evaluate RAG pipelines : hybrid retrieval, re-ranking, grounding, and context validation. Integrate real-time voice interfaces (STT/TTS, WebRTC, LiveKit) into intelligent conversational flows. Instrument and evaluate system performance using observability and model-faithfulness metrics. What we’re looking for Proven ability More ❯
City of London, London, United Kingdom Hybrid/Remote Options
La Fosse
performing engineering teams in AI/ML & Computer Vision. Strong technical background with an emphasis on architecture and system design. Preference for candidates with experience building real-time or low-latency systems – not essential. Ability to lead without being hands-on – this role is more strategic and people-focused. Excellent communication and stakeholder management skills. If you're More ❯
performing engineering teams in AI/ML & Computer Vision. Strong technical background with an emphasis on architecture and system design. Preference for candidates with experience building real-time or low-latency systems – not essential. Ability to lead without being hands-on – this role is more strategic and people-focused. Excellent communication and stakeholder management skills. If you're More ❯
Proven experience as CTO, Technical Co Founder, or Lead Engineer in DeFi, trading, or high performance fintech. Deep understanding of Solana architecture, Rust, and smart contract design. Experience building lowlatency trading systems or DeFi protocols. Strong grasp of DeFi market structure, perpetuals, and liquidity mechanics. Demonstrated ability to hire, scale, and lead high performing technical teams. High More ❯
every day. As a trusted ally to major technology firms and Fortune 500 brands that build user-generated and GenAI products, ActiveFence empowers security, AI, and policy teams with low-latency Real-Time Guardrails and a continuous Red Teaming program that pressure-tests systems with adversarial prompts and emerging threat techniques. Powered by deep threat intelligence, unmatched harmful More ❯
is a rare opportunity to shape the future of embedded and agentic analytics at scale. Key Responsibilities Define & Drive the Analytics Platform Vision Build and scale a high-performance, low-latency analytics engine that supports embedded dashboards, cross-source joins, and both human and AI-native querying (e.g. Text-to-SQL). Leverage in-app instrumentation (session replays More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Stax - Deeptech Talent
is a rare opportunity to shape the future of embedded and agentic analytics at scale. Key Responsibilities Define & Drive the Analytics Platform Vision Build and scale a high-performance, low-latency analytics engine that supports embedded dashboards, cross-source joins, and both human and AI-native querying (e.g. Text-to-SQL). Leverage in-app instrumentation (session replays More ❯
frontend, backend, data, and cloud. Design clear interfaces and a mediated API layer between services and clients. Lead decisions on scalability, observability, reliability, cost, and security for high-volume, low-latency workloads. Write production code in TypeScript/Node and Angular, pairing with teams on complex features. Establish guardrails: coding standards, architecture principles, testing strategy, CI/CD More ❯
This pioneering deep tech company is developing the future of media technologyenabling scalable, low-latency delivery of rich, immersive content across a variety of platforms. With a focus on spatial and 3D data formats, their work spans everything from XR and interactive streaming to cloud-native media infrastructure. Combining advanced artificial intelligence with classical signal processing, the click More ❯
brilliant minds, cutting-edge tech, and systems that push the limits of speed and performance. Network Engineer – Prop Trading Firm (London) You will design, optimize, and support a global low-latency network environment. This role will be hands-on engineering with the chance to work with some of the fastest systems and newest technologies in the industry. Key More ❯
Delta Live Tables (DLT), and cost-efficient architecture. Role: Designing and developing streaming and batch data pipelines in Databricks (including DLT). Managing structured streaming in continuous mode, ensuring low-latency data delivery. Collaborating with architects, consultants, and business stakeholders to build a unified Azure-based lakehouse. Troubleshooting and optimising performance bottlenecks in high-volume streaming systems. Applying More ❯
Delta Live Tables (DLT), and cost-efficient architecture. Role: Designing and developing streaming and batch data pipelines in Databricks (including DLT). Managing structured streaming in continuous mode, ensuring low-latency data delivery. Collaborating with architects, consultants, and business stakeholders to build a unified Azure-based lakehouse. Troubleshooting and optimising performance bottlenecks in high-volume streaming systems. Applying More ❯
bi-directional communication On-device inference using Edge optimised frameworks Offline-aware & offline-first UX flows using prefetching, caching, and background sync Profile, optimise, and tune the app for lowlatency, efficient memory usage, and battery performance Work closely with AI & ML team members to build context-driven user experiences Help define and evolve a robust mobile architecture More ❯
bi-directional communication On-device inference using Edge optimised frameworks Offline-aware & offline-first UX flows using prefetching, caching, and background sync Profile, optimise, and tune the app for lowlatency, efficient memory usage, and battery performance Work closely with AI & ML team members to build context-driven user experiences Help define and evolve a robust mobile architecture More ❯
Greater Bristol Area, United Kingdom Hybrid/Remote Options
Affinity Partnership Group
high-impact, strategic leadership role at the heart of product innovation shaping end-to-end system performance across radio, baseband, MAC and networking layers to power ultra-secure, ultra-low-latency wireless networks in some of the most demanding environments globally. The Opportunity As Head of System Engineering, you will play a pivotal role in defining the system More ❯
control to ensure secure and consistent connectivity. Monitor and Troubleshoot Network Traffic : Analyse logs, monitor network traffic, troubleshoot performance or access issues, and optimize Zscaler deployments for reliability and low latency. Collaborate with Security Teams : Align Zscaler policies with compliance requirements, assist in audits, and support incident response processes. Administer Microsoft Entra ID : Manage user and group lifecycle, directory More ❯
s biggest players in over 120 countries to trade hundreds of billions of dollars with each other every day. Our trading applications are distributed, fault-tolerant, highly transactional and low-latency. Our team FXGO is a growing team of talented engineers who are passionate about our product. We take pride in building one of the premier FX trading platforms … real-time foreign exchange trading application. This system handles high volumes of pricing data and executes trades with minimal latency. They utilize geographically distributed data centers to reduce trade latency for a global client base, ensuring swift execution for traders worldwide. Continuous optimization of these systems is a primary focus. Who are you? An engineer who is motivated by More ❯