151 to 175 of 311 Low Latency Jobs

Staff Software Engineer - Machine Learning

Hiring Organisation
17918
Location
London, United Kingdom
LangGraph, LangChain, VectorDBs, RAG)Understanding of responsible AI practices, including guardrails, hallucination mitigation, and output quality management for AI systems Experience designing and scaling low-latency, customer-facing ML/AI architectures Proven experience setting a multi-team ML/AI technical vision and strategyStrong track record ...

Principal Machine Learning Scientist (Experiences)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Solve complex, non‐linear travel journeys by utilizing sequential recommenders, representation learning, and deep multi‐objective frameworks. System Architecture & Scalability: Oversee the deployment of lowlatency, high‐throughput retrieval and ranking pipelines (e.g., multi‐stage retrieval, vector search) capable of processing billions of travel data points (reviews, photos ...

Senior Trading Platform Engineer — Low-Latency Java & AI

Hiring Organisation
Jobleads-UK
Location
United Kingdom
Bullish, Inc. is looking for an experienced Java Developer to join their Trading Product Engineering team. In this role, you will design and implement efficient Java code that supports the core functionalities of the Bullish ...

Energy Trading - Principal Software Engineer London

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
proposition based on your experience of the sector and our work to date Navigate the nuances of physical and financial trading flows, data latency constraints, reliability and regulatory obligations Technical Expertise This is a hands‐on role requiring deep technical execution alongside strategic leadership. The ideal candidate brings proven … areas: Data Engineering: Proven experience with streaming platforms (e.g. Kafka), real‐time processing frameworks and modern data stack tools Architecture: Demonstrated ability to design lowlatency, high‐throughput data systems with microsecond‐level performance requirements Cloud Platforms: Hands‐on experience with AWS and or Azure, including managed services ...

Staff Machine Learning Engineer, ML Infrastructure

Hiring Organisation
SimpliSafe
Location
Cambridge, Massachusetts, United States
Employment Type
Permanent
Salary
USD Annual
design and evolution of cloud-side inference systems that process live video and events from SimpliSafe devices in real time. Drive throughput, latency, and cost improvements (batching strategies, GPU utilization, autoscaling, multi-model serving) for production CV models. Build the feedback loops between cloud inference, edge devices … networking) and with Kafka, containerized deployments, CI/CD, and infrastructure-as-code. Demonstrated experience designing and operating high-throughput, low-latency inference systems - GPU-aware scheduling, batching, autoscaling, multi-tenancy. Solid grounding in ML fundamentals : how models are trained, evaluated, versioned, deployed, monitored, and rolled back ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Wichita, Kansas, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Omaha, Nebraska, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Baltimore, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Boston, Massachusetts, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Portland, Oregon, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Denver, Colorado, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Madison, Wisconsin, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Tucson, Arizona, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Austin, Texas, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Hartford, Connecticut, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Richmond, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Miami, Florida, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Cincinnati, Ohio, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Louisville, Kentucky, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Seattle, Washington, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Indianapolis, Indiana, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Atlanta, Georgia, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Nashville, Tennessee, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Minneapolis, Minnesota, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...

Staff AI Platform Engineer

Hiring Organisation
Invoca
Location
Tulsa, Oklahoma, United States
Employment Type
Permanent
Salary
USD Annual
production scale and reliability: Design and improve the serving, routing, and control layers that sit between applications and model providers. You will help ensure low latency, high availability, cost efficiency, and strong production behavior.Drive observability and governance: Build and enforce the platform capabilities that make AI systems measurable … them away. You have built frameworks using them to solve various AI concerns for developers. Experience with model infrastructure and operational concerns: You understand latency, throughput, reliability, token/cost management, model-provider integration, and the realities of running AI systems in production. Experience with observability and evaluation ...