201 to 225 of 536 Observability Jobs

Senior Backend Developer (.NET | AI-First SaaS Platform)

Hiring Organisation
Keepnet
Location
United Kingdom
/CD (Azure), and post-release monitoring. Work on distributed and asynchronous systems using message queues, background workers, and event-driven workflows. Use observability signals (logging, metrics, tracing, tools like Sentry) to proactively detect, diagnose, and prevent production issues. Collaborate closely with frontend, product, and customer-facing teams to deliver ...

FX e Trading Senior Full Stack Engineer (React TypeScript + Java)

Hiring Organisation
Atrium Workforce Solutions Ltd
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£759 - £900 per day
with modular, reusable components; drive state management, data visualization, and UX for trading workflows. Optimize latency, throughput, and reliability across the stack; instrument observability (metrics, tracing, logging) and performance profiling. Establish engineering best practices: code standards, testing strategies (unit/integration/E2E), CI/CD, secure coding, and release ...

AI Solutions Manager

Hiring Organisation
Durlston Partners
Location
Greater London, England, United Kingdom
quantitative models, enterprise data pipelines, and real-time reasoning systems. Evolve team structures and capacity models to optimize delivery. Own safety, security, compliance, and observability of AI systems operating in production investment environments. Maintain deep awareness of emerging AI technologies, agentic patterns, and best practices. How Success Is Measured Strength ...

Lead Software Engineer

Hiring Organisation
We Are Dcoded Limited
Location
Manchester, North West, United Kingdom
Employment Type
Permanent
Salary
£95,000
Contributing to architectural decisions while remaining grounded in delivery Helping your squad plan effectively using a now/next/later mindset Championing reliability, observability, and supportability. You build it, you help run it Improving automation, CI/CD pipelines, and engineering standards incrementally Supporting and mentoring others through pairing ...

Lead Test Analyst

Hiring Organisation
Horwich Farrelly
Location
Salford, Lancashire, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
enable fast, reliable testing at scale. Coordinate cross-team dependencies, test data, and environment scheduling for parallel initiatives Champion shift-left/shift-right, observability and risk-based release decisions. Oversee UAT, regression and sign-off on release quality, with clear quality metrics and executive reporting; confident go/ ...

Senior Software Engineer (SatOS Team)

Hiring Organisation
Spire
Location
Glasgow, Scotland, United Kingdom
Contribute to the continuous improvement of our development processes and tools Perform ground-based testing and in-orbit verification of new software services Implement observability solutions for satellite-side services Work with our customers to translate their requirements into effective software solutions Key Skills: 5+ years experience in professional software ...

Technical Lead

Hiring Organisation
Perch Group
Location
North West, England, United Kingdom
development using Azure Data Factory Exposure to DataBricks, Synapse or Spark Experience working within event-driven architectures Understanding of DevOps, IaC (Terraform/Bicep), Observability The Application Timeline A first stage video call with the internal talent acquisition team (15 minutes) A second stage teams interview with the hiring manager ...

Senior Software Engineer

Hiring Organisation
Perch Group
Location
North West, England, United Kingdom
Factory Familiarity with Databricks , Synapse , or Spark Experience working within event-driven architectures Understanding of DevOps principles , Infrastructure as Code (Terraform/Bicep) , and observability best practices The Application Timeline A first stage phone call with the internal talent acquisition team (15 minutes) A second stage competency Teams call interview ...

Lead AI Engineer (FM Hosting, LLM Inference)

Hiring Organisation
Capital One
Location
New York, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Lead AI Engineer (FM Hosting, LLM Inference)

Hiring Organisation
Capital One
Location
Cambridge, Massachusetts, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Lead AI Engineer (FM Hosting, LLM Inference)

Hiring Organisation
Capital One
Location
Baltimore, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Lead AI Engineer (FM Hosting, LLM Inference)

Hiring Organisation
Capital One
Location
Charlottesville, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Lead AI Engineer (FM Hosting, LLM Inference)

Hiring Organisation
Capital One
Location
Washington, Washington DC, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Lead AI Engineer (FM Hosting, LLM Inference)

Hiring Organisation
Capital One
Location
Dover, Delaware, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

AI Engineer

Hiring Organisation
Champion Data
Location
London Area, United Kingdom
strong bias toward practical usability and maintainability. Applied AI Integration - Apply modern AI capabilities pragmatically to real engineering and operational problems, focusing on reliability, observability, and safe deployment rather than novelty alone. Innovation Handover - Work closely with core engineering teams to transition successful prototypes into production-ready solutions, including documentation ...

Lead AI Engineer (FM Hosting, LLM Inference)

Hiring Organisation
Capital One
Location
Mc Lean, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Software Engineer (Dublin, Hybrid)

Hiring Organisation
G Treasury SS, LLC
Location
Dublin, Ireland
Employment Type
Permanent
Salary
EUR 125,000 - 150,000 Annual
flags, ensuring seamless integration and deployment Conduct rigorous unit, integration, and non-functional (performance, security) testing to guarantee our software is production-ready Leverage observability tools and logging to troubleshoot and resolve issues across development, test, and production environments Share your enthusiasm for tech trends, explore and learn new technologies ...

Site Reliability Engineer / SRE / Systems Engineer

Hiring Organisation
AWD Online
Location
Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£70,000
effective incident management across live environments. This Site Reliability Engineer/Systems Engineer role offers the chance to work with modern cloud technologies, containerisation, observability tools and automation practices, while influencing long-term reliability improvements across business-critical systems. APPLY TODAY Ready to make your next career move? Apply … live production issues through to resolution or handover System Monitoring and Availability: Maintaining high availability, performance and scalability of production platforms and services Observability Implementation: Managing logging, monitoring, alerting and metrics to proactively identify and resolve issues Reliability Improvements: Collaborating with development teams to translate operational insights into long-term ...

Senior Site Reliability Engineer - AI Platform

Hiring Organisation
N26 GmbH
Location
Berlin, Germany
Employment Type
Permanent
Salary
EUR Annual
team's strategy, roadmap, and architecture. Drive incident management and troubleshooting efforts, ensuring a stable and predictable AI development and deployment environment. Improve observability and monitoring, ensuring the AI Platform meets performance and compliance requirements. What you need to be successful Background and skills: Strong hands-on experience in designing … security best practices in cloud environments. Hands-on experience with CI/CD pipelines (GitHub Actions, ArgoCD, Jenkins, or similar). Familiarity with observability tools (DataDog, Prometheus, Grafana, OpenTelemetry). Nice to have: Experience in AI/ML production systems and the unique challenges of scaling AI workloads. Experience ...

Senior Site Reliability Engineer

Hiring Organisation
Alexander Ash Consulting
Location
Scotland, United Kingdom
drive SRE strategy, standards, and maturity across complex platforms. Design, build, and operate resilient, scalable, and secure infrastructure. Lead reliability engineering initiatives, including automation, observability, and incident management. Provide senior technical leadership during major incidents and drive long-term remediation. Use data, metrics, and SLOs to drive continuous improvement ...

Mobile Application Engineer

Hiring Organisation
eTeam Workforce Limited
Location
Burgess Hill, Sussex, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Back End for front ends for mobile applications using Kotlin or Java 17 and above, Spring and build automation with Maven or Gradle. Observability - Sentry, ELK, Dynatrace. Experienced technically leading an agile engineering team and contributing to agile ceremonies. Deep Knowledge of cloud and CI/CD technologies ...

Contract AI Software Engineer - SC Cleared

Hiring Organisation
Searchability NS&D
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£500 - £700 per day
place and transferable) Desirable Experience Experience building agentic workflows or orchestration layers for LLMs Exposure to MLOps, model monitoring, or AI system observability Previous experience working in government, defence, or highly regulated environments Working Arrangements Hybrid working - mixture of London office and remote Outside IR35 contract Apply ...

Senior ML Infrastructure Engineer

Hiring Organisation
Harnham
Location
England, United Kingdom
large, heterogeneous datasets • Scale public-facing data infrastructure supporting ML research • Optimise distributed AI workloads for latency, throughput, reliability, and GPU utilisation • Build observability tooling for data quality, pipeline health, and experiments • Support GPU infrastructure for large-scale model training • Translate research prototypes into robust, production systems • Scope and supervise ...

Scala Developer - Remote Contract - Immediate Start

Hiring Organisation
Stealth IT Consulting Limited
Location
United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
£380 per day £380 per day (Inside IR35)
environment (Scrum/Kanban). Participate in code reviews, architecture discussions, and pair programming sessions. Troubleshoot and resolve production issues; contribute to reliability and observability (logging, metrics, alerts). Assist in defining CI/CD pipelines and deployment processes (e.g., Jenkins, GitHub Actions, Concourse). Produce concise technical documentation ...

Lead Data Engineer | TechBio Platform | GCP, BigQuery, Terraform, DBT

Hiring Organisation
Cubiq Recruitment
Location
City of London, London, United Kingdom
someone who can come in and make sense of that environment quickly... design ingestion patterns for huge data drops, think carefully about cost and observability, and work closely with internal medical teams to translate real-world questions into production systems. Big egos will not fit in here, they ...