101 to 125 of 421 Permanent Observability Jobs

Senior Lead AI Engineer

Hiring Organisation
Capital One
Location
Richmond, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer

Hiring Organisation
Capital One
Location
Annapolis, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer

Hiring Organisation
Capital One
Location
Washington, Washington DC, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer

Hiring Organisation
Capital One
Location
Dover, Delaware, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer

Hiring Organisation
Capital One
Location
Mc Lean, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)

Hiring Organisation
Capital One
Location
Cambridge, Massachusetts, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)

Hiring Organisation
Capital One
Location
York, Pennsylvania, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)

Hiring Organisation
Capital One
Location
Annapolis, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)

Hiring Organisation
Capital One
Location
Washington, Washington DC, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)

Hiring Organisation
Capital One
Location
Harrisonburg, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)

Hiring Organisation
Capital One
Location
Dover, Delaware, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)

Hiring Organisation
Capital One
Location
Mc Lean, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Site Reliability Engineer

Hiring Organisation
Profile 29
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
performant infrastructure that underpins critical public-sector services. Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready. … PostgreSQL, Elasticsearch, and MongoDB Configure and support identity and access management (IdAM) solutions such as Keycloak Monitor system health, performance, and capacity using modern observability stacks (Prometheus, Grafana, ELK, OpenTelemetry) Champion DevSecOps practices, embedding security and compliance into every stage of delivery Automate deployment, scaling, and recovery processes to improve ...

Senior Python Developer

Hiring Organisation
Maxwell Bond
Location
Nationwide, United Kingdom
Employment Type
Permanent
Salary
£70000 - £100000/annum
REST Framework Create modern frontends with TypeScript and Vue.js or React Work across AWS, GCP, and Azure Support production systems with Datadog and strong observability What we are looking for 5+ years commercial Python experience Strong JavaScript and TypeScript skills Experience with Vue.js or React NoSQL databases such as MongoDB ...

Cloud Infrastructure & DevOps Engineer (Azure)

Hiring Organisation
4Square Recruitment Ltd
Location
Southall, Middlesex, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 - £65,000 per annum
Azure) Driving automation through Infrastructure-as-Code (Terraform) Building and maintaining Azure DevOps YAML pipelines (CI/CD) Monitoring performance and availability using modern observability tools Improving backups, patching, disaster recovery and overall resilience Working closely with development teams to ensure systems are secure, scalable and supportable Supporting cloud governance ...

Platform Engineer

Hiring Organisation
C4S Search Ltd
Location
Gloucestershire, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £60,000 per annum
platform and contribute to new feature development. RESPONSIBILITIES: Own the health, performance, and reliability of core platforms and infrastructure Proactively monitor and improve observability, resilience, and availability Manage and optimise Azure, Windows infrastructure, and SQL Server environments Support and enhance CI/CD pipelines and deployment processes Collaborate with development ...

Sr. Project Manager/Program Manager - Digital Twin / AIOps (OSS)

Hiring Organisation
Stackstudio Digital Ltd
Location
Reading, Berkshire, South East, United Kingdom
Employment Type
Permanent
Salary
£70,000
streaming/data pipelines (Kafka, Pub/Sub, Dataflow) Familiarity with cloud-native (GCP/Azure/AWS), Kubernetes, API-first integration, and observability stacks (metrics, logs, traces) Experience in ML/AI (feature engineering, MLOps, model monitoring), automation frameworks (RPA/BPM/runbooks), and security/compliance. Proven ...

Infrastructure Networking Engineer (GKE Specialist)

Hiring Organisation
Searchability NS&D
Location
England, United Kingdom
Evaluate and optimise Cluster Architecture and Tenancy configurations. Assess and improve Networking and Connectivity setups within the cloud environment. Review Security protocols, Operations, and Observability standards. Analyse Automation processes and CI/CD pipelines for efficiency. Audit Cost Management, Billing structures, and Testing methodologies. Key Skills: Google Kubernetes Engine ...

Java Software Engineer

Hiring Organisation
La Fosse
Location
England, UK
Azure/GCP) • Strong testing mindset (unit, integration, contract) and automation awareness • Understanding of OAuth2/OIDC, JWT and general security patterns • Familiarity with observability tools (OpenTelemetry, Prometheus, Grafana, ELK etc.) Nice to have • Previous work with healthcare or regulated environments • Experience in distributed systems or platform teams • Experience mentoring ...

Senior Software Engineer - Backend

Hiring Organisation
Fruition Group
Location
Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent
Salary
£70,000
decisions across teams. Lead by example, writing high-quality, maintainable code in Node.js and TypeScript. Design and optimise CI/CD pipelines, improving automation, observability, and release processes. Collaborate cross-functionally with product and platform teams to deliver robust services. Mentor and coach engineers, helping to raise the overall ...

Senior Backend Engineer

Hiring Organisation
Fruition Group
Location
Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent
Salary
£70,000
decisions across teams. Lead by example, writing high-quality, maintainable code in Node.js and TypeScript. Design and optimise CI/CD pipelines, improving automation, observability, and release processes. Collaborate cross-functionally with product and platform teams to deliver robust services. Mentor and coach engineers, helping to raise the overall ...

AWS Solutions Architect

Hiring Organisation
Henderson Scott
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£550 - £575 per day
decision-making. Desirable (Optional) Experience: We value architects who can adapt quickly to unfamiliar tools. Experience in any of the following is highly advantageous: Observability: Elasticsearch Stack, Dynatrace, Prometheus, or Grafana. Security & Identity: Hashicorp Vault, LDAP, Redhat SSO, OIDC, and Firewalling (Fortigate/AWS Network Firewall). Infrastructure/DevOps ...

Java Developer – Market Connectivity

Hiring Organisation
Solytics Partners
Location
City of London, London, United Kingdom
resolve connectivity, performance, and messaging issues in production. Conduct latency tuning, load testing, and system optimization. Collaborate with SRE/DevOps teams to improve observability, CI/CD, and deployment pipelines. Key Requirements: Strong expertise in Core Java, including concurrency, memory management, and GC tuning. Hands-on experience integrating ...

Machine Learning Engineer

Hiring Organisation
algo1
Location
City of London, London, United Kingdom
serving latency or pipeline robustness. Month 3: Own and deliver a major infrastructure component (e.g., feature store, training orchestration, or model registry); improve system observability with logging, metrics, and alerting. Month 6: Lead the end-to-end productionisation of our foundation model, meeting latency, throughput, and reliability SLAs; mentor teammates ...

Senior Engineer Data, AI & Analytics (m/w/d) - Hybrid

Hiring Organisation
Purpose Green
Location
Charlottenburg, Berlin, Germany
Employment Type
Permanent
Salary
EUR 50,000 - 75,000 Annual
Redshift, OpenSearch) • Hands-on experience deploying AI/LLM-based systems into production • Experience using dbt Cloud for transformation pipelines • Familiarity with tracing and observability (e.g., Langfuse, OpenTelemetry) • Experience preparing datasets and running supervised fine-tuning (SFT) of LLMs • Exposure to reverse ETL tools (e.g., Census, Hightouch) or building custom ...