726 to 750 of 1,761 Permanent Observability Jobs

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Hiring Organisation
Capital One
Location
Fredericksburg, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Hiring Organisation
Capital One
Location
Annapolis, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Hiring Organisation
Capital One
Location
Salisbury, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Hiring Organisation
Capital One
Location
Washington, Washington DC, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Hiring Organisation
Capital One
Location
Dover, Delaware, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Hiring Organisation
Capital One
Location
Charlottesville, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Hiring Organisation
Capital One
Location
Mc Lean, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent ...

Principal Engineer

Hiring Organisation
Motive Group
Location
Slough, Berkshire, UK
Employment Type
Full-time
experience with Kubernetes and container orchestration. A strong grasp of Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working ...

Principal Engineer

Hiring Organisation
Motive Group
Location
City of London, London, United Kingdom
experience with Kubernetes and container orchestration. A strong grasp of Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working ...

Principal Engineer

Hiring Organisation
Motive Group
Location
London Area, United Kingdom
experience with Kubernetes and container orchestration. A strong grasp of Infrastructure-as-Code (Terraform) and configuration management tools (Ansible, Puppet, or similar). Strong observability experience using tools like Prometheus/Mimir, Loki, Tempo, Grafana, Alertmanager. Experience deploying and operating large-scale GPU clusters or HPC systems (Ideally). Working ...

Cloud DevOps Engineer

Hiring Organisation
Randstad Digital
Location
Birmingham, West Midlands, United Kingdom
Employment Type
Permanent
Salary
£50,000
Lambda, RDS) using CloudFormation and AWS CDK . Pipeline Excellence: Assist in CI/CD development ( GitLab, Jenkins, CodePipeline ). Reliability: Contribute to observability (metrics, logs) and incident response. Security: Implement security controls and secrets management. Code: Automate tasks primarily using TypeScript . Must-Haves: 2-4 years in DevOps ...

AWS Cloud Infra

Hiring Organisation
Randstad Technologies Recruitment
Location
Birmingham, West Midlands, West Midlands (County), United Kingdom
Employment Type
Permanent
Salary
£45000 - £50000/annum
Lambda, RDS) using CloudFormation and AWS CDK . Pipeline Excellence: Assist in CI/CD development ( GitLab, Jenkins, CodePipeline ). Reliability: Contribute to observability (metrics, logs) and incident response. Security: Implement security controls and secrets management. Code: Automate tasks primarily using TypeScript . Must-Haves: 2-4 years in DevOps ...

Senior Software Engineer

Hiring Organisation
ZEREN
Location
London Area, United Kingdom
scaling core data integrations with common SaaS platforms Designing data ingestion pipelines and analytics storage layers Productionising the agent execution platform - improving reliability, observability, and performance Improving developer experience , tooling, and internal platform abstractions Working closely with AI systems (LLMs), including prompt and system design to enable accurate, efficient answers ...

Senior Software Engineer

Hiring Organisation
ZEREN
Location
City of London, London, United Kingdom
scaling core data integrations with common SaaS platforms Designing data ingestion pipelines and analytics storage layers Productionising the agent execution platform - improving reliability, observability, and performance Improving developer experience , tooling, and internal platform abstractions Working closely with AI systems (LLMs), including prompt and system design to enable accurate, efficient answers ...

SRE Engineer Java JVM - Fintech

Hiring Organisation
Client Server
Location
East London, London, United Kingdom
Employment Type
Permanent, Work From Home
driving improvements across the platform. You'll have a broad scope including hands-on coding (Java) to build tools and automation, incident response, observability, performance optimisation and operational excellence, working with large scale Java JVM distribution systems and message brokers such as Kafka and ActiveMQ, with ...

SRE Engineer Java JVM - Fintech

Hiring Organisation
Client Server
Location
London, England, United Kingdom
driving improvements across the platform. You'll have a broad scope including hands-on coding (Java) to build tools and automation, incident response, observability, performance optimisation and operational excellence, working with large scale Java JVM distribution systems and message brokers such as Kafka and ActiveMQ, with ...

AI Augmented Software Engineer

Hiring Organisation
The Skills Network
Location
Selby, North Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent
Salary
£40,000
business objectives. The role will be responsible for the full software development life cycle for specific projects from discovery, coding, testing, deployment, DevOps, observability, and cybersecurity. Key Responsibilities: Proactively communicate progress on projects, blockers, and project timelines to Head of Engineering and Product & Delivery Lead Use cutting-edge AI coding ...

3rd Line Technology Specialist

Hiring Organisation
Irlam Associates
Location
Stockport, Cheshire, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £50,000 per annum, Inc benefits, OTE
Collaborate with managed service providers and internal teams to prioritise workloads effectively. AWS & Cloudflare Responsibilities Multi-account governance and secure AWS network design. Logging, observability, and security with CloudWatch, CloudTrail, GuardDuty, Security Hub, and Inspector, including CIS-aligned hardening. Resilience and DR planning: multi-AZ architecture, backup and restore strategies ...

Data Engineer

Hiring Organisation
Planet Pharma
Location
Slough, Berkshire, UK
Employment Type
Full-time
compute solutions for bioinformatics, ML, and data-intensive workloads. Drive automation using Terraform, Azure DevOps, and CI/CD pipelines. Embed security, compliance, and observability across all cloud components. Collaborate with cross-functional teams to align cloud architecture with data platform strategy. Mentor engineering teams and translate technical requirements into ...

Data Engineer

Hiring Organisation
Planet Pharma
Location
London Area, United Kingdom
compute solutions for bioinformatics, ML, and data-intensive workloads. Drive automation using Terraform, Azure DevOps, and CI/CD pipelines. Embed security, compliance, and observability across all cloud components. Collaborate with cross-functional teams to align cloud architecture with data platform strategy. Mentor engineering teams and translate technical requirements into ...

Data Engineer

Hiring Organisation
Planet Pharma
Location
City of London, London, United Kingdom
compute solutions for bioinformatics, ML, and data-intensive workloads. Drive automation using Terraform, Azure DevOps, and CI/CD pipelines. Embed security, compliance, and observability across all cloud components. Collaborate with cross-functional teams to align cloud architecture with data platform strategy. Mentor engineering teams and translate technical requirements into ...

Lead Software Engineer

Hiring Organisation
ByteHire
Location
London, United Kingdom
Employment Type
Permanent
Docker, and CI/CD pipelines . Work with distributed systems and messaging technologies (e.g. SQS, RabbitMQ ). Drive strong testing practices, monitoring, observability, and production readiness. Essential Technical Experience: Significant full-stack engineering experience, particularly with PHP (Symfony or Laravel) . Strong experience with JavaScript (Vue.js, React.js, and/ ...

Data Solutions Consultant, AI, Palantir

Hiring Organisation
Staffworx Limited
Location
United Kingdom
Employment Type
Permanent
Foundry artefacts. Scalability, Reliability & Operations Lead performance tuning for large-scale production deployments (eg parallelisation, partitioning, caching, compute configuration). Design monitoring, alerting and observability for pipelines, applications and integrations. Handle incident response and root cause analysis for platform and application issues. Define and enforce non-functional requirements (SLA/ ...

Software Engineer, STV Player

Hiring Organisation
STV Group plc
Location
Glasgow, Scotland, United Kingdom
applications, particularly those hosted in cloud environments such as AWS. Strong debugging and optimization skills, with experience of effectively using unit testing, monitoring and observability tools to build a solid product. A good working knowledge of PHP & associated frameworks (e.g. Laravel, Symfony, Slim). Demonstrable experience using Continuous Delivery tools ...

Software Engineer, STV Player

Hiring Organisation
STV Group plc
Location
Paisley, Renfrewshire, UK
Employment Type
Full-time
applications, particularly those hosted in cloud environments such as AWS. Strong debugging and optimization skills, with experience of effectively using unit testing, monitoring and observability tools to build a solid product. A good working knowledge of PHP & associated frameworks (e.g. Laravel, Symfony, Slim). Demonstrable experience using Continuous Delivery tools ...