301 to 325 of 1,845 Permanent Observability Jobs

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Chicago, Illinois, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Harrisonburg, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Goochland, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Salisbury, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Norfolk, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Petersburg, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Baltimore, Maryland, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Newport News, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Hiring Organisation
Capital One
Location
Mc Lean, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
Capital One. The Cloud Operations Resilience Engineering (CORE) Technology division is responsible for enabling and evolving Capital One's foundational cloud infrastructure layer, including observability, connectivity, resilience and availability. What You'll Do: Lead a portfolio of diverse technology projects and a team of developers with deep experience in distributed ...

Senior Infrastructure Engineer

Hiring Organisation
Syniti
Location
England, United Kingdom
resource governance. · Implement secure workload identity, service endpoint controls, and resource policies. · Partner with SRE and platform teams to optimize Azure region builds. · Support observability, tagging standards, and drift detection in Azure environments. WHAT IT TAKES · Strong infrastructure automation and operations troubleshooting capability. · Solid understanding of Azure managed services ...

Senior Software Engineer

Hiring Organisation
TD Bank
Location
Dublin, Ireland
Employment Type
Permanent
Salary
EUR 70,000 - 90,000 Annual
pipeline orchestration tools (Azure Data Factor or similar). Experience implementing ETL/ELT pipelines using modern data lakehouse solutions. Experience with data quality, observability/monitoring tools, and best practices. Experience ensuring compliance with data governance and security policies. Who We Are TD is one of the world ...

Observability Developer/Engineer -

Hiring Organisation
Morela
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£40,000 - £75,000 per annum
Title: Observability Developer/Engineer Location: Hybrid (UK, with travel as required) Employment Type: Full-time This role is with Morela please respond to for further informaiton Do you want to be part of something special? Morela is proud to represent our exclusive client , a fast-growing start-up transforming … this company is redefining how enterprises monitor, manage, and optimise IT operations. This is your chance to join a team shaping the future of observability and operational intelligence from the ground up. We are seeking a skilled Observability Developer to design, build, and optimise observability solutions that help enterprise clients ...

Observability Developer/Engineer

Hiring Organisation
VIQU IT
Location
London, United Kingdom
Employment Type
Permanent
Salary
£40000 - £75000/annum
Title: Observability Developer/Engineer Location: Hybrid (UK, with travel as required) Employment Type: Full-time This role is with Morela please respond to (url removed) for further informaiton Do you want to be part of something special? Morela is proud to represent our exclusive client , a fast-growing start … this company is redefining how enterprises monitor, manage, and optimise IT operations. This is your chance to join a team shaping the future of observability and operational intelligence from the ground up. We are seeking a skilled Observability Developer to design, build, and optimise observability solutions that help enterprise clients ...

Senior Specialist Engineer (SRE)

Hiring Organisation
UK Health Security Agency
Location
Birmingham, Leeds, Liverpool, London (Canary Wharf), United Kingdom
Employment Type
Permanent
Salary
£41983.00 - £52113.00 a year
identify bottlenecks with an engineering mindset. Ensure systems can handle current and future workloads through automation and capacity planning. Continuously improve services through observability, and identify ways to improve observability practices. Follow SRE principles. Guide and educate stakeholders to adopt implemented principles. Provide technical documentation for engineers. Providing training, where … production incidents, ensuring minimal downtime and quick restoration of services. Perform root cause analysis and postmortems, implementing lessons learned to prevent recurrence. Monitoring, Alerting & Observability Contribute to the design and implementation of effective monitoring and alerting systems using tools and dashboards. Improve observability of services, ensuring issues are identified ...

Systems/SRE Engineer

Hiring Organisation
Thurn Partners
Location
London Area, United Kingdom
Proficient in one or more programming languages such as Python, Go, Ruby, or Perl. Strong experience with Linux system administration. Hands-on experience with observability tools like Prometheus, Grafana, Thanos, and the ELK stack. Familiarity with Kubernetes, Docker, AWS, and GCP. ...

Systems/SRE Engineer

Hiring Organisation
Thurn Partners
Location
City of London, London, United Kingdom
Proficient in one or more programming languages such as Python, Go, Ruby, or Perl. Strong experience with Linux system administration. Hands-on experience with observability tools like Prometheus, Grafana, Thanos, and the ELK stack. Familiarity with Kubernetes, Docker, AWS, and GCP. ...

Site Reliability Engineer- eDV Cleared

Hiring Organisation
Searchability NS&D
Location
Gloucestershire, England, United Kingdom
Employment Type
Full-Time
Salary
£40,000 - £90,000 per annum, Negotiable
Experience as in a Site Reliability Engineering role SITE RELIABILITY ENGINEER ESSENTIAL SKILLS- Reliability, incident response/incident management experience - Experience with Monitoring and Observability tools such as Prometheus, Grafana and OpenSearch- Automation tools (Go, Bash)- Experience with Linux- Cloud infrastructure experience (AWS, Azure, GCP)- DevOps Mindset and ability ...

Site Reliability Engineer- eDV Cleared

Hiring Organisation
Searchability NS&D
Location
Manchester, Lancashire, England, United Kingdom
Employment Type
Full-Time
Salary
£40,000 - £90,000 per annum, Negotiable
Experience as in a Site Reliability Engineering role SITE RELIABILITY ENGINEER ESSENTIAL SKILLS- Reliability, incident response/incident management experience - Experience with Monitoring and Observability tools such as Prometheus, Grafana and OpenSearch- Automation tools (Go, Bash)- Experience with Linux- Cloud infrastructure experience (AWS, Azure, GCP)- DevOps Mindset and ability ...

Site Reliability Engineer- eDV Cleared

Hiring Organisation
Searchability NS&D
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 - £95,000 per annum, Negotiable
Experience as in a Site Reliability Engineering role SITE RELIABILITY ENGINEER ESSENTIAL SKILLS- Reliability, incident response/incident management experience - Experience with Monitoring and Observability tools such as Prometheus, Grafana and OpenSearch- Automation tools (Go, Bash)- Experience with Linux- Cloud infrastructure experience (AWS, Azure, GCP)- DevOps Mindset and ability ...

Artificial Intelligence Engineer

Hiring Organisation
Omnis Partners
Location
London Area, United Kingdom
clear and practical terms Teaching frameworks such as LangGraph, LangChain, AutoGen and knowledge graph fundamentals Showing what “production-grade” really looks like - reliability, observability, safety, evaluation, failure handling During non-training weeks, working as a senior IC on internal AI engineering/ML projects 🎯 What They’re Looking ...

Artificial Intelligence Engineer

Hiring Organisation
Omnis Partners
Location
City of London, London, United Kingdom
clear and practical terms Teaching frameworks such as LangGraph, LangChain, AutoGen and knowledge graph fundamentals Showing what “production-grade” really looks like - reliability, observability, safety, evaluation, failure handling During non-training weeks, working as a senior IC on internal AI engineering/ML projects 🎯 What They’re Looking ...

Senior ML / MLOps Engineer (AWS, SageMaker, LLM, Data Pipelines)

Hiring Organisation
Morson Edge
Location
Greater London, England, United Kingdom
similar frameworks) AWS experience: SageMaker, Lambda, ECS, or serverless pipelines Comfortable working with complex datasets and robust data pipelines CI/CD, monitoring, and observability for ML pipelines Strong problem-solving, communication, and collaboration skills Nice-to-have RAG/LLM pipelines experience Terraform, serverless architectures Cloud data stores (DynamoDB ...

Platform Engineer

Hiring Organisation
Ncounter Technology Recruitment
Location
Swindon, UK
Employment Type
Full-time
responsible for creating stable, repeatable, and compliant environments that support large scale digital systems. This includes shaping secure-by-design cloud patterns, introducing strong observability, and driving automation to remove manual effort. The work suits someone who is comfortable operating across infrastructure, pipelines, and application delivery while staying close ...

Platform Engineer

Hiring Organisation
Ncounter Technology Recruitment
Location
Wiltshire, England, United Kingdom
responsible for creating stable, repeatable, and compliant environments that support large scale digital systems. This includes shaping secure-by-design cloud patterns, introducing strong observability, and driving automation to remove manual effort. The work suits someone who is comfortable operating across infrastructure, pipelines, and application delivery while staying close ...

Platform Engineer

Hiring Organisation
Ncounter
Location
Bath, Avon, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £100,000 per annum
responsible for creating stable, repeatable, and compliant environments that support large scale digital systems. This includes shaping secure-by-design cloud patterns, introducing strong observability, and driving automation to remove manual effort. The work suits someone who is comfortable operating across infrastructure, pipelines, and application delivery while staying close ...