Permanent Chaos Engineering Jobs in London

11 of 11 Permanent Chaos Engineering Jobs in London

Global IT Quality Engineer Senior Director & CoE Lead

London, United Kingdom
The Boston Consulting Group GmbH
of our DNA. To meet the needs of BCG's global, mobile, fast growing and increasingly diverse business, we are looking for a Global IT Senior Director for Quality Engineering role to lead and expand our central QA Center of Excellence (CoE) into an end-to-end QA Team. To execute this transformation, we need people who can translate … and expertise development for Quality Assurance and Performance Engineering. Among your responsibilities, you will: Lead End-to-End Quality Assurance: Lead the development and expansion of a centralized Quality Engineering (QE) Centre of Excellence (COE), ensuring that quality and performance standards are maintained across all platforms, products, including end-user environments. Implement best practices in quality metrics, reviews, and … end-to-end testing and manage structured QA cycles for security updates, patches, and system upgrades, ensuring comprehensive testing across third-party and custom-built applications. Establish Advanced Performance Engineering: Establish a robust performance engineering strategy, integrating advanced tools for application performance monitoring (APM), observability, and telemetry. Focus on early identification of performance bottlenecks and quality assurance measures More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, South East, England, United Kingdom
Hybrid / WFH Options
Rise Technical Recruitment Limited
a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries.In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems.The ideal candidate will be an … experienced Site Reliability Engineer with a deep background in AWS, Kubernetes (EKS), Terraform, and monitoring/eventing tools. You'll have a strong grasp of application-level troubleshooting, chaos engineering, and performance tuning.This is a fantastic opportunity to work in a modern DevOps environment where innovation is encouraged, personal development is supported, and technical impact is real. The … Role: *Manage and optimise AWS and Kubernetes (EKS) infrastructure*Implement resilience strategies and conduct chaos engineering experiments*Monitor and maintain Kafka clusters for performance and reliability*Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and More ❯
Employment Type: Full-Time
Salary: £80,000 - £90,000 per annum, Inc benefits
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Rise Technical Recruitment Limited
strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal candidate will be … an experienced Site Reliability Engineer with a deep background in AWS, Kubernetes (EKS), Terraform, and monitoring/eventing tools. You'll have a strong grasp of application-level troubleshooting, chaos engineering, and performance tuning. This is a fantastic opportunity to work in a modern DevOps environment where innovation is encouraged, personal development is supported, and technical impact is … real. The Role: Manage and optimise AWS and Kubernetes (EKS) infrastructure Implement resilience strategies and conduct chaos engineering experiments Monitor and maintain Kafka clusters for performance and reliability Respond to and resolve application-level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Rise Technical Recruitment Limited
strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal candidate will be … an experienced Site Reliability Engineer with a deep background in AWS, Kubernetes (EKS), Terraform, and monitoring/eventing tools. You'll have a strong grasp of application-level troubleshooting, chaos engineering, and performance tuning. This is a fantastic opportunity to work in a modern DevOps environment where innovation is encouraged, personal development is supported, and technical impact is … real. The Role: *Manage and optimise AWS and Kubernetes (EKS) infrastructure *Implement resilience strategies and conduct chaos engineering experiments *Monitor and maintain Kafka clusters for performance and reliability *Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with More ❯
Employment Type: Permanent, Work From Home
Salary: £90,000
Posted:

Engineering Lead - Public Cloud Engineering Practices - SVP

London, United Kingdom
Hybrid / WFH Options
Citigroup Inc
About the Opportunity Are you a seasoned technology leader with a passion for building cutting-edge enterprise products and a hands-on approach to engineering? Join Citi's Cloud Technology Services (CTS) team and be part of our commitment to transform Citi technology leveraging game-changing Cloud capabilities to drive agility, efficiency, and innovation. We're providing our businesses … with a competitive edge by leveraging public cloud scale and enabling new infrastructure economics. As the Public Cloud Engineering Practices Lead , you will play a pivotal role in shaping and executing our public cloud strategy. You will be part of a team that continues to deliver big! From building cloud base High Performance Compute (HPC) platform to run huge … GenAI at scale, all the way to enabling payments solutions, this team is at the forefront of innovation. What You'll Do: Lead the Charge: Own the public cloud engineering practices strategy and its execution, enabling Citi's secure and enterprise-scale adoption of public cloud. You will provide technical authority for all engineering practices across all public More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Sr. Cloud Operations Delivery Manager (CODM), Enterprise Support - UKI

London, United Kingdom
Amazon
ability to make high-judgment technical decisions in complex environments - Experience leading cross-functional teams with a mix of technical, business, and operational roles PREFERRED QUALIFICATIONS - Experience with resilience engineering, chaos engineering, and observability practices in AWS - Understanding of enterprise IT operational capabilities - examples include Change, Incident Management, infrastructure management or applications management - Knowledge of the AWS More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer

London, United Kingdom
Hybrid / WFH Options
Elliptic Enterprises Ltd
Senior DevOps Engineer Department: Engineering Employment Type: Full Time Location: London, UK Description The impact you will have: You will have a transformative impact across Elliptic by evangelising DevOps, security, and reliability principles and fostering a culture of efficiency and autonomy. You will join a growing team of experienced and passionate engineers who are not afraid to fail and … enjoy tackling difficult problems head-on. Openness is one of our core values at Elliptic, and nowhere is this more evident than in our engineering teams. We strongly encourage engineers to challenge convention and find unique and innovative solutions to our customers' problems. Key Responsibilities What you will do: Provide senior DevOps expertise and leadership across Engineering at … all layers of the stack Evangelise DevOps, security and reliability engineering across the Engineering team-at-large Provision resilient infrastructure across multiple regions and AZs Build compliant, reliable and featureful developer platforms centered on container orchestration. Enable Continuous Delivery and Deployment capabilities using CICD pipelines and GitOps tooling Enable shifting left on security and testing, and facilitate progressive More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

SRE & Service Lead

London, United Kingdom
Sanderson Recruitment
Role: SRE & Service Lead - Digital Core Platforms Location: 2 days a week in London Salary: £160,000 + 20% Value Account + Bonus Are you a forward-thinking Engineering Leader with a deep understanding of software engineering, cloud infrastructure, and SRE principles? Do you have a sharp eye for automation, observability, and leading technical teams through digital transformation … could be the perfect opportunity to elevate your career at the forefront of banking innovation. This is a unique opportunity to join a major UK bank and lead strategic engineering efforts across three key areas: Retail Mortgages Bank of APIs - delivering on PSD2 and other regulatory initiatives Real-Time Core Banking - part of a long-term, cutting-edge modernisation … programme You'll be responsible for coordinating engineering teams, guiding technical strategy, and embedding best practices across one of the largest engineering domains in the bank (over 1000 staff - 75% engineers). This is a hands-on leadership role for someone who's passionate about driving resilience, automation, performance, and security at scale. Ideal Candidate: Deep software engineering More ❯
Employment Type: Permanent
Posted:

SRE & Service Lead

London, South East, England, United Kingdom
Sanderson
Role: SRE & Service Lead - Digital Core Platforms Location: 2 days a week in London Salary: £160,000 + 20% Value Account + Bonus Are you a forward-thinking Engineering Leader with a deep understanding of software engineering, cloud infrastructure, and SRE principles? Do you have a sharp eye for automation, observability, and leading technical teams through digital transformation … could be the perfect opportunity to elevate your career at the forefront of banking innovation. This is a unique opportunity to join a major UK bank and lead strategic engineering efforts across three key areas: Retail Mortgages Bank of APIs - delivering on PSD2 and other regulatory initiatives Real-Time Core Banking - part of a long-term, cutting-edge modernisation … programme You'll be responsible for coordinating engineering teams, guiding technical strategy, and embedding best practices across one of the largest engineering domains in the bank (over 1000 staff - 75% engineers). This is a hands-on leadership role for someone who's passionate about driving resilience, automation, performance, and security at scale. Ideal Candidate: Deep software engineering More ❯
Employment Type: Full-Time
Salary: £150,000 - £175,000 per annum, Negotiable, Inc benefits
Posted:

Technology Resilience Manager

London, United Kingdom
Innovation Group
in technology operations, who is looking to broaden their skillset. After developing your specialist skills you are now looking for opportunities to grow and learn more about wider resilience, chaos engineering and cloud services - we will support, provide guidance and mentor you. Nevertheless, we are open to other experiences as we are creating a new diverse and dynamic More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Global Processing Services
shape our SRE strategy, establish best practices, and set the standard for service reliability and performance. What You'll Do Define strategies for Application Performance Monitoring, Unit Cost, and Chaos Engineering. Continuously optimize production environments to enhance reliability and efficiency. Implement and apply MTTR, SLO, and SLI principles to ensure high service standards. Respond to incidents, analyze root causes … layers that drive our platform's success. What You Need Proven experience implementing SRE principles at scale, including deep knowledge of SLI/SLO/SLA differences. A product engineering background with strong coding skills in Python, C#, or similar. Experience with incident management frameworks and evolving them for efficiency. Expertise in cloud platforms (AWS preferred) and container orchestration More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Chaos Engineering
London
25th Percentile
£103,750
Median
£107,500
75th Percentile
£141,250
90th Percentile
£159,250