let's talk. About the Role We're looking for a Senior Site Reliability Engineer to join our SRE team. This is a hybrid role that blends deep platform engineering with application-level troubleshooting . You'll be responsible for the stability, performance, and resilience of our cloud-native infrastructure while also being on the front line when issues … strategies for microservices and core platforms Continuously monitor and improve system performance, cost-efficiency, and observability (LGTM stack/Datadog) Partner with security teams on compliance and vulnerability remediation ️ ChaosEngineering & Resilience Design and execute ChaosEngineering experiments. Develop and track SLOs, SLIs, and error budgets for critical systems Conduct resilience reviews and game days to … to backend service disruptions Investigate issues across infrastructure, Kubernetes, logs, traces, and service code Resolve incidents and support root causes (Java and GoLang services) Contribute to postmortems and reliability engineering initiatives Who You Are Essential Experience 5+ years in an SRE, DevOps, or infrastructure role Deep hands-on experience with AWS , EKS/Kubernetes , and Terraform Working knowledge of More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries.In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems.The ideal candidate will be an … experienced Site Reliability Engineer with a deep background in AWS, Kubernetes (EKS), Terraform, and monitoring/eventing tools. You'll have a strong grasp of application-level troubleshooting, chaosengineering, and performance tuning.This is a fantastic opportunity to work in a modern DevOps environment where innovation is encouraged, personal development is supported, and technical impact is real. The … Role: *Manage and optimise AWS and Kubernetes (EKS) infrastructure*Implement resilience strategies and conduct chaosengineering experiments*Monitor and maintain Kafka clusters for performance and reliability*Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and More ❯
strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal candidate will be … an experienced Site Reliability Engineer with a deep background in AWS, Kubernetes (EKS), Terraform, and monitoring/eventing tools. You'll have a strong grasp of application-level troubleshooting, chaosengineering, and performance tuning. This is a fantastic opportunity to work in a modern DevOps environment where innovation is encouraged, personal development is supported, and technical impact is … real. The Role: Manage and optimise AWS and Kubernetes (EKS) infrastructure Implement resilience strategies and conduct chaosengineering experiments Monitor and maintain Kafka clusters for performance and reliability Respond to and resolve application-level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal candidate will be … an experienced Site Reliability Engineer with a deep background in AWS, Kubernetes (EKS), Terraform, and monitoring/eventing tools. You'll have a strong grasp of application-level troubleshooting, chaosengineering, and performance tuning. This is a fantastic opportunity to work in a modern DevOps environment where innovation is encouraged, personal development is supported, and technical impact is … real. The Role: *Manage and optimise AWS and Kubernetes (EKS) infrastructure *Implement resilience strategies and conduct chaosengineering experiments *Monitor and maintain Kafka clusters for performance and reliability *Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
About the Opportunity Are you a seasoned technology leader with a passion for building cutting-edge enterprise products and a hands-on approach to engineering? Join Citi's Cloud Technology Services (CTS) team and be part of our commitment to transform Citi technology leveraging game-changing Cloud capabilities to drive agility, efficiency, and innovation. We're providing our businesses … with a competitive edge by leveraging public cloud scale and enabling new infrastructure economics. As the Public Cloud Engineering Practices Lead , you will play a pivotal role in shaping and executing our public cloud strategy. You will be part of a team that continues to deliver big! From building cloud base High Performance Compute (HPC) platform to run huge … GenAI at scale, all the way to enabling payments solutions, this team is at the forefront of innovation. What You'll Do: Lead the Charge: Own the public cloud engineering practices strategy and its execution, enabling Citi's secure and enterprise-scale adoption of public cloud. You will provide technical authority for all engineering practices across all public More ❯
Senior DevOps Engineer Department: Engineering Employment Type: Full Time Location: London, UK Description The impact you will have: You will have a transformative impact across Elliptic by evangelising DevOps, security, and reliability principles and fostering a culture of efficiency and autonomy. You will join a growing team of experienced and passionate engineers who are not afraid to fail and … enjoy tackling difficult problems head-on. Openness is one of our core values at Elliptic, and nowhere is this more evident than in our engineering teams. We strongly encourage engineers to challenge convention and find unique and innovative solutions to our customers' problems. Key Responsibilities What you will do: Provide senior DevOps expertise and leadership across Engineering at … all layers of the stack Evangelise DevOps, security and reliability engineering across the Engineering team-at-large Provision resilient infrastructure across multiple regions and AZs Build compliant, reliable and featureful developer platforms centered on container orchestration. Enable Continuous Delivery and Deployment capabilities using CICD pipelines and GitOps tooling Enable shifting left on security and testing, and facilitate progressive More ❯