Site Reliability Engineering Jobs in England

301 to 325 of 1,615 Site Reliability Engineering Jobs in England

Senior Software Engineer, Observability

London, England, United Kingdom
Forter
Role Forter is seeking a Senior Software Engineer to join our Observability team . This role offers the opportunity to work at the intersection of software development and platform engineering, contributing to the tools, systems, and practices that improve visibility, reliability, and operational excellence across our engineering organisation. This position is ideally suited for experienced software engineers … for delivering scalable and user-friendly solutions to over 150 engineers working across more than 20 teams. We’re focused on enabling rapid incident detection and resolution, improving our reliability posture, and supporting a culture of continuous improvement. What you'll be doing: Design, build, and maintain observability tools and infrastructure that help our engineers provide actionable insights into … term impact, sustainable engineering practices, and continuous improvement. Preferred Qualifications Experience with OpenTelemetry or distributed tracing systems. Understanding of observability-driven development and service reliability principles (e.g. SRE, MTTR, SLIs/SLOs). Experience optimising observability systems for cost and performance at scale. Knowledge of microservices architectures and how to monitor and debug distributed systems. Contributions to open More ❯
Posted:

Senior Software Engineer, Fleet

Fleet, England, United Kingdom
Hayden AI Technologies, Inc
optimization technologies and beyond, our innovative mobile perception system empowers our clients to accelerate transit, enhance street safety, and drive toward a sustainable future. Job Summary: The Cloud Fleet Engineering team at Hayden AI is at the forefront of managing and scaling our fleet of AI-powered edge devices deployed across global transportation agencies. As a senior engineer, you … and remediation efforts, learning and applying best practices. Design, build, and maintain scalable cloud services that support device observability, OTA updates, and fleet operations. Lead efforts to improve the reliability, security, and performance of multi-region AWS infrastructure using Infrastructure as Code (IaC) tools. Own CI/CD pipelines for cloud services, enabling efficient, secure, and automated delivery processes. … and operate scalable, secure APIs and microservices. Strong communication skills and a track record of leading cross-functional technical initiatives. Experience working in agile environments with modern DevOps and SRE practices. #J-18808-Ljbffr More ❯
Posted:

Senior/Lead Software Engineer

London, England, United Kingdom
GroupM
optimization suite that delivers amazing outcomes for brands and audiences. As a Senior Engineer you will work side by side with a Product Manager, a Design Lead, and your engineering team to design, build, deliver and operate purposeful solutions. The role is a hands-on development role, involving contributing to product strategy, crafting scalable and reliable architectures, developing features … heavily exploit automation, and run on AWS, GCP, k8s, Snowflake and more. You’ll play a leading role in significantly scaling this further. WHAT YOU WILL DO Lead an engineering team, overseeing and participating in all stages of development, delivery and operations, together with Product and Design leaders. Architect solutions that are simple, scalable, reliable, secure, maintainable, and make … a measurable impact. Develop and deliver new features, maintain our product, and drive growth to hit team KPIs. As a part of this, undertake hands-on development and engineering tasks with your team and other teams, to deliver production solutions. Employ and champion modern pragmatic engineering principles, practices, and tooling, including TDD/BDD/ATDD, XP, QA More ❯
Posted:

Python Infrastructure Engineer (SC CLEARED)

Hounslow, England, United Kingdom
JR United Kingdom
key role in maintaining and evolving the infrastructure underpinning critical applications and data pipelines. Key Responsibilities Codebase Management & Maintenance: Maintain and enhance the existing digitall-land-python codebase, ensuring reliability, scalability, and compliance with coding standards. Collaborate with developers and data engineers to support application lifecycle and deployment workflows. Infrastructure as Code (IaC): Design and implement Terraform scripts to … with CI/CD tools (e.g., GitHub Actions, Jenkins, CircleCI) and Git-based workflows. Hands-on experience with Airflow , particularly DAG management and operational reliability. Solid understanding of DevOps , SRE , or platform engineering principles. Exposure to secrets management tools (e.g., AWS Secrets Manager, HashiCorp Vault). Experience managing GitHub organization permissions and automations via APIs or Terraform providers. Background … in working with data engineering platforms or data lakes. Experience with containerization (Docker) and container orchestration (Kubernetes) is a plus. #J-18808-Ljbffr More ❯
Posted:

Senior Software Engineer London, England

London, United Kingdom
Group M Worldwide Inc
endlessly curious. Our team of thinkers, builders, creators, and problem solvers are over 1,000 strong, across 20 markets around the world. ABOUT CHOREOGRAPH OPTIMIZATION We're a Product & Engineering team within Choreograph that's focused on optimizing the impact that advertising has for audiences and the brands that we work with. We do this by optimizing media buying … optimization suite that delivers amazing outcomes for brands and audiences. As a Senior Engineer, you will work side by side with a Product Manager, a Design Lead, and your engineering team to design, build, deliver, and operate purposeful solutions. The role is a hands-on development role, involving contributing to product strategy, crafting scalable and reliable architectures, developing features … objects annually, which amounts to 20 billion ad impressions across the globe. You'll play a leading role in significantly scaling this further. WHAT YOU WILL DO Lead an engineering team, overseeing and participating in all stages of development, delivery, and operations, together with Product and Design leaders. Architect solutions that are simple, scalable, reliable, secure, maintainable, and make More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer

London, England, United Kingdom
Hybrid / WFH Options
BAE Systems Applied Intelligence
to complex challenges as part of a team who help keep the UK safe? Join BAE Systems as an experienced DevOps Engineer. As a key member of a Software Engineering team, you’ll be working with our National Security Customers to build systems that support their core mission capabilities. You’ll work as part of empowered, autonomous DevOps teams … our customer organisations. You will work in a small team given as much ownership and responsibility as you have the appetite for but be part of a much bigger Engineering community to give you the support you need to grow in your career. We fully embrace DevOps ways of working in our teams, and build a very broad range … an organisation who makes a huge impact to the security of the UK. About you You will have many of the following: Experience working in a similar DevOps/SRE/Infrastructure role An appreciation of Infrastructure as Code, and CI/CD tooling An understanding of live service and how to support critical business systems Scripting abilities with languages More ❯
Posted:

Senior Software Engineer

Cambridge, England, United Kingdom
Hybrid / WFH Options
Arm
Get AI-powered advice on this job and more exclusive features. Job Overview We are building a modern, cloud-native compute orchestration platform to support large-scale, compute-intensive engineering workloads. As a Senior Software Engineer, you will play a key role in designing and delivering a highly scalable, reliable, and observable system, with a particular focus on software … development and performance testing. Job Overview We are building a modern, cloud-native compute orchestration platform to support large-scale, compute-intensive engineering workloads. As a Senior Software Engineer, you will play a key role in designing and delivering a highly scalable, reliable, and observable system, with a particular focus on software development and performance testing. This role is … Design, implement, and maintain core components of the platform using cloud-native technologies. Lead efforts around performance benchmarking, load testing, and scalability validation. Define and enforce SLAs; work with SRE/DevOps to ensure high availability and observability. Tune platform performance under high-throughput workloads and lead capacity planning. Automate and execute stress/load tests using both synthetic and More ❯
Posted:

Principal Network Engineer - London Stock Exchange Group

London, England, United Kingdom
Jobs via eFinancialCareers
the Principal Network Engineer - London Stock Exchange Group role at Jobs via eFinancialCareers LSEG is looking for a Network Design Engineer to establish successful partnerships with our internal product, engineering, operational and business partners. This is a technical position for someone who can investigate, design, engineer, build and test hybrid on premise and cloud-optimized network designs. The role … involves elements of design, engineering, automation and testing to produce infrastructure designs that align with business and architectural requirements. As well as a confirmed technical background, the ideal candidate will be able to follow industry standard design, development and engineering frameworks incorporating planning, design, implementation, engineering, testing, operation and optimization. Be able to effectively participate in multi … organization. Ensuring network architecture supports the execution of critical business applications. Implements the balance between business and technology, provides consultancy into the project teams and defines the design and engineering governance function, signing off governance architecture/design against well-defined standards. DAY TO DAY RESPONSIBILITIES: This Network Design Engineer will be responsible for network design and engineering More ❯
Posted:

Senior Cloud Engineer

London, England, United Kingdom
GroupM
Summary And Impact WHO WE ARE LOOKING FOR As a member of the Global Technical Operations (TechOps), you will be a part of a team that focuses on operational reliability within a cloud-based infrastructure. You have hands-on cloud experience in architecting, building, deploying, managing databases, compute instances, and storage buckets. You have a passion for providing solutions … Trust. Lead Optimistically by Championing Growth and Development to Mobilize the Enterprise. What You’ll Need Recent hands-on experience (7+ years) as a Cloud Engineer/DevOps/SRE (GCP/AWS/Azure). Proven work experience in automating tasks within a cloud-based environment, using infrastructure coding tools (Terraform, Terragrunt). Experience with configuration management tools (Ansible … applications received, only those candidates selected for an interview will be contacted. Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Engineering and Information Technology Industries Market Research, Advertising Services, and Online Audio and Video Media Referrals increase your chances of interviewing at GroupM by 2x Sign in to set job More ❯
Posted:

Platform Specialist - Scheduler and Orchestrators

London, England, United Kingdom
Squarepoint Capital
set and experience. If our team feels you are better suited for another role, we will reach out about the alternate opportunity. Platform Specialist - Scheduler and Orchestrators Department: Application Engineering Position Overview: As a Platform Applications Specialist at Squarepoint, you will play a crucial technical role in delivering mission-critical platform infrastructure using the DevOps methodology. In this position … architecture and user experience to offer the most out of our platform. Required Qualifications: Degree in Engineering, Computer Science or similar discipline. 4+ years of experience in DevOps, SRE, or platform engineering roles. Experience with software development (Python, Git) Experience with system administration (Bash, Linux, Containerization) Deep knowledge of HPC (e.g. Slurm) or orchestration technologies (e.g. Kubernetes) Excellent … written and verbal communication skills. Ability to work well in a fast-paced environment. Nice to have: Experience with other orchestration technologies (Prefect, Airflow) Experience with advanced software engineering concepts. Experience with modern software development tooling, such as Gitlab, Artifactory or Docker. Experience with infrastructure automation and configuration management, such as Ansible and Terraform. Experience with Observability systems, such More ❯
Posted:

Lead Cloud Engineer

London, England, United Kingdom
Made Tech Limited
technology to improve our society. We believe putting people at the heart of designing, building and delivering public services leads to better outcomes for everyone. Based within the Cloud & Engineering Practice, our Cloud Engineers help public sector organisations to deliver digital services that are highly available, reliable, resilient and secure. They are software engineers who … tend to focus more on the successful deployment and operations of services than on functional development. They are adaptable and comfortable covering a range of disciplines such as DevOps, SRE and infrastructure architecture. About the role You will play a critical role in designing, implementing, and maintaining our clients’ cloud infrastructure, ensuring it is secure, scalable, and highly available. You … architecting cloud infrastructure, and taking it through to deployment and support Experience working within multidisciplinary teams with product, design, and technical disciplines all successfully collaborating An understanding of DevOps, SRE and Platform Engineering, and their places within a modern digital organisation Experience working in a technology consultancy environment Relevant cloud certifications such as AWS Cloud Practitioner, Developer, DevOps Engineer More ❯
Posted:

Senior Cloud Security Engineer FullTime London

London, England, United Kingdom
Trainline plc
evolving environment. As part of our mission to continuously improve and mature Trainline's security capabilities, we work in close collaboration with cross functional teams, including Cloud Engineering, SRE, Platform Engineering, and more, to integrate the latest technologies and best practices into our security strategy. You will play a critical role in safeguarding all digital channels that collectively … in our AWS environment. Ensure that security is seamlessly integrated into every layer of the cloud architecture, from network configuration to identity management. Container & Orchestration Security : Partner with Cloud Engineering, DevOps, and Platform teams while being a key stakeholder in the architecture and implement secure containerised environments using platforms like Docker, and ECS. Focus on vulnerability mitigation, compliance automation … practices. Regularly evaluate new security tools, frameworks, and technologies to enhance the organisation’s cloud security posture. Security Training & Mentorship : Provide ongoing cloud security training, guidance, and mentorship to Engineering and Platform teams, fostering a security-first culture within the organisation. Ensure that teams are well-equipped to identify, understand, and mitigate cloud security risks and align with established More ❯
Posted:

Senior Cloud Engineer

London, England, United Kingdom
Choreograph
WHO WE ARE LOOKING FOR As a member of the Global Technical Operations (TechOps), you will be part of a team focused on operational reliability within a cloud-based infrastructure. You should have hands-on cloud experience in architecting, building, deploying, and managing databases, compute instances, and storage buckets. A passion for automation and collaboration is essential. WHAT YOU … liaison between TechOps and other groups. Take on new responsibilities as needed. WHAT YOU’LL NEED 7+ years of recent hands-on experience as a Cloud Engineer, DevOps, or SRE (GCP, AWS, Azure). Experience automating tasks in cloud environments using infrastructure as code tools (Terraform, Terragrunt). Knowledge of configuration management tools (preferably Ansible). Experience with container orchestration More ❯
Posted:

Senior Developer - Kubernetes

London, England, United Kingdom
Different Technologies Pty Ltd
technical leadership to junior engineers. Design, develop, troubleshoot, and debug software for databases, applications, tools, and networks. Create and maintain Kubernetes clusters, including troubleshooting. Build infrastructure using Terraform. Support SRE and developer teams, ensuring security practices are followed. Maintain documentation and runbooks. Handle incident detection, response, and post-incident reviews. Required Experience: 3+ years in building mission-critical software applications … Kubernetes security and on-premises deployment. Knowledge of vSphere, Packer, Flux or ArgoCD, and Kubernetes installation tools like Kubespray. Experience with East-West clusters. As part of the software engineering team, you will help define and develop software, provide technical leadership, and implement changes to software architecture as needed. The role requires independent judgment and may involve project leadership More ❯
Posted:

Senior Cloud Engineer

London, England, United Kingdom
GroupM
WHO WE ARE LOOKING FOR As a member of the Global Technical Operations (TechOps), you will be part of a team focused on operational reliability within a cloud-based infrastructure. You have hands-on cloud experience in architecting, building, deploying, managing databases, compute instances, and storage buckets. You are passionate about providing solutions through automation and believe that success … a liaison between TechOps and other groups. Take on new responsibilities as needed. WHAT YOU'LL NEED Recent hands-on experience (7+ years) as a Cloud Engineer, DevOps, or SRE in GCP, AWS, or Azure. Proven experience automating tasks within cloud environments using infrastructure as code tools (Terraform, Terragrunt). Experience with configuration management tools (preferably Ansible). Experience with More ❯
Posted:

DevOps Engineer

Manchester, England, United Kingdom
Hybrid / WFH Options
MRJ Recruitment
strong DevOps culture, so you'll be a central figure in advocating for scalable infrastructure and robust platform engineering principles. This means close collaboration with development, QA, and SRE teams to build secure, cost-effective, and repeatable systems. You'll blend deep technical work with impactful leadership, needing a solid grasp of production operations, incident response, Infrastructure as Code More ❯
Posted:

MLOps Engineer

City of London, London, United Kingdom
Ultralytics
to ensuring that our state-of-the-art models are accessible, reliable, and performant for our global user base. 🛠️ Skills and Experience 5+ years of experience in a DevOps, SRE, or MLOps role. Strong proficiency in Python and extensive experience with ML frameworks like PyTorch. Proven experience building and managing CI/CD pipelines for machine learning systems. Deep expertise More ❯
Posted:

MLOps Engineer

London Area, United Kingdom
Ultralytics
to ensuring that our state-of-the-art models are accessible, reliable, and performant for our global user base. 🛠️ Skills and Experience 5+ years of experience in a DevOps, SRE, or MLOps role. Strong proficiency in Python and extensive experience with ML frameworks like PyTorch. Proven experience building and managing CI/CD pipelines for machine learning systems. Deep expertise More ❯
Posted:

Senior Software Engineer II

London, United Kingdom
LexisNexis Risk Solutions
cross-functional teams (Development, QA, Security) to ensure DevOps processes align with project goals and foster a culture of shared responsibility for delivery. Requirements : Experience: Strong experience in DevOps, SRE, or related roles managing cloud infrastructure and CI/CD pipelines in a production environment. AWS Expertise: Strong expertise in Amazon Web Services - hands-on experience designing and managing AWS More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

Slough, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
play a critical role in ensuring system reliability, scalability, and performance across both AWS and Azure environments. This is your opportunity to lead cloud-native transformation and embed SRE best practices into engineering at scale. What you’ll be doing as their Site Reliability Engineer: You’ll be the go-to expert for designing and maintaining … CI/CD pipelines to reduce toil and accelerate deployment frequency. Build observability into everything—own monitoring, alerting, and incident response to minimize MTTR and improve system health. Champion SRE culture and reliability-focused engineering—help shape sustainable engineering practices, SLAs, SLOs, and error budgets. Contribute across the stack with flexibility in tooling—experience with Python, Go … dental insurance 25 days annual leave + bank holidays R&D and personal training budgets And much more... This is an incredibly rare chance for a seasoned, high-performing SRE to leave your mark on high-impact transformation projects in a business that’s truly committed to doing things the right way. #J-18808-Ljbffr More ❯
Posted:

Site Reliability Engineer

Wakefield, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
team. Things are moving fast here, and as we continue to grow; reliability, automation, and scalability have never been more important to us. You will be our first SRE so a strong background in implementing SRE best practices would be Ideal. You will know what good looks like and strive to continuously improve automation, availability and resilience. This is … to build out infrastructure and tooling using AWS, Terraform, Docker, and CI/CD pipelines. Supporting and evolving our container-based architecture (we use ECS and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support your work. What we’re … looking for: Strong experience with AWS, Terraform, Docker, and container orchestration (ECS/Fargate). Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve More ❯
Posted:

Site Reliability Engineer

Stockport, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
team. Things are moving fast here, and as we continue to grow; reliability, automation, and scalability have never been more important to us. You will be our first SRE so a strong background in implementing SRE best practices would be Ideal. You will know what good looks like and strive to continuously improve automation, availability and resilience. This is … to build out infrastructure and tooling using AWS, Terraform, Docker, and CI/CD pipelines. Supporting and evolving our container-based architecture (we use ECS and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support your work. What we’re … looking for: Strong experience with AWS, Terraform, Docker, and container orchestration (ECS/Fargate). Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve More ❯
Posted:

Site Reliability Engineer III - Corporate Oversight and Governance Technology

London, England, United Kingdom
J.P. MORGAN
Job Description Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability. As a Site Reliability Engineer at JPMorgan Chase within the Corporate Oversight and Governance (COG), Architecture & Engineering team, you work collaboratively with stakeholders to define non-functional requirements … observability and reliability designs for complex systems that are robust, stable, and do not incur additional toil or technical debt. Collaborate in the design, creation and advocacy of SRE products that can be used to scale the implementation of SRE best practices within COGT. Evolves and debug critical components of applications and platforms. Contributes to JPMorgan Chase’s site reliability community via internal forums, communities of practice, guilds, and conferences. Participates in architecting, designing and building highly distributed systems and SRE products, solving complex problems in code. Maintain and promote best practices in software engineering, leading by example. Required qualifications, capabilities, and skills. Demonstrable applied experience of SRE concepts, strategies, and culture. Knowledge and experience in More ❯
Posted:

Senior Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Xtremepush
Reliability Engineer role at Xtremepush Join to apply for the Senior Site Reliability Engineer role at Xtremepush About The Role We are seeking a Senior SRE with experience of working with scaled SaaS production infrastructure. The successful candidate will work as part of a team focused on site reliability, security, and scalability, as we … manage our rapid growth. About The Role We are seeking a Senior SRE with experience of working with scaled SaaS production infrastructure. The successful candidate will work as part of a team focused on site reliability, security, and scalability, as we manage our rapid growth. The ideal candidate will be a proactive and driven individual, who excels at … technologies include PHP, MySQL, Vue.js and AWS. Participating in an on-call roster is required as part of this role. Key Responsibilities Act as a senior member of the SRE team, supporting activities including the backlog and workload of the team, scoping requirements, peer review of code, providing feedback to the rest of the team. Represent the team in management More ❯
Posted:

Site Reliability Engineer

City of London, London, United Kingdom
Xcede
looking for a Site Reliability Engineer to join their highly skilled, innovative team. Essential skills: Strong proficiency in Python for infrastructure and automation Hands-on experience in SRE, DevOps or production engineering roles Deep understanding of monitoring, incident response workflows, and system architecture Productive approach to improving systems and reducing technical debt Strong collaboration and communication skills … working closely with developers, quants, and platform engineers Experience designing and delivering scalable, reliable production systems Proficiency with Linux/Unix systems Bachelor’s degree in CS, Engineering or a related field Familiarity with Kubernetes, Docker, or container orchestration technologies Experience with automation tools such as Terraform or Ansible Background in Go, Bash or other system-level languages Exposure … design and implement automation for operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting). You will also: apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs; lead incident response and implement improvements; collaborate closely with quant developers/ More ❯
Posted:
Site Reliability Engineering
England
10th Percentile
£57,500
25th Percentile
£64,509
Median
£85,000
75th Percentile
£115,000
90th Percentile
£136,875