London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description SiteReliabilityEngineer Southampton HQ - 2 Times a week in Office Cloud, SaaS, AWS, Please be advised Security Clearance is required for this position We are working alongside one of our longstanding clients in helping them recruit a SiteReliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and … automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a … strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure More ❯
London, England, United Kingdom Hybrid / WFH Options
Quaisr Limited
DevOps/SiteReliabilityEngineer, Junior/Mid/Senior (m/f/*) We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/SiteReliabilityEngineer with experience managing complex infrastructure and deploying scalable, reliable More ❯
the reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … to ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language. Proficiency in C# or … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
London, England, United Kingdom Hybrid / WFH Options
Global Screening Services
observability, and responding to incidents as well as ensuring the platform is performant and reliable. You will be a key member of the team, liaising with product teams, embedding SRE principles and building the observability platform for the next stage of growth at GSS. You will have direct input into the direction of Technical Operations, solving problems, supporting developers and … collaborative, flexible, and innovative work culture where your ideas are valued. What You’ll Do Key responsibilities in this role will include (but not be limited to): Leveraging core SRE values - measuring (SLI/SLO/SLA), testing, and eliminating toil via automation with appropriate Disaster Recovery planning Refining KPIs to enable data-driven decision making for availability and reliability … within a production, SaaS company (preferably event-driven) Be a self-starter that relishes responsibility. Take strategic direction and own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards More ❯
Senior SiteReliabilityEngineer - AWS Kubernetes Senior SiteReliabilityEngineer - AWS Kubernetes Get AI-powered advice on this job and more exclusive features. A truly unique opportunity to help launch a brand new team within a global financial services provider. This new team of highly skilled Full Stack Infrastructure Engineers will cover Compute, Storage … Network and Cloud technologies. You will help design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance. Requirements: Proven experience managing and optimizing a diverse infrastructure stack. Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation). Familiarity of service mesh technologies (Istio, Linkerd). Solid understanding of virtualization (VMware, Hyper-V … function Job function Information Technology Industries Computer and Network Security Referrals increase your chances of interviewing at Source Technology by 2x Sign in to set job alerts for “Senior SiteReliabilityEngineer” roles. London, England, United Kingdom 1 month ago London, England, United Kingdom 2 weeks ago London, England, United Kingdom 4 hours ago London, England, United More ❯
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure SiteReliabilityEngineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies More ❯
This new team of highly skilled Full Stack Infrastructure Engineers will cover Compute, Storage, Network and Cloud technologies. You will help design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance. Requirements: Proven experience managing and optimizing a diverse infrastructure stack. Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation). Familiarity More ❯
Full-time Job function Job function Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Freelancer.com by 2x Get notified about new DevOps Engineer jobs in London, England, United Kingdom . South Croydon, England, United Kingdom 1 month ago London, England, United Kingdom 3 weeks ago City Of London, England, United Kingdom More ❯
London, England, United Kingdom Hybrid / WFH Options
Future Talent Group
Senior SiteReliabilityEngineer – FinTech/Global Payments – London HQ/Remote First Senior SiteReliabilityEngineer – FinTech/Global Payments – London HQ/Remote First Get AI-powered advice on this job and more exclusive features. This range is provided by Future Talent Group. Your actual pay will be based on your skills … new industry standard within their market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability and availability. Monitor systems … capacity planning. Collaborate with dev teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Strong More ❯
and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliabilityEngineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent tools … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliabilityEngineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent tools … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliabilityEngineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent tools … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliabilityEngineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent tools … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description Senior SiteReliabilityEngineer – FinTech/Global Payments – London HQ/Remote First Salary - £85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech scale-up, setting a … new industry standard within their market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability and availability. Monitor systems … capacity planning. Collaborate with dev teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands More ❯
small and mid-market firms, rely on SS&C for expertise, scale, and technology. Job Description We are looking for a highly skilled SiteReliabilityEngineer (SRE) to join our engineering team. As an SRE, you will be responsible for ensuring the scalability, reliability, and performance of our systems, infrastructure, and applications. You will collaborate closely … and incident response systems to ensure high availability and performance. Develop tools and automation to eliminate manual operations and improve system efficiency. Collaborate with development teams to ensure that reliability and performance are considered from the outset. Conduct root cause analysis and postmortems to learn from system failures and prevent recurrence. Participate in on-call rotations and respond to … and problem-solving skills. Strong communication and collaboration abilities. Preferred Qualifications: Experience with large-scale distributed systems. Familiarity with SLAs, SLOs, and SLIs. Previous experience in a DevOps or SRE role in a production environment. We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives More ❯
applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. Our team is at globally located, focused on ensuring production stability, automations, reliability and observability. We are looking for solution-oriented, commercially minded, customer-focused individuals, used to working in an agile environment who want to be a part of building something … the firm. Your work will have a massive impact, both on us as a company, as well as our clients and our business partners around the world. As a SiteReliabilityEngineer III at JPMorgan Chase within the Corporate Technology - Capital Management , you will solve complex and broad business problems with simple and straightforward solutions. Through code … delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Preferred Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. About Us J.P. Morgan is a global leader in financial services, providing strategic advice and products to the world’s most prominent corporations, governments More ❯
London, England, United Kingdom Hybrid / WFH Options
Stott and May
Join to apply for the Senior SiteReliabilityEngineer role at Stott and May 1 day ago Be among the first 25 applicants Join to apply for the Senior SiteReliabilityEngineer role at Stott and May Job Description Senior SiteReliabilityEngineer Start: ASAP Duration: 6-12 months Location: hybrid … London (Tuesdays, Thursdays WFH) Pay: negotiable, inside IR35 We're looking for an experienced DevOps Engineer to join our team on a contract basis, with a focus on AWS infrastructure, observability tooling, and CI/CD automation. This is a hands-on role supporting high-availability systems, rapid deployments, and production incident response. Key Responsibilities Manage and monitor AWS … Python, Bash, Go or SQL Work with Git-based workflows for infrastructure as code Troubleshoot Kubernetes workloads and containerised services Participate in an on-call rotation to ensure system reliability Your Profile Essential Solid hands-on AWS experience in a DevOps setting Background in incident, change, and problem management Strong with Prometheus, Grafana, Splunk, and PromQL Proficient in scripting More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
expertise on financial and tax planning, offering home finance and annuities propositions, and providing collective fund solutions to third party customers. Job Summary We are looking for DevOps/SRE Engineers with proven DevSecOps skills, to support our transformation and the future growth of the Canada Life UK business. We are on a journey of simplifying our IT estate, removing … learn and develop their own skills Essential skills needed Application support within a Financial Services context (SaaS, PaaS, Cloud, On-Prem hosted applications) Relevant experience as a DevOps or SREEngineer Highly proficient in scripting like Powershell, Python, Ruby, Bash, and programming like C# for automating tasks and creating tools. Strong experience in a cloud-based environment e.g. Azure More ❯
Do Deploy and maintain a resilient, secure, and efficient SaaS application platform to meet established SLAs. Automate, monitoring, management and incident response to achieve an auto-remediation system. Monitor site stability and performance and troubleshoot site issues. Participate in on-call rotation to ensure stability and uptime for our platforms. Scale infrastructure to meet rapidly increasing demand. Collaborate … technical challenges. Provide deployment and operations support for multi-tiered distributed software applications. Estimate engineering effort, plan implementation, and rollout system changes that meet requirements for functionality, performance, scalability, reliability, and adherence to development goals and principles. Collaborate in a fast paced environment with multiple teams (software development, release management, build and release, etc...). Collaborate in a fast … paced environment with multiple teams in a dynamic entrepreneurial organization Defining how the behavior of large scale systems can be achieved Measuring and achieving reliability through engineering and operations work Monitoring and alert development, documentation and management with the goal of creating an auto-remediation system Adapting security controls to product not typically native to GA releases Developing automation More ❯
London, England, United Kingdom Hybrid / WFH Options
Elwood Technologies Services Limited
infrastructure at scale, enabling financial institutions, neobanks, and corporations to access digital asset markets quickly and efficiently. Job Description We are seeking a SiteReliabilityEngineer (SRE) to join our globally distributed engineering team, with a key responsibility for weekend operations and system reliability . You’ll play a critical role in maintaining uptime, resolving incidents … platforms, which are built on AWS and GCP cloud environments. This is a highly visible role that blends deep technical ownership with cross-functional collaboration. In addition to core SRE responsibilities, you will support our Technical Account Managers and client-facing teams in resolving production issues that impact users, helping to ensure smooth and reliable client experiences. You’ll be … that could affect the user experience. Monitor alert channels, logs and infrastructure load for the entire stack. Setup automation for alerting. Required Experience 5+ years of experience in an SRE, DevOps, or infrastructure engineering role. Strong experience with AWS or GCP , including services like EC2, Lambda, S3, RDS, and GKE (for GCP). Experience with automation tools like Terraform . More ❯
SiteReliabilityEngineer (Hybrid – London) | RegTech Innovator | AWS, Terraform, Kubernetes Location: London (Hybrid – 2-3 days in office) Are you passionate about scalable infrastructure and modern DevOps practices? Want … to make a tangible impact in a fast-growing RegTech company that’s transforming how businesses navigate regulatory compliance? Join us as a SiteReliabilityEngineer (SRE) and help build and operate the infrastructure that powers cutting-edge compliance solutions used by global financial institutions. What You'll Do Maintain and improve our AWS-based infrastructure using … Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and Kubernetes Experience with production support, incident management, and RCA practices Comfortable working in a fast-paced startup More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Explore Group
SiteReliabilityEngineer (Hybrid – London) | RegTech Innovator | AWS, Terraform, Kubernetes Location: London (Hybrid – 2-3 days in office) Are you passionate about scalable infrastructure and modern DevOps practices? Want … to make a tangible impact in a fast-growing RegTech company that’s transforming how businesses navigate regulatory compliance? Join us as a SiteReliabilityEngineer (SRE) and help build and operate the infrastructure that powers cutting-edge compliance solutions used by global financial institutions. What You'll Do Maintain and improve our AWS-based infrastructure using … Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and Kubernetes Experience with production support, incident management, and RCA practices Comfortable working in a fast-paced startup More ❯
Social network you want to login/join with: We are looking for a highly skilled SiteReliabilityEngineer (SRE) to join our engineering team. As an SRE, you will be responsible for ensuring the scalability, reliability, and performance of our systems, infrastructure, and applications. You will collaborate closely with software engineers, system administrators, and DevOps … and incident response systems to ensure high availability and performance. Develop tools and automation to eliminate manual operations and improve system efficiency. Collaborate with development teams to ensure that reliability and performance are considered from the outset. Conduct root cause analysis and postmortems to learn from system failures and prevent recurrence. Participate in on-call rotations and respond to … and problem-solving skills. Strong communication and collaboration abilities. Preferred Qualifications: Experience with large-scale distributed systems. Familiarity with SLAs, SLOs, and SLIs. Previous experience in a DevOps or SRE role in a production environment. We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives More ❯
Senior SiteReliabilityEngineer Start: ASAP Duration: 6-12 months Location: hybrid, London (Tuesdays, Thursdays WFH) Pay: negotiable, inside IR35 We're looking for an experienced DevOps Engineer to join our team on a contract basis, with a focus on AWS infrastructure, observability tooling, and CI/CD automation. This is a hands-on role supporting … Python, Bash, Go or SQL - Work with Git-based workflows for infrastructure as code - Troubleshoot Kubernetes workloads and containerised services - Participate in an on-call rotation to ensure system reliability Your Profile Essential: - Solid hands-on AWS experience in a DevOps setting - Background in incident, change, and problem management - Strong with Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting More ❯
South West London, London, England, United Kingdom
Oscar Technology
SiteReliabilityEngineer - AWS/Azure | Outside IR35 | £450-500/day … Month Contract | Paddington, London (Hybrid, 2 Days Onsite) We're working with a fast growing client undergoing rapid expansion, looking for an experienced SiteReliabilityEngineer (SRE) to join them on a 6-month contract (outside IR35) You'll be leading efforts acriss AWS and Azure Cloud environments, focusing on automation, observability, infrastructure as code and performance … Monitor, Grafana, ELK) Own incident response processes, ensuring high availability and rapid resolution Collaborate with stakeholders to communicate solutions and technical trade-offs clearly Ideal Experience: 3-5 years SRE or DevOps experience across AWS and Azure platforms Strong knowledge of Terraform , scripting (Python, Bash, PowerShell), and cloud architecture Comfortable with containerisation and orchestration ( Docker, Kubernetes ) Understanding of networking, DNS More ❯