SiteReliabilityEngineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
SiteReliabilityEngineer £65,000 £95,000 DOE Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
SiteReliabilityEngineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits)Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
SiteReliabilityEngineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
Junior SiteReliabilityEngineer … We are currently working with a leading Financial Services company, who are looking for a Junior SiteReliabilityEngineer to join their ever-expanding platform/SRE team from their Shoreditch, London, Office where you will be expected to travel to the office 4 days a week. They are looking for you to have excellent cloud knowledge … ideally AWS as well as having experience of Powershell/Python. As the Junior SiteReliabilityEngineer, you will be a self-starter who has excellent stakeholder management experience who can show outcome based work. You will ideally have 2 years of commercial experience coming from an IT Operations/Cloud infrastructure background. Please note this is More ❯
Farnborough, Hampshire, England, United Kingdom Hybrid / WFH Options
Addition
SiteReliabilityEngineer (Defence) This is a chance to join a forward-thinking digital solutions business delivering secure technology for the Defence and Security sector. As a SiteReliabilityEngineer, you’ll be at the heart of building, scaling, and maintaining critical platforms that underpin mission-ready technology. Role Overview: Role: SiteReliabilityEngineer Location: Hybrid, 3 days per week in Farnborough Package: £60,000- £70,000 per annum Benefits Industry: Defence & Security What You’ll Be Doing: Designing and maintaining Kubernetes environments for scalable deployments. Building and optimising CI/CD pipelines to improve efficiency. Implementing monitoring systems to ensure reliability and performance. Driving automation initiatives to reduce manual … in security, maintainability, and scalability. Staying ahead of emerging technologies to keep the platform cutting-edge. Main Skills Needed: Applications must be eligible for Security Clearance. Proven experience in SiteReliability or Platform Engineering (5+ years). Strong knowledge of Kubernetes and container orchestration. Expertise in CI/CD tools (Jenkins, GitLab, etc.). Experience with AWS is More ❯
SiteReliabilityEngineer (International Travel) This is a chance to join a forward-thinking digital solutions business delivering secure technology for the Defence and Security sector. As a SiteReliabilityEngineer, you’ll be at the heart of building, scaling, and maintaining critical platforms that underpin mission-ready technology. Role Overview: Role: SiteReliabilityEngineer Location: 1-2 days per week in Farnborough International Travel (Project based) Package: £60,000- £70,000 per annum Benefits Industry: Defence & Security What You’ll Be Doing: Designing and maintaining Kubernetes environments for scalable deployments. Building and optimising CI/CD pipelines to improve efficiency. Implementing monitoring systems to ensure reliability and performance. … the platform cutting-edge. Main Skills Needed: Applications must be eligible for Security Clearance. Happy to travel internationally according to project requirements (All costs covered). Proven experience in SiteReliability or Platform Engineering (4+ years). Strong knowledge of Kubernetes and container orchestration. Expertise in CI/CD tools (Jenkins, GitLab, etc.). Experience with AWS is More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
Oscar Associates (UK) Limited
SiteReliabilityEngineer | Cheltenham | £600 per day (Outside IR35) About the Role: We're seeking an experienced SiteReliabilityEngineer with live eDV clearance to join an on-site team in Cheltenham. This contract role involves supporting and maintaining a managed cross-domain service, applying SRE practices to ensure reliability, security, and performance. Contract Details: Employment: Contract (Outside IR35) Rate: £600 per day Length: 6 months (long-term extensions very likely). Location: Cheltenham … days on-site) Clearance: Live eDV required Start Date: ASAP Key Responsibilities: Build and deploy code using Java, Maven, NPM, Terraform, and Ansible across OpenShift, RHEL/CentOS, and Docker. Monitor and optimise system performance with Influx and Grafana. Provide 2nd/3rd line support, incident response, and root cause analysis. Carry out BAU maintenance including patching, database housekeeping More ❯
Role Overview: We are seeking a highly skilled and motivated SiteReliabilityEngineer (SRE) to join our engineering team to support critical application deployments in a "follow-the-sun" environment. In this role, you will leverage your expertise in cloud provisioning, infrastructure as code, and container orchestration to ensure the reliability, scalability, and performance of our … and versioning. Containerization and Orchestration: Deploy, manage, and provide ongoing support for containerized applications using Kubernetes, including Amazon EKS (Elastic Kubernetes Service) and Azure Kubernetes Service (AKS), ensuring their reliability, availability, and performance. Monitoring and Alerting: Monitor application performance and system health through observability tools (e.g., Prometheus, Grafana, ELK stack), proactively identifying and resolving issues to ensure high availability … and solutions, including RESTful APIs, ensuring seamless integration across platforms. Post-Mortem Analysis: Conduct comprehensive post-mortem analyses following incidents, identifying root causes and recommending improvements to enhance system reliability and performance. Mentorship: Mentor and guide junior engineers, fostering a culture of knowledge sharing and continuous improvement within the engineering team. Skills and Experience: Bachelor's degree in computer More ❯
Bath, Somerset, South West, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions
SiteReliabilityEngineer Work From Home (WFH) + Quarterly Visits to Bath Full Time, Initial 12 Month Fixed Term Contract Salary DOE ( ?????? £45k - ? £60k) + Benefits + Bonus Deerfoot Recruitment is working with an established FCA-authorised outsourced service provider in the financial services sector, seeking a talented SiteReliabilityEngineer to join their … cloud architecture Engage in infrastructure design, implementation, and operation to ensure highly available, scalable systems Work collaboratively across development and operations teams throughout the software lifecycle Champion system automation, reliability, and continuous improvement initiatives Monitor production systems with auto … healing and auto-scaling methodologies Support CI/CD pipelines and streamline infrastructure-as-code workflows Maintain strong security-first practices within infrastructure design and management About You Proven SRE generalist with broad cloud infrastructure experience and adaptability Experience deploying cloud infrastructure in a regulated financial services environment Skilled in Terraform and PowerShell automation tools Familiarity with Windows Server and More ❯
Join us as a Senior SiteReliabilityEngineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as … a Senior SiteReliabilityEngineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning - SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). … skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. Resolution, analysis More ❯
Join us as a Senior SiteReliabilityEngineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. All potential applicants are encouraged to scroll through and read the complete job description before applying. This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as a Senior SiteReliabilityEngineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning – SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages (e.g., Python … skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. Resolution, analysis More ❯
Are you a seasoned SitereliabilityEngineer looking for an exciting new challenge? Join this team and transition into maintaining and enhancing the reliability of one of the world's largest platforms. In this role, you will utilise your expertise in Golang coding to develop robust applications, ensuring the systems remain resilient, scalable, and efficient. If … presence and commitment to innovation, you will have the opportunity to work on projects that reach millions of users, making a real difference in the tech world. As a SiteReliabilityEngineer, you will be responsible for designing, developing, and maintaining systems and applications using Golang. You will monitor and optimise system performance with tools such as … Grafana, Prometheus, New Relic, and Splunk. Your role will involve identifying and resolving reliability issues, automating processes, and ensuring the seamless operation of the platform. If you have a passion for technology and a drive to ensure excellence, we would love to hear from you More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
and be part of our success story! Role Summary In the role of SiteReliabilityEngineer at Commify, you will be an integral part of our SRE team. Your focus will be on ensuring that our products and platforms perform at their best, understanding how our software interacts with both physical and Cloud infrastructure to deliver exceptional … What essentials are we looking for? Proficiency with Microsoft Azure Strong expertise in Terraform, App Services, and Kubernetes Fluent in both written and spoken English A genuine passion for reliability in systems Experience in creating and modifying Terraform deployments Prior experience in an operations role, ideally as a SiteReliabilityEngineer Ability to work cross-functionally More ❯
Bath, Avon, England, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions Ltd
SiteReliability EngineerWork From Home (WFH) + Quarterly Visits to BathFull Time, Initial 12 Month Fixed Term ContractSalary DOE ( ̃ £45k - £60k) + Benefits + Bonus Deerfoot Recruitment is working with an established FCA-authorised outsourced service provider in the financial services sector, seeking a talented SiteReliabilityEngineer to join their IT Operations team. This … cloud architecture Engage in infrastructure design, implementation, and operation to ensure highly available, scalable systems Work collaboratively across development and operations teams throughout the software lifecycle Champion system automation, reliability, and continuous improvement initiatives Monitor production systems with auto … healing and auto-scaling methodologies Support CI/CD pipelines and streamline infrastructure-as-code workflows Maintain strong security-first practices within infrastructure design and management About You Proven SRE generalist with broad cloud infrastructure experience and adaptability Experience deploying cloud infrastructure in a regulated financial services environment Skilled in Terraform and PowerShell automation tools Familiarity with Windows Server and More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
esure Group
Engineer to join our Tech Enable team. As a Lead Engineer for SiteReliability, you must demonstrate various skills to effectively lead and engage in SRE practices. The successful candidate will act as a point of escalation for critical issues, applying technical expertise to promptly address complex problems in collaboration with additional teams. What you’ll … do: Serve as the SRE Lead's backup, assuming leadership duties when necessary to maintain the continuity and efficiency of SRE operations. Provide day-to-day guidance, support, and informed decision-making for the team, maintaining stability and direction. Serve as a subject matter expert, shaping technical direction, leading initiatives, and mentoring colleagues to build team capability. Stay up to … date with emerging technologies and industry trends, sharing knowledge across company communities to embed SRE best practice. Drive continual improvement by automating manual processes and optimising monitoring systems to achieve full estate coverage. Lead initiatives to improve availability, performance, and scalability through proactive monitoring, capacity planning, and ongoing maintenance. Collaborate with development squads to embed monitoring, reliability, and scalability More ❯
FDM is a global business and technology consultancy seeking a Private Cloud SRE Manager to work for our client within the financial services sector. This is initially a 12-month contract with the potential to extend and become permanent. It will be a hybrid role based in Leeds or Manchester. Our client is looking for a passionate and experienced Engineer to join the SiteReliability Engineering (SRE) team to help run and evolve one of the group’s most critical platforms. You’ll be a key contributor to the stability, performance, and scalability of services, supporting the organisations digital transformation and long-term technology vision. You’ll work actively with container platforms, VMware infrastructure, and observability tooling … effort and improve operational efficiency Oversee and contribute to stability, scalability and performance of services, ensuring use of engineering best practices Requirements Minimum of 5 years’ experience as a SiteReliabilityEngineer within financial services Strong understanding of DevOps principles, including Infrastructure as Code and CI/CD Experience with container platforms and orchestration such as Docker More ❯
We unleash the potential of organisations through the science of board effectiveness, building better businesses and benefiting society. The Opportunity As a Senior SiteReliabilityEngineer (SRE), you'll be joining a team whose mission is to ensure the availability, performance, security and reliability of our platform and core services, ensuring that they meet the needs … be responsible for visibility and monitoring of those systems, for building tooling and automation to reduce TOIL and for responding to incidents as part of our 24/7 SRE on-call team. The SRE team: Strives to provide the highest standards of Availability, Scalability, Performance and Security for our Software as a Service environments across multiple cloud vendors and … work Proactively monitors our platform and responds to incidents as part of a 24/7 rota Key responsibilities of the role We're looking for a great Senior SRE to be a hands on individual contributor to key technical projects and to help us build a first-class SRE function. This role will involve: Hands on work with technical More ❯
Job Description Would you like to be an Engineer that builds the Cloud, rather than just uses it? At AWS, our Engineers manage the behind-the-scenes software and tools that support the world's largest cloud computing infrastructure. We … offer an exciting opportunity to join a world-class network team in a dynamic environment that feels like a start-up. As a SiteReliabilityEngineer (SRE) , you will deploy, manage, troubleshoot, and innovate the tools, services, and components that enable our network engineers to automate and maintain network operations. Your internal customers are your network engineering More ❯
Job Title: Platform Engineer/SRE Work Location: Bromley/Chester, UK (Hybrid 3 days in a week) Job Description: We are seeking a Platform Engineer/SRE with a strong and diverse technical background. The ideal candidate will possess hands-on development experience along with SiteReliability Engineering (SRE) expertise. This role requires a proactive … platform stability issues, and develop resilient and reliable systems. Key Responsibilities: Provide hands-on technical leadership in platform engineering initiatives. Ensure platform stability and resilience by identifying and resolving reliability … issues. Collaborate with cross-functional teams to deliver scalable and robust system solutions. Key Skills Required: Strong development experience in Java (primary skill). SiteReliability Engineering ( SRE ) experience. Proficiency with Kafka , Mule , and Oracle Database . Ability to work at a managerial level while remaining hands-on with technical tasks. Nice to Have: Knowledge of payment systems More ❯
Chester, Cheshire West and Chester, Cheshire, United Kingdom Hybrid / WFH Options
Ascendion
Job Title: Platform Engineer/SRE Work Location: Bromley/Chester, UK (Hybrid – 3 days in a week) Job Description: We are seeking a Platform Engineer/SRE with a strong and diverse technical background. The ideal candidate will possess hands-on development experience along with SiteReliability Engineering (SRE) expertise. This role requires a proactive … platform stability issues, and develop resilient and reliable systems. Key Responsibilities: Provide hands-on technical leadership in platform engineering initiatives. Ensure platform stability and resilience by identifying and resolving reliability … issues. Collaborate with cross-functional teams to deliver scalable and robust system solutions. Key Skills Required: Strong development experience in Java (primary skill). SiteReliability Engineering ( SRE ) experience. Proficiency with Kafka , Mule , and Oracle Database . Ability to work at a managerial level while remaining hands-on with technical tasks. Nice to Have: Knowledge of Payments systems More ❯
Job Title: Platform Engineer/SRE Work Location: Bromley/Chester, UK (Hybrid – 3 days in a week) Job Description: We are seeking a Platform Engineer/SRE with a strong and diverse technical background. The ideal candidate will possess hands-on development experience along with SiteReliability Engineering (SRE) expertise. This role requires a proactive … platform stability issues, and develop resilient and reliable systems. Key Responsibilities: Provide hands-on technical leadership in platform engineering initiatives. Ensure platform stability and resilience by identifying and resolving reliability … issues. Collaborate with cross-functional teams to deliver scalable and robust system solutions. Key Skills Required: Strong development experience in Java (primary skill). SiteReliability Engineering ( SRE ) experience. Proficiency with Kafka , Mule , and Oracle Database . Ability to work at a managerial level while remaining hands-on with technical tasks. Nice to Have: Knowledge of payment systems More ❯
Splunk SiteReliabilityEngineer/Migration Specialist - Fixed Term Contract 6-12 Months Job Summary: The Splunk SRE/Migration Specialist is responsible for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools, along with strong … models and recreate in Kibana. Incident Response Ensure the smooth functioning of Splunk platform across BT maintaining the Splunk’s infrastructure in Production & Non-Production environments. To support Splunk SRE & Application teams in investigating incidents following established procedures. Upgrades: Keep the Splunk components to the latest version applicable and carry out the necessary pre & post upgrade checks accordingly. Change Requests … security measures and ensure compliance with relevant standards and best practices. Skills and Qualifications: Hands-on experience with enterprise-level monitoring tools and applications, and familiarity with DevOps/SRE’s best practices. Proven experience with Splunk and Elasticsearch (ELK Stack). Familiarity with containerized environments (Docker, Kubernetes). Proficiency in Unix/Linux systems, Networking protocols, and possess strong More ❯
Overview SiteReliabilityEngineer, Region Services Job ID: AWS EMEA SARL (UK Branch) Would you like to help implement innovative cloud computing solutions and solve the most complex technical problems? Are you excited by the prospect of building and running the world's largest cloud computing infrastructure to provide a better world for future generations? AWS builds … you'll be part of a world-class team in a dynamic environment that has the entrepreneurial feel of a start-up. This is an opportunity to operate and engineer systems on a massive scale, and to gain world class experience in cloud computing. You'll be surrounded by people who are passionate about cloud computing, believe that first … Build and operate distributed systems Design and build the tools and utilities that are part of the AWS fleet running our internal services Key job responsibilities The Systems Development engineer will be a key member of a new team pioneering automated build and deployment of Windows based services. The team is adopting a code-first and hands off CI More ❯
Overview Senior SiteReliabilityEngineer - Monitoring and Observability. Our team is dedicated to running and uplifting the current environment to the NextGen IT Monitoring and Observability stage. We run and maintain enterprise-wide log analytics, monitoring, and observability services, ensuring optimal performance and customer satisfaction. What role will you play? As a Monitoring and Observability EngineerMore ❯