SiteReliability Engineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site … and remotely, supporting mission-critical systems where performance and reliability are paramount. The SiteReliability Engineer Role: We are seeking a SiteReliability Engineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliability Engineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and evaluate More ❯
SiteReliability Engineer £65,000 £95,000 DOE Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site … and remotely, supporting mission-critical systems where performance and reliability are paramount. The SiteReliability Engineer Role: We are seeking a SiteReliability Engineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliability Engineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and evaluate More ❯
SiteReliability Engineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits)Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site … and remotely, supporting mission-critical systems where performance and reliability are paramount. The SiteReliability Engineer Role: We are seeking a SiteReliability Engineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliability Engineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and evaluate More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
SiteReliability Engineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site … and remotely, supporting mission-critical systems where performance and reliability are paramount. The SiteReliability Engineer Role: We are seeking a SiteReliability Engineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliability Engineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and evaluate More ❯
Chester, Cheshire West and Chester, Cheshire, United Kingdom Hybrid / WFH Options
Ascendion
Job Title: Platform Engineer/SRE Work Location: Bromley/Chester, UK (Hybrid – 3 days in a week) Job Description: We are seeking a Platform Engineer/SRE with a strong and diverse technical background. The ideal candidate will possess hands-on development experience along with SiteReliabilityEngineering (SRE) expertise. This role requires a proactive individual … who can lead by example, address platform stability issues, and develop resilient and reliable systems. Key Responsibilities: Provide hands-on technical leadership in platform engineering initiatives. Ensure platform stability and resilience by identifying and resolving reliability issues. … Collaborate with cross-functional teams to deliver scalable and robust system solutions. Key Skills Required: Strong development experience in Java (primary skill). SiteReliabilityEngineering ( SRE ) experience. Proficiency with Kafka , Mule , and Oracle Database . Ability to work at a managerial level while remaining hands-on with technical tasks. Nice to Have: Knowledge of Payments systems More ❯
Reliability Engineer III Apply locations Belfast - Millennium House time type Full time posted on Posted 3 Days Ago job requisition id 32913 CME Group is seeking a SRE III to help, build, operate and scale systems in our Markets portfolio. Markets SREs work on products and applications related to CME's Globex trading platform. Our systems deliver an … learn how we observe, monitor, automate, and improve Production service reliability and act as a mentor to junior colleagues. He/she will have a keen interest in SRE and enjoy the cut-and-thrust of operating Production systems. They will be a strong communicator, and may have previously worked in an SRE role, a software engineering role … ideas and reliability improvement suggestions to the Product backlog Support the migration of markets applications to Google Cloud Platform (GCP) Act as a mentor to L2 and L1 SRE colleagues What We're Looking for: Experience with Linux-based systems Experience with Cloud-based platform(s) - Google Cloud Platform, GCE, and/or GKE a bonus Understanding of application More ❯
Chester, Cheshire West and Chester, Cheshire, United Kingdom
Ascendion
Job Description: We are seeking a Platform Engineering Manager with a strong hands-on background in Java development and SiteReliabilityEngineering (SRE). The ideal candidate will have a broad technical skillset across Java, Spring, MuleSoft, Kafka, and Oracle DB, and must be capable of leading platform stability efforts while contributing directly to development. Experience … and implement improvements. Architect and develop resilient backend systems primarily using Java, Spring, Kafka, and Oracle. Implement best practices for observability, incident response, and operational excellence in line with SRE principles. Drive automation and self-healing mechanisms across platform components. Provide technical leadership and hands-on coding as needed. Monitor, troubleshoot, and resolve production issues, conducting root cause analysis and … platform engineering experience. Strong Java expertise with deep understanding of backend design patterns and frameworks (Spring Boot preferred). Proven experience in SiteReliabilityEngineering (SRE), including monitoring, alerting, and incident management. Hands-on experience with Kafka, MuleSoft, and Oracle DB. Familiarity with performance tuning, system design, and distributed computing concepts. Experience with CI/CD More ❯
Farnborough, Hampshire, England, United Kingdom Hybrid / WFH Options
Addition
SiteReliability Engineer (Defence) This is a chance to join a forward-thinking digital solutions business delivering secure technology for the Defence and Security sector. As a SiteReliability Engineer, you’ll be at the heart of building, scaling, and maintaining critical platforms that underpin mission-ready technology. Role Overview: Role: SiteReliability Engineer … Security What You’ll Be Doing: Designing and maintaining Kubernetes environments for scalable deployments. Building and optimising CI/CD pipelines to improve efficiency. Implementing monitoring systems to ensure reliability and performance. Driving automation initiatives to reduce manual processes. Managing repositories and version control for seamless collaboration. Partnering with development teams to align platform capabilities with requirements. Supporting long … in security, maintainability, and scalability. Staying ahead of emerging technologies to keep the platform cutting-edge. Main Skills Needed: Applications must be eligible for Security Clearance. Proven experience in SiteReliability or Platform Engineering (5+ years). Strong knowledge of Kubernetes and container orchestration. Expertise in CI/CD tools (Jenkins, GitLab, etc.). Experience with AWS More ❯
SiteReliability Engineer (International Travel) This is a chance to join a forward-thinking digital solutions business delivering secure technology for the Defence and Security sector. As a SiteReliability Engineer, you’ll be at the heart of building, scaling, and maintaining critical platforms that underpin mission-ready technology. Role Overview: Role: SiteReliability … Security What You’ll Be Doing: Designing and maintaining Kubernetes environments for scalable deployments. Building and optimising CI/CD pipelines to improve efficiency. Implementing monitoring systems to ensure reliability and performance. Driving automation initiatives to reduce manual processes. Managing repositories and version control for seamless collaboration. Partnering with development teams to align platform capabilities with requirements. Supporting long … the platform cutting-edge. Main Skills Needed: Applications must be eligible for Security Clearance. Happy to travel internationally according to project requirements (All costs covered). Proven experience in SiteReliability or Platform Engineering (4+ years). Strong knowledge of Kubernetes and container orchestration. Expertise in CI/CD tools (Jenkins, GitLab, etc.). Experience with AWS More ❯
Junior SiteReliability … Engineer We are currently working with a leading Financial Services company, who are looking for a Junior SiteReliability Engineer to join their ever-expanding platform/SRE team from their Shoreditch, London, Office where you will be expected to travel to the office 4 days a week. They are looking for you to have excellent cloud knowledge … ideally AWS as well as having experience of Powershell/Python. As the Junior SiteReliability Engineer, you will be a self-starter who has excellent stakeholder management experience who can show outcome based work. You will ideally have 2 years of commercial experience coming from an IT Operations/Cloud infrastructure background. Please note this is an More ❯
FDM is a global business and technology consultancy seeking a Private Cloud SRE Manager to work for our client within the financial services sector. This is initially a 12-month contract with the potential to extend and become permanent. It will be a hybrid role based in Leeds or Manchester. Our client is looking for a passionate and experienced Engineer … to join the SiteReliabilityEngineering (SRE) team to help run and evolve one of the group’s most critical platforms. You’ll be a key contributor to the stability, performance, and scalability of services, supporting the organisations digital transformation and long-term technology vision. You’ll work actively with container platforms, VMware infrastructure, and observability tooling … ensuring their services are resilient and efficient. You’ll also lead and participate in post-mortems, drive automation, and continuously improve the platform through engineering-led solutions. This role also involves working in Agile environments, collaborating across multiple teams and disciplines to deliver high-quality outcomes at pace. Responsibilities Enhance and support a wide range of platform technologies, including More ❯
Join our team as a MongoDB SiteReliability Engineer, where you'll be at the forefront of designing and maintaining robust, high-performance systems that power critical financial services. In this dynamic and fast-paced environment, your role will be essential to ensuring our infrastructure remains resilient, secure, and scalable. You'll work on automating operations, enhancing system … If you're motivated by solving, multi-layered problems and building systems that perform reliably amid shifting priorities, we encourage you to apply. To be successful as a MongoDB SiteReliability Engineer, you should have experience with: Working in SiteReliabilityEngineering, DevOps, and MongoDB administration in financial services. Using MongoDB features like replicaset, sharding More ❯
Lisburn, County Antrim, United Kingdom Hybrid / WFH Options
Camlin
industries, including power and rail, and also has interests in a number of R&D projects in a variety of scientific sectors. At Camlin we believe in high quality engineering and design, allowing us to develop market leading products and services. In short, we love creating value for our customers by solving … difficult problems. As of today, the Camlin operation spans over 20 countries across the globe. Job Overview We are seeking a dedicated and experienced SiteReliability Engineer (SRE) to join our dynamic team. The SRE will be responsible for ensuring the reliability, performance, and availability of our critical systems and services. This role requires a blend of … software engineering and operations skills to build and run large-scale, distributed, fault-tolerant systems. Key Responsibilities System Reliability and Performance Design, implement, and maintain scalable and reliable infrastructure. Monitor system performance, detect issues, and ensure maximum uptime. Develop and implement strategies for disaster recovery and data backup. Automation and Tooling Automate repetitive tasks to improve efficiency and More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
Oscar Associates (UK) Limited
SiteReliability Engineer | Cheltenham | £600 per day (Outside IR35) About the Role: We're seeking an experienced Site … Reliability Engineer with live eDV clearance to join an on-site team in Cheltenham. This contract role involves supporting and maintaining a managed cross-domain service, applying SRE practices to ensure reliability, security, and performance. Contract Details: Employment: Contract (Outside IR35) Rate: £600 per day Length: 6 months (long-term extensions very likely). Location: Cheltenham … days on-site) Clearance: Live eDV required Start Date: ASAP Key Responsibilities: Build and deploy code using Java, Maven, NPM, Terraform, and Ansible across OpenShift, RHEL/CentOS, and Docker. Monitor and optimise system performance with Influx and Grafana. Provide 2nd/3rd line support, incident response, and root cause analysis. Carry out BAU maintenance including patching, database housekeeping More ❯
an essential role in supporting AWS public cloud infrastructure while championing automation through Infrastructure as Code solutions such as Terraform. Your day-to-day activities will involve collaborating with SRE and engineering teams to enhance system observability, proactively managing operational risks, maintaining high standards of security compliance, and ensuring robust disaster recovery capabilities. You will be responsible for documenting … Maintain the reliability and security of cloud environments by implementing robust monitoring tools and adhering to industry best practices.* Enhance observability and telemetry within cloud-hosted environments using SRE methodologies to deliver on Service Level Agreements (SLAs), Objectives (SLOs), and Indicators (SLIs).* Document and regularly review operational risks within the cloud environment, ensuring that identified issues are tracked … for all cloud-hosted services through effective backup strategies and disaster recovery processes, including planning and conducting quarterly DR tests.* Collaborate closely with SiteReliabilityEngineering (SRE) and engineering teams to ensure optimal management of the cloud environment.* Support asset management processes throughout their lifecycle, ensuring compliance with end-of-service (EOS) and end-of-life More ❯
SiteReliability Engineer - Barcelona Join a leading global travel technology company that's transforming the way businesses manage travel. You will be working with cutting-edge platforms combined with world-class travel inventory with powerful management tools, delivering freedom for … travellers and control for companies, saving time, money,and hassle for everyone. Our client are an award-winning scale-up company seeking a talented SiteReliability Engineer (SRE) to help ensure our systems are fast, reliable, and ready to scale. What is in t for you: Salary up to €85,000plus equity in the company. Generous holiday allowance … company events. Mental health support tools. Significant opportunities for growth and progression. The role: Design, build, and maintain scalable, secure cloud infrastructure in AWS. Monitor and manage system performance, reliability, and security. Implement and refine monitoring tools to ensure system health and availability. Partner with development teams to create resilient, secure applications. Participate in our on-call rotation, responding More ❯
SiteReliability Engineer Hybrid As a senior SRE you would be joining a growing HashiVault squad as part of the strategy to offer more services and a better user experience.. What we're looking for: Hands-on experience with Google Cloud Platform (GCP) Hands-on experience with Kubernetes Experience in implementing technological solutions, managing applications and services in … Security Engineer Hashicorp certifications such as Terraform Associate, Terraform Authoring and Operations Professional, Vault Associate, or Vault Operations Professional Experience migrating application workloads into GCP and/or GKE SiteReliability Engineer Due to the volume of applications received for positions, it will not be possible to respond to all applications and only applicants who are considered suitable More ❯
Overview SiteReliability Engineer - Global Network Services Transformation A leading financial technology organisation is embarking on an exciting journey to transform its Network Services Group , and they're now seeking a SiteReliability Engineer to join their growing team. This opportunity is perfect for someone who thrives at the intersection of software engineering and infrastructure … reliability . The successful candidate will design, develop, and maintain self-service automation tools that drive efficiency, reduce costs, and improve resilience across one of the world's most sophisticated network infrastructures. Working with colleagues across the US, UK, India, and Singapore , this engineer will play a pivotal role in advancing the company's automation-first approach, deploying microservices … JavaScript/jQuery/HTML5/CSS is highly desirable. Familiar with Terraform or similar IaC tools. Comfortable in Linux environments; confident using VSCode . Strong grounding in software engineering practices and DevOps culture. Excellent communicator with analytical problem-solving skills. Experience in networking or security automation is a distinct advantage. Personal Qualities Proactive, with a problem-solving mindset. More ❯
Bath, Avon, England, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions Ltd
SiteReliability EngineerWork From Home (WFH) + Quarterly Visits to BathFull Time, Initial 12 Month Fixed Term ContractSalary DOE ( ̃ £45k - £60k) + Benefits + Bonus Deerfoot Recruitment is working with an established FCA-authorised outsourced service provider in the financial services sector, seeking a talented SiteReliability Engineer to join their IT Operations team. This role … cloud architecture Engage in infrastructure design, implementation, and operation to ensure highly available, scalable systems Work collaboratively across development and operations teams throughout the software lifecycle Champion system automation, reliability, and continuous improvement initiatives Monitor production systems with auto … healing and auto-scaling methodologies Support CI/CD pipelines and streamline infrastructure-as-code workflows Maintain strong security-first practices within infrastructure design and management About You Proven SRE generalist with broad cloud infrastructure experience and adaptability Experience deploying cloud infrastructure in a regulated financial services environment Skilled in Terraform and PowerShell automation tools Familiarity with Windows Server and More ❯
Bath, Somerset, South West, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions
SiteReliability Engineer Work From Home (WFH) + Quarterly Visits to Bath Full Time, Initial 12 Month Fixed Term Contract Salary DOE ( ?????? £45k - ? £60k) + Benefits + Bonus Deerfoot Recruitment is working with an established FCA-authorised outsourced service provider in the financial services sector, seeking a talented SiteReliability Engineer to join their IT Operations … cloud architecture Engage in infrastructure design, implementation, and operation to ensure highly available, scalable systems Work collaboratively across development and operations teams throughout the software lifecycle Champion system automation, reliability, and continuous improvement initiatives Monitor production systems with auto … healing and auto-scaling methodologies Support CI/CD pipelines and streamline infrastructure-as-code workflows Maintain strong security-first practices within infrastructure design and management About You Proven SRE generalist with broad cloud infrastructure experience and adaptability Experience deploying cloud infrastructure in a regulated financial services environment Skilled in Terraform and PowerShell automation tools Familiarity with Windows Server and More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
esure Group
Reliability Engineer to join our Tech Enable team. As a Lead Engineer for SiteReliability, you must demonstrate various skills to effectively lead and engage in SRE practices. The successful candidate will act as a point of escalation for critical issues, applying technical expertise to promptly address complex problems in collaboration with additional teams. What you’ll … do: Serve as the SRE Lead's backup, assuming leadership duties when necessary to maintain the continuity and efficiency of SRE operations. Provide day-to-day guidance, support, and informed decision-making for the team, maintaining stability and direction. Serve as a subject matter expert, shaping technical direction, leading initiatives, and mentoring colleagues to build team capability. Stay up to … date with emerging technologies and industry trends, sharing knowledge across company communities to embed SRE best practice. Drive continual improvement by automating manual processes and optimising monitoring systems to achieve full estate coverage. Lead initiatives to improve availability, performance, and scalability through proactive monitoring, capacity planning, and ongoing maintenance. Collaborate with development squads to embed monitoring, reliability, and scalability More ❯
Join us as a Senior SiteReliability Engineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as … a Senior SiteReliability Engineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning - SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise … strategic thinking and digital and technology, as well as job-specific technical skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability More ❯
Join us as a Senior SiteReliability Engineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence.Considering making an application for this job Check all the details in this job description, and then click on Apply. This role will include: applying software engineering techniques, automation, and best practices in incident response … ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as a Senior SiteReliability Engineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning – SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages … strategic thinking and digital and technology, as well as job-specific technical skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
us and be part of our success story! Role Summary In the role of SiteReliability Engineer at Commify, you will be an integral part of our SRE team. Your focus will be on ensuring that our products and platforms perform at their best, understanding how our software interacts with both physical and Cloud infrastructure to deliver exceptional … Maintaining high levels of system performance through monitoring and performance tuning Implementing scalability and fault tolerance Automating processes and improving operational efficiencies Troubleshooting application and middleware challenges Collaborating with engineering teams to support high-throughput production environments Building and maintaining robust deployment pipelines What essentials are we looking for? Proficiency with Microsoft Azure Strong expertise in Terraform, App Services … and Kubernetes Fluent in both written and spoken English A genuine passion for reliability in systems Experience in creating and modifying Terraform deployments Prior experience in an operations role, ideally as a SiteReliability Engineer Ability to work cross-functionally, take ownership of tasks, and prioritize effectively Excellent communication and collaboration skills Experience with monitoring solutions (e.g. More ❯
Skipton, North Yorkshire, Yorkshire, United Kingdom
Skipton Building Society
background, and whatever your goals, we'll help you take the next step towards a better future. You will be working within our IT function as part our Cloud Engineering teams on our Azure Platform, ensuring it is fit for purpose, resilient, scalable and provides the capabilities needed to deliver value at pace. The teams consist of around … who you will support as overall lead technical authority and decision maker. The teams work very closely with the rest of the IT function and our software and data engineering teams who are building cutting edge solutions within our Azure Platform. Whats In It For You? Skipton values work/life balance and we are proud to support hybrid … teams adopt cutting-edge Azure methodologies and adhere to industry standards for security, compliance, and performance. Driving the adoption of DevOps and SiteReliabilityEngineering (SRE) principles to improve operational efficiency, resilience, and service reliability. Engaging with Senior Leadership, IT Teams, Security teams, Engineering teams and Data teams to shape cloud strategy ensuring that cloud More ❯