are We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/SiteReliabilityEngineer with experience managing complex infrastructure and deploying scalable, reliable systems. You are passionate about automation, cloud technologies, and continuous improvement. Must have: Proven track record More ❯
the reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … to ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language. Proficiency in C# or … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
Bristol, Avon, England, United Kingdom Hybrid / WFH Options
Robert Walters
design, development, and operation of cloud infrastructure and applications on Google Cloud Platform. You will work collaboratively with engineering and infrastructure teams to implement sitereliability engineering (SRE) principles, focusing on system reliability, observability, automation, and operational excellence. This role follows a hybrid working model, requiring attendance at the Bristol office for at least two days per … week or 40% of the working time. Key Responsibilities Promote and embed SRE best practices within engineering teams and microservices environments Partner with infrastructure and DevOps engineers to improve system resilience and performance Troubleshoot complex incidents and implement long-term solutions through code and automation Develop and improve automation pipelines to reduce manual operations and enhance system efficiency Contribute to … multiple strategic digital initiatives and collaborate across engineering domains Essential Skills and Experience Background in software engineering or telemetry, with current focus on SRE Extensive experience with public cloud platforms, particularly Google Cloud (or AWS/Azure) Proven ability to manage Kubernetes clusters in production environments Competence in scripting and development using languages such as Python, Java, Go, Bash, or More ❯
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure SiteReliabilityEngineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies More ❯
and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliabilityEngineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent tools … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
across customer platforms, internal systems, and exploring AI-driven solutions - making it an exciting space for anyone looking to shape the future of property technology in the UK.The Lead SiteReliabilityEngineer owns the operational reliability of the Connells Microsoft Azure public cloud platform. You are data-informed, customer-first and know from your own engineering … quality drives service reliability. Your engineering background helps you understand your customers, your platform and how automation is the only way to scale quality.You will be a champion of sitereliability practices within and without your team. You form part of a ‘you build it-you … run it’ platform with an agile mindset. We want to hear from you if: You are a SitereliabilityEngineer in a past life, or an SRE ready to step up and lead. You have recent Microsoft Azure experience, but we also recognise the transferability of cloud engineering and operations fundamentals from Amazon Web Services and Google More ❯
Gloucester, Gloucestershire, South West Hybrid / WFH Options
CGI
SiteReliabilityEngineer (DV Security Clearance) Position Description CGI was recognised in the Sunday Times Best Places to Work List 2025 and has been named one of the 'World's Best Employers' by Forbes magazine. We offer a competitive salary, excellent pension, private healthcare, plus a share scheme (3.5% + 3.5% matching) which makes you a CGI … Gitlab, ELK stack, Terraform, Grafana, Sonarqube, Openshift, Linux Required qualifications to be successful in this role Proven experience in SiteReliability Engineering or a similar DevOps/SRE role supporting cloud-based applications. Strong scripting and automation skills using Bash, Python, or Go. Experience with CI/CD pipelines and tools such as Jenkins, GitLab CI, and Ansible. … working on big data projects is highly advantageous. Qualifications: Degree in Computer Science, Engineering, or related technical field (or equivalent practical experience). Relevant certifications in AWS, DevOps, or SRE practices are a plus. #LI-JS2 Together, as owners, let's turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you'll More ❯
Senior SiteReliabilityEngineer Start: ASAP Duration: 6-12 months Location: hybrid, London (Tuesdays, Thursdays WFH) Pay: negotiable, inside IR35 We're looking for an experienced DevOps Engineer to join our team on a contract basis, with a focus on AWS infrastructure, observability tooling, and CI/CD automation. This is a hands-on role supporting … Python, Bash, Go or SQL - Work with Git-based workflows for infrastructure as code - Troubleshoot Kubernetes workloads and containerised services - Participate in an on-call rotation to ensure system reliability Your Profile Essential: - Solid hands-on AWS experience in a DevOps setting - Background in incident, change, and problem management - Strong with Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting More ❯
Hereford, Herefordshire, West Midlands, United Kingdom Hybrid / WFH Options
Twinstream Limited
SiteReliabilityEngineer Hybrid (Near Hereford) | £80,000 - £110,000 DOE + Clearance | DV Clearance Required Join a team that builds technology where it matters most. In 2019, TwinStream was founded by a group of engineers with deep experience solving complex, cross-domain problems within government organisations. Today, we continue that mission delivering cutting-edge solutions with … ReliabilityEngineer Were growing and so is demand for the secure, high-performance systems we deliver to government clients. As a SiteReliabilityEngineer (SRE) , youll play a key role in ensuring that these critical services are always available, resilient, and cost-effective. Youll work closely with development and support teams to evolve infrastructure, optimise … our work, candidates must be eligible for Developed Vetting (DV) clearance. All offers are subject to security screening. Ready to Engineer Systems That Matter? If youre a proactive SRE looking to work on challenging, high-impact projects in a flexible and supportive environment wed love to hear from you. Apply now and lets build the future together. More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
SiteReliabilityEngineer Hybrid – Bristol (with occasional travel to other sites & possible 24/7 callout when on rota | £80,000 – £110,000 DOE Join a Team Built on Technical Excellence At TwinStream , we're not just technologists—we're mission-driven engineers solving some of the UK government's most complex cross-domain challenges. Founded by … and remotely, we continue to grow—driven by demand for our high-trust, high-performance services. Now, we're looking for a SiteReliabilityEngineer (SRE) to join our fast-growing team. Why this SiteReliabilityEngineer role? Our SREs are the quiet heroes behind the scenes—ensuring that our mission-critical systems … prem environments, shaping infrastructure, delivery pipelines, and monitoring systems while partnering closely with dev and support teams. This role is ideal for someone passionate about engineering excellence, automation, and reliability at scale. Why TwinStream? We believe the best engineering happens in environments that respect life outside of work. Here's what we offer: 8% Employer Pension Contribution Private Medical More ❯
are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a SiteReliabilityEngineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability, ensuring … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 5+ years' experience in SiteReliabilityEngineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon More ❯
are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a SiteReliabilityEngineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability, ensuring … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 5+ years' experience in SiteReliabilityEngineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
organisations TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home. Location: Hybrid working in Bristol (occasional visits to other sites) with possible 24/7 call out when on rota Security Clearance: Eligible … for these services continues to grow in both scope and scale. We are seeking an experienced SiteReliabilityEngineer to help satisfy that demand. As an SRE you will be responsible for ensuring the availability, performance and cost effectiveness of these services. You will be working with multiple feature development teams and the BAU/Support team … to define and evolve our cloud & on-prem infrastructure & delivery pipelines, improving system observability, demonstrating performance and capacity improvements and proactively identifying and mitigating reliability risks. Key Responsibilities of the SiteReliabilityEngineer: Collaborate with Software Engineers to improve reliability and performance in their subsystems Partner with System Administrators in automating toil and eliminating alerts More ❯
SiteReliabilityEngineer/DevOps Engineer page is loaded SiteReliabilityEngineer/DevOps Engineer Apply locations Farringdon time type Full time posted on Posted 9 Days Ago job requisition id R94904 SiteReliabilityEngineer/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do … you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data to support LexisNexis IP search and analytics applications, empowering our customers with actionable insights … RDS, Azure VMs, Azure Functions). Maintaining and improving system documentation and operational procedures. Mentor team members and contribute to a culture of learning and inclusion. Continuously improving infrastructure reliability and reducing manual work (TOIL). Participating in incident response and root cause analysis. Why Join Us? Join our team and contribute to a culture of innovation, collaboration, and More ❯
We are seeking a foundational member for the Cloud Infrastructure team at Writer. This role involves contributing to the development and implementation of our SiteReliability Engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer's critical systems, proactively guaranteeing that our high-ROI products reach customers seamlessly. Your responsibilities … ensure cost efficiency. Ensure the security and compliance of our systems, adhering to industry standards and regulations. Provide mentorship and technical guidance to junior engineers, fostering a culture of reliability and continuous improvement. Stay current with emerging technologies and industry trends to improve our sitereliability practices. Is this you? Proven expertise in SiteReliability … Kubernetes) and orchestration tools. Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) for maintaining system health and performance. Ability to lead and mentor junior engineers in reliability and system optimization best practices. Excellent communication skills for effective collaboration with cross-functional teams and stakeholders. Proactive in identifying and mitigating potential system failures and performance issues. Preferred More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
enable innovation across the business. To support that mission, we're growing our Data Engineering Platform team and investing deeply in modern, reliable infrastructure. We're seeking a DevOps engineer with hands-on expertise in containerisation, orchestration, cloud platforms, continuous-delivery pipelines, and cloud at scale. In this role, you'll partner with the team to develop new functionality … cloud deployments (AWS-first) using Terraform and platform tooling Improve security posture across IAM, secrets, and networking Help the team ship faster and safer by mentoring on DevOps and SRE practices We're solving for reliability, compliance, performance, and speed - at once. You'll be key to making it work. Required Skills: Knowledge of one or more programming languages … highly leveraged platform, enabling hundreds of engineers to use critical data systems with confidence. You'll have ownership, impact, and a seat at the table as we define how SRE and platform thinking shape our next-generation data infrastructure. If you're looking to scale not just systems but the capabilities of the engineers around you, this is your team. More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
to gemstone supplies They have a presence in London, Hong Kong, Amsterdam, and as well in Mumbai and now in New York in 2001. About the role : As the SRE Manager, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and services through both direct technical contribution along with team building and … tooling. Drive automation initiatives to streamline operational workflows and improve efficiency. Develop and maintain tools, scripts, and dashboards to monitor system health, performance, and reliability. Build a first class SRE team. Through a combination of leading by example, coaching and mentoring, mould the team would want to have around you. Provide leadership and guidance to the SRE team, fostering a … culture of collaboration, innovation, and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with More ❯
Dundee, Angus, United Kingdom Hybrid / WFH Options
Ivanti
of Ivanti's SaaS product offerings. We are responsible for the reliability, deployment, and operation of the Ivanti Cloud product portfolio. We are seeking individuals eager to drive SRE maturity through the research and development of internal tooling, operational enhancements, and deployment pipelines. Ivanti SRE takes a holistic view of operational procedures, incident response procedures, application and infrastructure monitoring … and process automation. Ivanti SRE is a blend of infrastructure, networking, automation, development, and application administration. This is a hands-on technical position. The ideal candidate will have a software engineering background and strong experience with continuous deployment, SaaS delivery, and production incident response. This role requires that applicants reside in Scotland and be authorized to work in the United … a pivotal role in shaping the company's growth trajectory through continuous innovation and customer-centric solutions. What You Will Be Doing Researching maintaining, and contributing to automation of SRE tools and processes Contributing to solutions toward reducing toil within SRE Participating in code review and analysis with SRE peers Composing and reviewing contributions to SRE documentation such as system More ❯
passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior SiteReliabilityEngineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability, ensuring … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 7+ years' experience in SiteReliabilityEngineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability More ❯
enforcement systems. Bring your leadership, technical expertise, and high bar for quality to a team that's building the foundation for fast, reliable cloud services worldwide. As a Senior SiteReliabilityEngineer in the Axon IaC group, your responsibilities will include contributing to architectural decisions, tool selection and guiding best practices for our IaC provisioning pipelines. You … Axon to deliver new features efficiently. You are obsessed with achieving the high performance and reliability our customers demand and reducing toil. You will work closely with both SRE's and SWE's, and your technical deliverables will join forces with partner teams in building our cloud infrastructure provisioning platform and CICD pipelines of the future. The ideal candidate … You'll Do Location: London, England Build robust, easy-to-use foundational platforms and tools that enable engineering teams to provision services rapidly, consistently, and securely. Exemplify cloud-native sitereliability best practices. Write code that is performant, maintainable, clear, and concise. Employ strong problem-solving skills, with the ability to debug problems in cloud-native distributed systems. More ❯
Out in Science, Technology, Engineering, and Mathematics
enforcement systems. Bring your leadership, technical expertise, and high bar for quality to a team that's building the foundation for fast, reliable cloud services worldwide. As a Senior SiteReliabilityEngineer in the Axon IaC group, your responsibilities will include contributing to architectural decisions, tool selection and guiding best practices for our IaC provisioning pipelines. You … Axon to deliver new features efficiently. You are obsessed with achieving the high performance and reliability our customers demand and reducing toil. You will work closely with both SRE's and SWE's, and your technical deliverables will join forces with partner teams in building our cloud infrastructure provisioning platform and CICD pipelines of the future. The ideal candidate … You'll Do Location: London, England Build robust, easy-to-use foundational platforms and tools that enable engineering teams to provision services rapidly, consistently, and securely. Exemplify cloud-native sitereliability best practices. Write code that is performant, maintainable, clear, and concise. Employ strong problem-solving skills, with the ability to debug problems in cloud-native distributed systems. More ❯
SiteReliabilityEngineer with Python Our Client looking to bring on a sitereliabilityengineer to help deploy, manage, troubleshoot, and enhance our complex cloud-based set of internal tools and externally managed services for a variety of users across our wide-ranging organization. You will have at least 7 to 10 years hands … on expertise working as a SiteReliability Engineer. You will work closely with IT, product, and engineering to extend and maintain this set of tools and services and to help debug and resolve problems. In addition, the ideal candidate will proactively look for system weaknesses and find ways to resolve them before they can cause production issues via More ❯
You'll Do Location: London, England Build robust, easy-to-use foundational platforms and tools that enable engineering teams to provision services rapidly, consistently, and securely. Exemplify cloud-native sitereliability best practices. Write code that is performant, maintainable, clear, and concise. Employ strong problem-solving skills, with the ability to debug problems in cloud-native distributed systems. More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
organisations TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home. Location: Hybrid working in Cheltenham with possible 24/7 call out when on rota Security Clearance: Must have live DV Clearance About … troubleshooting, and disaster recovery best practices. Experience building and maintaining robust CI/CD pipelines. Proven experience deploying full-stack solutions to cloud infrastructure. Comprehensive experience in implementing Service Reliability processes. Understanding of agile software development principles and practices, with the ability to collaborate in a fast-paced, evolving environment. Knowledge of or understanding of: OpenShift Docker Compose/ More ❯
A leading global financial institution is seeking a Principal SiteReliabilityEngineer to provide essential support for their Foreign Exchange (FX) desk, focusing on trading and risk applications, including an advanced algorithmic ultra-low latency stack. … This is a unique opportunity to play a pivotal role in ensuring the reliability, performance, and scalability of a real-time trading environment by applying best-in-class SRE principles. PRINCIPAL SITERELIABILITYENGINEER Salary: £110,000 - £125,000Location: London A leading global financial institution is seeking a Principal SiteReliabilityEngineer to … This is a unique opportunity to play a pivotal role in ensuring the reliability, performance, and scalability of a real-time trading environment by applying best-in-class SRE principles. You will work directly with senior traders and developers on the trading floor, optimising workflows, troubleshooting complex issues, and driving ongoing improvements across both processes and technology. What you More ❯