Dundee, Scotland, United Kingdom Hybrid / WFH Options
Ivanti
of Ivanti’s SaaS product offerings. We are responsible for the reliability, deployment, and operation of the Ivanti Cloud product portfolio. We are seeking individuals eager to drive SRE maturity through the research and development of internal tooling, operational enhancements, and deployment pipelines. Ivanti SRE takes a holistic view of operational procedures, incident response procedures, application and infrastructure monitoring … and process automation. Ivanti SRE is a blend of infrastructure, networking, automation, development, and application administration. This is a hands-on technical position. The ideal candidate will have a software engineering background and strong experience with continuous deployment, SaaS delivery, and production incident response. This role requires that applicants reside in Scotland and be authorized to work in the … a pivotal role in shaping the company's growth trajectory through continuous innovation and customer-centric solutions. What You Will Be Doing Researching,, maintaining, and contributing to automation of SRE tools and processes Contributing to solutions toward reducing toil within SRE Participating in code review and analysis with SRE peers Composing and reviewing contributions to SRE documentation such as system More ❯
Liverpool, England, United Kingdom Hybrid / WFH Options
Concerto
Join to apply for the SiteReliability Engineer role at Concerto - property asset management system with CAFM 1 week ago Be among the first 25 applicants Join to apply for the SiteReliability Engineer role at Concerto - property asset management system with CAFM SiteReliability Engineer - Liverpool (Hybrid Working) As a SiteReliability Engineer at Concerto (part of Bellrock Group), you will play a pivotal role in ensuring the reliability, performance, and scalability of our Intelligent Assets Management SaaS platform. You will lead the improvement of … infrastructure, DevOps, and monitoring across our systems—empowering the engineering team to release features faster and more safely. Your hands-on experience and strategic thinking will help embed SRE principles throughout the team, improving customer experience, system health and developer productivity. You’ll work across internal environments and customer-facing systems, shaping operational excellence and reliability at every More ❯
Join to apply for the SiteReliability/Gitops Engineer role at Canonical 3 weeks ago Be among the first 25 applicants Join to apply for the SiteReliability/Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features. Canonical is a leading provider of open source software and … operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of global distributed collaboration, with … at Canonical supports and maintains all of Canonical's IT production services. The team is in charge of running services used by over 60 million Ubuntu users. As an SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. More ❯
Join to apply for the SR SiteReliability Engineer role at Wakapi . We are seeking a highly skilled Senior SiteReliability Engineer to join our Platform Engineering team. The ideal candidate will have a strong understanding of DevOps and Service Level Management (SLM) metrics, with experience in event-driven infrastructure projects using tools like … Terraform, New Relic, Kubernetes, AWS, and Kafka. As a Platform Engineering representative, you will collaborate with engineering teams to ensure our platform infrastructure tooling meets their needs and positively impacts Developer Experience. You will also assist in setting appropriate thresholds for alerts and automations related to their applications. Responsibilities Design, implement, and maintain scalable and highly available systems … ensuring observability through metrics, tracing, log aggregation, and alerting. Help teams determine settings and thresholds for alerts and automations based on application performance requirements. Monitor, optimize, and ensure system reliability and performance using tools like New Relic and applying DORA metrics. Track uptime, response times, and resolution times to ensure compliance with SLAs, SLOs, and SLIs. Implement and promote More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
KnowBe4
Snr. SiteReliability Engineer (Remote position located in Leeds/Sheffield, United Kingdom) KnowBe4 Sheffield, England, United Kingdom Join or sign in to find your next job Join to apply for the Snr. SiteReliability Engineer (Remote position located in Leeds/Sheffield, United Kingdom) role at KnowBe4 Snr. SiteReliability Engineer (Remote position … located in Leeds/Sheffield, United Kingdom) KnowBe4 Sheffield, England, United Kingdom 4 days ago Be among the first 25 applicants Join to apply for the Snr. SiteReliability Engineer (Remote position located in Leeds/Sheffield, United Kingdom) role at KnowBe4 Get AI-powered advice on this job and more exclusive features. About KnowBe4 KnowBe4, the provider … largest security awareness training and simulated phishing platform, is used by tens of thousands of organizations around the globe. KnowBe4 enables organizations to manage the ongoing problem of social engineering by helping them train employees to make smarter security decisions, every day. About KnowBe4 KnowBe4, the provider of the world's largest security awareness training and simulated phishing platform More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Mindrift
Freelance SiteReliability Engineer (Security Automation & Penetration Testing) 1 day ago Be among the first 25 applicants About The Company At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. Our goal? Advance the field of artificial intelligence through collaborative Generative AI projects with domain experts. The … an exciting AI project Enrich your portfolio with new experience Make a tangible impact on the quality of AI Seniority level: Entry level Employment type: Part-time Job function: Engineering and Information Technology Industries: IT Services and IT Consulting #J-18808-Ljbffr More ❯
SiteReliability Engineer/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division (https://www.lexisnexisip.com) provides international patent content and a suite of online and analytic tools that meet the evolving needs … and communities. Working here means joining a vibrant, diverse, and collaborative team where you are free to grow and contribute actively. About Role: We are a high-performing systems engineering team operating in a fast-paced enterprise environment, focused on modernising our infrastructure while upholding strict security and compliance standards. Our engineers work with Microsoft Hyper-V and a … RDS, Azure VMs, Azure Functions). Maintaining and improving system documentation and operational procedures. Mentor team members and contribute to a culture of learning and inclusion. Continuously improving infrastructure reliability and reducing manual work (TOIL). Participating in incident response and root cause analysis. Why Join Us? Join our team and contribute to a culture of innovation, collaboration, and More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
organisations TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home. Location: Hybrid working in Cheltenham with possible 24/7 call out when on rota Security Clearance: Must have live DV Clearance About … troubleshooting, and disaster recovery best practices. Experience building and maintaining robust CI/CD pipelines. Proven experience deploying full-stack solutions to cloud infrastructure. Comprehensive experience in implementing Service Reliability processes. Understanding of agile software development principles and practices, with the ability to collaborate in a fast-paced, evolving environment. Knowledge of or understanding of: OpenShift Docker Compose/ More ❯
Social network you want to login/join with: Head of SiteReliabilityEngineering (SRE), slough col-narrow-left Client: O Partners Location: slough, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 2 Posted: 31.05.2025 Expiry Date: 15.07.2025 col-wide Job Description: Head of SiteReliabilityEngineering (SRE) Are you ready to lead a global SRE and Production Engineering function for a business-critical suite of platforms used by leading players in financial services? My client is hiring a Head of Production Engineering & SRE to drive the reliability, scalability, and performance of infrastructure that supports mission-critical, client-facing applications across global markets. … Experience required: 10+ years of experience in engineering, with 5+ years in a leadership role in SRE, DevOps, or Production Engineering. Proven track record managing reliable, scalable systems in a high-compliance environment (e.g., FinTech, HealthTech). Strong understanding of modern software development lifecycle, CI/CD, IaC, and cloud-native technologies. Expertise in Kubernetes, AWS (or Azure/ More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Resident Advisor Ltd
Join to apply for the Senior SRE Engineer role at Resident Advisor 4 days ago Be among the first 25 applicants Join to apply for the Senior SRE Engineer role at Resident Advisor Founded in 2001, Resident Advisor (RA) is one of the world's longest-running music media brands and a cornerstone of the dance, electronic and DJ ecosystem. … The site's audience of over 6 million monthly users is drawn in by a combination of news, editorial, club listings and ticketing, RA-branded events at venues and festivals worldwide, original films and a weekly mix series that has run for 18 years. We're looking for a Senior SiteReliability Engineer passionate about electronic music … MSSQL databases, ElasticSearch, Redis, and Kafka running on AWS EKS (Kubernetes), managed via Terraform with CI/CD pipelines and DataDog monitoring. Your responsibilities include improving infrastructure performance and reliability, driving modernization and cost optimization, developing shared components (i.e. auth systems, GraphQL gateways), enhancing developer experience, maintaining E2E testing systems, and creating internal tooling. This is an opportunity to More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
Mindrift
SRE - Freelance Security & Automation Engineer (Pentesting Focus) 2 days ago Be among the first 25 applicants About The Company At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. Our goal? Advance the field of artificial intelligence through collaborative Generative AI projects with domain experts. The Mindrift platform allows … an exciting AI project. Enrich your portfolio with new experience. Make a tangible impact on the quality of AI. Seniority level: Entry level Employment type: Part-time Job function: Engineering and Information Technology Industries: IT Services and IT Consulting #J-18808-Ljbffr More ❯
Birmingham, England, United Kingdom Hybrid / WFH Options
Mindrift
SRE - Freelance Security & Automation Engineer (Pentesting Focus) 6 days ago Be among the first 25 applicants About The Company At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. Our goal? Advance the field of artificial intelligence through collaborative Generative AI projects with domain experts. The Mindrift platform allows … an exciting AI project. Enrich your portfolio with new experience. Make a tangible impact on the quality of AI. Seniority level Entry level Employment type Part-time Job function Engineering and Information Technology Industries IT Services and IT Consulting #J-18808-Ljbffr More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Women's Engineering Society
You’ll contribute to the architecture and design of new and existing systems, establish best working practices, and deliver high-quality software products. With your knowledge of various software engineering methodologies, you’ll bring fresh ideas and approaches that have a real impact at the heart of our mission to keep the UK safe in the real world, and … role with plenty of opportunities to develop yourself and others. You might be reviewing pull requests, defining review, branching, and deployment strategies, or working with a range of software engineering frameworks. You operate at a deep technical level, leveraging your familiarity with languages such as JavaScript, Java, C++, Node, Python, Rust, Go, and .NET. Importantly, you’ll bring a … genuine excitement for discovering new software engineering techniques. You are part of a wider network of peers keen to share experiences, collaborate on projects, and learn from each other. With your experience, you set the standard, share innovative ways of working, and identify new priorities. You might lead and mentor a team or be the technical expert within a More ❯
Join us as an Infrastructure Engineer Recognised as a hands-on technical subject matter expert, you'll lead a team of Distributed and Network Analysts in a critical on-site 24/… fast paced environment Utilising your experience, you'll role model engineering practices in order to provide a high-quality service and deliver strategic outcomes You'll leverage SRE and DevOps principles to develop and maintain automation, reducing manual and repetitive tasks, and removing inefficiencies This is a chance to work with colleagues across the bank to share engineering … and manage the selection, creation, and maintenance of technologies required to meet the needs of our customers, strategic targets, and architecture outcomes. You'll also develop products using modern engineering practices and tools. We'll look to you to collaborate with product owners to develop product roadmaps and manage the lifecycle of the team's products. As well as More ❯
optimization technologies and beyond, our innovative mobile perception system empowers our clients to accelerate transit, enhance street safety, and drive toward a sustainable future. Job Summary: The Cloud Fleet Engineering team at Hayden AI is at the forefront of managing and scaling our fleet of AI-powered edge devices deployed across global transportation agencies. As a senior engineer, you … and remediation efforts, learning and applying best practices. Design, build, and maintain scalable cloud services that support device observability, OTA updates, and fleet operations. Lead efforts to improve the reliability, security, and performance of multi-region AWS infrastructure using Infrastructure as Code (IaC) tools. Own CI/CD pipelines for cloud services, enabling efficient, secure, and automated delivery processes. … and operate scalable, secure APIs and microservices. Strong communication skills and a track record of leading cross-functional technical initiatives. Experience working in agile environments with modern DevOps and SRE practices. #J-18808-Ljbffr More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
Arm
Get AI-powered advice on this job and more exclusive features. Job Overview We are building a modern, cloud-native compute orchestration platform to support large-scale, compute-intensive engineering workloads. As a Senior Software Engineer, you will play a key role in designing and delivering a highly scalable, reliable, and observable system, with a particular focus on software … development and performance testing. Job Overview We are building a modern, cloud-native compute orchestration platform to support large-scale, compute-intensive engineering workloads. As a Senior Software Engineer, you will play a key role in designing and delivering a highly scalable, reliable, and observable system, with a particular focus on software development and performance testing. This role is … Design, implement, and maintain core components of the platform using cloud-native technologies. Lead efforts around performance benchmarking, load testing, and scalability validation. Define and enforce SLAs; work with SRE/DevOps to ensure high availability and observability. Tune platform performance under high-throughput workloads and lead capacity planning. Automate and execute stress/load tests using both synthetic and More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
MRJ Recruitment
strong DevOps culture, so you'll be a central figure in advocating for scalable infrastructure and robust platform engineering principles. This means close collaboration with development, QA, and SRE teams to build secure, cost-effective, and repeatable systems. You'll blend deep technical work with impactful leadership, needing a solid grasp of production operations, incident response, Infrastructure as Code More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
play a critical role in ensuring system reliability, scalability, and performance across both AWS and Azure environments. This is your opportunity to lead cloud-native transformation and embed SRE best practices into engineering at scale. What you’ll be doing as their SiteReliability Engineer: You’ll be the go-to expert for designing and maintaining … CI/CD pipelines to reduce toil and accelerate deployment frequency. Build observability into everything—own monitoring, alerting, and incident response to minimize MTTR and improve system health. Champion SRE culture and reliability-focused engineering—help shape sustainable engineering practices, SLAs, SLOs, and error budgets. Contribute across the stack with flexibility in tooling—experience with Python, Go … dental insurance 25 days annual leave + bank holidays R&D and personal training budgets And much more... This is an incredibly rare chance for a seasoned, high-performing SRE to leave your mark on high-impact transformation projects in a business that’s truly committed to doing things the right way. #J-18808-Ljbffr More ❯
Stockport, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
team. Things are moving fast here, and as we continue to grow; reliability, automation, and scalability have never been more important to us. You will be our first SRE so a strong background in implementing SRE best practices would be Ideal. You will know what good looks like and strive to continuously improve automation, availability and resilience. This is … to build out infrastructure and tooling using AWS, Terraform, Docker, and CI/CD pipelines. Supporting and evolving our container-based architecture (we use ECS and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support your work. What we’re … looking for: Strong experience with AWS, Terraform, Docker, and container orchestration (ECS/Fargate). Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve More ❯
Wakefield, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
team. Things are moving fast here, and as we continue to grow; reliability, automation, and scalability have never been more important to us. You will be our first SRE so a strong background in implementing SRE best practices would be Ideal. You will know what good looks like and strive to continuously improve automation, availability and resilience. This is … to build out infrastructure and tooling using AWS, Terraform, Docker, and CI/CD pipelines. Supporting and evolving our container-based architecture (we use ECS and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support your work. What we’re … looking for: Strong experience with AWS, Terraform, Docker, and container orchestration (ECS/Fargate). Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve More ❯
Social network you want to login/join with: JOB TITLE: SiteReliability Engineer (SRE) SALARY: £86,964-102,310 per annum LOCATION: Bristol HOURS: Full-time – 35 hours per week WORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our Bristol office. … Our Cloud SiteReliabilityEngineering) team is looking for an experienced and passionate Engineer to join out Consumer Servicing and Engagement Platform. As an application level SRE, you’ll be an active and leading member of a cloud-focused team of engineers – working on one of the Group’s flagship projects to run and maintain a set … Cloud Platform (GCP) that will enable the business to realise the next generation of services that form the Bank’s future vision. Accountabilities will include: Delivering against GCP and SRE Public Cloud technology roadmaps Collaboratively working with other engineering teams to release and evolve enterprise-class solutions, that are reliable and evergreen as cloud technologies continue to mature at More ❯
Liverpool, England, United Kingdom Hybrid / WFH Options
Bellrock Group
SiteReliability Engineer - Liverpool (Hybrid Working) As a SiteReliability Engineer at Concerto (part of Bellrock Group), you will play a pivotal role in ensuring the reliability, performance, and scalability of our Intelligent Assets Management SaaS platform. You will lead the improvement of … infrastructure, DevOps, and monitoring across our systems—empowering the engineering team to release features faster and more safely. Your hands-on experience and strategic thinking will help embed SRE principles throughout the team, improving customer experience, system health and developer productivity. You’ll work across internal environments and customer-facing systems, shaping operational excellence and reliability at every … scalable environments using technologies such as Terraform. Work closely with developers, QA, and DBAs to improve platform design and release workflows. Implement and promote best practices for operational readiness, reliability, and fault tolerance. Guide the platform team on tooling, automation, instrumentation, observability and best practice in Azure. Build a high-quality platform aligned to the Microsoft Cloud Adoption Framework More ❯
Ipswich, England, United Kingdom Hybrid / WFH Options
Devopshunt
brings together Design, Development, Test and Technical Services all under one roof. Collectively we work in an Agile/Scrum model, uniquely positioning us to exploit the best of SRE/DevOps practices. You will help us manage changes and deliveries for our platforms to support the ‘stand out services’ our company is so proud of. You will have opportunities … to contribute to the best practices used by our SRE team within Software Delivery. The team is diverse and adaptive varying from beginners to experienced hybrid engineers. The activities covered are broad, exploiting a range of cloud environments such as AWS and GCP, truly embracing the hybrid skills of the future! What you’ll be doing Be the delivery focused … coach for the teams, using various tools and agile methodologies focusing on driving efficiency for our SiteReliabilityEngineering (SRE) teams. Play a crucial part in collecting requirements, setting up deliverables, reporting progress to executive stakeholders, managing complex dependencies, and contributing to the product delivery process. Directly impact our customers by owning and scheduling our critical projects More ❯
Swindon, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Social network you want to login/join with: SiteReliability Engineer, swindon, wiltshire col-narrow-left Client: Harrington Starr Location: swindon, wiltshire, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 8 Posted: 04.06.2025 Expiry Date: 19.07.2025 col-wide Job Description: SiteReliability Engineer – Fintech Up to … s leading financial institutions to streamline international payments and ensure compliance at scale - all through smart automation and modern cloud-native infrastructure. They’re looking to bring on a SiteReliability Engineer with deep experience in observability . If you’ve worked with tools like Prometheus in AWS , supported development teams with tracing and performance insights , and thrive … SLIs/SLOs , automating tasks, and reducing operational noise Working with AWS (EKS, EC2, Lambda, RDS), Terraform, and CI/CD tools What They’re Looking For: Experience in SRE or DevOps roles in a production environment Strong knowledge of observability tools , especially Prometheus in AWS Experience with tracing , metrics, and logs to support development teams Skills in Python or More ❯
Social network you want to login/join with: SiteReliability Engineer, Leeds, West Yorkshire Client: Ranger Technical Resources Location: Leeds, West Yorkshire, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 31.05.2025 Expiry Date: 15.07.2025 Job Description: Position: SiteReliability Engineer #2494 Position Summary: Our partner, an innovative PaaS company … specializing in remote monitoring and network management solutions, is looking for a SiteReliability Engineer to help ensure the reliability, scalability, and performance of critical infrastructure and applications. The role involves building and maintaining highly available systems, supporting CI/CD pipelines, and collaborating with development, DevOps, and other teams to maintain high uptime, security, and user … millions of endpoints. Experience and Education: Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related field/experience. 7+ years of experience in SiteReliabilityEngineering, DevOps, Infrastructure, or related roles. Deep understanding of AWS and its modules and services. Strong Linux administration and troubleshooting skills. Experience with implementing and managing More ❯