to minimize recurrence. What We're Looking For: Basic Required Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a SiteReliabilityEngineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running in AWS Comfortable with Infrastructure … GitHub Actions, Azure DevOps Excellent troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues Familiarity working in an Agile environment True understanding of SiteReliability Engineering Ability to build and maintain a system and culture that supports and implements SLOs. Familiar with Docker & Kubernetes, specifically EKS & ECS Familiar with programming languages, such More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Couchbase
Join to apply for the SiteReliabilityEngineer role at Couchbase 2 weeks ago Be among the first 25 applicants Join to apply for the SiteReliabilityEngineer role at Couchbase As industries race to embrace AI, traditional database solutions fall short of rising demands for versatility, performance, and affordability. Couchbase is leading the … to edge. Trusted by over 30% of the Fortune 100, Couchbase is unlocking innovation, accelerating AI transformation, and redefining customer experiences. Come join our mission. Role Overview At Couchbase, SiteReliability Engineers are hybrid software and systems engineers. They are the glue holding things together, whether that’s infrastructure/platform, tooling support for our cloud business or … managing Observability posture for Couchbase. In this role the candidate we are looking for is for the Observability team which is responsible for maintaining Reliability, Availability and Serviceability for the entire Couchbase cloud offerings. You will be working as a Software Engineer developing and maintaining Couchbase monitoring stack which includes metrics pipeline, alerting, notifications and the likes. You More ❯
Social network you want to login/join with: SiteReliabilityEngineer (Equity only 0.5%), manchester col-narrow-left Client: Location: manchester, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 10.06.2025 Expiry Date: 25.07.2025 col-wide Job Description: Company Description : Luupli is a social media app that has … and the planet. Our team is made up of passionate and dedicated individuals who are committed to making Luupli a success. Role Description : We are seeking an experienced Senior SRE/DevOps Engineer to play a key role in our cloud migration initiative from AWS to Google Cloud Platform (GCP) for our high-traffic social media application. You will … complies with relevant security standards and policies, implementing practices like VPC configuration, IAM best practices, and Cloud Armor for protection. Required Qualifications : Experience: 5+ years of experience in DevOps, SRE, or cloud operations. Deep expertise in AWS services and hands-on experience with Google Cloud Platform services (Cloud SQL, GKE, Cloud Functions, Cloud Storage, Cloud CDN, etc.). Containerization & Orchestration More ❯
Social network you want to login/join with: SiteReliabilityEngineer (Equity only 0.5%), stockport col-narrow-left Client: Location: stockport, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 10.06.2025 Expiry Date: 25.07.2025 col-wide Job Description: Company Description : Luupli is a social media app that has … and the planet. Our team is made up of passionate and dedicated individuals who are committed to making Luupli a success. Role Description : We are seeking an experienced Senior SRE/DevOps Engineer to play a key role in our cloud migration initiative from AWS to Google Cloud Platform (GCP) for our high-traffic social media application. You will … complies with relevant security standards and policies, implementing practices like VPC configuration, IAM best practices, and Cloud Armor for protection. Required Qualifications : Experience: 5+ years of experience in DevOps, SRE, or cloud operations. Deep expertise in AWS services and hands-on experience with Google Cloud Platform services (Cloud SQL, GKE, Cloud Functions, Cloud Storage, Cloud CDN, etc.). Containerization & Orchestration More ❯
Social network you want to login/join with: SiteReliabilityEngineer (Equity only 0.5%), bolton, greater manchester col-narrow-left Client: Location: bolton, greater manchester, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 10.06.2025 Expiry Date: 25.07.2025 col-wide Job Description: Company Description : Luupli is a social … and the planet. Our team is made up of passionate and dedicated individuals who are committed to making Luupli a success. Role Description : We are seeking an experienced Senior SRE/DevOps Engineer to play a key role in our cloud migration initiative from AWS to Google Cloud Platform (GCP) for our high-traffic social media application. You will … complies with relevant security standards and policies, implementing practices like VPC configuration, IAM best practices, and Cloud Armor for protection. Required Qualifications : Experience: 5+ years of experience in DevOps, SRE, or cloud operations. Deep expertise in AWS services and hands-on experience with Google Cloud Platform services (Cloud SQL, GKE, Cloud Functions, Cloud Storage, Cloud CDN, etc.). Containerization & Orchestration More ❯
we're looking for a SiteReliability & Platform Engineer to help lead the way. You'll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure as Code … enablement, to help development teams ship faster, safer, and more cost-efficiently. What you'll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules … suits: This is a great opportunity for someone passionate about building robust infrastructure and enabling others to move faster and more securely. You might come from a cloud engineering, SRE, or DevOps background - what matters most is your curiosity, systems thinking, and drive to improve operational efficiency. At Sorted, we are committed to fostering an inclusive environment where people from More ❯
Social network you want to login/join with: SiteReliabilityEngineer, Bolton, Greater Manchester Client: Ranger Technical Resources Location: Bolton, Greater Manchester, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 3 Posted: 31.05.2025 Expiry Date: 15.07.2025 Job Description: SiteReliabilityEngineer #2494 Position Summary: Our partner, an innovative PaaS … company specializing in remote monitoring and network management solutions, is looking for a SiteReliabilityEngineer to help ensure the reliability, scalability, and performance of critical infrastructure and applications. You will build and maintain highly available systems, support and optimize CI/CD pipelines, and determine optimal solutions for the company's products. Collaboration with development … of endpoints. Experience and Education: Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related technical field/experience. 7+ years of experience in SiteReliability Engineering, DevOps, Infrastructure, or related roles. Deep understanding of AWS and its various modules and services. Strong background in Linux administration and troubleshooting. Proven experience in implementing More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Mindrift
SRE - Freelance Security & Automation Engineer (Pentesting Focus) 1 day ago Be among the first 25 applicants About The Company At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. Our goal? Advance the field of artificial intelligence through collaborative Generative AI projects with domain experts. The Mindrift platform More ❯
Stockport, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
team. Things are moving fast here, and as we continue to grow; reliability, automation, and scalability have never been more important to us. You will be our first SRE so a strong background in implementing SRE best practices would be Ideal. You will know what good looks like and strive to continuously improve automation, availability and resilience. This is … to build out infrastructure and tooling using AWS, Terraform, Docker, and CI/CD pipelines. Supporting and evolving our container-based architecture (we use ECS and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support your work. What we’re … looking for: Strong experience with AWS, Terraform, Docker, and container orchestration (ECS/Fargate). Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Sectigo
Sectigo Manchester, England, United Kingdom SiteReliabilityEngineer Sectigo Manchester, England, United Kingdom 20 hours ago Be among the first 25 applicants Job Description We are looking for a SiteReliabilityEngineer to join our growing global team at Sectigo. Job Description We are looking for a SiteReliabilityEngineer to … join our growing global team at Sectigo. The SiteReliabilityEngineer will design and implement solutions to reduce toil and ensure reliability of our critical services at Sectigo. This is a full-time and remote position, with the ideal candidate located within 1-hour of vehicle commute … distance from Manchester, U.K. Here are the core functions, responsibilities, and expectations for this role: Ensure the reliability of our critical products and services by meeting or exceeding SRE objectives. Instantiate and maintain production infrastructure using Infrastructure as Code and Configuration Management tools. Build and maintain proper monitoring of our services by utilizing centralized logging and time series databases. More ❯
working to deliver mission critical solutions helping to keep the nation safe, secure and prosperous. Working on leading edge technology solutions including AI/DS, Cyber, Cloud, DevOps/SRE, Platform Engineering We have secured long term work, across the full spectrum, on the latest framework with the client, which provides the springboard for our ongoing growth and development in … this domain, so join us on what will be an incredible growth journey. Role As a SREEngineer, within our National Security Business, you thrive on providing technical leadership; working closely and embedding with customers to maintain the operational integrity of mission critical services in a National Security environment. You’re a SREEngineer that provides technical leadership … simple add-ons to multiple vendors; you’ll be working directly with customers to provide ensure successful delivery of software rollouts that enable the nations safety. As our next SREEngineer, you’ll operate with customer availability as a priority; catching issues early and ensuring continued operational integrity of services that are strategically important to the nation. Whilst developing More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems, directly impacting … operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into … user demands and enhance overall service performance. This role is eligible for inclusion in the Company’s hybrid working from home policy. Preferred skills and experience Excellent knowledge of SiteReliability Engineering principles, including the creation and management of effective Service Level Indicators (SLI) and Service Level Objectives (SLO) for reliability and customer satisfaction. Knowledge of contemporary More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Resident Advisor Ltd
Join to apply for the Senior SREEngineer role at Resident Advisor 4 days ago Be among the first 25 applicants Join to apply for the Senior SREEngineer role at Resident Advisor Founded in 2001, Resident Advisor (RA) is one of the world's longest-running music media brands and a cornerstone of the dance, electronic and … DJ ecosystem. The site's audience of over 6 million monthly users is drawn in by a combination of news, editorial, club listings and ticketing, RA-branded events at venues and festivals worldwide, original films and a weekly mix series that has run for 18 years. We're looking for a Senior SiteReliabilityEngineer passionate … MSSQL databases, ElasticSearch, Redis, and Kafka running on AWS EKS (Kubernetes), managed via Terraform with CI/CD pipelines and DataDog monitoring. Your responsibilities include improving infrastructure performance and reliability, driving modernization and cost optimization, developing shared components (i.e. auth systems, GraphQL gateways), enhancing developer experience, maintaining E2E testing systems, and creating internal tooling. This is an opportunity to More ❯
etc. What you'll do: You will work with Development and Product Management to design and deliver new functionality. You will perform deep dives into both systemic and latent reliability issues; partner with software engineers across the organization to produce and roll out fixes. You will drive standardization efforts across multiple disciplines and services in conjunction with SREs throughout … JVM-based languages. You have a solid understanding of continuous integration, deployment and operations concepts. You have production experience of managing Windows Infrastructure running IIS workloads. Passion for resolving reliability issues and identify strategies to mitigate going forward. Automation mindset - if you can automate it, do it. Fluency in English. What you'll gain at Intapp: Our culture at More ❯
Job Description SiteReliabilityEngineer Exciting opportunity to join a growing technical leader, in a specialist technical capacity Hybrid based position (2 days a week on site) Salary up to £60,000 Central Manchester based client Based out of our revamped central Manchester office, you will join at an exciting time for our organisation, where we … with everything that happens Utilize a wide range of technologies like Terraform, AWS/GCP, Splunk, New Relic, Grafana, Python, and Golang We need you to have Experience in SRE/DevOps focused positions An appreciation of the Software Delivery lifecycle A finger on the pulse for the latest technologies and trends To be Considered Please apply by clicking online More ❯
A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering … expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. … user demands and enhance overall service performance. This role is eligible for inclusion in the Company's hybrid working from home policy. Preferred skills and experience Excellent knowledge of SiteReliability Engineering principles, including the creation and management of effective Service Level Indicators (SLI) and Service Level Objectives (SLO) for reliability and customer satisfaction. Knowledge of contemporary More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
bet365
Direct message the job poster from bet365 A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems … directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best … user demands and enhance overall service performance. This role is eligible for inclusion in the Company’s hybrid working from home policy. Preferred skills and experience Excellent knowledge of SiteReliability Engineering principles, including the creation and management of effective Service Level Indicators (SLI) and Service Level Objectives (SLO) for reliability and customer satisfaction. Knowledge of contemporary More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description Who we are looking for A Junior SiteReliabilityEngineer, who will improve system reliability, observability and performance through strong engineering and assist with incident resolution and operational excellence. Supported by our sitereliability engineering team, you will work to integrate reliability and observability practices into the Software Development Life Cycle … and enhance overall performance. You will ensure the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including instrumentation with tools such as OpenTelemetry, improving logging practices, and developing features for maintainability. You will also assist in creating tools and automation for effective service management. This … toolsets. Working with IT Operations to provide and support the use of critical tooling that will enable increasing levels of value to the Business. Driving initiatives to enhance system reliability and observability, both within the team and across the department, fostering a culture of continuous improvement. “By applying to us you are agreeing to share your Personal Data in More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
SiteReliabilityEngineer - Healthcare Technology Location: UK | Hybrid | Full-time | Permanent We're working with a leading healthtech company to find a SiteReliabilityEngineer to support and optimise the platforms … behind critical clinical systems. This is a hybrid role offering flexibility, technical challenge, and the chance to make a direct impact on healthcare delivery. You'll join a collaborative SRE team focused on maintaining cloud and on-premise environments, improving deployment pipelines, reducing manual work, and supporting project delivery. You'll work closely with internal teams across software development, support … and delivery. Key technologies include: Linux, and Windows Server We're looking for enthusiastic people with experience in SRE or DevOps roles, particularly in environments using containerised and cloud-based applications. Strong communication skills and the ability to work across teams are essential. Applicants must have the right to live and work in the UK. If you are interested in More ❯
Social network you want to login/join with: SiteReliabilityEngineer - Multi Cloud, Stockport Client: iO Associates - UK/EU Location: Stockport, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: SiteReliabilityEngineer - Healthcare Technology UK | Hybrid | Full-time | Permanent We … behind critical clinical systems. This is a hybrid role offering flexibility, technical challenge, and the chance to make a direct impact on healthcare delivery. You'll join a collaborative SRE team focused on maintaining cloud and on-premise environments, improving deployment pipelines, reducing manual work, and supporting project delivery. You'll work closely with internal teams across software development, support … and delivery. Key technologies include: - Linux, and Windows Server We're looking for enthusiastic people with experience in SRE or DevOps roles, particularly in environments using containerised and cloud-based applications. Strong communication skills and the ability to work across teams are essential. Applicants must have the right to live and work in the UK. If you are interested in More ❯
Location: bolton, greater manchester, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 col-wide Job Description: SiteReliabilityEngineer - Healthcare Technology ? UK | Hybrid | Full-time | Permanent We're working with a leading healthtech company to find a SiteReliabilityEngineer to … behind critical clinical systems. This is a hybrid role offering flexibility, technical challenge, and the chance to make a direct impact on healthcare delivery. You'll join a collaborative SRE team focused on maintaining cloud and on-premise environments, improving deployment pipelines, reducing manual work, and supporting project delivery. You'll work closely with internal teams across software development, support … and delivery. Key technologies include: - Linux, and Windows Server We're looking for enthusiastic people with experience in SRE or DevOps roles, particularly in environments using containerised and cloud-based applications. Strong communication skills and the ability to work across teams are essential. Applicants must have the right to live and work in the UK. If you are interested in More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Time to enhance your scope; broaden your horizon by delving into SiteReliability Engineering (SRE). You’ll take the skills you have picked up in software engineering and apply these to improve overall system and application performance and reliability. You’ll work on internal developer tooling, using modern programming languages such as Golang, Python or TypeScript - so More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Social network you want to login/join with: SiteReliabilityEngineer | Inside IR35 | Remote - UK | 6 Month Contract, Manchester Client: RP International Location: Manchester, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 4 Posted: 31.05.2025 Expiry Date: 15.07.2025 Job Description: SiteReliabilityEngineer | Inside IR35 | Remote - UK | 6 Month … Contract Our client, a multinational and respected consultancy, is hiring for a SiteReliabilityEngineer with expertise in GCP and DevOps tools for a new project in the Communication Sector. Duration: 6 Months + Extensions Location: Remote (Ideally UK Based) This role has multiple headcounts. Technical Skills/Experience: GCP, Gitlab, Terraform Scripts, HassCorp, Env0, Okta, Security More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
RP International
SiteReliabilityEngineer | Inside IR35 | Remote - UK | 6 Month Contract Our client a multinational and respected consultancy is hiring for a SiteReliabilityEngineer with expertise in GCP and DevOps Tools for a new project in the Communication Sector. Duration: 6 Months + Extensions Location: Remote (Ideally UK Based) Rate: £300-350 p/ More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
RP International
SiteReliabilityEngineer | Inside IR35 | Remote - UK | 6 Month Contract Our client a multinational and respected consultancy is hiring for a SiteReliabilityEngineer with expertise in GCP and DevOps Tools for a new project in the Communication Sector. Duration: 6 Months + Extensions Location: Remote (Ideally UK Based) Rate: £300-350 p/ More ❯