Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems. Working closely with cross-functional teams across the globe, including business stakeholders, product managers, and software engineers, you will … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story. As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
warrington, cheshire, north west england, united kingdom
Hays
Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems. Working closely with cross-functional teams across the globe, including business stakeholders, product managers, and software engineers, you will … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story. As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
bolton, greater manchester, north west england, united kingdom
Hays
Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems. Working closely with cross-functional teams across the globe, including business stakeholders, product managers, and software engineers, you will … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story. As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems. Scroll down to find an indepth overview of this job, and what is expected of candidates Make an application … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story. As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
MI5
healthy work life balance and offer a range of working patterns, including full time, part time, and compressed hours. Hybrid working, which refers to a combination of working on site and from home, may be more limited due to the nature of the work. However, some homeworking may be available depending on business needs. We also support flexible start … to the architecture and design of both new and existing systems, establish and promote best practices, and deliver high quality software solutions. Drawing on your expertise in various software engineering methodologies, you'll introduce fresh ideas and innovative approaches that make a real impact at the core of our mission: keeping the UK safe, both in the real world … and online. You'll bring a genuine enthusiasm for discovering and applying new software engineering techniques. As part of a wider network of peers, you'll collaborate and learn from others. With your experience, you'll set the standard, introduce innovative ways of working, and identify new priorities. Whether leading and mentoring a team or acting as the technical More ❯
join a global financial services organisation. This is a true engineering role focused on building, scaling, and optimising Azure infrastructure - not just operational support. The Role As an SRE, you’ll design and engineer resilient cloud platforms, develop automation to reduce manual tasks, and ensure the reliability and scalability of critical systems. You’ll work closely with other … engineers to embed SRE principles across infrastructure and deployments, helping to drive cloud adoption and operational excellence. What You’ll Be Doing Engineering into Azure public cloud infrastructure – deployment, scaling, and optimisation. Building automation and tooling (Python preferred; PowerShell or Terraform also applicable). Supporting CI/CD pipelines and SRE tooling (Jenkins a nice-to-have, but not … an on-call rota (1 weekend in 12, with the option to volunteer for additional weekends for extra pay). Partnering with DevOps teams and helping engineers transition towards SRE best practice. What I'm Looking For Hands-on Azure expertise – infrastructure, deployment, and engineering. Strong automation skills – ideally Python, but PowerShell or Terraform also relevant. Background in DevOps with More ❯
warrington, cheshire, north west england, united kingdom
Iceberg
join a global financial services organisation. This is a true engineering role focused on building, scaling, and optimising Azure infrastructure - not just operational support. The Role As an SRE, you’ll design and engineer resilient cloud platforms, develop automation to reduce manual tasks, and ensure the reliability and scalability of critical systems. You’ll work closely with other … engineers to embed SRE principles across infrastructure and deployments, helping to drive cloud adoption and operational excellence. What You’ll Be Doing Engineering into Azure public cloud infrastructure – deployment, scaling, and optimisation. Building automation and tooling (Python preferred; PowerShell or Terraform also applicable). Supporting CI/CD pipelines and SRE tooling (Jenkins a nice-to-have, but not … an on-call rota (1 weekend in 12, with the option to volunteer for additional weekends for extra pay). Partnering with DevOps teams and helping engineers transition towards SRE best practice. What I'm Looking For Hands-on Azure expertise – infrastructure, deployment, and engineering. Strong automation skills – ideally Python, but PowerShell or Terraform also relevant. Background in DevOps with More ❯
bolton, greater manchester, north west england, united kingdom
Iceberg
join a global financial services organisation. This is a true engineering role focused on building, scaling, and optimising Azure infrastructure - not just operational support. The Role As an SRE, you’ll design and engineer resilient cloud platforms, develop automation to reduce manual tasks, and ensure the reliability and scalability of critical systems. You’ll work closely with other … engineers to embed SRE principles across infrastructure and deployments, helping to drive cloud adoption and operational excellence. What You’ll Be Doing Engineering into Azure public cloud infrastructure – deployment, scaling, and optimisation. Building automation and tooling (Python preferred; PowerShell or Terraform also applicable). Supporting CI/CD pipelines and SRE tooling (Jenkins a nice-to-have, but not … an on-call rota (1 weekend in 12, with the option to volunteer for additional weekends for extra pay). Partnering with DevOps teams and helping engineers transition towards SRE best practice. What I'm Looking For Hands-on Azure expertise – infrastructure, deployment, and engineering. Strong automation skills – ideally Python, but PowerShell or Terraform also relevant. Background in DevOps with More ❯
Position : SiteReliability Engineer Manager Location : Manchester Industry : Financial Services Salary : Up to £72,000 + discretionary bonus Key Responsibilities: Lead and develop a high-performing SRE team, driving collaboration and continuous improvement. Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Partner globally to align on uptime and … resiliency goals. Promote SRE best practices and operational excellence. About You: Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. Excellent communicator and problem solver with a client-focused mindset. AWS certifications and software engineering experience are a plus. For more information, contact Maria Ciprini at Harrington Starr, or click "Apply" to More ❯
bolton, greater manchester, north west england, united kingdom
Harrington Starr
Position : SiteReliability Engineer Manager Location : Manchester Industry : Financial Services Salary : Up to £72,000 + discretionary bonus Key Responsibilities: Lead and develop a high-performing SRE team, driving collaboration and continuous improvement. Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Partner globally to align on uptime and … resiliency goals. Promote SRE best practices and operational excellence. About You: Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. Excellent communicator and problem solver with a client-focused mindset. AWS certifications and software engineering experience are a plus. For more information, contact Maria Ciprini at Harrington Starr, or click "Apply" to More ❯
warrington, cheshire, north west england, united kingdom
Harrington Starr
Position : SiteReliability Engineer Manager Location : Manchester Industry : Financial Services Salary : Up to £72,000 + discretionary bonus Key Responsibilities: Lead and develop a high-performing SRE team, driving collaboration and continuous improvement. Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Partner globally to align on uptime and … resiliency goals. Promote SRE best practices and operational excellence. About You: Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. Excellent communicator and problem solver with a client-focused mindset. AWS certifications and software engineering experience are a plus. For more information, contact Maria Ciprini at Harrington Starr, or click "Apply" to More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Lorien
Junior SiteReliability Engineer Hybrid - Manchester x2 days a week Salary up to £45,000 + Bonus The Company: Lorien Global are supporting a growing business based in Manchester City Centre as they expand their Support Services team. With an exciting pipeline of work ahead, they're looking to hire an experienced Junior SiteReliability Engineer More ❯
Position : SiteReliability Engineer Manager Location : Financial Services Salary : Up to £72,000 + discretionary bonus Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. AWS certifications and software engineering experience More ❯
Liverpool, England, United Kingdom Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of sitereliabilityengineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
birkenhead, north west england, united kingdom Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of sitereliabilityengineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
warrington, cheshire, north west england, united kingdom Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of sitereliabilityengineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
old swan, north west england, united kingdom Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of sitereliabilityengineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in sitereliabilityengineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
warrington, cheshire, north west england, united kingdom
Anson McCade
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in sitereliabilityengineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
bolton, greater manchester, north west england, united kingdom
Anson McCade
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in sitereliabilityengineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
We are hiring for SiteReliability Engineer (SRE) - Monitoring Focus, Senior Monitoring and Telemetry Specialist) Location : Knutsford - Hybrid - 2 to 3 days in Office Should have expertise in designing, implementing, and maintaining the telemetry and monitoring solutions that drive the health, performance, and reliability across diverse infrastructure Proven, hands-on experience in configuring, managing, and leveraging industry More ❯
looking for a DevOps Engineer to join our growing team. Day to Day You'll Be: Infrastructure & Operations: Participate in the design, implementation, and maintenance of our infrastructure, ensuring reliability, scalability, and security. Support, monitor, and enhance the live infrastructure and platform solutions, ensuring high availability and performance. Help plan and execute the integration of our current infrastructure into … analysis. Documentation & Best Practices: Ensure comprehensive documentation of infrastructure, systems, and processes to support onboarding, troubleshooting, and scalability. Promote and implement DevOps and SiteReliabilityEngineering (SRE) best practices across the organisation. Essential Skills & Experience: Technical Expertise: Strong Linux systems administration experience, including firewalls and hardening Expertise in Docker and container orchestration. Proficiency with Infrastructure as Code … Familiarity with GCP services such as Compute Engine, Kubernetes Engine (GKE), Cloud Storage, BigQuery, and IAM. Familiarity with configuration management and IT automation tools. Strong understanding of DevOps and SRE principles. Soft Skills: Self-motivated, highly organised, and capable of driving initiatives from concept to delivery. Excellent communication and stakeholder management skills. Desirable Skills: Experience with serverless infrastructure (e.g., AWS More ❯
Overview We are seeking a skilled MongoDB Engineer to join a dynamic infrastructure engineering team based in Knutsford. This role is focused on building and maintaining scalable, secure, and reliable infrastructure platforms that support critical applications and data systems. You will apply software engineering principles, automation, and incident response best practices to ensure operational excellence across technology platforms. … Key Responsibilities Design, develop, and maintain infrastructure solutions with a focus on performance, reliability, and scalability. Monitor system performance and proactively address incidents, vulnerabilities, and outages. Implement automation using scripting languages and configuration tools to streamline operations. Ensure secure configurations and protect infrastructure against cyber threats and unauthorized access. Collaborate with product managers, architects, and engineers to align infrastructure … automation. Familiarity with DevOps tools such as Git, JIRA, and CI/CD pipelines. Strong scripting skills in Python or Bash. Understanding of SiteReliabilityEngineering (SRE) practices and incident management. Knowledge of containerization and orchestration tools such as Kubernetes. Desirable Attributes Experience in financial services or regulated industries. Ability to work collaboratively across cross-functional teams. More ❯
Overview We are seeking a skilled MongoDB Engineer to join a dynamic infrastructure engineering team based in Knutsford. This role is focused on building and maintaining scalable, secure, and reliable infrastructure platforms that support critical applications and data systems. You will apply software engineering principles, automation, and incident response best practices to ensure operational excellence across technology platforms. … Key Responsibilities Design, develop, and maintain infrastructure solutions with a focus on performance, reliability, and scalability. Monitor system performance and proactively address incidents, vulnerabilities, and outages. Implement automation using scripting languages and configuration tools to streamline operations. Ensure secure configurations and protect infrastructure against cyber threats and unauthorized access. Collaborate with product managers, architects, and engineers to align infrastructure … automation. Familiarity with DevOps tools such as Git, JIRA, and CI/CD pipelines. Strong scripting skills in Python or Bash. Understanding of SiteReliabilityEngineering (SRE) practices and incident management. Knowledge of containerization and orchestration tools such as Kubernetes. Desirable Attributes Experience in financial services or regulated industries. Ability to work collaboratively across cross-functional teams. More ❯
report if all aspects of Production processing (Incidents, Problems, Service Level indicators etc.) using automated dashboards are performing at optimum levels according to business needs. Work closely with the Engineering/product team to ensure implementation of automation of the Production processes. Promote SiteReliabilityEngineering culture. Should be aware of Agile and DevOps principles. Communicate … procedures to ensure integrity of system. Should be open to learn new technologies based on project requirements. Preferably candidate who has worked in production support function. Requirements Bachelor of Engineering preferred. Good knowledge of ServiceNow tool and able to create dashboards/reports for management overview. Should have hands-on experience in creating reports in Microsoft excel. Capable of More ❯