SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £95,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliability Engineer (SRE) on a … permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as … ReliabilityEngineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £70,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliability Engineer (SRE) on a … permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as … ReliabilityEngineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked with.Net, Azure and C# technologies. Proficiency in C# language – alongside knowledge of scripting languages like Bash, Python or PowerShell … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems.Working closely with cross-functional teams across the globe, including business stakeholders, product managers, and software engineers, you will ensure … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story.As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. Foster … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems. Working closely with cross-functional teams across the globe, including business stakeholders, product managers, and software engineers, you will … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story. As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
SiteReliability Engineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a SiteReliability Engineer (SRE) to join their platform team and … performance into the software lifecycle. Managing and evolving CI/CD pipelines to ensure smooth deployments and rollbacks. Contributing to incident response , post-mortems, and reliability improvements. Championing SRE principles such as error budgets, SLIs/SLOs, and automation-first thinking. What Were Looking For Strong experience running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations … engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | SiteReliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the More ❯
Cambridge, Cambridgeshire, East Anglia, United Kingdom
RedTech Recruitment
are already renowned as having game-changing technology within their industry, with exciting scope for expansion into further industries. This role is looking for someone to work within the SRE team responsible for incident response and issue resolution. Location: Cambridge Salary: £32,000 £60,000 + excellent benefits (£32,000 for a new Graduate) Requirements for SiteReliability … of a role involving lots of problem solving identifying the root causes of issues. Good logical reasoning Responsibilities for SiteReliability Engineer Graduate Considered: Working within the SRE team you will be responsible for the architecture of a mission-critical cloud platform for an industry-leading software company. You will be diagnosing issues within complex systems and identifying … emailing (if this email address has been removed by the job-board, full details for contact are available on our website). Keywords- SiteReliability Engineer/SRE/DevOps/Software Engineering/Software Development/Engineering/Physics/Astrophysics/Python/Computer science/Cloud/Mathematics/AWS/Azure/ More ❯
Graduate DevOps Engineer/SRE All top graduates with tech-related degrees should read this If you have a passion for building things, love constantly solving interesting challenges and also enjoy some coding as well, then we would encourage you to explore a career in DevOps & SiteReliabilityEngineering (if you're not already). The demand … for this skill set is high, the role is interesting and varied and it is quite rare to see entry-level DevOps or SRE positions advertised. If you're already an experienced DevOps Engineer or SiteReliability Engineer we also really want to hear from you, as we are excited to be able to offer this role working … days a week in office) Salary: £35,000 - £70,000 per annum + excellent benefits (£35,000 for a new Gradaute, more DOE experience) Requirements for Graduate DevOpsEngineer/SRE: This company hires some of the very brightest engineers and is looking for a 2.1 or 1st class honours degree from a leading international University in a STEM subject Minimum More ❯
Knutsford, Cheshire, North West, United Kingdom Hybrid / WFH Options
Anson Mccade
on database performance, automation, and scalability. You'll play a key role in improving the availability and reliability of mission-critical platforms, combining traditional database expertise with modern SRE practices - automation, observability, and proactive problem solving. This is an excellent opportunity for someone with strong Microsoft SQL Server knowledge who enjoys using engineering techniques to enhance resilience, reduce … tools to reduce manual processes, increase efficiency, and improve reliability. Optimise system performance, address bottlenecks, and apply performance tuning best practices. Collaborate with development and platform teams to embed SRE principles and improve service reliability. Stay current with emerging technologies, contributing to continuous improvement and technical excellence. Skills and Experience Essential: Proven technical expertise with Microsoft SQL Server … enterprise scale. Strong experience with automation and configuration management tools , ideally Chef or Ansible . Skilled in scripting languages such as PowerShell for automation and migration tasks. Understanding of SRE concepts - SLIs, SLOs, incident response, and reliability metrics. Experience supporting and optimising large-scale, high-availability environments. Desirable: Exposure to database standardisation and automation at scale. Familiarity with observability More ❯
Head of Performance & ReliabilityEngineering Full-Time – Hybrid (3 days in Cambridgeshire) Up to £95,000 + Bonus This is an exceptional opportunity to join a major organisation at a pivotal stage in their digital transformation. As Head of Performance & ReliabilityEngineering You’ll shape strategy, lead performance testing and chaos engineering initiatives, and embed … reliability best practices across engineering, DevOps … and infrastructure teams. This is a senior, strategic leadership role focused on system excellence, observability, and continuous improvement. Ideal Candidate: Proven experience leading Performance Engineering, Reliability, or SRE functions Deep expertise in performance testing methodologies (load, stress, spike, soak) Strong hands-on background with LoadRunner and Dynatrace (plus tools such as NeoLoad, k6, or JMeter) Skilled in chaos More ❯
Head of Performance & ReliabilityEngineering Full-Time – Hybrid (3 days in Cambridgeshire) Up to £95,000 + Bonus This is an exceptional opportunity to join a major organisation at a pivotal stage in their digital transformation. As Head of Performance & ReliabilityEngineering You’ll shape strategy, lead performance testing and chaos engineering initiatives, and embed … reliability best practices across engineering, DevOps … and infrastructure teams. This is a senior, strategic leadership role focused on system excellence, observability, and continuous improvement. Ideal Candidate: Proven experience leading Performance Engineering, Reliability, or SRE functions Deep expertise in performance testing methodologies (load, stress, spike, soak) Strong hands-on background with LoadRunner and Dynatrace (plus tools such as NeoLoad, k6, or JMeter) Skilled in chaos More ❯
Head of Performance & ReliabilityEngineering Full-Time – Hybrid (3 days in Cambridgeshire) Up to £95,000 + Bonus This is an exceptional opportunity to join a major organisation at a pivotal stage in their digital transformation. As Head of Performance & ReliabilityEngineering You’ll shape strategy, lead performance testing and chaos engineering initiatives, and embed … reliability best practices across engineering, DevOps … and infrastructure teams. This is a senior, strategic leadership role focused on system excellence, observability, and continuous improvement. Ideal Candidate: Proven experience leading Performance Engineering, Reliability, or SRE functions Deep expertise in performance testing methodologies (load, stress, spike, soak) Strong hands-on background with LoadRunner and Dynatrace (plus tools such as NeoLoad, k6, or JMeter) Skilled in chaos More ❯
SiteReliability Engineer Central London (3 days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package We are working with an exciting technology company that are looking to bring in a SiteReliability Engineer to help scale their cloud infrastructure and DevOps capability. Theyve built a high-performing … engineering team and are now investing further into the platform side of things as demand grows. Think modern, cloud-native architecture, and a real emphasis on automation, scalability, and developer enablement. Youll have the autonomy to make technical decisions and help shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services … days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package Click APPLY NOW to be considered for this position! AWS, SRE, Cloud, Kubernetes, EKS, Terraform, CI/CD, Automation etc. More ❯
SiteReliability Engineer Central London (3 days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package We are working with an exciting technology company that are looking to bring in a SiteReliability Engineer to help scale their cloud infrastructure and DevOps capability. They’ve built a high … performing engineering team and are now investing further into the platform side of things as demand grows. Think modern, cloud-native architecture, and a real emphasis on automation, scalability, and developer enablement. You’ll have the autonomy to make technical decisions and help shape how platform engineering is done as the team continues to scale. Tech stack AWS … days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package Click APPLY NOW to be considered for this position! AWS, SRE, Cloud, Kubernetes, EKS, Terraform, CI/CD, Automation etc. More ❯
Lead SiteReliability Engineer Are you ready to take your career to the next level in a role that’s critical to the reliability, scalability, and performance of cutting-edge systems? We’re on the lookout for a Lead SiteReliability Engineer to bring innovation, leadership, and technical excellence to our growing team. What You … and uphold best security practices. Contribute to quality systems through deviation management, CAPA follow-up, and root cause investigations. What We’re Looking For: 5+ years of experience in SiteReliabilityEngineering or a related field. Hands-on experience with Biosafety and GMP environments. Strong foundation in Lean Six Sigma principles. Proven problem-solving skills with a … knack for performance tuning. Effective communicator and team player. Formal education in an engineering-related discipline. More ❯
Lead SiteReliability Engineer Are you ready to take your career to the next level in a role that’s critical to the reliability, scalability, and performance of cutting-edge systems? We’re on the lookout for a Lead SiteReliability Engineer to bring innovation, leadership, and technical excellence to our growing team. What You … and uphold best security practices. Contribute to quality systems through deviation management, CAPA follow-up, and root cause investigations. What We’re Looking For: 5+ years of experience in SiteReliabilityEngineering or a related field. Hands-on experience with Biosafety and GMP environments. Strong foundation in Lean Six Sigma principles. Proven problem-solving skills with a … knack for performance tuning. Effective communicator and team player. Formal education in an engineering-related discipline. More ❯
Senior SiteReliability Engineer At UnlikelyAI, we are building the future of AI: one that is reliable, accurate and transparent. Our neurosymbolic technology harnesses the power of LLMs and generative AI, and combines it with classical symbolic technology to produce hallucination-resistant artificial intelligence for high-trust applications. To support our rapidly increasing commercial momentum, we're looking … for an experienced and pragmatic sitereliability engineer to join our exceptional team. This role is ideal for someone who has successfully scaled systems from prototype to production and enjoys working in cross-functional teams to champion cloud-native engineering. We are looking for someone with the experience and expertise to define, and own, our approach to building … for reliability and security as first-class citizens. This is a strategically important role for our technology team, as we rapidly approach entering full production in multiple projects. You'll work on a range of customer-facing and internal infrastructure projects, applying your engineering skills to solve complex reliability and scalability challenges. Your ability to build robust More ❯
Head of Performance & ReliabilityEngineering Full-Time - Hybrid (3 days in Cambridgeshire) Up to £95,000 + Bonus This is an exceptional opportunity to join a major organisation at a pivotal stage in their digital transformation. As Head of Performance & ReliabilityEngineering You'll shape strategy, lead performance testing and chaos engineering initiatives, and embed … reliability best practices across engineering, DevOps … and infrastructure teams. This is a senior, strategic leadership role focused on system excellence, observability, and continuous improvement. Ideal Candidate: Proven experience leading Performance Engineering, Reliability, or SRE functions Deep expertise in performance testing methodologies (load, stress, spike, soak) Strong hands-on background with LoadRunner and Dynatrace (plus tools such as NeoLoad, k6, or JMeter) Skilled in chaos More ❯
do at CMC Markets, and staying true to that has been pivotal to our success. CMC Markets is seeking an experienced and proactive SiteReliabilityEngineering (SRE) Manager to establish and lead a new SRE function within the IT Production department. This is a key leadership role responsible for defining the SRE strategy, implementing best practices, and … resilience across the trading platforms Ensure new systems are aligned with best practices Drive improvements and alignment in observability and monitoring tools, improving MTTD and MTTR Produce analysis on SRE function performance Provide guidance, recommendations and hands-on support to teams, promoting SRE best practices Develop and maintain a roadmap for continuous improvement of support and observability Maintain personal/… role Read and comply with CMC policies and procedures as they relate to your employment Complete all mandatory compliance training KEY SKILLS AND EXPERIENCE 2 years experience in a SRE function or similar in hybrid cloud/on prem environment 7 years experience in IT operational roles working with highly reliable systems Experience in modern development methodologies and languages Proficiency More ❯
Security Cleared SiteReliability Engineer - Contract Outside IR35 - 3 months+ -Hybrid We are seeking a Lead Operations/SiteReliability Engineer to take ownership of day-to-day operations across a legacy technology estate. The role will focus on maintaining service stability, ensuring operational readiness, and leading the response to incidents and outages. The Lead Operations …/SiteReliability Engineer will play a pivotal role during the transition phase by embedding operational standards, improving monitoring and support processes, and enabling knowledge transfer into ongoing service delivery teams. Key Responsibilities: Lead daily operational support of legacy systems, ensuring availability, performance, and resilience. Manage incident, problem, and change activities in line with ITIL and enterprise service More ❯
Must hold UKIC DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
Job Summary This role is to design, build, and scale enterprise cloud platforms with a strong focus on automation, reliability, and developer experience. As part of the Cloud Infrastructure & DevOps team, you will build multi-cloud infrastructure that powers hundreds of production services, including critical Salesforce DevOps pipelines. You’ll partner closely with development, security, and operations teams to … Drive infrastructure compliance, DevSecOps, and policy-as-code practices. What we expect of you Minimum 5 years of experience in Platform Engineering, SiteReliabilityEngineering (SRE), or DevOps roles supporting cloud-native enterprise environments Proficient in Microsoft Azure and AWS platforms with hands-on experience in Kubernetes (AKS/EKS), Helm charts, and service mesh technologies … or HashiCorp Terraform Associate are advantageous Strong interpersonal skills including clear communication, collaboration across teams, adaptability in fast-paced environments, and a proactive mindset with a focus on reliability, performance, and developer enablement More ❯
Job Summary This role is to design, build, and scale enterprise cloud platforms with a strong focus on automation, reliability, and developer experience. As part of the Cloud Infrastructure & DevOps team, you will build multi-cloud infrastructure that powers hundreds of production services, including critical Salesforce DevOps pipelines. You’ll partner closely with development, security, and operations teams to … Drive infrastructure compliance, DevSecOps, and policy-as-code practices. What we expect of you Minimum 5 years of experience in Platform Engineering, SiteReliabilityEngineering (SRE), or DevOps roles supporting cloud-native enterprise environments Proficient in Microsoft Azure and AWS platforms with hands-on experience in Kubernetes (AKS/EKS), Helm charts, and service mesh technologies … or HashiCorp Terraform Associate are advantageous Strong interpersonal skills including clear communication, collaboration across teams, adaptability in fast-paced environments, and a proactive mindset with a focus on reliability, performance, and developer enablement More ❯
Job Summary This role is to design, build, and scale enterprise cloud platforms with a strong focus on automation, reliability, and developer experience. As part of the Cloud Infrastructure & DevOps team, you will build multi-cloud infrastructure that powers hundreds of production services, including critical Salesforce DevOps pipelines. You’ll partner closely with development, security, and operations teams to … Drive infrastructure compliance, DevSecOps, and policy-as-code practices. What we expect of you Minimum 5 years of experience in Platform Engineering, SiteReliabilityEngineering (SRE), or DevOps roles supporting cloud-native enterprise environments Proficient in Microsoft Azure and AWS platforms with hands-on experience in Kubernetes (AKS/EKS), Helm charts, and service mesh technologies … or HashiCorp Terraform Associate are advantageous Strong interpersonal skills including clear communication, collaboration across teams, adaptability in fast-paced environments, and a proactive mindset with a focus on reliability, performance, and developer enablement More ❯
SiteReliability Engineer (Postgres SQL) £60,000 GBP Hybrid WORKING Location: Manchester, North West - United Kingdom Type: Permanent The PostgreSQL SiteReliability Engineer (SRE) plays a key role in ensuring the reliability, availability, and scalability of critical systems and platforms. This position applies advanced software engineering techniques, automation, and incident response best practices to … Stay up to date with emerging technologies and industry trends , contributing to internal technology communities and promoting a culture of continuous improvement and technical excellence. Role Overview The PostgreSQL SRE is responsible for monitoring and maintaining critical technology infrastructure, resolving complex technical issues, and minimising operational disruptions. Acting as a technical leader , this individual shapes the direction of database administration … Key Skills and Experience Proven experience as a Database Administrator (DBA) with strong expertise in PostgreSQL , and exposure to Oracle or MS SQL databases. Demonstrated success implementing and leading SRE practices across large-scale or complex environments. Hands-on experience with containers and Kubernetes for scalable infrastructure management. Proficiency with DevOps tools such as Git, JIRA, Ansible, and database CI More ❯