SiteEngineer III The SiteEngineer III (SE) core responsibility is to perform infrastructure related activities in our Data Centers to ensure a reliable and efficient operation of the technical infrastructure. This is achieved through maintenance, testing and surveillance of all assets installed. The SiteEngineer III is responsible for the planning and control … of all day-to-day maintenance activities in cooperation with the Lead Engineer and provide leadership to the site teams, including project management for maintenance related activities. This role is critical to meet Service Level Agreements (SLAs) and deliver client satisfaction, and when applicable being part of a 7x24 shift rotation. What youll do Operations and Compliance: Lead … process to guarantee smooth transitions and minimize downtime in data center operations. Responsible for contributing to the DC operating efficiency and the implementation of assigned optimization measures. Consult with site management to provide solutions and adhere to SLAs agreed in customer contracts. Ensure site overall appearance and cleanliness, escalating shortfalls when necessary. Proactively indicate and report if SLAs More ❯
SiteEngineer III The SiteEngineer III (SE) core responsibility is to perform infrastructure related activities in our Data Centers to ensure a reliable and efficient operation of the technical infrastructure. This is achieved through maintenance, testing and surveillance of all assets installed. The SiteEngineer III is responsible for the planning and control More ❯
The SiteEngineer III (SE) core responsibility is to perform infrastructure related activities in our Data Centers to ensure a reliable and efficient operation of the technical infrastructure. This is achieved through maintenance, testing and surveillance of all assets installed. The SiteEngineer III is responsible for the planning and control of all day-to-day … maintenance activities in cooperation with the Lead Engineer and provide leadership to the site teams, including project management for maintenance related activities. This role is critical to meet Service Level Agreements (SLAs) and deliver client satisfaction, and when applicable being part of a 7x24 shift rotation. What youll do Operations and Compliance: Lead the planning and execution of … process to guarantee smooth transitions and minimize downtime in data center operations. Responsible for contributing to the DC operating efficiency and the implementation of assigned optimization measures. Consult with site management to provide solutions and adhere to SLAs agreed in customer contracts. Ensure site overall appearance and cleanliness, escalating shortfalls when necessary. Proactively indicate and report if SLAs More ❯
Site Reliability Engineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a Site Reliability Engineer (SRE) to join their platform team and … company with a modern cloud-native engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to More ❯
Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £85,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE) on a … Leading incident response, postmortems, and continuous improvement processes. Driving cost optimisation, capacity planning, and load testing. Championing best practices in cloud security and resilience. Key Skills & Experience Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). … Background in web applications and distributed systems. Desirable Skills: Knowledge of Microservices Architecture. Familiarity with Kanban. Experience with Puppet or Chef If you’re passionate about Site Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you. Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes More ❯
Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote THIS IS AN AZURE FOCUSED ROLE, IF YOU APPLY AND DO NOT WORK EITHER SOLEY OR MAINLY ON AZURE YOU WILL NOT BE CONSIDERED. Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £95,000 per annum + benefits Start Date … ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE) on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure … Leading incident response, postmortems, and continuous improvement processes. Driving cost optimisation, capacity planning, and load testing. Championing best practices in cloud security and resilience. Key Skills & Experience Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). More ❯
Azure Site reliability Engineer|6 month contract|Onsite 2/3 days per week|£650 per day InsideIR35 Opus RS are looking for a Senior Site Reliability Engineer with deep expertise in Azure cloud migration and a strong DevOps background to join our clients team. What We're Looking For Previous experience as a Site Reliability Engineer Strong skills in Terraform, GitHub, AKS, and networking (load balancing, Firewalls, routing). Proven track record in Agile delivery and DevOps practices. Extensive experience with Azure and cloud migration using frameworks like CAF and WAF. Ability to communicate effectively with technical and non-technical stakeholders. Familiarity with change control processes and performance monitoring. If you're More ❯
Role: Site Reliability Engineer Location: London or Manchester (2 days per week on-site + monthly team days rotating between locations) Duration: 6 months Rate: £675 per day (Inside IR35) Security Clearance: Must hold active SC clearance due to project timings Team Size: Core team of 10 within a wider programme of 60+ and expanding Experience Level … 4+ years About the Role We're looking for an experienced Site Reliability Engineer to support a major central government programme, ensuring the reliability, scalability, and performance of critical digital services. You'll help shape modern cloud infrastructure and enable engineering teams to deliver secure and resilient platforms. Essential Requirements Technical Skills AWS Services: Strong proficiency in AWS … and maintaining AWS-based infrastructure using Infrastructure as Code Designing and implementing CI/CD pipelines to support development and deployment Providing technical direction and guidance to engineers Embedding site reliability principles, including monitoring, alerting, and automation Supporting the scaling of a large and evolving central government programme More ❯
Site Reliability Engineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion click apply for full job details More ❯
Role- Senior Site Reliability Engineer (SRE) Location - London (full onsite- 5 days every week) Perm up to 80K gross Minimum 12+ year profile are required PFB updated JD Core Competency, • Datadog, Splunk, Dynatrace, Grafana, Prometheus, Thousand Eyes, Gremlin etc. • Efficiency in creating Dashboard for Infra/APM/E2E workflows. • Monitoring, logging, Alerting and Error budget , 99.99, % ) for More ❯
application, please feel free to note which pronouns you use (For example: she/her, he/him, they/them, etc). We are looking for an experienced engineer with strong Linux and system-level expertise who can operate autonomously in complex production environments. You must be able to independently troubleshoot incidents, lead and support post-incident service … recovery, and drive improvements to overall system stability, performance, and observability. We are looking for a hands-on Site Reliability Engineer (SRE) with a strong background in Linux infrastructure and third-party system operations. This role focuses on managing and optimizing large-scale environments (5,000+ hosts) running technologies like Kafka, Redis, and Kubernetes. The position does not More ❯
HQ or the wider global organisation, you'll be a part of collaborative, high-performing teams, creating cutting-edge software, platforms, and infrastructure. The Role Join us as a Site Reliability Engineer and help us build the future of data sovereignty! We're seeking an SRE passionate about creating high-performance, scalable, and reliable services for our production … implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. Engineer the ACRA platform for high availability and fault tolerance. This includes ensuring resilience against Cloud Availability Zone outages and the ability to gracefully handle node failures. Guarantee 99.9% uptime More ❯
data and analytics space, an organisation known for its technical excellence, collaborative culture, and meaningful impact across sectors. They re scaling their SRE function and looking for a seasoned engineer to join a high-performing team delivering internal applications that power critical operations. This is a 6-month contract (with strong potential to extend), starting ASAP. You ll be More ❯
data and analytics space, an organisation known for its technical excellence, collaborative culture, and meaningful impact across sectors. They’re scaling their SRE function and looking for a seasoned engineer to join a high-performing team delivering internal applications that power critical operations. This is a 6-month contract (with strong potential to extend), starting ASAP. You’ll be More ❯
With a strong culture of collaboration and technical excellence, the organisation continues to push the boundaries of low-latency infrastructure and reliable system design. The team is hiring a Site Reliability Engineer (London) to build, monitor, and optimise mission-critical trading systems. The role will focus on automation, system scalability, and incident response to maintain maximum uptime and … Solid experience with Linux Systems administration and troubleshooting. Hands-on experience with Kubernetes for container orchestration. Proficient in Python scripting for automation and system management. A mindset focused on site reliability and performance. Strong troubleshooting skills and a proactive approach to problem-solving. Benefits: Lucrative bonus scheme Salary: Up to £90,000 base salary More ❯
Security Engineer Static Night Shift (Heathrow Airport) Location: Heathrow Airport, West London Salary: £55,000 £65,000 base + Time & 1/3 Night Shift Premium + Overtime No On-Call We re seeking a confident and capable Installation Engineer with experience in electronic security systems to join a high-profile, night-shift-only project at Heathrow Airport. … role no servicing, no maintenance, and no call-out. Why This Role? Night shifts only (typically 20 00) Time and a third uplift on all night hours Static single-site no travel between locations No on-call rota and no weekend standby Long-term Heathrow-based project Strong team environment with a focus on quality and safety Role Responsibilities … Full installation of IP-based CCTV and access control systems Working to project plans and technical drawings Ensuring compliance with site protocols and health & safety standards Coordinating with supervisors and project managers on-site What You ll Need: Demonstrable installation experience with CCTV and access control systems ECS Card essential SSSTS essential Comfortable working night shifts on a More ❯