SiteReliabilityEngineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £85,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliabilityEngineer (SRE … on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing … Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliabilityEngineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
SiteReliabilityEngineer (SRE) - eDV Cleared Location: London (On-site) Salary: Up to £75,000 + Clearance Bonus + Company Bonus Clearance: eDV (Enhanced Developed Vetting) required Are you an experienced SiteReliabilityEngineer (SRE) with active eDV Clearance Do you want to work on mission-critical systems that directly support UK National … brightest minds in the industry, ensuring the reliability, scalability and performance of complex, high-assurance systems that protect the nation. The Role: As a key member of the SRE team, you'll design, build and maintain reliable infrastructure and automation solutions to keep vital services running smoothly. You'll drive continuous improvement across monitoring, deployment, and incident response for … performance bonus . Opportunity to work on high-impact, national security projects . Career development within one of the UK's most respected secure consultancies. If you're an SRE with eDV clearance looking to make a real impact in a secure and rewarding environment, we'd love to hear from you. Apply now or reach out directly to Dominic More ❯
Role- Senior SiteReliabilityEngineer (SRE) Location - London (full onsite- 5 days every week) Perm up to 80K gross Minimum 12+ year profile are required PFB updated JD Core Competency, • Datadog, Splunk, Dynatrace, Grafana, Prometheus, Thousand Eyes, Gremlin etc. • Efficiency in creating Dashboard for Infra/APM/E2E workflows. • Monitoring, logging, Alerting and Error budget , 99.99 More ❯
application, please feel free to note which pronouns you use (For example: she/her, he/him, they/them, etc). We are looking for an experienced engineer with strong Linux and system-level expertise who can operate autonomously in complex production environments. You must be able to independently troubleshoot incidents, lead … and support post-incident service recovery, and drive improvements to overall system stability, performance, and observability. We are looking for a hands-on SiteReliabilityEngineer (SRE) with a strong background in Linux infrastructure and third-party system operations. This role focuses on managing and optimizing large-scale environments (5,000+ hosts) running technologies like Kafka, Redis … and Kubernetes. The position does not involve application development but requires deep operational expertise and solid troubleshooting skills. Qualifications 5+ years of experience in Linux system administration or SRE roles Proven experience managing large-scale infrastructure environments Strong troubleshooting and performance tuning skills at the infrastructure level Basic scripting/automation experience (Bash, Python) Familiarity with IaC tools (e.g., Ansible More ❯
Strong expertise in implementing SiteReliability Engineering (SRE) principles. Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills). Proficiency in automation & scripting using Python & Ansible (primary skills). Strong experience with cloud platforms AWS & Azure (primary skills). Solid understanding of containerization and orchestration tools like Docker and Kubernetes . Proficiency in cloud native distributed More ❯