Remote Distributed Systems Jobs in Cambridge

1 of 1 Remote Distributed Systems Jobs in Cambridge

Site Reliability Engineer

Cambridge, Cambridgeshire, United Kingdom
Hybrid / WFH Options
AI Tech Suite
experience in Site Reliability Engineering or a similar role 3+ years of experience in a technical leadership or management position Deep understanding of Linux systems, containerization, virtualization, and networking technologies Strong background in managing and monitoring large-scale distributed systems and bare-metal fleets Expertise in infrastructure … our infrastructure Collaborate with cross-functional teams to align SRE initiatives with broader organizational goals Establish and maintain SLIs, SLOs, and SLAs for critical systems and services Drive the adoption of best practices in automation, monitoring, and incident response Software Engineer, Site Reliability Engineer. Fireworks AI offers a fast … or equivalent experience. Experience Requirements: 5+ years of experience building and managing infrastructure at scale, particularly on the edge. Proficiency in Python, Docker, Linux systems, and scripting (Bash, Python). Strong expertise with infrastructure automation tools (Terraform, Ansible). Experience managing observability and monitoring systems, particularly Prometheus. Deep More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted: