Remote Site Reliability Engineer Jobs in Slough

2 of 2 Remote Site Reliability Engineer Jobs in Slough

Senior Site Reliability Engineer

Slough, Berkshire, UK
Hybrid/Remote Options
Prolific
a world where foundational AI technologies are increasingly commoditized, it's the quality and diversity of human-generated data that truly differentiates products and models. The role As a Site Reliability Engineer, you will focus on ensuring that the Prolific platform is resilient, scalable and highly performant for our customers. You'll ensure stability and reliability … observability is at the right standard, and dive into incident remediation where needed in collaboration with service delivery and teams. You will work with cross-functional teams to embed SRE principles, upskill teams in key areas such as kubernetes and observability. What you'll bring to the role 5+ years with Google Cloud Platform, GKE, and the Kubernetes ecosystem with … teams in cloud architecture and kubernetes. Improve observability and alerting systems across our application and infrastructure, ensuring proactive detection of system degradation. Collaborate with Engineering teams to foster an SRE culture, including contributing defining SLO's, SLA's and error budgets. Design and implement automation strategies to ensure managed services remain up-to-date, secure, and performant. Lead and support More ❯
Employment Type: Full-time
Posted:

Senior Site Reliability Engineer

Slough, Berkshire, UK
Hybrid/Remote Options
TechNET IT Recruitment Ltd
Senior Linux SRE Outside IR35 - 12 month contract initially Full remote role across UK/Europe Our client is a consumer facing tech business and they are looking for a Senior SRE with a strong background in Linux infrastructure and third-party system operations. You'll be responsible for running and optimising large-scale production environments (5,000+ hosts) built … on technologies such as Kafka, Redis, Kubernetes and MySQL. This is a hands-on, systems-level position focused on reliability, scalability, performance and troubleshooting. You'll work alongside experienced engineers, operating with a high degree of autonomy to keep critical systems healthy, resilient, and observable. Key Responsibilities Manage, configure and maintain Linux systems (CentOS, Rocky, RHEL or similar distributions … reduce toil and streamline recurring tasks Contribute to Infrastructure-as-Code practices using tools such as Ansible or Puppet Required Experience & Skills 5+ years' experience in Linux system administration, SRE, Infrastructure or Platform Engineering roles Proven experience operating large-scale infrastructure (thousands of hosts/distributed systems) Strong troubleshooting and performance tuning skills at the infrastructure and OS level Solid More ❯
Employment Type: Full-time
Posted: