Senior Site Reliability Engineer
Slough, Berkshire, UK
Hybrid/Remote Options
Hybrid/Remote Options
Prolific
a world where foundational AI technologies are increasingly commoditized, it's the quality and diversity of human-generated data that truly differentiates products and models. The role As a Site Reliability Engineer, you will focus on ensuring that the Prolific platform is resilient, scalable and highly performant for our customers. You'll ensure stability and reliability … observability is at the right standard, and dive into incident remediation where needed in collaboration with service delivery and teams. You will work with cross-functional teams to embed SRE principles, upskill teams in key areas such as kubernetes and observability. What you'll bring to the role 5+ years with Google Cloud Platform, GKE, and the Kubernetes ecosystem with … teams in cloud architecture and kubernetes. Improve observability and alerting systems across our application and infrastructure, ensuring proactive detection of system degradation. Collaborate with Engineering teams to foster an SRE culture, including contributing defining SLO's, SLA's and error budgets. Design and implement automation strategies to ensure managed services remain up-to-date, secure, and performant. Lead and support More ❯
Employment Type: Full-time
Posted: