Senior Site Reliability Engineer
Colchester, Essex, UK
Hybrid/Remote Options
Hybrid/Remote Options
TechNET IT Recruitment Ltd
with internal teams to improve observability, monitoring and alerting across services Identify and implement operational improvements to existing monitoring, logging and incident response processes Use scripting and automation (primarily Bash and Python) to reduce toil and streamline recurring tasks Contribute to Infrastructure-as-Code practices using tools such as Ansible or Puppet Required Experience & Skills 5+ years' experience in … MySQL operations, including replication concepts Hands-on experience with Kafka and/or other distributed messaging systems Experience with Kubernetes or similar container orchestration platforms Practical scripting skills in Bash and/or Python for automation and tooling Familiarity with IaC tools such as Ansible or Puppet Good understanding of monitoring, alerting, logging and observability best practices Excellent communication More ❯
Employment Type: Full-time
Posted: