Site Reliability Engineer — AWS & Observability
- Hiring Organisation
- Jobleads-UK
- Location
- Greater London, England, United Kingdom
aspects of GitHub workflows and deployment pipelines Incident response – Lead incidents, run blameless post-mortems, and drive continuous improvement Enable developers – Mentor teams on SRE and observability practices, helping them quickly understand and resolve issues Leverage AI tooling – Use AI‐assisted development tools (e.g. GitHub Copilot) to accelerate infrastructure work … explore AI‐driven approaches to incident detection, root cause analysis, and remediation What We're Looking For Essential 3+ years in an SRE, Platform, or DevOps engineering role AWS services: CloudWatch, X-Ray, Lambda, API Gateway, S3, SQS, Aurora PostgreSQL, DynamoDB, CloudFront, VPC, IAM, Security Groups Python for scripting, tooling ...