Site Reliability Engineer
- Hiring Organisation
- EQUALS
- Location
- Greater London, England, United Kingdom
Manage Cloudflare (WAF, bot management, DNS, firewall rules) - Make cost-conscious infrastructure decisions - right-sizing instances, storage tiering, optimizing spend Monitoring & Observability - Own the Datadog APM setup: tracing, alerting, dashboards, log management - Maintain and tune alert channels integrated with Slack - Reduce alert fatigue by tuning thresholds, suppressing false positives … Experience with Redis at scale: clustering, read replicas, failover handling - Solid understanding of container orchestration and deployment strategies - Experience with monitoring and observability platforms (Datadog preferred) - Comfort with incident response: you've been paged at 2am and know how to stay calm, diagnose, and fix - Familiarity with CI/ ...