Observability Subject Matter Expert (SME) (Telford)
Smart Edge’s Client is looking for an individual to help with their Observability Subject Matter Expert (SME)@ Telford, UK
Experince Required :
- • Observability Architecture & Strategy:
- Design and implement observability pipelines using AWS-native and third-party tools.
- Define telemetry standards (metrics, logs, traces) across microservices, APIs, and data pipelines.
- Establish SLIs/SLOs and integrate them into service health dashboards.
- AWS Workload Monitoring:
- Implement observability for AWS Connect (contact flows, agent metrics, call quality).
- Monitor AWS Data Services (Glue, Redshift, Athena, S3, Lake Formation) for performance, throughput, and data lineage.
- Integrate AWS Integration Services (API Gateway, EventBridge, Step Functions, Lambda) with distributed tracing and structured logging.
- Tooling & Automation:
- Deploy and manage observability tools: CloudWatch, X-Ray, OpenTelemetry, Prometheus, Grafana, Datadog, Splunk, ELK.
- Automate alerting, anomaly detection, and incident correlation using AI/ML-based tools.
- Integrate observability into CI/CD pipelines and Infrastructure-as-Code (IaC) workflows.
- Incident Management & RCA:
- Lead real-time diagnostics during major incidents using telemetry data.
- Conduct post-incident reviews with detailed root cause analysis and observability insights.
- Collaboration & Governance:
- Work closely with DevOps, Security, and Application teams to enforce observability standards.
- Ensure compliance with data governance, retention, and security policies for telemetry data.
- 7+ years in observability engineering.
- Deep expertise in AWS services, especially AWS Connect, Glue, Lambda, API Gateway, S3, Infrastructure and Network
- Strong hands-on experience with observability stacks such as : Dynatrace OpenTelemetry, Prometheus, Grafana, Datadog, Splunk, ELK, CloudWatch/X-Ray.
- Proficient in scripting (Python, Bash) and IaC (Terraform, CloudFormation).
- Experience with monitoring enterprise platforms like Pega and Contact Center systems.
- Solid understanding of distributed systems, networking, and application performance tuning.
If this sounds like a role you would be interested in or if you know someone in this field.
Connect with me or email me at sharshini.a@smartedgesolutions.co.uk
Alternatively, you can call me on Tel: +44(0)203 500 2108.
- Company
- Smartedge Solutions
- Location
- Telford, Shropshire, UK
- Posted
- Company
- Smartedge Solutions
- Location
- Telford, Shropshire, UK
- Posted