Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
of AWS or other cloud providers (e.g. GCP, Azure). Strong understanding of key security technologies and protocols such as TLS, OAuth and SPIFFE. Observability, alerting, metrics collection and visualisation (e.g. Prometheus, Grafana, Elasticsearch, Dynatrace). "Nice To Have" Skills and Experience: We would be even more impressed if you More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
the edge. Proficiency in Python, Docker, Linux systems, and scripting (Bash, Python). Strong expertise with infrastructure automation tools (Terraform, Ansible). Experience managing observability and monitoring systems, particularly Prometheus. Deep understanding of networking concepts and protocols. Responsibilities: Design, build, and maintain scalable and resilient infrastructure on the edge. Develop … as-code solutions using Terraform, Ansible, and scripting languages (Python, Bash). Deploy and manage containerized applications using Docker and related technologies. Ensure system observability by building and optimizing monitoring systems, particularly using Prometheus. Troubleshoot and optimize Linux-based systems (e.g., Red Hat, CentOS, Ubuntu). xAI's Grok is … technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting metrics. Building easily parsable dashboards. Designing and overseeing our on-call rotations. Improving our deployment process to increase reliability. Luminance is More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Gearset Limited
infrastructure using Terraform. Strong knowledge of CI/CD pipeline configuration and operation. Familiarity with .NET/C# applications and development environments. Experience implementing observability solutions and using them to gain insights into application and infrastructure performance. Benefits and perks (the stuff you'd expect!) This is a full time More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Promote Project
Secondary Responsibilities Assist with support and bug triage. Assist with CI/CD pipeline as necessary. Assist with E2E tests as necessary. Improve application observability with logging and automated alerting. Explain technical concepts to non-technical stakeholders. Guide and mentor other engineers of all levels with their professional growth and More ❯
Cambridge, Cambridgeshire, UK Hybrid / WFH Options
Trust In SODA
Programme Manager – AI Ops & Observability Rollout Location: Hybrid (40% in-office minimum) | Cambridge Type: 6-month Contract Rate: £670 - £710 per day Inside IR35 A major enterprise is seeking an experienced Programme Manager to lead the organisation-wide rollout of a new AI Ops and Observability Platform . This strategic … resilience, reducing downtime, and enabling proactive incident management. You’ll drive end-to-end delivery, from roadmap ownership to stakeholder alignment, while shaping how observability is embedded into tools, workflows, and culture. This is a high-impact role requiring coordination across engineering, IT, and business teams. Key Responsibilities: Lead planning … and execution of the observability platform rollout Manage roadmap, risks, and dependencies across functions Oversee change management, communications, and adoption strategies Engage stakeholders at all levels to ensure alignment and delivery Track and report KPIs to demonstrate business value What You Bring: Proven experience in large-scale programme delivery or More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
Trust In SODA
Programme Manager – AI Ops & Observability Rollout Location: Hybrid (40% in-office minimum) | Cambridge Type: 6-month Contract Rate: £670 - £710 per day Inside IR35 A major enterprise is seeking an experienced Programme Manager to lead the organisation-wide rollout of a new AI Ops and Observability Platform . This strategic … resilience, reducing downtime, and enabling proactive incident management. You’ll drive end-to-end delivery, from roadmap ownership to stakeholder alignment, while shaping how observability is embedded into tools, workflows, and culture. This is a high-impact role requiring coordination across engineering, IT, and business teams. Key Responsibilities: Lead planning … and execution of the observability platform rollout Manage roadmap, risks, and dependencies across functions Oversee change management, communications, and adoption strategies Engage stakeholders at all levels to ensure alignment and delivery Track and report KPIs to demonstrate business value What You Bring: Proven experience in large-scale programme delivery or More ❯