11 of 11 Observability Jobs in Cambridge

Senior Backend Engineer

Hiring Organisation
Orbis Group
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
with low latency Deploy and operate services on Kubernetes and Docker, leveraging AWS infrastructure such as EC2, S3, Lambda, and RDS Implement monitoring and observability using tools like Grafana and Prometheus to track system performance Collaborate with product, frontend, and analytics teams to deliver features that make a tangible impact ...

Platform Engineer

Hiring Organisation
SoCode Recruitment
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
removing single points of failure and enhancing autoscaling, high availability and managed service usage • Collaborate with SRE, Security and Engineering teams to strengthen observability, monitoring and alerting using Prometheus, Grafana and CloudWatch • Work closely with Security to embed best practice for IAM, secrets management, WAF and cloud posture management • Optimise … Kubernetes operations on AWS including cluster scaling, deployment automation and monitoring • Solid background in Linux administration, networking and cloud security principles • Familiarity with observability tools such as Prometheus, Grafana and Loki along with structured alerting practices • Experience with database migrations, high availability configurations, backups and disaster recovery • Strong scripting ...

Senior Software Engineer

Hiring Organisation
Aveni
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
with React Solid understanding of cloud-native engineering on AWS Experience with microservices, messaging patterns and distributed systems A commitment to clean code, testing, observability and operational excellence A proactive and motivated mindset — someone who wants to build, ship and iterate quickly Interest in AI-powered products and a drive ...

Cloud Platform Engineer -AWS, Degree, Cloud, Linux - Cambridge

Hiring Organisation
Adecco
Location
Cambridge, Cambridgeshire, England, United Kingdom
Employment Type
Full-Time
Salary
£70,000 - £100,000 per annum
Kubernetes operations on AWS (EKS), including cluster scaling and deployment automation. Proficiency in Linux administration, networking fundamentals, and cloud security principles. Familiarity with observability stacks such as Prometheus, Grafana, and Loki, with structured alerting practices. Knowledge of database operations, including migrations, high availability, backups, and disaster recovery strategies. Skilled … platform resilience by improving autoscaling, high availability, and eliminating single points of failure. Work closely with SRE and Security teams to enhance monitoring and observability through Prometheus, Grafana, and CloudWatch. Embed security best practices into every layer of the platform, covering IAM, secrets management, WAF, and compliance. Drive cost efficiency ...

Senior Engineer - Developer Experience (DevEx)

Hiring Organisation
Complexio
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
runners, caching, artifact storage). Maintain stability, scalability, and cost-effectiveness of pipelines. Build and maintain systems for our monorepo. Ensure CI/CD observability, with metrics flowing into Datadog/Slack. Pipeline Instrumentation & Optimisation Analyse pipelines for inefficiencies (e.g., flaky tests, redundant steps, lack of caching). Recommend … SDLC practices and developer productivity tooling. Hands-on experience with infrastructure automation (e.g., Docker, Kubernetes, IaC with Terraform, Ansible or Pulumi). Familiarity with observability & monitoring (Datadog, Prometheus, or similar). Experience managing or improving monorepo build systems. Strong ability to measure developer productivity gaps and define KPIs. Experience ...

Technical Lead

Hiring Organisation
Cambridge University Press and Assessment
Location
Cambridge, Cambridgeshire, United Kingdom
Employment Type
Permanent
Salary
GBP 51,400 - 68,800 Annual
designing scalable, resilient architectures using approaches like microservices, serverless, and containerisation, while communicating design rationale to diverse stakeholders for buy-in and feedback. Drive Observability: Implement robust observability frameworks to ensure system performance, reliability, and proactive issue resolution, fostering a team culture of shared accountability and continuous improvement. Prepare … with business impact. On the technical side, you'll have proficiency in areas like microservices, serverless, containerisation, and building web applications, with experience in observability, security standards, Infrastructure as Code, and CI/CD practices. We value transferable skills over specific tool expertise, as technical depth can be developed through ...

Principal Software Engineer (DevOps)

Hiring Organisation
Oracle
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
common attack/defense patterns. Advanced networking knowledge: TCP/IP, IPv4/IPv6, BGP, routing policy; DNS fundamentals. Demonstrated operational excellence and observability practices (metrics, tracing, alerting). Preferred qualifications Expertise with anycast routing, global traffic steering, and multi-region service readiness. Experience with SDN, programmable data planes … incident learning. Drive automation at scale: CI/CD strategy, test frameworks, progressive delivery (canary/blue-green), and infrastructure-as-code. Establish robust observability (metrics, logs, traces) and capacity/scale models for high-throughput, highly available services. Lead threat modeling, architecture reviews, and audit readiness for Tier ...

Senior DevOps Engineer - Kubernetes

Hiring Organisation
AVM Consulting Inc
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
security tools to enforce safe, compliant workloads. Deployment Enablement: Enhance Helm charts, Kustomize workflows, and GitOps processes to support fast, safe, and reliable deployments. Observability: Own the integration and tuning of observability stacks (e.g., Prometheus, Grafana, Loki) for visibility into cluster and application health. Resilience & Recovery: Support fault-tolerant architectures ...

Senior Software Engineer

Hiring Organisation
Oracle
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
network security services and related attack/defense patterns. Solid networking knowledge: TCP/IP, IPv4/IPv6, BGP fundamentals; DNS/DHCP understanding. Observability experience (metrics, tracing, alerting) and operational excellence mindset. Preferred qualifications Experience with anycast routing, traffic steering, and multi-region service readiness. Exposure to SDN, programmable … post-incident improvements. Build automation-first workflows: CI/CD pipelines, test frameworks, canary/blue-green releases, and infrastructure-as-code. Create robust observability (metrics, logs, traces) and capacity/scale modeling for high-throughput, high-availability systems. Partner with product, SRE, and network engineering to deliver roadmap features ...

Technical Architect

Hiring Organisation
Inara
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
security, and product teams to turn business requirements into robust architectural solutions and clear technical roadmaps. Drive standards around cloud governance, infrastructure-as-code, observability, cost optimisation, and high availability. Provide hands-on technical leadership — code reviews, technical oversight, solution design — while mentoring teams and supporting delivery. What … Excellent knowledge of Python, Terraform, and modern automation patterns Experience designing scalable, secure, cloud-native platforms A solid DevOps mindset: CI/CD, IaC, observability, reliability Ability to influence technical direction while remaining hands-on Strong communication skills and confidence working with engineering, product, and security stakeholders ...

Platform Engineer

Hiring Organisation
NJF Global Holdings Ltd
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
join the Platform Engineering team which designs, builds, and operates the firm's global production trading infrastructure—from hardware and Linux systems to Kubernetes, observability, and build platforms. They work with a mix of open-source and in-house technologies to solve scaling, reliability, and performance problems in a highly … with a Platform Engineer who enjoys building highly scalable systems, adopting new technologies, and debugging complex production issues. Systems and platforms owned: In-house observability platform (ClickHouse, Redpanda, Rust) Firm-wide build and distribution systems Linux systems engineering for production trading On-prem Kubernetes clusters Hardware automation and operational tooling ...