Remote Prometheus Jobs in Cambridge

14 of 14 Remote Prometheus Jobs in Cambridge

Senior DevOps Engineer

Cambridge, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Arm Limited
cloud-native orchestration patterns. Proven track record building dashboards and visualizations across platforms such as Grafana, Datadog, and AWS. Experience with instrumentation tools like Prometheus and managing time-series stores such as Graphite and VictoriaMetrics. Solid understanding of networking, security, and compliance in cloud environments. Excellent written and verbal communication More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Tools Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
CATCHES
skills and a track record of cross-team collaboration. Nice to have: Kubernetes expertise (GKE/AKS/EKS) and container-native observability stacks (Prometheus/Grafana). NoSQL experience (Firestore, Cosmos DB, DynamoDB, MongoDB). Experience with game-backend scales, real-time services or hybrid cloud/bare-metal More ❯
Posted:

Senior Software Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Beazley Security
and cloud environments. Experience with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI, CircleCI). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong problem-solving and analytical skills. Excellent communication and collaboration skills. Experience with version control systems (e.g., Git). Experience working More ❯
Posted:

Senior Backend Developer - VoD & Live Stream

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
developrec
message queues (Kafka, RabbitMQ). Strong DevOps experience, including: CI/CD pipelines (GitHub Actions). Containerization technologies (Docker, Kubernetes). Monitoring & logging tools (Prometheus, Grafana, ELK stack). Strong problem-solving and debugging skills. Excellent communication and collaboration abilities. Experience working in Agile development environments. Fluent written & spoken English. More ❯
Posted:

Senior DevOps Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Growing Start up
/CD pipelines with GitLab and GitHub Actions Containerising with Docker and applying best practices for security and performance Monitoring and alerting using Datadog, Prometheus, and Grafana Debugging complex systems using tools like strace, dtrace, and beyond Supporting a tech stack that includes Rust, Python, Go, C++, Java, and more More ❯
Posted:

Principal Solution Architect

Cambridge, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Arm Limited
GCP, Azure). Strong understanding of key security technologies and protocols such as TLS, OAuth and SPIFFE. Observability, alerting, metrics collection and visualisation (e.g. Prometheus, Grafana, Elasticsearch, Dynatrace). "Nice To Have" Skills and Experience: We would be even more impressed if you are passionate about the following: Cluster processes More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Durlston Partners
is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks (Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable experimenting at More ❯
Posted:

Senior Software Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Ocho
data engineering tools such as Airflow, Pandas, or Spark Exposure to serverless architectures using AWS Lambda Familiarity with monitoring and logging tools (e.g. CloudWatch, Prometheus) Previous experience working in regulated or high-availability environments Location & Flexibility: This role can be fully remote, with optional visits to a UK-based office More ❯
Posted:

AI Tech Lead – Agentic AI, LangGraph, ML, Python, CI/CD, LLM’s, Startup, UK Remote

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
WMtech
GenAI, LLMs, and multimodal systems Architecture: Microservices, RESTful APIs, async programming Infrastructure: Docker, Terraform, GitHub Actions, GCP (preferred) Datastores: MongoDB, Redis Monitoring/Tooling: Prometheus, Grafana, Sentry The role is remote with occasional travel Ready to lead and build with purpose? If you're excited by the idea of applying More ❯
Posted:

Senior HPC Support Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Nscale
HPC container runtimes (e.g., Singularity, Apptainer). Exposure to provisioning and automation tools (e.g., Ansible, PXE, Terraform). Experience with monitoring tools such as Prometheus, Grafana, and DCGM. Understanding of GPU/accelerator toolchains like CUDA or ROCm. A proactive, customer-first mindset with strong communication skills. Ability to work More ❯
Posted:

Senior Backend Engineer (Go)

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Few&Far
and observability tools Bonus Points For Contributions to open-source projects Contributions to an AI product ⚙️ Tech Stack: Golang, GCP, microservices, Kubernetes, Kafka, MongoDB, Prometheus If scalability, security, databases and performance is your thing, looking for high ownership and impact - this role is for you! Please apply with an up More ❯
Posted:

Site Reliability Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Harrington Starr
infrastructure. They’re looking to bring on a Site Reliability Engineer with deep experience in observability . If you’ve worked with tools like Prometheus in AWS , supported development teams with tracing and performance insights , and thrive in a high-scale, distributed environment - this could be a great next step. … What You’ll Be Doing: Managing and improving observability tools like Prometheus, Grafana, and CloudWatch Helping product teams with tracing and monitoring to improve performance and reliability Defining and improving SLIs/SLOs , automating tasks, and reducing operational noise Working with AWS (EKS, EC2, Lambda, RDS), Terraform, and CI/… CD tools What They’re Looking For: Experience in SRE or DevOps roles in a production environment Strong knowledge of observability tools , especially Prometheus in AWS Experience with tracing , metrics, and logs to support development teams Skills in Python or Go , and a good understanding of AWS and Kubernetes What More ❯
Posted:

Site Reliability Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Uniting Cloud
and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support your work. What we’re looking for: Strong experience with AWS, Terraform, Docker, and container orchestration (ECS/Fargate). … Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve worked with Azure, GCP, or More ❯
Posted:

Site Reliability Engineer

cambridge, east anglia, United Kingdom
Hybrid / WFH Options
Stealth iT Consulting
and Service Level Objectives (SLOs), ensuring reliability and performance. An understanding of Microservices & container orchestration Strong Observability & Monitoring experience (preferably tools such as Dynatrace, Prometheus or OpenTelemetry) Experience delivering DevOps/SRE Best Practices and cost optimisation proposals Experience in Multi-Cloud, Security & Governance for Cloud Engineering and Operations would … be desired Key Responsibilities: Apply SRE principles effectively (experience with Dynatrace, Prometheus, and Open Telemetry is a bonus). Define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs) in collaboration with development teams. Develop dashboards and configure alerts to monitor system health in real time. Enhance Kubernetes More ❯
Posted: