|
14 of 14 Remote Prometheus Jobs in Cardiff
cardiff, United Kingdom Hybrid / WFH Options Spectrum IT Recruitment
CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems More ❯
cardiff, united kingdom Hybrid / WFH Options Inspirec
Orchestrate and manage containerized applications using Docker, supporting streamlined deployment and environment consistency across development and production. Implement comprehensive monitoring and alerting solutions with Prometheus, Grafana, and AlertManager to proactively identify and resolve system performance issues. Champion DevOps best practices in automation, security, and agile delivery to drive continuous improvement More ❯
cardiff, United Kingdom Hybrid / WFH Options Trireme
cloud-native deployment strategies. Hands-on with AWS, GCP, and Azure for compute, networking, and storage configurations. Familiarity with monitoring/logging tools (e.g., Prometheus, Grafana, ELK stack). Trading Systems & Finance: Solid understanding of trading infrastructure, latency optimization, execution systems, and market data feeds. Experience working in or with More ❯
cardiff, United Kingdom Hybrid / WFH Options Beazley Security
and cloud environments. Experience with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI, CircleCI). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong problem-solving and analytical skills. Excellent communication and collaboration skills. Experience with version control systems (e.g., Git). Experience working More ❯
cardiff, United Kingdom Hybrid / WFH Options LHH
or highly regulated sectors Familiarity with Apache Kafka, Spark, or Hadoop Experience with Docker and Kubernetes Use of monitoring/alerting tools such as Prometheus, Grafana, or ELK Understanding of machine learning algorithms and data science workflows Proven ability to deliver end-to-end data solutions Knowledge of Terraform, Ansible More ❯
cardiff, United Kingdom Hybrid / WFH Options Durlston Partners
is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks ( Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable experimenting at More ❯
cardiff, United Kingdom Hybrid / WFH Options Ocho
data engineering tools such as Airflow, Pandas, or Spark Exposure to serverless architectures using AWS Lambda Familiarity with monitoring and logging tools (e.g. CloudWatch, Prometheus) Previous experience working in regulated or high-availability environments Location & Flexibility: This role can be fully remote, with optional visits to a UK-based office More ❯
cardiff, United Kingdom Hybrid / WFH Options WMtech
GenAI, LLMs, and multimodal systems Architecture: Microservices, RESTful APIs, async programming Infrastructure: Docker, Terraform, GitHub Actions, GCP (preferred) Datastores: MongoDB, Redis Monitoring/Tooling: Prometheus, Grafana, Sentry The role is remote with occasional travel Ready to lead and build with purpose? If you're excited by the idea of applying More ❯
cardiff, United Kingdom Hybrid / WFH Options Nscale
HPC container runtimes (e.g., Singularity, Apptainer). Exposure to provisioning and automation tools (e.g., Ansible, PXE, Terraform). Experience with monitoring tools such as Prometheus, Grafana, and DCGM. Understanding of GPU/accelerator toolchains like CUDA or ROCm. A proactive, customer-first mindset with strong communication skills. Ability to work More ❯
cardiff, United Kingdom Hybrid / WFH Options Few&Far
and observability tools Bonus Points For Contributions to open-source projects Contributions to an AI product ⚙️ Tech Stack: Golang, GCP, microservices, Kubernetes, Kafka, MongoDB, Prometheus If scalability, security, databases and performance is your thing, looking for high ownership and impact - this role is for you! Please apply with an up More ❯
cardiff, United Kingdom Hybrid / WFH Options Prism Digital
version, and manage infrastructure as code across multiple environments. GitHub Actions & OIDC – build and maintain automated CI/CD pipelines with secure authentication. Datadog, Prometheus or similar – implement logging, metrics, and alerting for robust observability – the interim CTO is keen to hear your recommendation(s) on tooling and implementation strategy. More ❯
cardiff, United Kingdom Hybrid / WFH Options Harrington Starr
infrastructure. They’re looking to bring on a Site Reliability Engineer with deep experience in observability . If you’ve worked with tools like Prometheus in AWS , supported development teams with tracing and performance insights , and thrive in a high-scale, distributed environment - this could be a great next step. … What You’ll Be Doing: Managing and improving observability tools like Prometheus, Grafana, and CloudWatch Helping product teams with tracing and monitoring to improve performance and reliability Defining and improving SLIs/SLOs , automating tasks, and reducing operational noise Working with AWS (EKS, EC2, Lambda, RDS), Terraform, and CI/… CD tools What They’re Looking For: Experience in SRE or DevOps roles in a production environment Strong knowledge of observability tools , especially Prometheus in AWS Experience with tracing , metrics, and logs to support development teams Skills in Python or Go , and a good understanding of AWS and Kubernetes What More ❯
cardiff, United Kingdom Hybrid / WFH Options Uniting Cloud
and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support your work. What we’re looking for: Strong experience with AWS, Terraform, Docker, and container orchestration (ECS/Fargate). … Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve worked with Azure, GCP, or More ❯
cardiff, United Kingdom Hybrid / WFH Options Stealth iT Consulting
and Service Level Objectives (SLOs), ensuring reliability and performance. An understanding of Microservices & container orchestration Strong Observability & Monitoring experience (preferably tools such as Dynatrace, Prometheus or OpenTelemetry) Experience delivering DevOps/SRE Best Practices and cost optimisation proposals Experience in Multi-Cloud, Security & Governance for Cloud Engineering and Operations would … be desired Key Responsibilities: Apply SRE principles effectively (experience with Dynatrace, Prometheus, and Open Telemetry is a bonus). Define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs) in collaboration with development teams. Develop dashboards and configure alerts to monitor system health in real time. Enhance Kubernetes More ❯
|
Salary Guide Prometheus Cardiff - 10th Percentile
- £53,000
- 25th Percentile
- £53,750
- Median
- £57,500
- 75th Percentile
- £61,250
- 90th Percentile
- £62,000
|