City of London, London, United Kingdom Hybrid / WFH Options
CipherTek Recruitment
data points, either via APIs or other appropriate methods, to ensure real-time decision-making capabilities. Integrate with Instrumentation Platforms : Integrate the platform with Prometheus and Geneos for continuous monitoring, diagnostics, and system health checks. Desired technical Skills: Java Expertise: Extensive experience with Core Java , focusing on low-level performance More ❯
data points, either via APIs or other appropriate methods, to ensure real-time decision-making capabilities. Integrate with Instrumentation Platforms : Integrate the platform with Prometheus and Geneos for continuous monitoring, diagnostics, and system health checks. Desired technical Skills: Java Expertise: Extensive experience with Core Java , focusing on low-level performance More ❯
London, England, United Kingdom Hybrid / WFH Options
NTT DATA North America
with APIs on the IBM i platform at a high competency level alongside SQL PowerVC desirable (nice to have) Good knowledge and exposure to Prometheus, Grafana, Splunk, Elastic and at least working understanding of OTEL Ideally previous experience in a software engineering-type role or previous developer role (RPG/ More ❯
London, England, United Kingdom Hybrid / WFH Options
Grafana Labs
in videos, and many other programs. A big plus for us is if you already participate meaningfully in open-source communities such as Kubernetes, Prometheus, or the wider CNCF. We work in the R&D part of the organization, alongside product managers and engineers. The aim of the role is More ❯
with the ability to work effectively in a team. Technologies we use Golang AWS (Lambda, SQS, EventBridge, DynamoDB, RDS, CDK, OpenSearch) Github, Github Actions Prometheus, Grafana Event-driven architecture and domain-driven design How we reward our team Dynamic working environment with a diverse and driven team Huge opportunity for More ❯
London, England, United Kingdom Hybrid / WFH Options
Binance
Experience in large-scaled distributed environments Good command of Linux environment Clear, logical communicator in English Optional Familiar with tools such as Docker, Nginx, Prometheus, Grafana, etc. Experience with using time series databases Understanding of low-level programming languages such as C, C++, Rust, etc. Contributed to open source projects More ❯
us extra happy? Previous experience with billing or payment systems . Expertise in building and optimizing scalable, resilient distributed systems. Familiarity with Kubernetes and Prometheus for container orchestration and monitoring. Redis knowledge for caching and performance optimization. Experience with .NET Framework. Willingness to drive initiatives to upgrade to newer .NET More ❯
on building repeatable and cost-efficient infrastructure Experience building solutions for problems with no answers on Google Experience working with monitoring solutions in the Prometheus ecosystem; Grafana, Loki, Tempo, VictoriaMetrics Experience managing multi-cluster, multi-cloud Kubernetes deployments Familiarity with incident management Nice to have: Familiarity with Gitops, e.g. Flux More ❯
an absolute must; and a big plus for us is if you already participate meaningfully in any other open source communities such as Kubernetes, Prometheus, or the wider CNCF. We work in the R&D part of the organization, alongside product managers and engineers. The aim of the role is More ❯
Hanover, Maryland, United States Hybrid / WFH Options
Lockheed Martin
such as cloud providers (AWS, Azure, GCP), container registries (Docker Hub, Google Container Registry), and cloud-based logging and monitoring tools (e.g., ELK Stack, Prometheus) Monitoring and Logging: • Experience with monitoring tools such as Nagios, Prometheus, or Grafana • Knowledge of logging tools such as ELK Stack, Splunk, or Logstash CI More ❯
Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform root cause analysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes - Automate infrastructure tasks with Python, Bash, Go … ensure system reliability Your Profile Essential: - Solid hands-on AWS experience in a DevOps setting - Background in incident, change, and problem management - Strong with Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Stott & May Professional Search Limited
Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform root cause analysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes - Automate infrastructure tasks with Python, Bash, Go … ensure system reliability Your Profile Essential: - Solid hands-on AWS experience in a DevOps setting - Background in incident, change, and problem management - Strong with Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or More ❯
London, England, United Kingdom Hybrid / WFH Options
Stott and May
Responsibilities Manage and monitor AWS infrastructure for performance and security Respond to production incidents, perform root cause analysis, and implement fixes Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes Automate infrastructure tasks with Python, Bash, Go … ensure system reliability Your Profile Essential Solid hands-on AWS experience in a DevOps setting Background in incident, change, and problem management Strong with Prometheus, Grafana, Splunk, and PromQL Proficient in scripting (Python, Go, Bash, SQL) Skilled in GitHub, CI/CD, and Kubernetes operations Desirable Experience with Terraform or More ❯
DevOps engineer, handling routine tasks and being on-call for production issues Resolve production and development issues, leveraging strong troubleshooting skills Adjust and rewrite Prometheus alert expressions to be non-flapping and algorithmic .Requirements Understanding of networking fundamentals Proficiency in Linux OS, including system metrics and filesystems Experience with PostgreSQL …/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Wallet in Telegram
DevOps engineer, handling routine tasks and being on-call for production issues Resolve production and development issues, leveraging strong troubleshooting skills Adjust and rewrite Prometheus alert expressions to be non-flapping and algorithmic .Requirements Understanding of networking fundamentals Proficiency in Linux OS, including system metrics and filesystems Experience with PostgreSQL …/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python More ❯
London, England, United Kingdom Hybrid / WFH Options
Wallet in Telegram
DevOps engineer, handling routine tasks and being on-call for production issues Resolve production and development issues, leveraging strong troubleshooting skills Adjust and rewrite Prometheus alert expressions to be non-flapping and algorithmic .Requirements Understanding of networking fundamentals Proficiency in Linux OS, including system metrics and filesystems Experience with PostgreSQL …/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python More ❯
London, England, United Kingdom Hybrid / WFH Options
amber labs
delivery, and deployment of applications. Collaborate with the development team to optimise pipeline efficiency and ensure code quality. Implement monitoring solutions using AWS CloudWatch, Prometheus, Grafana, or similar tools to ensure visibility into application performance, health, and security. Troubleshoot production issues and provide resolution. Ensure the security of cloud infrastructure … using Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation. Monitoring & Logging Tools: Experience with monitoring and logging tools such as AWS CloudWatch, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana). Benefits: Join a rapidly expanding start-up where personal growth is a part of our DNA. Benefit from More ❯
this platform is highly desirable. This includes understanding of OpenShift's architecture, its project and application concepts, and its command-line client. Proficiency with Prometheus and Terraform: Experience with Prometheus for monitoring and alerting purposes is desirable. Familiarity with Terraform for infrastructure as code (IaC) to provision and manage any More ❯
London, England, United Kingdom Hybrid / WFH Options
Government Digital and Data
this platform is highly desirable. This includes understanding of OpenShift's architecture, its project and application concepts, and its command-line client. Proficiency with Prometheus and Terraform: Experience with Prometheus for monitoring and alerting purposes is desirable. Familiarity with Terraform for infrastructure as code (IaC) to provision and manage any More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Lorien
exposure to new technologies and career growth. Key Requirements: Proven advanced support and troubleshooting skills. Essential: Strong experience with monitoring tools (Instana, Splunk, Solarwinds, Prometheus, Grafana). Windows & Linux troubleshooting. ITIL environment experience. Understanding of website hosting (DNS, HTTP/S, Certs, basic networking). Excellent communication skills. AWS knowledge More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Lorien
exposure to new technologies and career growth. Key Requirements: Proven advanced support and troubleshooting skills. Essential: Strong experience with monitoring tools (Instana, Splunk, Solarwinds, Prometheus, Grafana). Windows & Linux troubleshooting. ITIL environment experience. Understanding of website hosting (DNS, HTTP/S, Certs, basic networking). Excellent communication skills. AWS knowledge More ❯
Burnley, England, United Kingdom Hybrid / WFH Options
Lorien
exposure to new technologies and career growth. Key Requirements: Proven advanced support and troubleshooting skills. Essential: Strong experience with monitoring tools (Instana, Splunk, Solarwinds, Prometheus, Grafana). Windows & Linux troubleshooting. ITIL environment experience. Understanding of website hosting (DNS, HTTP/S, Certs, basic networking). Excellent communication skills. AWS knowledge More ❯
DevOps Engineer, Site Reliability Engineer, Platform Engineer or similar role. Ideally in an entreprise-grade Experience with APM stacks such as Datadog, New Relic, Prometheus or similar. Experience with handling telemetry, tracing and logging data, at scale, in multiple different environments. Familiarity with low-level telemetry daemons and aggregators such More ❯
London, England, United Kingdom Hybrid / WFH Options
Sporty Group
you’ll be facilitating the database side On call responsibilities on a rotating pattern Our stack: MySQL, MongoDB, AWS EC2, Cloudwatch, RDS, Redshift, Grafana, Prometheus, Terraform, Python, Shell etc. You should apply if you have 4+ years experience tackling and managing advanced MySQL and MongoDB problems Hands on experience with More ❯
City of Westminster, England, United Kingdom Hybrid / WFH Options
VIOOH
the service and its inner workings. Experience managing AWS or GCP. Experience in building or integrating Monitoring Tools (Datadog/Kibana/Grafana/Prometheus). Write software using either Java/Scala/Python. The following are nice to have, but not required - Apache Spark jobs and pipelines. Experience More ❯