etc.) * Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others * Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform * Familiarity with container and container orchestration such as More ❯
Belfast, City of Belfast, County Antrim, United Kingdom Hybrid / WFH Options
Cala Consulting
build automation tools, build orchestration and environment automation. e.g. Jenkins, GitHub, GitLab, CloudFormation, Others Experience in implementing tools for logging, monitoring and alerting. e.g. Prometheus, Splunk, CloudWatch, Nagios Experience in creating and automating virtual machines in public and private clouds An understanding or experience of high availability, business continuity and More ❯
Employment Type: Permanent
Salary: £40000 - £60000/annum pension, share options, health
should have experience in setting up monitoring, logging, and alerting for improved system observability. Tech Stack: GitHub, Docker, Kubernetes, Ansible, Terraform, Gitlab, Synk, Vault, Prometheus, Grafana, Elk, Splunk What's in it for you At Accenture in addition to a competitive basic salary, you will also have an extensive benefits More ❯
skills and a track record of cross-team collaboration. Nice to have: Kubernetes expertise (GKE/AKS/EKS) and container-native observability stacks (Prometheus/Grafana). NoSQL experience (Firestore, Cosmos DB, DynamoDB, MongoDB). Experience with game-backend scales, real-time services or hybrid cloud/bare-metal More ❯
/service once live, including: observability best practises, logging best practises, error reporting, debugging and live incident management. Experience using tools such as Grafana, Prometheus, New Relic etc. Highly proficient in Python. Experience in data modelling and design patterns; in-depth knowledge of relational databases (PostgreSQL) and familiarity with data More ❯
on experience with Docker, Kubernetes, and container orchestration. Expertise with Databricks, including ML workflows and data pipeline management. Familiarity with tools like MLflow, DVC, Prometheus, and Grafana for versioning and monitoring. Experience implementing security and compliance standards for AI systems. Strong problem-solving and communication skills, with a collaborative mindset. More ❯
Code, particularly Terraform Proven experience with CI/CD pipelines and Azure DevOps Proficiency in containerization and Kubernetes orchestration Experience with monitoring tools like Prometheus and Grafana Advanced PowerShell scripting capabilities Microsoft certifications Strong knowledge of Windows Server infrastructure Experience supporting multi-language, European offices What Sets You Apart Passion More ❯
/service once live, including: observability best practices, logging best practices, error reporting, debugging and live incident management. Experience using tools such as Grafana, Prometheus, New Relic etc. Highly proficient in Python. Experience in data modelling and design patterns; in-depth knowledge of relational databases (PostgreSQL) and familiarity with data More ❯
versions. It supports multiple storage backends and cloud environments. We use Git for source control, Jenkins and GitHub Actions for continuous integration, Grafana and Prometheus for metrics collection, Docker and Kubernetes for containerization and orchestration, and Slack for internal communication. However, our technology stack is never static-we continuously evaluate More ❯
Proficient in observability and monitoring tools, including configuring alerts, creating dashboards, and conducting root cause analysis. Some of the tools we use are: Grafana, Prometheus, Elastic, Splunk. Configuring incident management platforms such as PagerDuty. Hands-on experience with Infrastructure-as-Code (IaC) and automation to improve operational efficiency, using tools More ❯
Process automation Respond to change requests Skills & Experience Oracle DB Powershell SQL Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration More ❯
hardware and software products. Experience implementing automated testing frameworks in a hardware-in-the-loop (HITL) environment. Familiarity with monitoring and logging tools (e.g., Prometheus, ELK stack). Experience with Nix/NixOS. Technical expertise and demonstrated performance in one or more of the following areas: networking, cloud technologies, application More ❯
tools like Ansible Experience working within a mature continuous development process highly desirable Experience with cloud infrastructure provisioning desirable Knowledge of monitoring tools like Prometheus, Grafana, Elastic Search, Splunk Team player essential with great communication skills QRT is an equal opportunity employer. We welcome diversity as essential to our success. More ❯
tools such as Docker, Kubernetes, and CI/CD pipelines (Jenkins, Git, Helm, Terraform) Ability to work with monitoring/logging tools (e.g. CloudWatch, Prometheus, Grafana) Previous experience supporting production environments or investigating application-level issues Comfortable in Agile environments using Jira, Confluence, and similar tools Strong communication, documentation, and More ❯
london, south east england, united kingdom Hybrid / WFH Options
psd group
tools such as Docker, Kubernetes, and CI/CD pipelines (Jenkins, Git, Helm, Terraform) Ability to work with monitoring/logging tools (e.g. CloudWatch, Prometheus, Grafana) Previous experience supporting production environments or investigating application-level issues Comfortable in Agile environments using Jira, Confluence, and similar tools Strong communication, documentation, and More ❯
skills in containerisation, including building and optimising base images and managing container repositories. Proven track record in administering monitoring systems such as Zabbix and Prometheus/Thanos in a high-availability environment. Experience leading and mentoring DevOps engineers, providing technical leadership across teams. Finance experience or knowledge of Trading or More ❯
london, south east england, united kingdom Hybrid / WFH Options
Intec Select
skills in containerisation, including building and optimising base images and managing container repositories. Proven track record in administering monitoring systems such as Zabbix and Prometheus/Thanos in a high-availability environment. Experience leading and mentoring DevOps engineers, providing technical leadership across teams. Finance experience or knowledge of Trading or More ❯
Architecture using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. More ❯
Architecture using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
PSD Group
tools such as Docker, Kubernetes, and CI/CD pipelines (Jenkins, Git, Helm, Terraform) Ability to work with monitoring/logging tools (e.g. CloudWatch, Prometheus, Grafana) Previous experience supporting production environments or investigating application-level issues Comfortable in Agile environments using Jira, Confluence, and similar tools Strong communication, documentation, and More ❯
in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Searchability NS&D
in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV More ❯
City Of London, England, United Kingdom Hybrid / WFH Options
Harrington Starr
with Python. Solid understanding of containerisation concepts and how they support scalability, isolation, and portability in modern application deployment. Familiarity with monitoring stacks (Grafana, Prometheus, etc.) Working knowledge of CI/CD pipelines and Git-based workflows. Exposure to Terraform and Infrastructure as Code principles. Ready to Take the Next More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Harrington Starr
with Python. Solid understanding of containerisation concepts and how they support scalability, isolation, and portability in modern application deployment. Familiarity with monitoring stacks (Grafana, Prometheus, etc.) Working knowledge of CI/CD pipelines and Git-based workflows. Exposure to Terraform and Infrastructure as Code principles. Ready to Take the Next More ❯