London, England, United Kingdom Hybrid / WFH Options
Magentus Group
or similar). Experience with scripting or programming languages (Python, Go, Bash, etc.). Understanding of networking, security principles, and best practices. Knowledge of observability tools such as Datadog, Prometheus, Grafana, etc. Desired Attributes Strong problem-solving skills with a proactive approach to improving systems and processes. Excellent communication and collaboration skills, able to work effectively with cross-functional teams. More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Magentus Group
or similar). Experience with scripting or programming languages (Python, Go, Bash, etc.). Understanding of networking, security principles, and best practices. Knowledge of observability tools such as Datadog, Prometheus, Grafana, etc. Desired Attributes Strong problem-solving skills with a proactive approach to improving systems and processes. Excellent communication and collaboration skills, able to work effectively with cross-functional teams. More ❯
implement efficient CI/CD pipelines Containerization and Orchestration: Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes) Monitoring and Logging: Experience with monitoring and logging tools like DataDog, Prometheus, or Grafana Data Engineering Skills: Knowledge of event streaming platforms (e.g., Apache Kafka) and SQL database management Strong Communication and Collaboration: Excellent communication skills and the ability to work effectively More ❯
knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat More ❯
Techniques - Hands-on experience with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
The Acorn Group
Techniques - Hands-on experience with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform More ❯
for product teams to iterate on their applications in AWS. Setting best practices and policies, especially around microservice architecture. Developing, maintaining and operating complex operational tooling (e.g. Kubernetes, Opensearch, Prometheus, Grafana, Github or equivalent alternative technologies). Assessing customer technical capabilities and upskilling for reduced friction and increased platform adoption. Enhancing operational reliability and scalability of existing products POC’ing More ❯
London, England, United Kingdom Hybrid / WFH Options
Xapo Bank
e.g., Jenkins, GitLab CI, GitHub Actions, ArgoCD). Deep technical knowledge across software development, systems architecture, cloud infrastructure (preferably AWS), containerization (Kubernetes), IaC (e.g., Terraform, Pulumi), and observability (e.g., Prometheus, Grafana, ELK Stack). Strategic & Pragmatic Thinking: Ability to translate high-level business and engineering strategy into a clear platform roadmap, making pragmatic decisions that balance innovation with operational stability More ❯
London, England, United Kingdom Hybrid / WFH Options
Appvia
for product teams to iterate on their applications in AWS Setting best practices and policies, especially around microservice architecture Developing, maintaining and operating complex operational tooling (e.g. Kubernetes, Opensearch, Prometheus, Grafana, Github or equivalent alternative technologies) Assessing customer technical capabilities and upskilling for reduced friction and increased platform adoption Enhancing operational reliability and scalability of existing products POC'ing new More ❯
Role: DevOps Engineer Location: Slough-Berkshire, UK Salary: £38000 - £40000 IT Global Consulting Limited is seeking an experienced and highly motivated DevOps Engineer responsible for developing, maintaining, and supporting the platform, acting as the internal focal point for an enterprise More ❯
and production incident response. Key Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform root cause analysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes - Automate infrastructure tasks with Python, Bash, Go or SQL - Work with Git … an on-call rotation to ensure system reliability Your Profile Essential: - Solid hands-on AWS experience in a DevOps setting - Background in incident, change, and problem management - Strong with Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or CloudFormation - Advanced log analysis with More ❯
London, England, United Kingdom Hybrid / WFH Options
Stott and May
and production incident response. Key Responsibilities Manage and monitor AWS infrastructure for performance and security Respond to production incidents, perform root cause analysis, and implement fixes Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes Automate infrastructure tasks with Python, Bash, Go or SQL Work with Git … an on-call rotation to ensure system reliability Your Profile Essential Solid hands-on AWS experience in a DevOps setting Background in incident, change, and problem management Strong with Prometheus, Grafana, Splunk, and PromQL Proficient in scripting (Python, Go, Bash, SQL) Skilled in GitHub, CI/CD, and Kubernetes operations Desirable Experience with Terraform or CloudFormation Advanced log analysis with More ❯
deployments using Docker and Kubernetes Manage cloud infrastructure on Azure, ensuring high availability, scalability, and security Set up and maintain monitoring, logging, and alerting solutions using tools such as Prometheus and Grafana Collaborate with development teams to optimize AI application deployment and operational performance Ensure compliance with security standards and best practices in cloud and infrastructure management Qualifications & Skills: Proven … tools Hands-on experience with containerisation and orchestration technologies such as Docker and Kubernetes In-depth knowledge of cloud platforms, especially Azure Experience with monitoring and logging tools like Prometheus and Grafana Familiarity with supporting AI/ML workloads is preferred Strong problem-solving skills and ability to work collaboratively in a fast-paced environment Multi-Year Project - Flexible Start More ❯
deployments using Docker and Kubernetes Manage cloud infrastructure on Azure, ensuring high availability, scalability, and security Set up and maintain monitoring, logging, and alerting solutions using tools such as Prometheus and Grafana Collaborate with development teams to optimize AI application deployment and operational performance Ensure compliance with security standards and best practices in cloud and infrastructure management Qualifications & Skills: Proven … tools Hands-on experience with containerisation and orchestration technologies such as Docker and Kubernetes In-depth knowledge of cloud platforms, especially Azure Experience with monitoring and logging tools like Prometheus and Grafana Familiarity with supporting AI/ML workloads is preferred Strong problem-solving skills and ability to work collaboratively in a fast-paced environment Multi-Year Project - Flexible Start More ❯
deployments using Docker and Kubernetes Manage cloud infrastructure on Azure, ensuring high availability, scalability, and security Set up and maintain monitoring, logging, and alerting solutions using tools such as Prometheus and Grafana Collaborate with development teams to optimize AI application deployment and operational performance Ensure compliance with security standards and best practices in cloud and infrastructure management Qualifications & Skills: Proven … tools Hands-on experience with containerisation and orchestration technologies such as Docker and Kubernetes In-depth knowledge of cloud platforms, especially Azure Experience with monitoring and logging tools like Prometheus and Grafana Familiarity with supporting AI/ML workloads is preferred Strong problem-solving skills and ability to work collaboratively in a fast-paced environment Multi-Year Project - Flexible Start More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Wallet in Telegram
dashboards Serve as a duty DevOps engineer, handling routine tasks and being on-call for production issues Resolve production and development issues, leveraging strong troubleshooting skills Adjust and rewrite Prometheus alert expressions to be non-flapping and algorithmic .Requirements Understanding of networking fundamentals Proficiency in Linux OS, including system metrics and filesystems Experience with PostgreSQL Experience with load balancers (we … Nginx/Traefik, AWS ELB/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python English proficiency at an intermediate More ❯
dashboards Serve as a duty DevOps engineer, handling routine tasks and being on-call for production issues Resolve production and development issues, leveraging strong troubleshooting skills Adjust and rewrite Prometheus alert expressions to be non-flapping and algorithmic .Requirements Understanding of networking fundamentals Proficiency in Linux OS, including system metrics and filesystems Experience with PostgreSQL Experience with load balancers (we … Nginx/Traefik, AWS ELB/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python English proficiency at an intermediate More ❯
tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems More ❯
tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems More ❯
London, England, United Kingdom Hybrid / WFH Options
Wallet in Telegram
dashboards Serve as a duty DevOps engineer, handling routine tasks and being on-call for production issues Resolve production and development issues, leveraging strong troubleshooting skills Adjust and rewrite Prometheus alert expressions to be non-flapping and algorithmic .Requirements Understanding of networking fundamentals Proficiency in Linux OS, including system metrics and filesystems Experience with PostgreSQL Experience with load balancers (we … Nginx/Traefik, AWS ELB/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python English proficiency at an intermediate More ❯
tools; Maven, Gradle or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems More ❯
London, England, United Kingdom Hybrid / WFH Options
amber labs
GitLab to ensure continuous integration, delivery, and deployment of applications. Collaborate with the development team to optimise pipeline efficiency and ensure code quality. Implement monitoring solutions using AWS CloudWatch, Prometheus, Grafana, or similar tools to ensure visibility into application performance, health, and security. Troubleshoot production issues and provide resolution. Ensure the security of cloud infrastructure by implementing best practices like … PowerShell. Experience automating infrastructure tasks using Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation. Monitoring & Logging Tools: Experience with monitoring and logging tools such as AWS CloudWatch, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana). Benefits: Join a rapidly expanding start-up where personal growth is a part of our DNA. Benefit from a flexible work environment focused More ❯
to name a few of the libraries we use extensively. We implement the systems that require the highest data throughput in Java. Within Data Engineering we use Dataiku, Snowflake, Prometheus, and ArcticDB heavily. We use Kafka for data pipelines, Apache Beam for ETL, Bitbucket for source control, Jenkins for continuous integration, Grafana + Prometheus for metrics collection, ELK for log More ❯
Brighton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
like Terraform, Vagrant, and scripting languages such as Shell, Python, and PowerShell. Manage containerized applications with Docker to ensure streamlined deployment and environment consistency. Implement monitoring and alerting with Prometheus, Grafana, and AlertManager to identify and resolve performance issues. Promote DevOps best practices in automation, security, and agile delivery to improve operational efficiency and team productivity. ESSENTIAL EXPERIENCE Hands-on More ❯