London, England, United Kingdom Hybrid / WFH Options
Cint
Kubernetes, Docker, Packer, Ansible and Jenkins. We support applications and services written in Golang, Python, Java, Scala and .Net. We monitor and alert on everything we deploy via Grafana, Prometheus, Graphite and ELK stacks. The team holds itself accountable to a high standard of build quality. We have recently completed the first major phase of a completely green-field infrastructure … GitHub Actions etc.) You have a grasp of “cloud native” and 12-Factor applications You have good knowledge of monitoring and alerting using one or more of: Graphite, Statsd, Prometheus, Grafana, PagerDuty You have expertise in at least one scripting or programming language (Python, Bash, Ruby, Node, Golang, Java) Bonus Points If You Have You have good knowledge of the More ❯
per day End date - 31st March 2026 Active SC clearance Onsite travel to Leeds/Newcastle/Manchester/Blackpool/Sheffield AWS Terraform Gitlab CI/CD Prometheus Grafana Splunk Gov experience More ❯
Skills: Experience working in Agile environments Strong understanding of Site Reliability Engineering (SRE) principles Familiarity with Azure DevOps for CI/CD and pipeline management Knowledge of observability tools: Prometheus, Grafana, Loki, Tempo Experience with Infrastructure as Code: Helm, Kustomize Hands-on experience with Tekton and ArgoCD Ability to support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ) Understanding More ❯
Configuration automation with Ansible Linux, preferably Redhat/Centos Good Network skills (Firewalls & Switches) AWS/Azure/GCP Containerisation technologies such as Kubernetes and Docker Grafana and or Prometheus Vmware Experience: Minimum 3 years. More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
london (city of london), south east england, united kingdom
Prism Digital
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site Reliability Engineering or DevOps transformation More ❯
years of technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of More ❯
experience building and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking More ❯
Sheffield, South Yorkshire, England, United Kingdom
KBC Technologies UK LTD
networking, storage, and security configurations. Perform upgrades and patching of OpenShift clusters and associated components. Monitoring and Optimization: Implement monitoring solutions for KVM and OpenShift environments using tools like Prometheus, Grafana, or ELK Stack. Analyze system performance and recommend optimizations for resource utilization and cost efficiency. Collaboration and Documentation: Work closely with development, DevOps, and operations teams to ensure seamless More ❯
Sheffield, South Yorkshire, England, United Kingdom Hybrid / WFH Options
KBC Technologies UK LTD
. Monitor, troubleshoot, and optimize OpenShift workloads for high performance. Manage networking, storage, and security configurations. Perform upgrades and patching of OpenShift clusters & components. Monitoring & Optimization Implement monitoring tools (Prometheus, Grafana, ELK Stack). Analyze performance and recommend optimizations for efficiency. Collaboration & Documentation Work with DevOps, development, and operations teams for seamless integration. Document infrastructure designs, processes, and best practices. More ❯
and modern deployment practices Familiarity with infrastructure-as-code tools such as Terraform Strong understanding of security best practices in application and infrastructure design Exposure to observability tools (e.g. Prometheus, Grafana, structured logging) Confident debugging and resolving issues in complex distributed systems Product-oriented mindset with a collaborative approach to improving developer experience Bonus: experience with Kafka, gRPC, or contributing More ❯
performance. Manage OpenShift networking, storage, and security configurations. Perform upgrades and patching of OpenShift clusters and associated components. Implement monitoring solutions for KVM and OpenShift environments using tools like Prometheus, Grafana, or ELK Stack. Analyze system performance and recommend optimizations for resource utilization and cost efficiency. Work closely with development, DevOps, and operations teams to ensure seamless integration of infrastructure More ❯
handsworth, yorkshire and the humber, united kingdom
Wipro
performance. Manage OpenShift networking, storage, and security configurations. Perform upgrades and patching of OpenShift clusters and associated components. Implement monitoring solutions for KVM and OpenShift environments using tools like Prometheus, Grafana, or ELK Stack. Analyze system performance and recommend optimizations for resource utilization and cost efficiency. Work closely with development, DevOps, and operations teams to ensure seamless integration of infrastructure More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Huxley
Bicep or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Huxley Associates
Bicep or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days More ❯
to embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
to embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management More ❯
RabbitMQ, Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source More ❯
Application Support – Oracle, SQL Server, PostgreSQL Scripting & Automation – Bash, PowerShell, Python Networking Protocols – TCP/IP, VPN, VLANs, Subnetting, Firewalls, Routing/Switching System Monitoring & Management – Nagios, Zabbix, Grafana, Prometheus, SolarWinds Rewards & Benefits TCS is consistently voted a Top Employer in the UK and globally. Our competitive salary packages feature pension, health care, life assurance, laptop, phone, access to extensive More ❯
the perfect environment for you. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash, or Go What Youll Be Doing Designing and maintaining reliable, scalable, and secure infrastructure for production systems. Automating … Looking For Strong experience running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations (EKS, Helm, or similar). Solid knowledge of monitoring, alerting, and logging (Grafana, Prometheus, ELK). Hands-on experience with Terraform and CI/CD tooling. Strong scripting or development background (Python, Go, or similar). Excellent troubleshooting skills and a proactive, problem-solving More ❯
on experience with Gatling and open-source performance tools. Strong knowledge of CI/CD tools (Jenkins, GitHub Actions, Gradle/Maven). Skilled in monitoring/logging with Prometheus and Grafana. Proficiency in scripting languages (Scala, Python, Shell). This role offers the chance to make an impact on a global platform, work with cutting-edge tech, and collaborate More ❯