automation. Strong knowledge of CI/CD tooling, IaC, and cloud-native technologies. Advanced scripting (Bash, Python) and automation experience. Skilled in monitoring and observability tools (e.g., Prometheus, Grafana, ELK). Strong problem-solving, communication, and leadership skills. Familiarity and Experience of CI/CD Tools: Jenkins, GitLab CI Infrastructure More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
Acorn Group
Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as More ❯
Develop a baseline monitoring and tooling concept for cloud to address the need for compliance infrastructure reporting within agile deliveries as part of our Observability strategy. Develop concepts and tools for chargeback and showback (Financial Instrumentation) in a multicloud context. Implement and mature a cloud forecasting and capacity management solution More ❯
CD Pipeline Development: Develop and maintain robust CI/CD pipelines for continuous integration and deployment of ML models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration More ❯
london, south east england, united kingdom Hybrid / WFH Options
Amber Labs
tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for More ❯
CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
Fruition Group
a small team of engineers Align DevOps capabilities with the wider business Champion DevEx, reliability, and security Embed operational excellence and incident response Promote observability and performance optimisation Lead DevOps Engineer Requirements Proven technical and some leader/mentoring experience Cloud-native expertise (any cloud provider is fine: GCP, AWS More ❯
tools, such as Terraform, CloudFormation, ARM, or Pulumi. Expertise in building secure applications and infrastructure, with strong knowledge of security practices. SRE skills, including observability and telemetry monitoring. Hands-on experience with the HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul). Experience in containerisation using Docker, Kubernetes, OpenShift, and Helm. More ❯
london, south east england, united kingdom Hybrid / WFH Options
Aimhire
engineering team to support microservices architecture, with focus on latency-sensitive and high-availability services. ● Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing). ● Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). ● Lead incident response, system reliability More ❯
in healthcare IT, or familiarity with HL7, FHIR, or DICOM standards. Exposure to identity and access management (e.g. Keycloak, OAuth2, SSO). Familiarity with observability and logging tools (e.g. Prometheus, Grafana, OpenSearch, ELK stack). Experience with secrets management (e.g. Vault, AWS Secrets Manager). Experience working with or building More ❯
enhance internal DevOps culture, tooling, and CI/CD processes. Collaborate cross-functionally to continuously innovate and improve development workflows and system operations. Foster observability and reliability across live systems through best-in-class monitoring and automation. Day to Day: Collaborate with engineers and architects to define and implement cloud More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
london, south east england, united kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
london (west end), south east england, united kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Ashton-Under-Lyne, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Networks, ExpressRoute, VPNs, NSGs, and Azure Firewall for secure connectivity. Integrate hybrid cloud solutions using Azure Arc and hybrid connectivity strategies. Monitoring & Resilience: Implement observability using Azure Monitor, Log Analytics, App Insights, and Prometheus/Grafana . Design for high availability (HA), disaster recovery (DR), and business continuity (BCP) . More ❯
into the platform. Support scalable, resilient cloud environments with modern DevOps practices. Promote GitOps deployment strategies and mentor peers in DevOps best practice. Enhance observability using tools like Prometheus and Grafana. This role is ideal for someone looking to take the next step in a DevOps career while working with More ❯
models Additional skills that are a plus: Programming languages such as Scala, Rust, Go, Angular, React, Kotlin Database management with PostgreSQL Experience with ElasticSearch, observability tools like Grafana and Prometheus What this role can offer Opportunity to deepen understanding of AI and Data Science applications Mentorship and support from colleagues More ❯
interest in learning technology that facilitates innovation Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Preferred Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. About More ❯