or Argo Workflows) for containerized microservices, ML model training, and inference workloads. Integrate automated testing, security scans, and policy checks into the release process. Observability & Reliability Engineering Implement comprehensive monitoring, logging, and tracing stacks (Prometheus/Grafana, Loki, ELK, OpenTelemetry). Define SLOs/SLA dashboards; lead incident response, root More ❯
high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). More ❯
tools, such as Terraform, CloudFormation, ARM, or Pulumi. Expertise in building secure applications and infrastructure, with strong knowledge of security practices. SRE skills, including observability and telemetry monitoring. Hands-on experience with the HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul). Experience in containerisation using Docker, Kubernetes, OpenShift, and Helm. More ❯
london, south east england, United Kingdom Hybrid / WFH Options
LHH
or CloudFormation. Implement CI/CD pipelines, enabling continuous integration and continuous deployment for mission-critical applications. Monitor system performance, availability, and security, implementing observability best practices. Work in an Agile environment, engaging with stakeholders to understand requirements and deliver iterative improvements. Your skills and experience Essential: Experience deploying and More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
london (west end), south east england, United Kingdom Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
environments (e.g. Docker), and IaC tools like Terraform and Ansible for infrastructure performance and cost efficiency. • Implement best practices in DevOps and DevSecOps, including observability, security, networking, API integration, and disaster recovery. • Mentor junior engineers and contribute technical leadership, ideally with experience in broadcast workflows, audio/video streaming, and More ❯
efficiency and cost-effectiveness Utilising automation tools for streamlining processes Influencing and developing relationships through confident, engaging, professional, high-impact interactions Utilising visibility/observability solutions. e.g., Platform Monitoring, User Experience Monitoring, Application Performance Monitoring, Application Resource Monitoring, Experience with Backups, High Availability and DR What you'll bring: Hands More ❯
enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . More ❯
and cost-efficient cloud solutions. You will be provided with access to cutting-edge cloud technologies, including AWS serverless computing, Kubernetes orchestration, AI-driven observability, and security automation, keeping you at the forefront of innovation. Your responsibilities: Implement and manage highly available, scalable, and secure applications hosted on AWS Cloud More ❯
Ansible, and Docker. Develop, maintain, and improve CI/CD pipelines to streamline and standardise application delivery processes. Monitor infrastructure performance and availability using observability tools including CloudWatch, Grafana, InfluxDB, and ElasticSearch. Implement security best practices including IAM policies, WAF configuration, and key management to ensure platform compliance. Manage and More ❯
You Bring: 5+ years of Linux engineering experience in high-performance environments. Strong scripting skills (Python, Bash), solid Ansible know-how. Expertise in networking, observability (Prometheus, Grafana, ELK), and DevOps tooling (Terraform, Git, CI/CD). Familiarity with containers (Docker/Kubernetes) and cloud (AWS) is a plus. Bonus More ❯
Reigate, England, United Kingdom Hybrid / WFH Options
Client Server
such as Pulumi and Terraform You have experience of building and improving CI/CD pipelines for product teams You have experience with cloud observability (logging, tracing, metrics, monitoring and alerting) You have experience with Containerisation - Docker, Kubernetes You have strong scripting skills with PowerShell and/or C# .Net More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Parser
DevOps teams to implement CI/CD pipelines and infrastructure as code using tools like Terraform, CloudFormation, and Ansible. Implement and manage monitoring and observability tools such as Datadog. Ensure real-time logging, alerting, and troubleshooting capabilities. Collaboration & Stakeholder Management: Work closely with business units, developers, and IT teams to More ❯
interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction More ❯
Bash, or Python Solid understanding of Linux systems, networking, routing, and firewall configurations A deep grasp of AWS operational best practices, particularly in monitoring, observability, and FinOps Expertise in Infrastructure as Code (IaC) tools such as CloudFormation, CDK, and Terraform Additionally, it would be advantageous to have experience with: AWS More ❯
london, south east england, united kingdom Hybrid / WFH Options
Annapurna
with CI/CD pipelines and container technologies like Docker and Kubernetes. Deep understanding of networking, distributed systems, and databases. Expertise in monitoring and observability tools such as DataDog, Prometheus, Grafana, ELK stack, or Splunk. Excellent communication skills and a meticulous approach to problem-solving. Desirable Experience: Familiarity with Azure. More ❯
Fleet, midlands, United Kingdom Hybrid / WFH Options
developrec
. Own key architecture and platform decisions to improve system performance, reliability, and scalability. Champion DevOps best practices: CI/CD, automation, IaC (Terraform), observability and security. Collaborate across teams, build strong engineering practices, and foster a culture of continuous improvement. Mentor and guide engineers, shaping both tech strategy and More ❯
fleet, south east england, United Kingdom Hybrid / WFH Options
developrec
. Own key architecture and platform decisions to improve system performance, reliability, and scalability. Champion DevOps best practices: CI/CD, automation, IaC (Terraform), observability and security. Collaborate across teams, build strong engineering practices, and foster a culture of continuous improvement. Mentor and guide engineers, shaping both tech strategy and More ❯
Fleet, south west england, United Kingdom Hybrid / WFH Options
developrec
. Own key architecture and platform decisions to improve system performance, reliability, and scalability. Champion DevOps best practices: CI/CD, automation, IaC (Terraform), observability and security. Collaborate across teams, build strong engineering practices, and foster a culture of continuous improvement. Mentor and guide engineers, shaping both tech strategy and More ❯
database deployments using Terraform, AWS CloudFormation, and AWS CDK. Integrate database changes into CI/CD pipelines using tools like Flyway or Liquibase. Define observability and monitoring strategies using CloudWatch, X-Ray, and Prometheus. Practice & Team Development Contribute to the development of modernisation frameworks, methodologies, and best practices. Help shape More ❯
Manage ETL, build pipelines, and scale data infrastructure to support data science and analytics initiatives. Design, implement and improve tools and services for orchestration, observability, data governance and data quality to high engineering standards Deploy and manage products using CI/CD best practices Work in partnership with Analytical stakeholders More ❯