London, South East, England, United Kingdom Hybrid / WFH Options
Michael Page Technology
tools (Jenkins, GitHub Actions, GitLab CI). Knowledge of scripting languages (Python, Bash, PowerShell). Knowledge of containerization & orchestration (Docker, Kubernetes). Experience with monitoring/logging tools (Prometheus, Grafana, Splunk, ELK, CloudWatch). Professional level of English (spoken and written), enabling effective communication across international teams. Excellent problem-solving, analytical, and communication skills. Ability to work in a fast More ❯
re Looking For: Proven experience in DevOps roles within agile environments Strong scripting skills (Python, Bash, etc.) Hands-on experience with containerization (Docker, Kubernetes) Familiarity with observability tools (Prometheus, Grafana, ELK) Excellent problem-solving and communication skills ✅ Bonus Points: Experience with security and compliance in cloud environments Knowledge of serverless architecture Previous work in fintech, e-commerce, or SaaS environments More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Experis UK
re Looking For: Proven experience in DevOps roles within agile environments Strong scripting skills (Python, Bash, etc.) Hands-on experience with containerization (Docker, Kubernetes) Familiarity with observability tools (Prometheus, Grafana, ELK) Excellent problem-solving and communication skills ✅ Bonus Points: Experience with security and compliance in cloud environments Knowledge of serverless architecture Previous work in fintech, e-commerce, or SaaS environments More ❯
and AWS integration. Kafka – experience with production clusters, scaling, tuning, troubleshooting, and event-driven systems. MongoDB – strong admin experience including replication, sharding, tuning, and backups. Monitoring/Observability – Prometheus, Grafana, ELK, Datadog, with strong alerting/SLO design. AWS – expertise across EC2, VPC, S3, RDS, IAM, ALB/NLB, and cost optimisation. Linux – advanced administration, performance debugging, and security hardening. More ❯
as AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful/Bonus Skills to have More ❯
South West London, London, United Kingdom Hybrid / WFH Options
InterQuest Group (UK) Limited
DevOps) AND Cloud platforms (AWS, Azure, or GCP) Solid understanding of networking concepts (TCP/IP, DNS, routing, VPNs, firewalls). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK). Scripting skills including Python and React. Expereince of working within an Agile environment including using Jira and ideally SAFe scaled Agile expereince. Ability to build automated pipelines and More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Interquest
DevOps) AND Cloud platforms (AWS, Azure, or GCP) Solid understanding of networking concepts (TCP/IP, DNS, routing, VPNs, firewalls). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK). Scripting skills including Python and React. Expereince of working within an Agile environment including using Jira and ideally SAFe scaled Agile expereince. Ability to build automated pipelines and More ❯
Ruby, etc.). Strong expertise in designing systems for observability, including effective monitoring, detailed logging, comprehensive performance testing strategies, and hands-on experience with modern observability tools such as Grafana, Prometheus, or CloudWatch to implement and manage monitoring solutions. Hands-on experience with core AWS, or other cloud providers like GCP or Azure, to architect scalable and resilient infrastructure. Extensive More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
processes to ensure systems are robust, secure and observable. You'll be working with a modern tech stack using Java, Spring Boot, CI/CD, Kubernetes, AWS, EKS and Grafana/Splunk. About you: You have advanced backend software engineering experience with Java, Spring Boot, REST, Postgres, Redis You have experience of running production workloads on Kubernetes (Amazon EKS preferred More ❯
with PostgreSQL or similar databases, including writing queries for validation and verifying data integrity. Experience testing applications running in Kubernetes environments. Familiarity with using monitoring and observability tools like Grafana to support test analysis and validation. Experience troubleshooting and supporting customers with product features, including investigating issues and providing technical guidance. Bias for action and problem solving - eagerness to take More ❯
modern deployment practices Familiarity with infrastructure-as-code tools such as Terraform Strong understanding of security best practices in application and infrastructure design Exposure to observability tools (e.g. Prometheus, Grafana, structured logging) Confident debugging and resolving issues in complex distributed systems Product-oriented mindset with a collaborative approach to improving developer experience Bonus: experience with Kafka, gRPC, or contributing to More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Huxley
or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days depending More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Huxley Associates
or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days depending More ❯
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯
London, Bloomsbury, United Kingdom Hybrid / WFH Options
IntaPeople
GitHub Actions, or AWS CodePipeline Support and train technical staff in upskilling necessary for ongoing operations Monitor and ensure system reliability, availability, and performance using tools likeCloudWatch, Prometheus, Icinga2, Grafana, and Datadog Automate deployment, scaling, and management of containerized applications using Docker and Kubernetes Desirable skills Travis CI Monitoring – Grafana, Icinga Prometheus Rabbit MQ/AMQP Working knowledge of security More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Eligo Recruitment
ll Bring Strong experience with GCP , Terraform , and Infrastructure-as-Code Deep knowledge of cloud networking, security automation, and compliance standards Proficiency in CI/CD pipelines , monitoring tools (Grafana, Datadog), and scripting A collaborative mindset with excellent communication and mentoring skills Why Join? Shape a next-gen AI infrastructure with autonomy and purpose Hybrid working with regular meetups in More ❯
Collaborate with cross-functional teams to shape and refine foundational capabilities. Own your work from concept to deployment and beyond-digging into production issues using tools like Honeycomb, Datadog, Grafana, and Rollbar to ensure system health. Write clear, maintainable, and well-documented Go code, with observability and long-term maintainability built in. Participate in architectural decisions and technical strategy development. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Eligo Recruitment
indexing, and capacity planning for mission-critical systems Develop secure backup, recovery, and disaster recovery procedures Explore multi-tenant and sharded architectures to support growth Implement monitoring strategies using Grafana, Datadog, and CI/CD integrations Champion database best practices, mentor teams, and standardize tooling and automation What You’ll Bring Extensive experience managing cloud-hosted PostgreSQL at scale Proficiency More ❯
messaging and Protobuf for consistent data contracts across components. Working with a Vue-based frontend and integrating it with backend services. Managing local SQLite databases and integrating them with Grafana dashboards and interactive Vue pages. Building CI/CD pipelines to support development and deployment workflows. Collaborating on authentication and RBAC strategies (e.g., Windows Auth, OAuth, OIDC). Writing clean More ❯
London, St James's, United Kingdom Hybrid / WFH Options
Stock in the Channel
with throttling and versioning. Developing durable workflows. Writing efficient and scalable SQL queries , stored procedures, and scripts. Integrating external systems with custom data synchronisation logic. Utilising Open Telemetry and Grafana for logs, metrics, tracing, and alerting across backend services. Contributing to technical design discussions, code reviews, and deployments. What We’re Looking For: Strong experience in C#/.NET backend More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Get Staffed Online Recruitment Limited
with throttling and versioning. Developing durable workflows. Writing efficient and scalable SQL queries , stored procedures, and scripts. Integrating external systems with custom data synchronisation logic. Utilising Open Telemetry and Grafana for logs, metrics, tracing, and alerting across backend services. Contributing to technical design discussions, code reviews, and deployments. What They’re Looking For: Strong experience in C#/.NET backend More ❯