related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise in Terraform, CloudFormation More ❯
related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise in Terraform, CloudFormation More ❯
london (city of london), south east england, united kingdom
Damia Group
related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise in Terraform, CloudFormation More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Robert Half
on experience building and maintaining CI/CD pipelines (Azure DevOps, GitHub Actions, Jenkins, or similar). Strong understanding of monitoring, logging, and observability tools (e.g., AppInsights, ELK, Prometheus, Grafana). Solid knowledge of test-driven development and experience embedding TDD in automated delivery workflows. Experience working directly within software development teams to support agile delivery. Familiarity with API-driven More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Computer Futures
Architecting and maintaining AWS cloud environments Managing Kubernetes clusters (plus Docker & Helm) Building CI/CD pipelines and automated deployment tools Driving observability with tools like CloudWatch, ELK, and Grafana Mentoring junior engineers and shaping DevOps best practices Ensuring security, compliance, and disaster recovery readiness What You Bring You're a tech-savvy problem solver with a passion for DevOps More ❯
pipelines Familiarity with regulated workflows: ISO27001, SOC2, GDPR aren't just abbreviations, and don't fill you with dread Observability skills: Well familiar with Open Telemetry, Prometheus, Loki and Grafana CI/CD pipeline skills: You know what it takes to build templates and guardrails to allow the most junior developers to confidently push code, safely knowing that the computer More ❯
of their peers. Nice to have Building CI/CD pipelines. Knowledge of deployment, rollout, rollback strategies. Knowledge of observability practices (logging, metrics, tracing) and monitoring tools (e.g. Prometheus, Grafana). Understanding of cloud security best practices, including IAM policies and secret management. Time Off & Work-Life Balance 25 Days Annual Leave + bank holidays - plus the option to buy More ❯
various methods such as unit, integration, contract and E2E testing. You have a high degree of experience in observing the performance and health of applications via tools such as Grafana, Prometheus, Data Dog, Sentry, etc. You have a strong desire and are an advocate for performant applications. Proactive in solving problems simply and effectively, with an eye for pragmatic solutions. More ❯
help shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Configuration Management Ansible Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) GitHub Actions (CI/CD pipelines) What They’re Looking For Experience in AWS cloud infrastructure (ideally in a regulated More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
up Experience Strong cloud skills (AWS, GCP, Azure) and containerisation (Docker, Kubernetes) Experience in automating deployments and orchestrating cloud environments Nice to have: Python (Jupyter, PyTorch), monitoring tools (Prometheus, Grafana), cloud databases (RDS, Aurora, Spanner), CI/CD tools (CircleCI), and data visualisation experience. This is a unique opportunity to join a visionary team redefining AI in 3D , with the More ❯
Employment Type: Full-Time
Salary: £140,000 - £160,000 per annum, Inc benefits
Docker, Kubernetes, Terraform). Skilled in cloud platforms (AWS, GCP, or Azure). Bonus points for: Familiarity with ML frameworks (PyTorch, Jupyter). Knowledge of cloud monitoring tools (Prometheus, Grafana). Experience with cloud-based databases (RDS, Aurora, Redshift, Spanner, etc.). Exposure to CI/CD pipelines (e.g., CircleCI). What's on offer: The chance to shape the More ❯
Operate, Elasticsearch, etc.) Backend Development: Java 21, Spring Boot 3.x, Kafka Frontend Development: Angular 15+, React 18+, REST APIs CI/CD Tools: Jenkins, Docker, Kubernetes Monitoring & Alerts: Prometheus, Grafana Data Persistence: PostgreSQL, MongoDB, Redis Why Join Us? Be part of a forward-thinking organisation that values diversity, inclusion, and sustainability. Work in a collaborative environment where your ideas and More ❯
Operate, Elasticsearch, etc.) Backend Development: Java 21, Spring Boot 3.x, Kafka Frontend Development: Angular 15+, React 18+, REST APIs CI/CD Tools: Jenkins, Docker, Kubernetes Monitoring & Alerts: Prometheus, Grafana Data Persistence: PostgreSQL, MongoDB, Redis Why Join Us? Be part of a forward-thinking organisation that values diversity, inclusion, and sustainability. Work in a collaborative environment where your ideas and More ❯
GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness in More ❯
/JavaScript, C# Libraries and Frameworks: React, Next.js, Node.js, .NET Testing: Vitest, Playwright, Pact, K6 Datastores: PostgreSQL, CosmosDB, Redis Infrastructure and DevOps: GitHub Actions, Azure, Kubernetes, Docker, Terraform Monitoring: Grafana, Azure App Insights While familiarity with our full technology stack is desirable, it is by no means required. Required Skills & Experience Proven experience in solution architecture, technical leadership, or senior More ❯
AWS Lambda, Azure Functions); Experience with CI/CD pipelines and automation tools (e.g., GitHub Actions,Azure DevOps); Ability to switch between multiple languages and paradigms effectively; Familiarity with Grafana is a plus, as monitoring will become a growing focus in the project. You'll be: Self-motivated, proactive and continually looking for ways to improve and develop yourself; A More ❯
cloud architecture IoT 'smart' edge devices (using nVidia AI chips) Linux-based embedded OS on our Edge devices Continuous Integration and Delivery using Jenkins, SonarQube Terraform for infrastructure management Grafana, Elasticsearch, Kibana & New Relic for metrics, logs and monitoring In the company we also use: VueJS, MySQL, Spring Boot, Apache Camel, AWS Redshift, AWS SageMaker, Pentaho, Balena, Serverless functions Winnow More ❯
automation workflows. Containerization : Docker and containerized application deployment. Cloud : AWS experience supporting ML workloads. CI/CD & Automation : ArgoCD, GitHub Actions, Infrastructure-as-Code (Terraform). Monitoring & Observability : Prometheus, Grafana, cloud-native stacks. ML Lifecycle: Production experience with experimentation, training, deployment, versioning, and monitoring. Reliability & Support : On-call participation, incident response, and system optimization. The ideal candidate will be a More ❯
of technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of managers. More ❯
bash/python) and configuration management (Ansible, Salt) experience building tooling and drive automation with an IAC mindset Experience with open source monitoring and metrics collections (Nagios, TICK, Prometheus, Grafana, etc.) Comfortable operating in Linux based environments Production operations: incident response, triaging and troubleshooting Excellent verbal and written communication skills preferred as the team interfaces directly with senior stakeholders, external More ❯
management experience Tech Stack AWS, Terragrunt, Terraform, EKS, Helm, ArgoCD, Docker, Gitlab SaaS, Gitlab CI, Victoria Metrics (Prometheus compatible), Vault, Clickhouse, PostgreSQL, MariaDB/MySQL, MongoDB, KeyCloak, ELK logging, Grafana Legacy (migrating from): Kubernetes (Bare Metal), Ceph, Jenkins, Gitlab On-Premises AI Usage Disclaimer At FXC Intelligence, we are enthusiastic about the use of AI tools and value candidates with More ❯
cloud networks, such as AWS, GCP, Azure Familiarity with configuration management tools, such as Ansible or Salt is desirable to support zero-touch network management Familiarity with Python, Prometheus, Grafana, ELK, GitHub is desirable Understanding of basic power consumption and cooling issues in a data center environment Knowledge of fiber optics technology and cabling standards ranging from 1 to More ❯
and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking testing and More ❯
with PostgreSQL or similar databases, including writing queries for validation and verifying data integrity. Experience testing applications running in Kubernetes environments. Familiarity with using monitoring and observability tools like Grafana to support test analysis and validation. Experience troubleshooting and supporting customers with product features, including investigating issues and providing technical guidance. Bias for action and problem solving - eagerness to take More ❯
Good client facing skills and problem solving aptitude DevOps knowledge of SQL Oracle DB Postgres ActiveMQ Zabbix Ambari Hadoop Jira Confluence BitBucket ActiviBPM Oracle SOA Azure SQLServer IIS AWS Grafana Oracle BPM Jenkins Puppet CI and other cloud technologies. All profiles will be reviewed against the required skills and experience. Due to the high number of applications we will only More ❯