access control (802.1x, RADIUS), or zero-trust security concepts. Exposure to infrastructure-as-code (Terraform, Ansible) and version control systems (Git). Experience with monitoring and observability tools (LogicMonitor, Grafana, Prometheus). Knowledge of hybrid cloud networking, including AWS Direct Connect or GCP Interconnect. Relevant certifications such as CCNP, AWS Advanced Networking Specialty, or Google Cloud Network Engineer. More ❯
of technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of managers. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An understanding of observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with some AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking testing More ❯
and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking testing and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level More ❯
Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level More ❯
and reduce manual work. Some other highly valued skills may include: Using Percona, ClusterControl, CI/CD tools, and automation platforms like Ansible or Chef. Monitoring systems with Prometheus, Grafana, ELK stack, and running containers with Kubernetes. Building APIs with FastAPI and supporting scalable, high-performance systems. You may be assessed on the key critical skills relevant for success in More ❯
London, England, United Kingdom Hybrid / WFH Options
Cint
Terraform, Kubernetes, Docker, Packer, Ansible and Jenkins. We support applications and services written in Golang, Python, Java, Scala and .Net. We monitor and alert on everything we deploy via Grafana, Prometheus, Graphite and ELK stacks. The team holds itself accountable to a high standard of build quality. We have recently completed the first major phase of a completely green-field … Actions etc.) You have a grasp of “cloud native” and 12-Factor applications You have good knowledge of monitoring and alerting using one or more of: Graphite, Statsd, Prometheus, Grafana, PagerDuty You have expertise in at least one scripting or programming language (Python, Bash, Ruby, Node, Golang, Java) Bonus Points If You Have You have good knowledge of the network More ❯
infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
london (city of london), south east england, united kingdom
Hunter Bond
infrastructure automation and configuration management tools (Chef, Puppet, or Ansible) Exposure to distributed storage systems and related protocols Experience with observability and monitoring tools (Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Strong written and verbal communication skills Demonstrated ability to learn quickly and adapt to evolving technologies Ability to work effectively in a fast-paced, collaborative environment jhayne@hunterbond.com More ❯
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside More ❯
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside More ❯
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside More ❯
london (city of london), south east england, united kingdom
Staffworx
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside More ❯
West London, London, United Kingdom Hybrid / WFH Options
Staffworx Limited
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
west london, south east england, united kingdom Hybrid / WFH Options
Staffworx Limited
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
london, south east england, united kingdom Hybrid / WFH Options
Staffworx Limited
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
South East London, London, United Kingdom Hybrid / WFH Options
Stepstone UK
architecture - Principles: TDD, Agile, Pair Programming - CI/CD: Git, Docker, Bamboo - Cloud: AWS, Lambda, ECS - Databases/Storage: Postgres , Dynamo, DocumentDb, OpenSearch/Elastic, Redis - Monitoring: Cloudwatch, Kibana, Grafana, DataDog Qualifications Experience of working with the following tech; C# .NET (8+), Terraform, Typescript, AWS, CI/CD Dedication to high quality, testable and maintainable code and super passionate about More ❯
deliver under pressure across multiple priorities. Requirements Essentials - Java 17 version preferred, Springboot, Microservices, AWS, Maven, Gradle, JPA, JMS, Junit, Bamboo, Stash, IntelliJ Good to have - ArgoCD, Kubernetes, Docker, Grafana, Splunk Nice to have - SonarQube Ability to work in small teams and strong communication skills Comm skills are very important. As Macquarie has small teams, developer who can work independently More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Understanding Recruitment
engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting global trading systems More ❯