London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
up Experience Strong cloud skills (AWS, GCP, Azure) and containerisation (Docker, Kubernetes) Experience in automating deployments and orchestrating cloud environments Nice to have: Python (Jupyter, PyTorch), monitoring tools (Prometheus, Grafana), cloud databases (RDS, Aurora, Spanner), CI/CD tools (CircleCI), and data visualisation experience. This is a unique opportunity to join a visionary team redefining AI in 3D , with the More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and scanning into More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and scanning into More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and scanning into More ❯
Telford, Shropshire, West Midlands, United Kingdom Hybrid / WFH Options
Capgemini UK Plc
communication skills Proven team engagement skills Proven coaching skills Quality Driven Adaptable/ability to context switch Stakeholder Management Optional Skills: Kubernetes Micro-services GraalVM Helm Mockito AWS Kibana Grafana Open API gRPC/Protobuf WCAG OAuth2/OpenID Sustainable Software Engineering Cucumber/Gherkin Selenium Agile Scrum Agile Practices Applicant must have SC Clearance or at least be eligible More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
processes to ensure systems are robust, secure and observable. You'll be working with a modern tech stack using Java, Spring Boot, CI/CD, Kubernetes, AWS, EKS and Grafana/Splunk. About you: You have advanced backend software engineering experience with Java, Spring Boot, REST, Postgres, Redis You have experience of running production workloads on Kubernetes (Amazon EKS preferred More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
processes to ensure systems are robust, secure and observable. You'll be working with a modern tech stack using Java, Spring Boot, CI/CD, Kubernetes, AWS, EKS and Grafana/Splunk. About you: You have advanced backend software engineering experience with Java, Spring Boot, REST, Postgres, Redis You have experience of running production workloads on Kubernetes (Amazon EKS preferred More ❯
East London, London, United Kingdom Hybrid / WFH Options
Client Server
processes to ensure systems are robust, secure and observable. You'll be working with a modern tech stack using Java, Spring Boot, CI/CD, Kubernetes, AWS, EKS and Grafana/Splunk. About you: You have advanced backend software engineering experience with Java, Spring Boot, REST, Postgres, Redis You have experience of running production workloads on Kubernetes (Amazon EKS preferred More ❯
woburn, massachusetts, united states Hybrid / WFH Options
Knox Systems
/no-code platform operations*. *Key ResponsibilitiesIncident Management & System Troubleshooting* * Perform advanced troubleshooting for infrastructure, OS, and application issues. * Analyze system logs, metrics, and telemetry from monitoring platforms (Grafana, Datadog, Wiz, CloudWatch). * Coordinate with Platform/DevOps Engineers on root cause analysis and long-term remediation. * Ensure timely resolution of escalated incidents in accordance with SLAs. *Cloud Administration More ❯
and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking testing and More ❯
Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level More ❯
London, England, United Kingdom Hybrid / WFH Options
Cint
Terraform, Kubernetes, Docker, Packer, Ansible and Jenkins. We support applications and services written in Golang, Python, Java, Scala and .Net. We monitor and alert on everything we deploy via Grafana, Prometheus, Graphite and ELK stacks. The team holds itself accountable to a high standard of build quality. We have recently completed the first major phase of a completely green-field … Actions etc.) You have a grasp of “cloud native” and 12-Factor applications You have good knowledge of monitoring and alerting using one or more of: Graphite, Statsd, Prometheus, Grafana, PagerDuty You have expertise in at least one scripting or programming language (Python, Bash, Ruby, Node, Golang, Java) Bonus Points If You Have You have good knowledge of the network More ❯
Glasgow, City of Glasgow, United Kingdom Hybrid / WFH Options
Lorien
What We're Looking For Proven experience in SQL performance tuning and query optimisation Familiarity with performance testing tools (e.g., JMeter, k6) Experience with observability platforms (e.g., Azure Monitor, Grafana) Strong problem-solving skills and a collaborative mindset Ability to develop and interpret MI for decision-making Bonus Skills Experience in financial services or data-heavy enterprise environments Knowledge of More ❯
Glasgow, Lanarkshire, Scotland, United Kingdom Hybrid / WFH Options
Lorien
What We're Looking For Proven experience in SQL performance tuning and query optimisation Familiarity with performance testing tools (e.g., JMeter, k6) Experience with observability platforms (e.g., Azure Monitor, Grafana) Strong problem-solving skills and a collaborative mindset Ability to develop and interpret MI for decision-making Bonus Skills Experience in financial services or data-heavy enterprise environments Knowledge of More ❯
West London, London, United Kingdom Hybrid / WFH Options
Staffworx Limited
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
london, south east england, united kingdom Hybrid / WFH Options
Staffworx Limited
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
west london, south east england, united kingdom Hybrid / WFH Options
Staffworx Limited
End development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated More ❯
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
Database & Application Support – Oracle, SQL Server, PostgreSQL Scripting & Automation – Bash, PowerShell, Python Networking Protocols – TCP/IP, VPN, VLANs, Subnetting, Firewalls, Routing/Switching System Monitoring & Management – Nagios, Zabbix, Grafana, Prometheus, SolarWinds Rewards & Benefits TCS is consistently voted a Top Employer in the UK and globally. Our competitive salary packages feature pension, health care, life assurance, laptop, phone, access to More ❯
South East London, London, United Kingdom Hybrid / WFH Options
Stepstone UK
architecture - Principles: TDD, Agile, Pair Programming - CI/CD: Git, Docker, Bamboo - Cloud: AWS, Lambda, ECS - Databases/Storage: Postgres , Dynamo, DocumentDb, OpenSearch/Elastic, Redis - Monitoring: Cloudwatch, Kibana, Grafana, DataDog Qualifications Experience of working with the following tech; C# .NET (8+), Terraform, Typescript, AWS, CI/CD Dedication to high quality, testable and maintainable code and super passionate about More ❯
deliver under pressure across multiple priorities. Requirements Essentials - Java 17 version preferred, Springboot, Microservices, AWS, Maven, Gradle, JPA, JMS, Junit, Bamboo, Stash, IntelliJ Good to have - ArgoCD, Kubernetes, Docker, Grafana, Splunk Nice to have - SonarQube Ability to work in small teams and strong communication skills Comm skills are very important. As Macquarie has small teams, developer who can work independently More ❯
engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting global trading systems More ❯