Leatherhead, Surrey, England, United Kingdom Hybrid / WFH Options
Avanti
Track record of leading teams or projects, this could be as a Tech Lead/Principal Engineer or Engineering Manager Exposure to incident management, monitoring and resilience tools (Prometheus, Grafana, ELK etc) Awareness of Security – dependency scanning, vulnerability management Strong communication skills, able to collaborate with stakeholder, present updates and represent engineering in wider discussions This role will have an More ❯
pipelines GitLab/GitHub Actions to enable rapid and reliable software delivery Manage and scale containerized environments using Kubernetes (RH OpenShift) Maintain integration with monitoring and logging tools (Prometheus, Grafana, ELK stack) Work with a software development team to troubleshoot, build and deploy to environmental issues. Integrate security best practice into DevOps lifecycle including vulnerability scanning, secrets management and code More ❯
pipelines GitLab/GitHub Actions to enable rapid and reliable software delivery Manage and scale containerized environments using Kubernetes (RH OpenShift) Maintain integration with monitoring and logging tools (Prometheus, Grafana, ELK stack) Work with a software development team to troubleshoot, build and deploy to environmental issues. Integrate security best practice into DevOps lifecycle including vulnerability scanning, secrets management and code More ❯
pipelines GitLab/GitHub Actions to enable rapid and reliable software delivery Manage and scale containerized environments using Kubernetes (RH OpenShift) Maintain integration with monitoring and logging tools (Prometheus, Grafana, ELK stack) Work with a software development team to troubleshoot, build and deploy to environmental issues. Integrate security best practice into DevOps lifecycle including vulnerability scanning, secrets management and code More ❯
pipelines GitLab/GitHub Actions to enable rapid and reliable software delivery Manage and scale containerized environments using Kubernetes (RH OpenShift) Maintain integration with monitoring and logging tools (Prometheus, Grafana, ELK stack) Work with a software development team to troubleshoot, build and deploy to environmental issues. Integrate security best practice into DevOps lifecycle including vulnerability scanning, secrets management and code More ❯
pipelines GitLab/GitHub Actions to enable rapid and reliable software delivery Manage and scale containerized environments using Kubernetes (RH OpenShift) Maintain integration with monitoring and logging tools (Prometheus, Grafana, ELK stack) Work with a software development team to troubleshoot, build and deploy to environmental issues. Integrate security best practice into DevOps lifecycle including vulnerability scanning, secrets management and code More ❯
Ansible Linux Administration Redhat family OS, including RHEL, Alma and some legacy CentOS Core internet applications protocols DHCP/DNS Monitoring Systems Icinga2/Elastic Stack/InfluxDB/Grafana Application and network security best practices SSH/Iptables/TLS AWS (EC2/VPS/RDS/EKS/S3) Terraform Databases PostgreSQL/MySQL CI/CD and More ❯
Ansible Linux Administration – Redhat family OS, including RHEL, Alma and some legacy CentOS Core internet applications protocols – DHCP/DNS Monitoring Systems – Icinga2/Elastic Stack/InfluxDB/Grafana Application and network security best practices – SSH/Iptables/TLS AWS (EC2/VPS/RDS/EKS/S3) Terraform Databases – PostgreSQL/MySQL CI/CD and More ❯
highly valued skills may include: Consumer-Driven Contract Testing experience with tools such as Pact, Spring Cloud Contract. Experience in Cell-Based Architecture. Observability Engineering: Tools & Practices OpenTelemetry, Prometheus, Grafana, distributed tracing, structured logging, service level indicator's (SLI) service level objective (SLO). You may be assessed on the key critical skills relevant for success in role, such as More ❯
Glasgow, City of Glasgow, United Kingdom Hybrid / WFH Options
Lorien
What We're Looking For Proven experience in SQL performance tuning and query optimisation Familiarity with performance testing tools (e.g., JMeter, k6) Experience with observability platforms (e.g., Azure Monitor, Grafana) Strong problem-solving skills and a collaborative mindset Ability to develop and interpret MI for decision-making Bonus Skills Experience in financial services or data-heavy enterprise environments Knowledge of More ❯
Glasgow, Lanarkshire, Scotland, United Kingdom Hybrid / WFH Options
Lorien
What We're Looking For Proven experience in SQL performance tuning and query optimisation Familiarity with performance testing tools (e.g., JMeter, k6) Experience with observability platforms (e.g., Azure Monitor, Grafana) Strong problem-solving skills and a collaborative mindset Ability to develop and interpret MI for decision-making Bonus Skills Experience in financial services or data-heavy enterprise environments Knowledge of More ❯
Experience working in Agile environments Strong understanding of Site Reliability Engineering (SRE) principles Familiarity with Azure DevOps for CI/CD and pipeline management Knowledge of observability tools: Prometheus, Grafana, Loki, Tempo Experience with Infrastructure as Code: Helm, Kustomize Hands-on experience with Tekton and ArgoCD Ability to support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ) Understanding of More ❯
CI/CD pipelines Hands-on experience with infrastructure-as-code (e.g. Terraform) Deep understanding of security best practices in cloud and application delivery Exposure to observability tooling (Prometheus, Grafana, structured logging, etc.) Confident debugging and resolving issues in complex distributed systems Background in B2B SaaS web applications, with familiarity in Node a plus Able to operate autonomously within small More ❯
Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom
Randstad Digital
university. Excellent communication skills for engaging business stakeholders, end-users, and technologists. ITIL certification (or equivalent ITIL framework experience). Technical expertise in: Databases & design: SQL Server Monitoring tools: Grafana, Prometheus, Victoria Metrics Scheduling tools: Control-M Operating systems: Windows, Linux Containerisation & cloud: Kubernetes, Azure Collaboration tools: JIRA, Git, Bitbucket This is a fantastic opportunity to work on impactful projects More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Huxley
or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days depending More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Huxley Associates
or ARM templates Hands-on experience with CI/CD pipelines (e.g., Bitbucket, Azure DevOps) API Gateway, Azure API Management (APIM), Azure Application Gateway Monitoring tools such as Prometheus, Grafana, and Azure Monitor Understanding of secure multi-region deployments and network segmentation Remote Working Expected to be in the office 1 to 2 days a week. With additional days depending More ❯
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM/incident management processes More ❯
to streamline monitoring, alerting, and recovery workflows. Knowledge of FIX, market data, and order routing protocols in a trading environment. Exposure to observability platforms such as ITRS Geneos, Prometheus, Grafana, or custom telemetry stacks. Comfortable working across Linux systems, hybrid infrastructure, and global production environments. Excellent communication and reporting skills, with ability to translate technical data into actionable business insights. More ❯
Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Why Work For Us? 25 days holiday + bank holidays Up to 5% employer pension contribution Educational More ❯
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯
to streamline monitoring, alerting, and recovery workflows. Knowledge of FIX, market data, and order routing protocols in a trading environment. Exposure to observability platforms such as ITRS Geneos, Prometheus, Grafana, or custom telemetry stacks. Comfortable working across Linux systems, hybrid infrastructure, and global production environments. Excellent communication and reporting skills, with ability to translate technical data into actionable business insights. More ❯
Sheffield, South Yorkshire, England, United Kingdom
KBC Technologies UK LTD
storage, and security configurations. Perform upgrades and patching of OpenShift clusters and associated components. Monitoring and Optimization: Implement monitoring solutions for KVM and OpenShift environments using tools like Prometheus, Grafana, or ELK Stack. Analyze system performance and recommend optimizations for resource utilization and cost efficiency. Collaboration and Documentation: Work closely with development, DevOps, and operations teams to ensure seamless integration More ❯