DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause investigations for infrastructure incidents, producing clear post-mortem reports and remediation recommendations. Defect Management • Log, triage, and track defects in Jira More ❯
estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI … present monitoring solutions and technical designs Proactively identify and highlight risks that could impact solution success What you'll need Strong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or Splunk Deep understanding of telemetry signal analysis and performance monitoring Experience integrating observability tools with ITSM platforms and DevOps toolchains Ability to troubleshoot complex infrastructure and application More ❯
Front Office Infrastructure & Application Support Specialist Business Area:Global Markets - Front Office Production Support(Global Banking & Markets/Corporate & Institutional Banking)Job Type:Mission:As part of the Front Office Support team, you will provide technical and functional support to More ❯
Farnborough, Hampshire, United Kingdom Hybrid / WFH Options
Searchability
configuration and orchestration best practices Develop scalable, secure infrastructure using Terraform and Ansible Evangelise GitOps and support deployment automation Monitor and improve platform performance using tools like Prometheus and Grafana Provide technical oversight and guidance to cross-functional teams Stay ahead of emerging tech trends to enhance platform capabilities WHAT I'M LOOKING FOR 5+ years' experience in Platform, DevOps … Terraform , Ansible , and CI/CD tooling (e.g., Jenkins, GitLab CI/CD) Solid understanding of Git and version control best practices Experience with monitoring tools like Prometheus and Grafana Comfortable in fast-paced, agile environments Excellent communication and problem-solving skills Active SC or DV clearance required NICE TO HAVE Experience with cloud platforms (AWS, Azure, GCP) GitOps mindset More ❯
environment. Experience working with distributed systems over wide-area networks, with an understanding of networking, latency, and failover strategies. Solid experience with system monitoring and alerting tools (e.g., Prometheus, Grafana). Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management, performance tuning, and system scaling. Familiarity with Linux/Unix systems and scripting (Bash, Python, etc. More ❯
Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
.NET framework knowledge (C#,ASP.NET Core etc..) Expertise in Windows Server administration Database administration (SQL Server primarily) Ability to instrument and consume monitoring and observability tools (Application Insights, Prometheus, Grafana) Experience using PowerShell, Azure CLI, and Bash for automation tasks Previous experience with Azure DevOps, Jenkins, GitHub Actions, or similar tools Containerisation and orchestration (Docker, Kubernetes) Knowledge of security best More ❯
with AWS. Automation experience (e.g., Terraform, Ansible, CI/CD tools). Strong understanding of infrastructure and cloud architecture. Experience supporting production environments. Familiarity with observability tools (e.g., Prometheus, Grafana, CloudWatch). Excellent problem-solving and communication skills. Desirable Experience working in a fast-paced or agile development environment. Familiarity with container technologies (e.g., Docker, Kubernetes). Previous experience in More ❯
Systems Puppet/Ansible Source Control - Git Linux Administration CentOS ideally, but any Linux/Unix Virtualization VmWare/Veeam Monitoring Systems Icinga2/Elastic Stack/InfluxDB/Grafana Core internet applications protocols DHCP/DNS Scripting Python/Perl/Bash Application and network security best practices SSH/Iptables/TLS Hardware management Server racking/Storage More ❯
Systems – Puppet/Ansible Source Control - Git Linux Administration – CentOS ideally, but any Linux/Unix Virtualization – VmWare/Veeam Monitoring Systems – Icinga2/Elastic Stack/InfluxDB/Grafana Core internet applications protocols – DHCP/DNS Scripting – Python/Perl/Bash Application and network security best practices – SSH/Iptables/TLS Hardware management – Server racking/Storage More ❯
Linux/Unix systems administration combined with strong SQL skills and proficiency in scripting languages such as Python or Java.* Demonstrated experience with monitoring and observability tools including Prometheus, Grafana, Splunk, Geneos, OpenTelemetry or Corvil is highly desirable.* Familiarity with cloud platforms as well as containerisation technologies like Kubernetes or Docker alongside CI/CD pipeline management is important for More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
MediaKind group
other Scrum ceremonies to ensure smooth project execution. Deployment Tools: Implement and manage deployment processes using Docker, Helm, Kubernetes, and VMs. Operational Platforms: Monitor and optimize operational environments using Grafana and Elastic Search. Cloud Deployment: Leverage tools such as Ansible, Terraform, Cloud API, OpenStack, OpenShift, and public cloud services for cloud deployment. Verification Tools: Use Jenkins and Azure pipelines for … the technologies below. Education: Bachelor's degree in Computer Science, Software Engineering, or a related field. Deployment Experience: Familiarity with Docker, Helm, Kubernetes, and VMs. Operational Knowledge: Experience with Grafana and Elastic Search. Cloud Tools: Understanding of Ansible, Terraform, Cloud API, OpenStack, OpenShift, and public cloud environments. Verification Tools: Experience with Jenkins and Azure pipelines. Configuration Management: Proficiency in Git More ❯
infrastructure as code.? Good understanding of networking and network protocols. Desirable: Experience with scripting or programming in Python.? Experience working with Amazon Web Services (AWS).? Experience working with Grafana (LGTM) and Prometheus.? Experience working with highly available and distributed infrastructure. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
BOSS Professional Services LTD
the customer base and product offering. For the SRE Engineer role we are seeking: Technology stack: Kubernetes, MySQL, PostgreSQL, PHP, Python, Docker, AWS Lambda, AWS, Redis, ELK, monitoring: Prometheus, Grafana or Loki You have previous experience of working within SRE capacity or experience in DevOps and interest in moving into that field. Be responsible for the production environment. Improve the More ❯
Ansible Linux Administration redhat family OS, including RHEL, Alma and some legacy CentOS Core internet applications protocols DHCP/DNS Monitoring Systems Icinga2/Elastic Stack/InfluxDB/Grafana Application and network security best practices SSH/Iptables/TLS AWS (EC2/VPS/RDS/EKS/S3) Terraform Databases PostgreSQL/MySQL CI/CD and More ❯
Ansible Linux Administration – redhat family OS, including RHEL, Alma and some legacy CentOS Core internet applications protocols – DHCP/DNS Monitoring Systems – Icinga2/Elastic Stack/InfluxDB/Grafana Application and network security best practices – SSH/Iptables/TLS AWS (EC2/VPS/RDS/EKS/S3) Terraform Databases – PostgreSQL/MySQL CI/CD and More ❯
on git-based commercial source control or similar (e.g., AzureDevOps, github including Actions, gitlab, bitbucket etc). Good to have Ideally, developing/configuring and publishing dashboards (ideally via Grafana or PowerBI). Ideally, Infrastructure as a code with Cloud formation/ARM templates, Terraform and Ansibl. Ideally, Linux Server Administration including container technology & ecosystem (docker, Kubernetes, Prometheus) linked to More ❯
variety of CI/CD tools and technologies (e.g., Git, Gitlab, Jenkins, GCP, AWS) Knowledge of containerisation and microservice architecture Ability to develop dashboard UIs for publishing performance (e.g., Grafana, Apache Superset, etc.) Exposure to safety certification standards and processes We provide: Competitive salary, benchmarked against the market and reviewed annually Company share programme Hybrid and/or flexible work More ❯
CD pipelines (Jenkins, GitLab CI/CD or similar) * Configuring Kubernetes clusters for secure, scalable deployments * Building automation across infrastructure provisioning and testing * Implementing monitoring and alerting (e.g., Prometheus, Grafana) * Managing repositories and version control (Git) * Driving SRE practices around performance, resilience, and supportability * Working closely with dev teams to integrate platform tooling into workflows * Supporting infrastructure security, maintainability and …/SRE/DevOps roles * Strong Kubernetes experience (config and deployment) * Deep CI/CD experience - Jenkins, GitLab CI/CD or similar * Skilled with infra observability tooling (Prometheus, Grafana, etc.) * Confident with Git and repo management workflows * Strong automation mindset - reducing manual intervention wherever possible * Cloud experience (AWS, Azure or GCP) * Must be a sole UK national and eligible More ❯
Farnborough, Hampshire, England, United Kingdom Hybrid / WFH Options
Randstad Technologies
or private cloud platforms Proficient in Infrastructure as Code - Ansible, Terraform Skilled in CI/CD tools Solid scripting skills - PowerShell, Python, or equivalent Experience with monitoring tools - Prometheus, Grafana, Kibana Please note: Active SC Clearance is essential Hybrid working - Farnborough-based Day Rate: £450-£550/day Duration: 6 months | Inside IR35 If this seems of interest to you More ❯
support highly available telephony solutions using AudioCodes and Oracle SBCs Develop scripts, tools, and APIs to improve SIP routing, call flows, and automation Integrate telephony with monitoring platforms like Grafana and ThousandEyes Collaborate with carriers to support SIP infrastructure and hybrid voice networks Contribute to hybrid cloud telephony solutions across UCaaS and CCaaS platforms Participate in Agile sprints and support More ❯
requirements ). Preferred Qualifications : Certifications in GCP Familiarity with Azure DevOps Pipelines is a plus. Experience with multi-cloud and hybrid cloud environments. Experience with Elastic (or OpenSearch) and Grafana Knowledge of ServiceNOWfor change management and incident management. Familiarity with observability tools and practices for 24x7x365 monitoring and alerting. Identity and Access Management experience is a plus for this role More ❯
Bracknell, Berkshire, United Kingdom Hybrid / WFH Options
Techex
Experience of public cloud platform architecture/design CCNP or higher/equivalent non-cisco qualification (Routing & Switching or Data-Centre/SDN) Experience with either Influx, Redis, Kafka, Grafana, Kibana Our Values and Benefits We have secured Great Place to work accreditation for the past two years and we seek out individuals who enjoy developing their professional skills, are More ❯
Watford, Hertfordshire, South East, United Kingdom
La Fosse
Hertfordshire office. In the role you'll take ownership of the end-to-end monitoring and alerting stack, designing and maintaining infrastructure and alert configurations (e.g., with Prometheus/Grafana or equivalent), and building dashboards that clearly communicate metrics to business stakeholders You'll drive system automation and integration, crafting scripts and work flows-primarily in Python—to on-board More ❯