london, south east england, united kingdom Hybrid / WFH Options
La Fosse
IaC, DevOps, and cloud infrastructure Experience working with or managing teams using graph databases, search technologies, and data pipelines Familiarity with IaC, GitHub Copilot, and modern observability tooling (e.g., Grafana) A strong ability to run skills gaps analysis, utilising delivery data, and an understanding how to use them to improve team effectiveness Experience guiding teams through change, process improvement and More ❯
slough, south east england, united kingdom Hybrid / WFH Options
La Fosse
IaC, DevOps, and cloud infrastructure Experience working with or managing teams using graph databases, search technologies, and data pipelines Familiarity with IaC, GitHub Copilot, and modern observability tooling (e.g., Grafana) A strong ability to run skills gaps analysis, utilising delivery data, and an understanding how to use them to improve team effectiveness Experience guiding teams through change, process improvement and More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
La Fosse
IaC, DevOps, and cloud infrastructure Experience working with or managing teams using graph databases, search technologies, and data pipelines Familiarity with IaC, GitHub Copilot, and modern observability tooling (e.g., Grafana) A strong ability to run skills gaps analysis, utilising delivery data, and an understanding how to use them to improve team effectiveness Experience guiding teams through change, process improvement and More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Devonshire Hayes Recruitment Specialists Limited
Relevant certifications (CCNA, CCNP, CompTIA Network+/Security+, Azure Administrator, etc.). Experience with infrastructure automation or scripting (PowerShell, Python). Knowledge of monitoring and observability tools (SolarWinds, PRTG, Grafana, etc.). Experience with ITIL practices and ServiceNow or equivalent ticketing systems. More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Devonshire Hayes
Relevant certifications (CCNA, CCNP, CompTIA Network+/Security+, Azure Administrator, etc.). Experience with infrastructure automation or scripting (PowerShell, Python). Knowledge of monitoring and observability tools (SolarWinds, PRTG, Grafana, etc.). Experience with ITIL practices and ServiceNow or equivalent ticketing systems. Personal Attributes Technically curious, proactive, and solutions oriented. Confident engaging with technical and business stakeholders alike. Comfortable balancing More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Devonshire Hayes
Relevant certifications (CCNA, CCNP, CompTIA Network+/Security+, Azure Administrator, etc.). Experience with infrastructure automation or scripting (PowerShell, Python). Knowledge of monitoring and observability tools (SolarWinds, PRTG, Grafana, etc.). Experience with ITIL practices and ServiceNow or equivalent ticketing systems. Personal Attributes Technically curious, proactive, and solutions oriented. Confident engaging with technical and business stakeholders alike. Comfortable balancing More ❯
london, south east england, united kingdom Hybrid / WFH Options
M-XR
data models (MongoDB, PostgreSQL) Implement asset storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development experience with production applications Track More ❯
slough, south east england, united kingdom Hybrid / WFH Options
M-XR
data models (MongoDB, PostgreSQL) Implement asset storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development experience with production applications Track More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
M-XR
data models (MongoDB, PostgreSQL) Implement asset storage, retrieval, and management systems (AWS S3) Build job queue management for async ML workflows (SNS, SQS) Setup application monitoring and logging (CloudWatch, Grafana, Prometheus) Implement CI/CD for application deployment (Bitbucket Pipelines) Create API documentation and developer tools What we are looking for 5+ years backend development experience with production applications Track More ❯
availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis … and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and future demand. … production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix systems and More ❯
availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis … and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and future demand. … production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix systems and More ❯
london (city of london), south east england, united kingdom
BGC Group
availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis … and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and integration problems. Perform capacity planning , scaling, and tuning of Solace infrastructure to meet current and future demand. … production support , preferably in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. Familiarity with Linux/Unix systems and More ❯
and maintain Infrastructure as Code (IaC) using Terraform and Ansible. Design highly reliable, scalable, and secure infrastructure supporting performance-critical workloads. Build proactive monitoring, observability, and alerting with Prometheus, Grafana, Azure Monitor, DataDog, and Dynatrace. Troubleshoot complex system issues spanning applications, networks, and infrastructure. Define platform SLAs, SLOs, and governance standards for self-service use. Collaborate closely with Salesforce DevOps … Ansible, along with scripting in PowerShell, Python, or Bash Experience implementing GitOps workflows and managing platform SLAs, SLOs, and governance standards Familiarity with observability and monitoring tools including Prometheus, Grafana, Azure Monitor, DataDog, or Dynatrace Preferred experience supporting Salesforce DevOps pipelines and working with Java, .NET, or Node.js application environments Exposure to AI/ML platforms, real-time data pipelines More ❯
and maintain Infrastructure as Code (IaC) using Terraform and Ansible. Design highly reliable, scalable, and secure infrastructure supporting performance-critical workloads. Build proactive monitoring, observability, and alerting with Prometheus, Grafana, Azure Monitor, DataDog, and Dynatrace. Troubleshoot complex system issues spanning applications, networks, and infrastructure. Define platform SLAs, SLOs, and governance standards for self-service use. Collaborate closely with Salesforce DevOps … Ansible, along with scripting in PowerShell, Python, or Bash Experience implementing GitOps workflows and managing platform SLAs, SLOs, and governance standards Familiarity with observability and monitoring tools including Prometheus, Grafana, Azure Monitor, DataDog, or Dynatrace Preferred experience supporting Salesforce DevOps pipelines and working with Java, .NET, or Node.js application environments Exposure to AI/ML platforms, real-time data pipelines More ❯
london (city of london), south east england, united kingdom
Hays
mandatory Strong understanding of monitoring, observability, and telemetry (metrics, logs, traces) Ability to translate technical concepts into actionable business requirements Hands-on experience with tools such as Datadog, BigPanda, Grafana would be desirable Excellent stakeholder management skills - including product and engineering teams. If you think this role is a good fit - apply now! Hays Specialist Recruitment Limited acts as an More ❯
mandatory Strong understanding of monitoring, observability, and telemetry (metrics, logs, traces) Ability to translate technical concepts into actionable business requirements Hands-on experience with tools such as Datadog, BigPanda, Grafana would be desirable Excellent stakeholder management skills - including product and engineering teams. If you think this role is a good fit - apply now! Hays Specialist Recruitment Limited acts as an More ❯
new technologies • Networking; good understanding of basic networks Desirable: • Experience of designing bespoke infrastructure solutions • Understanding of 5G/Telecoms background • Understanding of satellite communications networks • NMS Zabbix/Grafana • Firewalls : Fortinet/PFSense/Juniper: install/configuration/maintenance. Understanding of VPNs #LI-FP1 Together, as owners, let’s turn meaningful insights into action. Life at CGI is More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Get Staffed Online Recruitment Limited
with throttling and versioning. Developing durable workflows. Writing efficient and scalable SQL queries , stored procedures, and scripts. Integrating external systems with custom data synchronisation logic. Utilising Open Telemetry and Grafana for logs, metrics, tracing, and alerting across backend services. Contributing to technical design discussions, code reviews, and deployments. What They’re Looking For: Strong experience in C#/.NET backend More ❯
Developer | Node.js TypeScript React | IoT Platform Tech Stack Essential: Node.js, TypeScript, React Nice to Have: Grafana About the Role We're looking for a Senior Software Developer to play a key role in shaping and expanding our innovative IoT Back Office platform. You'll work alongside the Lead Developer and wider product team to drive technical direction, design new features More ❯
at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting Kubernetes and AKS … Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). Background in web applications and distributed systems. Desirable Skills: Knowledge of Microservices Architecture. Familiarity with Kanban. Experience with Puppet or Chef If you’re passionate about Site Reliability More ❯
Senior Performance Tester - Non-functional - 3 month contract - paying £550 Inside IR35 - Hybrid (2 days a week) We’re looking for an experienced Senior Performance Tester to join a high-performing team working on a global streaming platform that powers More ❯
Senior Performance Tester - Non-functional - 3 month contract - paying £550 Inside IR35 - Hybrid (2 days a week) We're looking for an experienced Senior Performance Tester to join a high-performing team working on a global streaming platform that powers More ❯
Senior Performance Tester - Non-functional - 3 month contract - paying £550 Inside IR35 - Hybrid (2 days a week) We’re looking for an experienced Senior Performance Tester to join a high-performing team working on a global streaming platform that powers More ❯
london (city of london), south east england, united kingdom
Arrows
Senior Performance Tester - Non-functional - 3 month contract - paying £550 Inside IR35 - Hybrid (2 days a week) We’re looking for an experienced Senior Performance Tester to join a high-performing team working on a global streaming platform that powers More ❯