in DevOps, cloud infrastructure, and automation. Strong knowledge of CI/CD tooling, IaC, and cloud-native technologies. Advanced scripting (Bash, Python) and automation experience. Skilled in monitoring and observability tools (e.g., Prometheus, Grafana, ELK). Strong problem-solving, communication, and leadership skills. Familiarity and Experience of CI/CD Tools: Jenkins, GitLab CI Infrastructure as Code: Terraform, Ansible, Helm More ❯
automatically deploy updates and fixes into the production environment. Maintenance, troubleshooting: Perform routine application maintenance to ensure the production environment runs smoothly. Develops maintenance requirements and procedures. Monitoring and Observability: Monitors servers, applications and clusters for failures, system crashes and resource usage, etc using tools like Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). FURTHER DUTIES WILL More ❯
NW10, Middlesex, Greater London, United Kingdom Hybrid / WFH Options
ITH Pharma
automatically deploy updates and fixes into the production environment. Maintenance, troubleshooting: Perform routine application maintenance to ensure the production environment runs smoothly. Develops maintenance requirements and procedures. Monitoring and Observability: Monitors servers, applications and clusters for failures, system crashes and resource usage, etc using tools like Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). FURTHER DUTIES WILL More ❯
london (city of london), south east england, united kingdom
rmg digital
Jira, Team City Expert level knowledge of DevOps tools like Bitbucket/GitHub, Sonar Cube, CAST, Team City/Jenkins/Azure DevOps Expert level knowledge of telemetry and observability platforms like ELK stack, Grafana, Kibana, Azure Application Insights, AWS Cloud Watch etc., Scripting languages preferably python, PowerShell Database technologies preferably MS SQL Server, Postgres SQL Infrastructure as code – AWS More ❯
Jira, Team City Expert level knowledge of DevOps tools like Bitbucket/GitHub, Sonar Cube, CAST, Team City/Jenkins/Azure DevOps Expert level knowledge of telemetry and observability platforms like ELK stack, Grafana, Kibana, Azure Application Insights, AWS Cloud Watch etc., Scripting languages preferably python, PowerShell Database technologies preferably MS SQL Server, Postgres SQL Infrastructure as code – AWS More ❯
Jira, Team City Expert level knowledge of DevOps tools like Bitbucket/GitHub, Sonar Cube, CAST, Team City/Jenkins/Azure DevOps Expert level knowledge of telemetry and observability platforms like ELK stack, Grafana, Kibana, Azure Application Insights, AWS Cloud Watch etc., Scripting languages preferably python, PowerShell Database technologies preferably MS SQL Server, Postgres SQL Infrastructure as code – AWS More ❯
Farnborough, Hampshire, England, United Kingdom Hybrid / WFH Options
Sopra Steria
scale secure cloud service. Domain orchestration. Developing workflows and tooling to automate processes and operations. Ensuring routine tasks are consistent, repeatable and scalable. Provisioning, managing and optimising infrastructure. Maintaining observability of the platform. Responding to alerts and incidents to ensure the availability of systems, interoperability and applications. Completing root cause analysis. Continually improving systems and processes to improve the efficiency More ❯
Reading, Berkshire, South East, United Kingdom Hybrid / WFH Options
Halian Technology Limited
in the team Contribute to solution architecture and strategic technical direction Build, integrate, and maintain REST APIs and backend services Champion best practices in software quality, CI/CD, observability, and DevOps Collaborate with cross-functional teams including Product, QA, and DevOps Optionally take on people management responsibilities for engineers Stay updated with emerging backend and cloud technologies Key Skills More ❯
Reading, Berkshire, South East, United Kingdom Hybrid / WFH Options
Halian Technology Limited
in the team Contribute to solution architecture and strategic technical direction Build, integrate, and maintain REST APIs and backend services Champion best practices in software quality, CI/CD, observability, and DevOps Collaborate with cross-functional teams including Product, QA, and DevOps Optionally take on people management responsibilities for engineers Stay updated with emerging backend and cloud technologies Key Skills More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Noir
financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at scale. We're hiring a Lead Site Reliability Engineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You'll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Sanderson
with IAM engineering experience across authentication, authorisation, single sign-on, multi-factor authentication, identity lifecycle management, OAuth2.0, OpenID Connect, SAML and policy management Knowledge of Site Reliability Engineering, automation, observability, incident management, resilience, disaster recovery, high availability, documentation IAM engineering experience, authentication, authorisation, single sign-on, multi-factor authentication, user lifecycle management, hands on CI/CD approaches and technologies More ❯
Employment Type: Full-Time
Salary: £100,000 - £135,000 per annum, Inc benefits
dashboards, widgets, and live data visualisation Collaborate with QA and DevOps to enhance automated testing and deployment pipelines Lead efforts in securing, scaling, and monitoring the frontend environment Use observability tools (Prometheus, Grafana, Loki) to monitor UI health and performance Drive UI architectural decisions, performance benchmarking, and best practice implementation Skills and Experience Required Degree in Computer Science, Engineering, or More ❯
automation, scalability, and high reliability. A strong working knowledge of Microsoft Azure is essential. The role involves daily coding, technical leadership across orchestration, CI/CD pipelines, cloud services, observability, and security-working alongside site reliability, onboarding, architecture, and delivery functions. You're expected to scale impact through others by upskilling team members, hiring where needed, and championing platform engineering More ❯
models. Desirable: Master's degree or DAMA certification (CDMP).Proven expertise with Workday product and data integrations. Experience across different database architectures (Lakehouse, Relational, Document Store). Familiarity with observability tools, BI platforms (Power BI), or Agile methods. Knowledge of advanced analytics, data science, or NLP techniques. Soft Skills Strong stakeholder management and communication skills, able to bridge technical and More ❯
mindset, from commit to production Collaborate directly with end-users and internal teams to understand needs and deliver value Operate across multi-cloud environments (AWS, GCP, Azure) Drive system observability and reliability with tools like Datadog Help shape our engineering culture by mentoring, sharing knowledge, and encouraging best practices Push boundaries, challenge assumptions, and ensure delivery of meaningful solutions Tech More ❯
Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
day-to-day and strategic decision making.You will be a hands-on and customer focused engineering servant-leader. You will be comfortable moving across orchestration, automation, pipelines, cloud services, observability and security domains (even if you are not an expert in them all). A non-negotiable is experience and familiarity with Microsoft Azure.You will play your part in operating More ❯
influencing at all levels. A mindset focused on long-term sustainability and strategic technical thinking. Bonus Points For Fintech or regulated environment experience, particularly investment platforms. Familiarity with modern observability stacks and incident response processes. Experience with security-first architecture and data protection best practices. Why Join? Well-Backed & Ambitious: Backed by a globally recognised financial group with significant investment More ❯
london (city of london), south east england, united kingdom
La Fosse
influencing at all levels. A mindset focused on long-term sustainability and strategic technical thinking. Bonus Points For Fintech or regulated environment experience, particularly investment platforms. Familiarity with modern observability stacks and incident response processes. Experience with security-first architecture and data protection best practices. Why Join? Well-Backed & Ambitious: Backed by a globally recognised financial group with significant investment More ❯
influencing at all levels. A mindset focused on long-term sustainability and strategic technical thinking. Bonus Points For Fintech or regulated environment experience, particularly investment platforms. Familiarity with modern observability stacks and incident response processes. Experience with security-first architecture and data protection best practices. Why Join? Well-Backed & Ambitious: Backed by a globally recognised financial group with significant investment More ❯
Sunbury-On-Thames, London, United Kingdom Hybrid / WFH Options
BP Energy
real-world problems, particularly in scientific and R&D contexts. Key Responsibilities Design, build, and maintain robust ML pipelines and services using modern engineering practices (CI/CD, testing, observability). Collaborate with cross-functional teams to translate business problems into ML solutions. Develop and deploy models for tasks such as classification, regression, recommendation, and natural language processing. Optimise model More ❯
using GCP-native tools and technologies.* Develop capabilities which allow Platform Engineering teams to operate with a DevOps ethos.* Collaborate with development teams to optimize application performance, reliability, and observability on GCP.* Implement and enforce Service Level Objectives (SLOs) and Error Budgets to ensure a balance between reliability and feature development.* Develop and maintain a comprehensive monitoring and alerting platform More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
INTEC SELECT LIMITED
with infrastructure teams to ensure system reliability and operational efficiency Integrate monitoring and logging solutions (e.g., Prometheus, Grafana, ELK) Define strategies for disaster recovery, scaling, and infrastructure resilience Improve observability by enhancing visibility into performance and error metrics Skills and Experience Required 10+ years of backend development experience, including 5+ years in an architectural or engineering leadership role Proven experience More ❯
South East London, London, United Kingdom Hybrid / WFH Options
Stepstone UK
Familiarity with deploying and scaling ML models in the cloud, particularly with AWS and SageMaker Understanding of DevOps processes and tools: CI/CD, Docker, Terraform, and monitoring/observability Bonus: experience with vector databases, semantic search, or event-driven systems like Kafka Additional Information Were a community here that cares as much about your life outside work as how More ❯
and CI/CD workflows (GitLab CI). Write clean, production-grade code in Python (Scala is a bonus). Build infrastructure using Terraform, AWS CloudFormation, or SAM. Drive observability across the platform using Datadog or CloudWatch. Actively mentor Data Engineers and Associates, and lead technical discussions and design sessions. Key requirements: Must-Have: Strong experience with AWS services: Glue More ❯