Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
William Hill PLC
functional requirements into working software alongside your team Collaborate with the team to analyze, debug, and resolve defects Demonstrate a commitment to monitoring and observability Manage technical debt effectively by avoiding its creation and removing it when possible Communicate clearly, translating technical and non-technical requirements as needed Understand timelines More ❯
delivering software features into production, ideally in a B2B SaaS or data-rich environment. Dedicated to driving best practise within the SDLC, including quality, observability, CI/CD, SOLID and Design Patterns. Strong background in software engineering with hands-on experience in developing, evaluating, and deploying complex systems. Proficiency with More ❯
and collaborate with Product, Data, and Artist Relations to translate business goals into resilient software. 5% Champion DevEx, proposing improvements to CI/CD, observability, and performance. You'll be successful here if you have 7+ years professional experience (at least 3 in a senior/lead capacity) delivering production More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Gotobeat
and collaborate with Product, Data, and Artist Relations to translate business goals into resilient software. 5% Champion DevEx, proposing improvements to CI/CD, observability, and performance. You'll be successful here if you have 7+ years professional experience (at least 3 in a senior/lead capacity) delivering production More ❯
Accreditation Council for Graduate Medical Education
collaboration such as GitHub, ArgoCD, or similar. Experience utilizing CI/CD platforms to automate provisioning infrastructure, software builds, tests, and releases. Experience using observability tools such as APM, logging, and metrics to assist with debugging issues. Experience using Infrastructure as Code tools for provisioning infrastructure such as Terraform, Cloudformation More ❯
dynamics, competition, and peer group activities Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE, etc.), and articulate a path toward a target operating model (people, process, and tools) REQUIRED SKILLS Strong leadership skills are essential for More ❯
dynamics, competition, and peer group activities Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE, etc.), and articulate a path toward a target operating model (people, process, and tools) REQUIRED SKILLS Strong leadership skills are essential for More ❯
such as Dagster , Airflow , or Prefect Knowledge of CI/CD best practices and tools (e.g. GitHub Actions, Jenkins, CodePipeline) Exposure to monitoring and observability tools for ML systems (e.g. Prometheus, Grafana, DataDog, WhyLabs, Evidently, etc.) Experience in building parallelised or distributed model inference pipelines Nice-to-Have Skills Familiarity More ❯
paced environment where you can make a real difference, we want to hear from you! What You'll Do: Develop and implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. More ❯
SAP Sovereign Cloud Automation & Tooling team and contribute to the Quality Engineering efforts for the Operations Control Plane (OCP) -a foundational capability responsible for observability, compliance, and policy enforcement in SAP's regulated cloud environments. You will support the test automation strategy for a WebApp architecture powered by Node.js and More ❯
enough knowledge to be able to fault find and identify the root cause. Core Responsibilities involved: Incident management Application design and development Site reliability (observability, alerting, high-availability, self-healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm More ❯
dynamics, competition, and peer group activities. Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE etc.), and articulate a path toward a target operating model (people, process, and tools). Required Skills Leadership: Strong leadership skills are More ❯
dynamics, competition, and peer group activities. Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE etc.), and articulate a path toward a target operating model (people, process, and tools). Required Skills Leadership: Strong leadership skills are More ❯
legal compliance of data in data engineering. Optimise data pipelines and queries for better performance and cost-efficiency. Integrate data pipelines with monitoring and observability to proactively detect and resolve issues before they impact business operations. Design and build data models for lake house storage and analytics. Maintain CI/ More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
such as Jenkins, GitLab CI/CD, or CircleCI. Strong knowledge of containerization technologies (e.g., Docker, Kubernetes) and microservices architecture. Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack, Cloudwatch). Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems. Experience of Incident … advantage if you also have: Hands-on experience of working with large Kubernetes Cluster. Certification will be an added plus. Working experience of Grafana Observability Suite (Loki, Mimir, Tempo). Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty, Rundeck. Familiarity with More ❯
of title, we are committed to achieving ambitious goals and we have fun celebrating our wins. We are looking for a self-motivated Senior Observability Engineer to join our dedicated Observability Infrastructure team. Anaplan is a high-growth company that is leading the way in enterprise planning. We look for … people who believe in simplicity, agility and performance and can choose and use the best tools for the job. In the role of Senior Observability Engineer, you will be designing and improving our approach to collecting and analyzing Observability telemetry (Logs, Metrics and Traces) and visualizing it in Grafana Cloud. … You will implement best observability practices to enable engineers across the business to track service performance and interaction in a scalable, performant, and cost-effective manner. What you'll be doing: In this role, working a minimum of 2 days a week in our York Office, you will be: Work More ❯
pipelines AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR Kubernetes Helm Kops Ingress/Egress Terraform Deployment of AWS Resources Pipelines OCI Observability ELK Dynatrace Prometheus Others Vault RedHat Skills working in a secure environment and ability to adhere to security principles Experience in support organisation More ❯
infrastructure using Terraform to manage Azure resources. Improve CI/CD pipelines in Azure DevOps for efficient testing and deployment. Set up monitoring and observability tools (Azure Monitor, App Insights) to maintain system health. Lead load testing, performance tuning, and incident response for optimal system performance. Collaborate with developers, data More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
InterQuest Group (UK) Limited
they work on, from ideation through to development, testing and deployment, so you should expect to champion and mentor on best practice like TDD, Observability and IaC. Skills: C#, .NET Core, APIs AWS, Docker, Kubernetes, Terraform CI/CD, TDD, SOLID The money is good too - up to £90k plus More ❯
Work closely with analysts, data scientists, and business stakeholders to align data systems with evolving needs Promote engineering best practices around version control, testing, observability, and documentation Guide improvements to data quality, reliability, and governance through policy and tooling Stay current with emerging technologies and make informed recommendations to modernize More ❯
Work closely with analysts, data scientists, and business stakeholders to align data systems with evolving needs Promote engineering best practices around version control, testing, observability, and documentation Guide improvements to data quality, reliability, and governance through policy and tooling Stay current with emerging technologies and make informed recommendations to modernize More ❯
pipelines, DevOps practices, and Infrastructure as Code (Terraform preferred). Strong programming experience in JavaScript, TypeScript, or Python for test automation. Experience working with observability tools (e.g., monitoring logs, distributed tracing, and alerting mechanisms). Ability to define and track key quality metrics to measure and continuously improve the testing More ❯
pipelines, DevOps practices, and Infrastructure as Code (Terraform preferred). Strong programming experience in JavaScript, TypeScript, or Python for test automation. Experience working with observability tools (e.g., monitoring logs, distributed tracing, and alerting mechanisms). Ability to define and track key quality metrics to measure and continuously improve the testing More ❯