Bristol, England, United Kingdom Hybrid / WFH Options
Unify Talent - IT, Digital & Tech Recruitment
improving AWSarchitecture (incl. ECS, EC2, S3, Lambda, CloudWatch) – Own and implement Infrastructure as Code (Terraformpreferred) – Improve and maintain CI/CD pipelines to optimisedeployment workflows – Develop backend services with observability, traceability,and scalability in mind – Design efficient, secure and reliable infrastructurepatterns for our cloud-native platform – Help define best practices across DevOps, cloud-nativearchitecture, and developer environments – Contribute to containerisation More ❯
tools (Pulumi/Terraform) Collaborate with analytics, ML, product, and engineering teams to scale data solutions Ensure data quality, lineage, and governance through testing and monitoring Work with cost observability tools to ensure efficient platform usage Essential Proven experience working with cloud-based data platforms (Snowflake, Redshift, BigQuery, etc.) Proficiency in Python and SQL for automation and analytics Experience with More ❯
control systems (Git) and CI/CD pipelines. Understanding of database systems and data pipeline architectures. Ability to troubleshoot and optimize complex production systems under load. Experience with monitoring, observability, and performance profiling tools. Strong communication and collaboration skills for working with ML researchers and data scientists. Nice to have: Python experience for interfacing with ML frameworks Experience with real More ❯
Charlton, England, United Kingdom Hybrid / WFH Options
Compare the Market
prototypes into high-quality production systems Platform & Engineering Standards • Contribute to the design and evolution of our internal ML platform and tooling • Champion best practices in CI/CD, observability, reproducibility, and infrastructure-as-code for ML • Ensure all deployed systems meet requirements for resilience, testing, security, and performance • Influence and contribute to shared frameworks, libraries, and deployment pipelines Strategy More ❯
London, England, United Kingdom Hybrid / WFH Options
BGL Group
prototypes into high-quality production systems Platform & Engineering Standards • Contribute to the design and evolution of our internal ML platform and tooling • Champion best practices in CI/CD, observability, reproducibility, and infrastructure-as-code for ML • Ensure all deployed systems meet requirements for resilience, testing, security, and performance • Influence and contribute to shared frameworks, libraries, and deployment pipelines Strategy More ❯
Desired Skills & Experience: Proven experience delivering software features into production, ideally in a B2B SaaS or data-rich environment. Dedicated to driving best practise within the SDLC, including quality, observability, CI/CD, SOLID and Design Patterns. Strong background in software engineering with hands-on experience in developing, evaluating, and deploying complex systems. Proficiency with relevant programming languages and frameworks More ❯
London, England, United Kingdom Hybrid / WFH Options
Automata
scheduling solver engine Deployment on AWS to IoT Greengrass, ECS, Lambda and EKS, with infrastructure managed via Terraform and pipelines via CircleCI Kubernetes and Docker for running our applications Observability platform via Datadog Feature flags to enable us to experiment and control when we release new features to customers In this role, you will be responsible for: Shipping and building More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
William Hill
doing Translate functional and non-functional requirements into working software alongside your team Collaborate with the team to analyze, debug, and resolve defects Demonstrate a commitment to monitoring and observability Manage technical debt effectively by avoiding its creation and removing it when possible Communicate clearly, translating technical and non-technical requirements as needed Understand timelines and dependencies, coordinating with other More ❯
London, England, United Kingdom Hybrid / WFH Options
Howden Group Holdings
and collaborate effectively with stakeholders across data, governance, and infrastructure. Experience in defining quality strategies , driving process improvements, and implementing automation in large-scale data platforms. Knowledge of data observability and anomaly detection tools (preferred). Industry experience in financial services, insurance, or regulated environments (not essential, but preferred). This role is an excellent opportunity for a Lead Quality More ❯
configuration and deployments Contribute to the implementation and operations of Software Defined Networks Help to deploy and manage Spine and Leaf networks Ensure high availability of our network through observability, failover, and redundancy Ensure clients have predictable networking performance through the use of network engineering and other applicable technologies Help with deploying and maintaining network monitoring and management tools You More ❯
Uppingham, England, United Kingdom Hybrid / WFH Options
MMT
chaos into elegant, self-healing systems that deploy flawlessly Knowledge of cloud security frameworks and compliance requirements Understanding of cost optimization strategies and cloud financial management Familiarity with monitoring, observability, and incident response best practices Communication & Business Skills Excellent presentation skills with experience speaking to technical and executive audiences Strong written communication abilities, especially for proposals and technical documentation Natural More ❯
Experience with build and deployment tooling (Bazel, Github Actions, Pants, Spinnaker) Experience with automation tools (Terraform, CloudFormation) Experience with relational database systems (PostgreSQL, MySQL) Exposure to microservices architecture and observability (Kubernetes, Docker, service meshes, distributed systems, tracing) Exposure to event streaming platforms (such as Kafka and Kinesis) To view our Candidate Privacy Notice for the US, please click here . More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
SST). 10% Mentor teammates and collaborate with Product, Data, and Artist Relations to translate business goals into resilient software. 5% Champion DevEx, proposing improvements to CI/CD, observability, and performance. You'll be successful here if you have 7+ years professional experience (at least 3 in a senior/lead capacity) delivering production full-stack applications. Deep expertise More ❯
experiences. ThousandEyes is deeply integrated across Cisco's extensive technology portfolio, supporting customers in scaling deployments while offering AI-powered assurance insights within Cisco’s Networking, Security, Collaboration, and Observability portfolios. About The Role We are seeking a skilled Senior Site Reliability Engineer (SRE) in Production Engineering with a strong background in SaaS and operations. You will design and manage More ❯
of the following a bonus: Java experience Python experience Ruby experience Big data technologies: Spark, Trino, Kafka Financial Markets experience SQL: Postgres, Oracle Cloud-native deployments: AWS, Docker, Kubernetes Observability: Splunk, Prometheus, Grafana For more information about DRW's processing activities and our use of job applicants' data, please view our Privacy Notice at https://drw.com/privacy More ❯
London, England, United Kingdom Hybrid / WFH Options
Thought Machine
critical systems. Experience with hybrid cloud technologies including OpenShift, Google Anthos, AWS EKS Anywhere, AWS Outposts A strong background in Go, Python or Java Experience with Postgres Experience with observability tools, e.g. Prometheus, Grafana Highly competitive salary Pension plan (match up to 5%) Life insurance - three times annual salary Competitive maternity (six months fully paid) and paternity leave (four weeks More ❯
London, England, United Kingdom Hybrid / WFH Options
MMT
chaos into elegant, self-healing systems that deploy flawlessly Knowledge of cloud security frameworks and compliance requirements Understanding of cost optimization strategies and cloud financial management Familiarity with monitoring, observability, and incident response best practices Communication & Business Skills Excellent presentation skills with experience speaking to technical and executive audiences Strong written communication abilities, especially for proposals and technical documentation Natural More ❯
London, England, United Kingdom Hybrid / WFH Options
Deutsche Bank
income products (bonds, swaps), related market events, market data feeds (Reuters/Bloomberg), and an understanding of the trading lifecycle (order routing, execution, and settlement) Experience with monitoring and observability tools (Prometheus, Grafana, Splunk, Geneos) Strong written and verbal communication skills How we’ll support you Training and development to help you excel in your career Coaching and support from More ❯
Dundee, Scotland, United Kingdom Hybrid / WFH Options
NCR Atleos
Scotland, United Kingdom 5 days ago Mechanical/Electronics/Mechatronics Test Engineer Dundee, Scotland, United Kingdom 2 weeks ago Dundee, Scotland, United Kingdom 2 weeks ago DevOps Engineer - Observability (Remote Scotland) Dundee, Scotland, United Kingdom 2 weeks ago Dundee, Scotland, United Kingdom 1 week ago Kingsbarns, Scotland, United Kingdom 7 hours ago Dundee, Scotland, United Kingdom 2 weeks ago More ❯
that automate their processes. Contribute to the development of our AI agent development platform that scales with our product strategy. Ensure our AI services maintain high standards of reliability, observability, availability, and performance. Participate in our machine learning community to influence how we implement machine learning and computer vision technologies, shaping Unitary's future. Take ownership of customer outcomes with More ❯
Code principles Design an agile release engineering strategy that delivers value incrementally and continuously Support a highly-available live production system, respond to alerts, diagnose problems using logs and observability tooling, triage and resolve incidents #J-18808-Ljbffr More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
that automate their processes. Contribute to the development of our AI agent development platform that scales with our product strategy. Ensure our AI services maintain high standards of reliability, observability, availability, and performance. Participate in our machine learning community to influence how we implement machine learning and computer vision technologies, shaping Unitary's future. Take ownership of customer outcomes with More ❯
Brentwood, England, United Kingdom Hybrid / WFH Options
Sky
ATSC, Bluetooth & HDMI input, Deep Sleep mode, content in VOD and Linear channel and native apps. Strong DevOps Practices experience where you will provide Full Lifecycle Ownership, Comprehensive Solutioning. Observability and Oversight, Process Optimisation, Collaboration and Integration and Operationalisation. Strong BDD and Test Automation and Infrastructure experience. Good understanding of Business-Driven Development Strategy. Fully familiar with automation tools and More ❯
user value Designing an agile release engineering strategy that delivers value incrementally and continuously Supporting a highly-available live production system, responding to alerts, diagnosing problems using logs and observability tooling, triaging and resolving incidents Working with lean and agile practices (e.g. Kanban, Scrum, etc.) Collaborating in cross-functional teams Coaching and line managing software engineers It’d be amazing More ❯
code, networking and databases with enough knowledge to be able to fault find and identify the root cause. Core Responsibilities involved: Incident management Application design and development Site reliability (observability, alerting, high-availability, self-healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React More ❯