software development and architecture. Experience influencing technical decisions across the different stakeholder levels of the business including non-technical audiences. Ability to foster a culture around data-driven reliability, observability, monitoring, and automation. Due to the global nature of the team, a degree of flexible working will be required to accommodate different time zones. We are an equal opportunities employer. More ❯
DevOps: you build it, you run it. Tech Stack M&S uses a variety of technologies including; Java, Spring, SpringBOOT, Micronaut React, Next.js, Typescript, Angular Azure Cloud, Kubernetes, Dynatrace (observability) SQL Server, MongoDB Ignite, Redis Everyone's Welcome We are ambitious about the future of retail. We're disrupting, innovating and leading the industry into a more conscientious, inspiring digital More ❯
London, England, United Kingdom Hybrid / WFH Options
Pleo
we mostly use Kotlin, with few services in TypeScript) Cloud environments/distributed systems/microservices (AWS, Google Cloud, Kubernetes) Relational databases (e.g. PostgreSQL) Testing frameworks (Cypress, JUnit, Testcontainers) Observability and monitoring (Datadog) DevOps culture and tools (GitHub Actions, Gradle, Terraform) Your colleagues would say you Work in English with ease (it's our company language) Never stop being curious More ❯
SDLC, spanning QA automation and CI/CD pipeline optimisation Work on cost-effective platform scalability of a multi-client system Work on all aspects of platform resilience, covering observability and recoverability to maintain SLAs targets Participate in the entire SDLC, helping to design changes, review code, build tests, and coordinate deployments Collaborate closely with colleagues in Product and Design More ❯
delivery and support functions in Cloud Operations Good working technical knowledge (Certificates are very welcome) in different cloud technologies and Azure and AWS Cloud Platforms Experience managing monitoring, alerting, observability, and dashboarding platforms (such as AWS Monitor, Prometheus, Grafana, and Elasticsearch) Good understanding of NOC and DevOps practices Experience and in-depth knowledge of databases and data handling. Solid understanding More ❯
Experience using managed languages such as Python, Go, C#, Java, or similar. Experience utilizing CI/CD platforms to automate provisioning infrastructure, software builds, tests, and releases. Experience using observability tools such as APM, logging, and metrics to assist with debugging issues. Experience designing tooling to simplify the operational management of SaaS/PaaS systems. Familiarity with building flexible and More ❯
product company You have experience in at least one of following disciplines; supporting cloud platforms at scale (AWS), working with infrastructure as code, container orchestration environments (preferably Kuberbetes) and observability platforms You are proficient in at least one programming or scripting language You are experienced in continuous integration and deployment tools Desirable Criteria: You have a security-first approach with More ❯
and CI/CD workflows (GitLab CI). Write clean, production-grade code in Python (Scala is a bonus). Build infrastructure using Terraform, AWS CloudFormation, or SAM. Drive observability across the platform using Datadog or CloudWatch. Actively mentor Data Engineers and Associates, and lead technical discussions and design sessions. Key requirements: Must-Have: Strong experience with AWS services: Glue More ❯
London, England, United Kingdom Hybrid / WFH Options
Morae Services India Private Limited
product planning, roadmap discussions, and strategic prioritization. Operational Excellence Own key engineering KPIs including system uptime, velocity, tech debt reduction, and deployment frequency. Drive cloud infrastructure cost-efficiency, system observability, and DevSecOps maturity. Lead incident management and escalation processes with customer sensitivity and transparency. Qualifications: 10+ years in software engineering, including 5+ years in engineering leadership roles. Proven experience building More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
RVU Co UK
Experience of building and designing cost optimised Cloud platforms (preferably Azure) from the ground up, following well architected principles Solid understanding of platform and reliability engineering approaches (SRE), including observability, performance optimisation, capturing analytics and security best practices Experience implementing Service Level Objectives and using them to drive error budgets, risk management and alerting Knowledge and experience with operating containers More ❯
London, England, United Kingdom Hybrid / WFH Options
Capgemini Invent
industries. Delivery Excellence: Extensive experience in delivery assurance for analytics and AI programmes. Desired Skills: Experience in data ingestion, integration, governance, and solution design. Familiarity with data quality frameworks, observability tools, and automation. WHAT YOU’LL LOVE ABOUT WORKING HERE? As a Senior Manager, you will have the opportunity to work at the forefront of AI and analytics innovation, contributing More ❯
London, England, United Kingdom Hybrid / WFH Options
Canonical
and storage, to the application layer Design, build and maintain solutions that will be deployed on public and private clouds and local workstations Master distributed systems concepts such as observability, identity, tracing Work with both Kubernetes and machine-oriented open source applications Collaborate proactively with a distributed team of engineers, designers and product managers Debug issues and interact in public More ❯
DevOps: you build it, you run it. Tech Stack M&S uses a variety of technologies including; Java, Spring, SpringBOOT, Micronaut React, Next.js, Typescript, Angular Azure Cloud, Kubernetes, Dynatrace (observability) SQL Server, MongoDB Ignite, Redis Everyone’s Welcome We are ambitious about the future of retail. We’re disrupting, innovating and leading the industry into a more conscientious, inspiring digital More ❯
DevOps: you build it, you run it. Tech Stack M&S uses a variety of technologies including; Java, Spring, SpringBOOT, Micronaut React, Next.js, Typescript, Angular Azure Cloud, Kubernetes, Dynatrace (observability) SQL Server, MongoDB Ignite, Redis Everyone’s Welcome We are ambitious about the future of retail. We’re disrupting, innovating and leading the industry into a more conscientious, inspiring digital More ❯
Experience using managed languages such as Python, Go, C#, Java, or similar. Experience utilizing CI/CD platforms to automate provisioning infrastructure, software builds, tests, and releases. Experience using observability tools such as APM, logging, and metrics to assist with debugging issues. Experience designing tooling to simplify the operational management of SaaS/PaaS systems. Familiarity with building flexible and More ❯
our global customers to innovate with confidence. Operating as part of the broader Infrastructure organization, the Cloud Security team partners closely with key engineering groups including Networking, Compute, and Observability to embed security deeply across Miro's cloud environment. The team also maintains strong alignment with our peers in the Security organization-such as Application Security and Detection & Response-ensuring More ❯
London, England, United Kingdom Hybrid / WFH Options
GSR
environment while integrating and automating our physical server inventory using Infrastructure as Code (IaC). You will work across all layers of infrastructure, including: Networking & Exchange Connectivity Microservice Orchestration & Observability Disaster Recovery & Security Optimization Your mission is to improve latency, scalability, and reliability, ensuring GSR remains a best-in-class market maker. We value engineers who drive automation, reduce friction More ❯
London, England, United Kingdom Hybrid / WFH Options
Valarian Technologies Limited
you thrive in a fast-paced environment where you can make a real difference, we want to hear from you! What You’ll Do: Develop and implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. Engineer the Acra platform for More ❯
Technical Expertise: 5+ years of professional experience in backend development (Go Lang). Deep knowledge of Go Lang, with hands-on experience building scalable services. Experience with working with observability stack (logging, metrics, tracing). Expertise in building RESTful APIs following company standards. Understanding of Domain-Driven Design and Modularization concepts. Asynchronous processing with approaches like co-routines, messages queuing More ❯
Eastbourne, England, United Kingdom Hybrid / WFH Options
AxisOps
and architecture through to production and operations. Our strength lies in software delivery, supported by deep expertise in platform engineering, built on an understanding of private cloud-native infrastructure, observability, and DevSecOps. Our culture We value sharp thinking, clear communication, and teams that look out for each other. At AxisOps, our core values are: Ingenuity – solving hard problems with elegant More ❯
London, England, United Kingdom Hybrid / WFH Options
Risk Ledger
processing and integrating large datasets into a product. Experience being a key person in the designing of a non-trivial solution and working with others to implement. Worked with observability solutions (Kibana, Grafana, Sentry). Experience using further technologies we use (Terraform, AWS RDS, AWS ECS or EKS, AWS EventBridge). Salary range £90,000-£110,000 GBP The perks More ❯
London, England, United Kingdom Hybrid / WFH Options
Docebo
our teams work together effectively, this role requires you to be located in UK. Responsibilities: Manage the Docebo Incident and Escalation process; Monitor metrics and develop ways to improve observability; Diagnose & troubleshoot service incidents & outages, with the capability to do urgent code fixes when needed on the various application services; Day by day operations to maintain and evolve the Docebo More ❯
London, England, United Kingdom Hybrid / WFH Options
JPMorganChase
discoveries. Build an understanding of product and technology for owned domain areas. Actively participate in scrum ceremonies including daily stand-ups, sprint planning and retrospectives. Consider Accessibility (WCAG), Security, Observability & Performance as part of all owned applications/deliverables. Required qualifications, capabilities and skills: Demonstrated success in developing and sustaining customer-focused web applications and single-page applications (SPAs) within More ❯
London, England, United Kingdom Hybrid / WFH Options
Ravelin
as Data Engineering and Product, to build a more effective and cohesive ML ecosystem. Deep expertise in data science and engineering best practices (version control, CI/CD, testing, observability) and a history of applying them to build robust, scalable machine learning systems. Exceptional analytical and problem-solving skills, with a demonstrated ability to define and solve highly ambiguous, complex More ❯
Technical Leadership & DevOps Culture Lead by example across delivery teams, offering hands-on technical support and ensuring engineering excellence. Promote a DevOps-first culture by championing continuous delivery, automation, observability, and operational readiness in everything we build. Help teams strike the right balance between shipping value quickly and building with long-term sustainability in mind. Work hand-in-hand with More ❯