platforms. Be part of a mission led company delivering smart, sustainable solutions for thousands of users across the UK. Work in a forward thinking engineering culture that embraces automation, observability and modern dev practices. Tech Stack You'll Work With Core: Java (essential), JavaScript or TypeScript (bonus) Performance Testing: Custom frameworks, traffic analysis, monitoring tools Testing Tools: Playwright, Cypress or … backend testing , APIs, CICD, and data driven testing A collaborative mindset: someone who enjoys mentoring, problem solving and working closely with devs and stakeholders Nice to Have Familiarity with observability tools, logging, and analysing system behaviour in production Experience with cloud environments (AWS preferred) and containerised apps (Docker or Kubernetes) Exposure to JavaScriptTypeScript and frontend automation tools Working Model Based More ❯
tools to manage a large-scale, multi-vendor network with an emphasis on automation, telemetry, and model-driven infrastructure as code. Automate the full network lifecycle-including provisioning, configuration, observability, testing, troubleshooting, and capacity planning. Collaborate with architecture and design teams and the CTO office to implement new technologies that ensure scalability, efficiency, and operational resilience. Develop tools and platforms … that enhance the observability, reliability, and performance of the production network. Enhance existing monitoring and observability frameworks, integrating intelligent alerting and self-remediation capabilities to reduce manual intervention and improve incident response. Define and measure service-level objectives (SLOs) to track infrastructure performance and reliability. Write software utilizing orchestration systems to automate tasks and interact with other systems. Provide mentorship More ❯
tools to manage a large-scale, multi-vendor network with an emphasis on automation, telemetry, and model-driven infrastructure as code. Automate the full network lifecycle—including provisioning, configuration, observability, testing, troubleshooting, and capacity planning. Collaborate with architecture and design teams and the CTO office to implement new technologies that ensure scalability, efficiency, and operational resilience. Develop tools and platforms … that enhance the observability, reliability, and performance of the production network. Enhance existing monitoring and observability frameworks, integrating intelligent alerting and self-remediation capabilities to reduce manual intervention and improve incident response. Define and measure service-level objectives (SLOs) to track infrastructure performance and reliability. Write software utilizing orchestration systems to automate tasks and interact with other systems. Provide mentorship More ❯
Wandsworth, Greater London, UK Hybrid / WFH Options
Intec Select
with organisational goals. Ensure all services are secure by design, working closely with the information security team to proactively manage risks. Drive service improvement and operational resilience through automation, observability, and DevOps best practices. Experience Required: Proven experience in leading platform/infrastructure and DevOps teams in a hands-on capacity. Strong technical foundation in both traditional infrastructure and modern … CI/CD, GitOps, IaC (e.g., Terraform, ARM), and automation scripting (e.g., PowerShell, Bash, Python). Cloud experience (ideally Azure) and hybrid infrastructure environments. Familiarity with monitoring, alerting, and observability platforms. Package: Up to 25% Bonus Remote Working Head of Platform & Infrastructure Engineering – Financial Services- London (Hybrid/Remote) - £100,000 - £120,000 + 25% Bonus + 15% Pension + More ❯
london, south east england, united kingdom Hybrid / WFH Options
Intec Select
with organisational goals. Ensure all services are secure by design, working closely with the information security team to proactively manage risks. Drive service improvement and operational resilience through automation, observability, and DevOps best practices. Experience Required: Proven experience in leading platform/infrastructure and DevOps teams in a hands-on capacity. Strong technical foundation in both traditional infrastructure and modern … CI/CD, GitOps, IaC (e.g., Terraform, ARM), and automation scripting (e.g., PowerShell, Bash, Python). Cloud experience (ideally Azure) and hybrid infrastructure environments. Familiarity with monitoring, alerting, and observability platforms. Package: Up to 25% Bonus Remote Working Head of Platform & Infrastructure Engineering – Financial Services- London (Hybrid/Remote) - £100,000 - £120,000 + 25% Bonus + 15% Pension + More ❯
exceptional payment experiences for our customers. You will own and optimize Collinson’s internal payment systems while managing key external partnerships with PSPs, Acquirers, payment orchestration, fraud prevention, and observability providers. In addition, you will oversee payment risk and fraud management, ensuring regulatory compliance and enhancing payment security. Key Responsibilities Payments Strategy & Execution • Define and execute a comprehensive payments strategy … Collaborate with orchestration platforms to streamline global payment routing, retries, and conversion optimization. • Integrate with fraud prevention providers, implementing real-time risk assessment and fraud mitigation tools. • Work with observability partners to ensure real-time monitoring, reporting, and payment analytics for proactive issue resolution. Payment Risk & Fraud Management • Oversee payment security, fraud prevention, and risk mitigation strategies across all payment More ❯
exceptional payment experiences for our customers. You will own and optimize Collinson’s internal payment systems while managing key external partnerships with PSPs, Acquirers, payment orchestration, fraud prevention, and observability providers. In addition, you will oversee payment risk and fraud management, ensuring regulatory compliance and enhancing payment security. Key Responsibilities Payments Strategy & Execution • Define and execute a comprehensive payments strategy … Collaborate with orchestration platforms to streamline global payment routing, retries, and conversion optimization. • Integrate with fraud prevention providers, implementing real-time risk assessment and fraud mitigation tools. • Work with observability partners to ensure real-time monitoring, reporting, and payment analytics for proactive issue resolution. Payment Risk & Fraud Management • Oversee payment security, fraud prevention, and risk mitigation strategies across all payment More ❯
Collaborate with People/HR and engineering leadership on career pathing, training, and coaching for engineering staff. Technology Enablement: Evaluate and deploy tools - especially AI - that support engineering productivity, observability, and collaboration. Work closely with DevOps, QA, and SRE teams to align infrastructure and operational excellence with engineering needs. Own key vendor relationships, evaluation of partnerships and represent technology on … scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous long-term incentive plan (LTIP) tez token More ❯
Engineer, you will contribute to the evolution of the strategic management of our GCP infrastructure, and of DevOps practices like incident management, SLOs and error budgets. You will champion observability as a way to improve mean time to recover and use DORA metrics to help the Product & Engineering team to get better at creating amazing products, and help other teams … to support the company's scaling needs, laying the foundation for performance, security, and maintenance. Build tooling and automation that promote team autonomy while ensuring operational excellence. Advance our observability platform to support long-term insights, meaningful alerting and improved ease of use for the engineering teams. Build visibility into infra costs to raise awareness across engineering and empower teams More ❯
and frameworks that accelerate development velocity across Samsara's web and mobile applications. Ensure high reliability, performance, and security across the stack by implementing best practices in testing, monitoring, observability, and CI/CD pipelines. Oversee the development lifecycle from planning to deployment, following Agile methodologies to ensure timely and efficient delivery. Champion best practices for API design, authentication, service … for efficient data fetching and API design, with a working knowledge of integrating GraphQL in mobile and web environments. Performance & Security: Deep understanding of web performance optimizations, caching strategies, observability, and security best practices. An ideal candidate also has: Leadership Experience: A proven ability to scale and manage diverse engineering teams and foster a high-performance culture. Collaboration & Communication: Strong More ❯
with business objectives. Technical Governance: Lead and optimise architecture review boards to ensure compliance, performance, and scalability. Strategic Leadership: Define and drive architectural strategies for security, API design, SDLC, observability, and cloud platforms. Innovation & Mentorship: Develop high-performing engineering teams, mentoring technical leads, architects, and developers. Why Join Us? At Leighton, we don’t just build technology—we create impact. More ❯
with business objectives. Technical Governance: Lead and optimise architecture review boards to ensure compliance, performance, and scalability. Strategic Leadership: Define and drive architectural strategies for security, API design, SDLC, observability, and cloud platforms. Innovation & Mentorship: Develop high-performing engineering teams, mentoring technical leads, architects, and developers. Why Join Us? At Leighton, we don't just build technology-we create impact. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Pontoon
support) What You Bring: Strong Java (streams, lambdas, concurrency) and front-end skills with React.js Deep knowledge of multithreaded, distributed systems and asynchronous architecture Experience with JVM tuning and observability tools (Prometheus, Elastic, etc.) TDD, CI/CD, and agile delivery experience Ability to deliver from design to deployment Bonus Points: Experience in Front Office, Risk, or Pricing within investment More ❯
Kubernetes) at scale. Experience working with a cloud provider (AWS, Azure, or GCE), or sysadmin/SRE experience in data centers. Expertise in designing, building, and operating high-scale observability or infrastructure systems. Working knowledge of networking fundamentals; experience with CNIs or cloud networking infrastructure is preferred. What We Require 4+ years of professional software development experience on core infrastructure More ❯
etc.) Comfort with basic computer administration including software installation, system configuration, and networking. Comfort with git and automated build pipelines (Jenkins, GitLab CI/CD, etc.) Preferred Passion for observability (Elastic, APM, Grafana, etc.) Experience integrating software with a Large Language Model (LLM) Experience with retrieval-augmented generation (RAG) Production-grade software development experience with Python Service containerization and deployment More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
and attitude on automating common repetitive tasks A suitable sense of ownership and responsibility in driving tasks to timely full completion "Nice To Have" Skills and Experience: AIOps and Observability Meaningful experience in a distributed team Working in a sophisticated, multi-geography, engineering services environment! Providing technical support and mentoring to othe Accommodations at Arm At Arm, we want to More ❯
with Engineering Managers and Product Management to support the goals and objectives on your team. You will have a focus on end-to-end responsibility for the development, quality, observability, and testing of the software you build. Everyone is welcome. We have a culture of creativity. We approach our work passionately, improve constantly and celebrate our wins at every turn. More ❯
platform services that arecrucial for accelerating the productivity of all engineeringteams. As an Engineering Manager, you will lead and expand twocore components: the Feature Flags Service and the EngineeringInternal Observability Platform (E360). You'll ensure that thesesystems serve as the foundation for seamless and efficientengineering workflows, allowing us to deliver top-tier AI-drivenproducts. What You'll Do at More ❯
software development and architecture. Experience influencing technical decisions across the different stakeholder levels of the business including non-technical audiences. Ability to foster a culture around data-driven reliability, observability, monitoring, and automation. Due to the global nature of the team, a degree of flexible working will be required to accommodate different time zones. We are an equal opportunities employer. More ❯
full lifecycle Develop APIs and tools that expose predictors as scalable services to other teams Contribute to software engineering best practices across the ML stack (testing, CI/CD, observability) Partner with platform engineers and product stakeholders to ensure technical alignment and delivery What we're looking for 6+ years of experience in software engineering with strong focus on machine More ❯
full lifecycle Develop APIs and tools that expose predictors as scalable services to other teams Contribute to software engineering best practices across the ML stack (testing, CI/CD, observability) Partner with platform engineers and product stakeholders to ensure technical alignment and delivery What we're looking for 6+ years of experience in software engineering with strong focus on machine More ❯
automating our physical server inventory using Infrastructure as Code (IaC). You will work across all layers of infrastructure, including: Networking & Exchange Connectivity Linux Systems & Kubernetes Administration Microservice Orchestration & Observability Disaster Recovery & Security Optimization Your mission is to improve latency, scalability, and reliability, ensuring GSR remains a best-in-class market maker. We value engineers who drive automation, reduce friction More ❯
Better Placed Ltd - A Sunday Times Top 10 Employer!
production-grade AI-powered products . Strong collaborator, with a track record of working closely with AI research, product, and infrastructure teams . Bonus Points: Exposure to MLOps , AI observability , or LLM deployment at scale . Experience with data engineering for large-scale pipelines . Prior background in enterprise SaaS or developer tools . Why Join: AI-native mission: Shape More ❯
london, south east england, united kingdom Hybrid / WFH Options
Better Placed Ltd - A Sunday Times Top 10 Employer!
production-grade AI-powered products . Strong collaborator, with a track record of working closely with AI research, product, and infrastructure teams . Bonus Points: Exposure to MLOps , AI observability , or LLM deployment at scale . Experience with data engineering for large-scale pipelines . Prior background in enterprise SaaS or developer tools . Why Join: AI-native mission: Shape More ❯
QUALIFICATIONS - 4+ years of experience in a technical support or support engineering role. - Experience working with AWS (e.g. EC2, EBS, S3, Route53)Experience working with SQL. - Experience working with observability platforms (e.g. Grafana, Kibana) for monitoring, troubleshooting, and diagnostic. - Experience working with monitoring and alerting systems (e.g. CloudWatch, Prometheus). Acknowledgement of country: In the spirit of reconciliation Amazon acknowledges More ❯