City of London, London, United Kingdom Hybrid/Remote Options
Identify Solutions
the past year and aggressive expansion across the UK, US, and EU, the company is scaling at pace. Data is the backbone: from APIs and pipelines to governance and observability, their data platform directly powers customer-facing products and AI-driven insights. They’re now hiring a Senior Data Engineer to own and shape this platform, building scalable, production-grade … systems that become the foundation for global brands. Why join? ✨ Greenfield impact – inherit a live but early platform, define best practice across structure, testing, observability, and governance. ✨ Direct product impact – your APIs, pipelines, and orchestration power the platform that 1,000+ brands rely on every day. ✨ AI at the core – work on infrastructure that enables machine learning and intelligent decision … doing: API strategy & development – own and scale FastAPI endpoints that deliver real-time access to platform data. Data pipeline development – build ingestion and replication pipelines with best-in-class observability, latency, and resilience. Platform technical vision – influence architecture and orchestration, shaping how the business handles data at scale. Data quality & governance – embed testing, freshness, lineage, and monitoring to ensure reliability More ❯
root cause analysis, lessons learnt and post actions Champion stability and resilience across the trading platforms Ensure new systems are aligned with best practices Drive improvements and alignment in observability and monitoring tools, improving MTTD and MTTR Produce analysis on SRE function performance Provide guidance, recommendations and hands-on support to teams, promoting SRE best practices Develop and maintain a … roadmap for continuous improvement of support and observability Maintain personal/professional development to meet the changing demands of the role, including all relevant regulatory and legislative training When dealing with all customers, clients, or colleagues, ensure that we provide a clear, fair and consistent high-quality service that presents a professional and positive image of CMC Markets Take all … cloud/on prem environment 7 years experience in IT operational roles working with highly reliable systems Experience in modern development methodologies and languages Proficiency in implementing and managing observability tools Knowledge of automation tools and methodologies Strong understanding and application of ITIL processes Excellent oral, written and presentation skills Strong stakeholder management and influencing skills Proactive attitude towards learning More ❯
Peterborough, Cambridgeshire, England, United Kingdom Hybrid/Remote Options
Noir
Performance & Reliability Director - Software House - Peterborough/Hybrid (Key skills: Performance Engineering, Reliability Engineering, SRE, Load Testing, Observability, Chaos Testing, Cloud Platforms, Microservices, Leadership, CI/CD, APM Tools) Are you a technology leader passionate about driving performance, scalability, and reliability across complex software platforms? Do you thrive in high-growth environments where innovation, engineering excellence, and resilience are core … lifecycle. You'll oversee system profiling, capacity planning, and test strategies - ensuring every release meets the highest standards for speed, scalability, and reliability. You'll drive the adoption of observability and monitoring frameworks, leveraging platforms like Datadog and Dynatrace to build a proactive performance culture. You'll champion continuous improvement, implement chaos testing programmes, and ensure teams deliver fault-tolerant More ❯
We’re building and owning the software that powers every Nando’s restaurant — from tills and payment devices to printers, sensors, and more. You’ll help automate configuration, improve observability, and create tools that keep our restaurants running smoothly and reliably. Your work will lay the foundation for the future of Nando’s in-restaurant technology, including next-generation tills … ll: Build and maintain automation for tills, PEDs, Stripe terminals, printers, tablets, and sensors (primarily Windows-based) Develop APIs and web UIs (Next.js) for configuration, monitoring, and alerting Implement observability — including logging, metrics, tracing, and automated remediation Integrate with Meraki APIs and other infrastructure systems to surface network and sensor insights Run proofs of concept for new in-restaurant technology … Experience building modern UIs with Next.js or React Knowledge of networked devices , hardware APIs , and device provisioning Familiarity with Windows device management and Android/iOS tablets Experience with observability tools — metrics, structured logging, alerting, and dashboards Security-first mindset with knowledge of PCI-DSS principles and secrets management Clear, collaborative communication style, with the ability to work effectively across More ❯
We’re building and owning the software that powers every Nando’s restaurant — from tills and payment devices to printers, sensors, and more. You’ll help automate configuration, improve observability, and create tools that keep our restaurants running smoothly and reliably. Your work will lay the foundation for the future of Nando’s in-restaurant technology, including next-generation tills … ll: Build and maintain automation for tills, PEDs, Stripe terminals, printers, tablets, and sensors (primarily Windows-based) Develop APIs and web UIs (Next.js) for configuration, monitoring, and alerting Implement observability — including logging, metrics, tracing, and automated remediation Integrate with Meraki APIs and other infrastructure systems to surface network and sensor insights Run proofs of concept for new in-restaurant technology … Experience building modern UIs with Next.js or React Knowledge of networked devices , hardware APIs , and device provisioning Familiarity with Windows device management and Android/iOS tablets Experience with observability tools — metrics, structured logging, alerting, and dashboards Security-first mindset with knowledge of PCI-DSS principles and secrets management Clear, collaborative communication style, with the ability to work effectively across More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Bloc Recruitment
engineering teams Guide system design for complex, cloud-native, containerised environments (Kubernetes, Terraform, Helm, Flux) Own the evolution of our data model and processing platforms Embed best practices in observability , security , and testing across the organisation Anticipate dependencies and unblock teams before issues arise Partner with product and business leaders to align technology with strategic goals Mentor engineers and technical … Expertise in modern backend architectures (Node.js, Python, Go) and fluency with frontend ecosystems (React, TypeScript) Deep experience with cloud-native infrastructure (Terraform, Kubernetes, Helm, Flux) Strong grasp of security, observability, and operational excellence Proven ability to influence and guide without formal authority Track record of mentoring and developing high-performing engineering teams Why You'll Love It Here Join a More ❯
North West, England, United Kingdom Hybrid/Remote Options
Uniting Ambition
flows, operational tooling. Work hands-on where required — providing technical leadership, code review, and architectural oversight. Partner with stakeholders across engineering, product, data and business leadership. Embed performance, reliability, observability, and resilience into system design from day one. Help shape our evolving use of AI/ML for optimisation, automation, and predictive insights. What We’re Looking For Proven experience … Hybrid working — choose Manchester or Staffordshire as your primary hub. Major influence over the architecture and future of a high-scale engineering platform . Opportunity to introduce modern patterns, observability, DevOps, automation, and AI-driven capabilities. High autonomy, strong engineering culture, supportive leadership team. If you’re driven by architecting for scale , shaping technical strategy, and leading teams toward world More ❯
patterns where appropriate Ensure APIs are well-documented using OpenAPI/Swagger standards Build and maintain a developer portal for internal and external API consumers Quality & Operations Implement comprehensive observability including logging, monitoring, and alerting Design for reliability, fault tolerance, and graceful degradation Optimize API performance, scalability, and cost efficiency Write clean, maintainable code with thorough testing and documentation Configure … and modern security patterns Testing mindset - you write unit tests and understand integration testing API documentation experience using OpenAPI/Swagger and maintaining developer portals Production systems mindset covering observability, reliability, and operational excellence Architectural thinking - ability to design systems for scale, security, and evolution Keywords RESTful APIs C# .Net Azure AI LLM ML Machine Learning SaaS Scale Up OAuth More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Tenth Revolution Group
and deploy dbt/dataform models with clean, reusable SQL transformations. Orchestrate and monitor data pipelines to ensure quality, reliability, and timely delivery. Integrate and maintain data quality checks, observability, cataloguing, and lineage. Drive strong CI/CD practices (e.g., GitHub Actions) for seamless testing, review, and deployment. Assist analysts with insight-driven Tableau dashboards in collaboration with business stakeholders. … data modelling and transformation. Advanced SQL skills and deep understanding of modern data architecture principles, especially Medallion architecture. Familiarity with orchestration tools such as Airflow. Experience implementing data quality, observability, and metadata/lineage tooling. Strong Git proficiency and experience using GitHub with CI/CD workflows. Tableau exposure Familiarity with Python for scripting and automation tasks. If you’re More ❯
and deploy dbt/dataform models with clean, reusable SQL transformations. Orchestrate and monitor data pipelines to ensure quality, reliability, and timely delivery. Integrate and maintain data quality checks, observability, cataloguing, and lineage. Drive strong CI/CD practices (e.g., GitHub Actions) for seamless testing, review, and deployment. Assist analysts with insight-driven Tableau dashboards in collaboration with business stakeholders. … data modelling and transformation. Advanced SQL skills and deep understanding of modern data architecture principles, especially Medallion architecture. Familiarity with orchestration tools such as Airflow. Experience implementing data quality, observability, and metadata/lineage tooling. Strong Git proficiency and experience using GitHub with CI/CD workflows. Tableau exposure Familiarity with Python for scripting and automation tasks. If you’re More ❯
components. Backend: Node.js/TypeScript, serverless on GCP, NestJS in new services. Data: Firestore as primary store, object storage for images. Tooling: NX monorepo, CI/CD, automated testing, observability with centralized logs/metrics/traces. A day in the life and how you'll make an impact: Own system architecture for customer-facing apps and internal platforms end … to-end: frontend, backend, data, and cloud. Design clear interfaces and a mediated API layer between services and clients. Lead decisions on scalability, observability, reliability, cost, and security for high-volume, low-latency workloads. Write production code in TypeScript/Node and Angular, pairing with teams on complex features. Establish guardrails: coding standards, architecture principles, testing strategy, CI/CD More ❯
Senior Software Engineer - Risk Factor Discovery Location London Business Area Engineering and CTO Description & Requirements The Risk Factor Discovery team empowers clients to access, customize, and dynamically override pricing and market data inputs to fit their methodologies or regulatory requirements. More ❯
of data science concepts, AI/ML models, automation workflows, and agentic orchestration to enhance business processes. (LEAD) Experience designing and deploying Agentic AI solutions leveraging orchestration, pipelines, and observability on Microsoft Azure AI Foundry. (LEAD) Experience applying Generative AI, NLP, and prompting techniques. Strong understanding of AI governance, observability, and compliance frameworks. Excellent communication and presentation skills. More ❯
of data science concepts, AI/ML models, automation workflows, and agentic orchestration to enhance business processes. (LEAD) Experience designing and deploying Agentic AI solutions leveraging orchestration, pipelines, and observability on Microsoft Azure AI Foundry. (LEAD) Experience applying Generative AI, NLP, and prompting techniques. Strong understanding of AI governance, observability, and compliance frameworks. Excellent communication and presentation skills. More ❯
of data science concepts, AI/ML models, automation workflows, and agentic orchestration to enhance business processes. (LEAD) Experience designing and deploying Agentic AI solutions leveraging orchestration, pipelines, and observability on Microsoft Azure AI Foundry. (LEAD) Experience applying Generative AI, NLP, and prompting techniques. Strong understanding of AI governance, observability, and compliance frameworks. Excellent communication and presentation skills. More ❯
a Great Place to Work, ITRS is an Enterprise SaaS provider with industry-leading solutions. Our mission is to make society's critical technology work via automated & holistic IT observability solutions that safeguard critical applications and enable innovation. With our prestigious customer base includes 90% of the world's top investment banks. We are backed by leading private equity investors … sit in the Platform Team and form part of a wider global Engineering Team. The Core Platform layer is a collection of distributed services which ingest, transform and materialise observability data to make it available to several similarly distributed visualisation, integration, analytics and other domain specific applications to provide solutions to a range of observability problems. We are looking to … tuned garbage collection, fixed CPU hotspots and analysed heap dumps to find memory leaks on JVMs. You have likely instrumented custom metrics in your application and used monitoring/observability tools to understand how they behave in different environments and to identify the root cause of issues. Health Insurance and Dental Health Cover for you and your dependants Employee Assistance More ❯
City Of London, England, United Kingdom Hybrid/Remote Options
Gravitas Recruitment Group (Global) Ltd
for critical pipelines, datasets, and reporting assets. Lead FinOps forums to improve cost transparency and efficiency. Oversee CI/CD pipelines and deployments for safe, compliant delivery. Champion monitoring, observability, and automation to reduce manual intervention. Develop operational runbooks, escalation protocols, and incident playbooks. Collaborate with data engineering and analytics teams to align operational strategy with business goals. Skills & Experience … Hands-on experience with Azure Synapse, Databricks, ADF, Power BI . Familiarity with CI/CD and automation. Strong FinOps mindset and cost management experience. Knowledge of monitoring and observability frameworks. Calm under pressure with strong problem-solving skills. Why This Role? This is a high-visibility position at the heart of a major data ecosystem. You’ll influence the More ❯
based workflows , ensuring scalability and performance. Run technical design reviews , guide architecture decisions, and support engineers in navigating trade-offs around performance, cost, and reliability. Champion operational excellence — strong observability, testing discipline, incident response, and SLO ownership. Collaborate with Product & Design to define technical requirements, prioritise roadmaps, and drive measurable outcomes. Tech Environment You’ll Influence Primary language: Python AWS … of 4 consistently delivers high-quality software and scalable data pipelines with predictable velocity. Clear improvements in system reliability, throughput, and cost efficiency. Strong engineering discipline across design, testing, observability, and incident management. Improved technical foundations and reduced operational toil. Clear, thoughtful communication and alignment across engineering, product, and design. More ❯
based workflows , ensuring scalability and performance. Run technical design reviews , guide architecture decisions, and support engineers in navigating trade-offs around performance, cost, and reliability. Champion operational excellence — strong observability, testing discipline, incident response, and SLO ownership. Collaborate with Product & Design to define technical requirements, prioritise roadmaps, and drive measurable outcomes. Tech Environment You’ll Influence Primary language: Python AWS … of 4 consistently delivers high-quality software and scalable data pipelines with predictable velocity. Clear improvements in system reliability, throughput, and cost efficiency. Strong engineering discipline across design, testing, observability, and incident management. Improved technical foundations and reduced operational toil. Clear, thoughtful communication and alignment across engineering, product, and design. More ❯
Telford, Shropshire, West Midlands, United Kingdom
Sanderson Government and Defence
insight, and proactive incident management. Key Responsibilities Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events … improvement initiatives and tooling exploitation to enhance operational efficiency efficiency within immature teams Required Skills and Experience Strong understanding and expereince in SRE principals and methodologies Strong understanding of Observability within a complex tech stack Hands-on experience with monitoring tools such as Splunk, Splunk ITSI, Dynatrace, AppDynamics, and synthetic monitoring platforms. Strong understanding and experience with implementing and using More ❯
Quantitative Developer | Two Streams: Execution Infrastructure and Low-Latency Crypto | NYC or London | $170k to $250k base + performance bonus Albert Bow is partnering with a top-tier systematic trading group to hire across 23 seats split between two streams. More ❯
Quantitative Developer | Two Streams: Execution Infrastructure and Low-Latency Crypto | NYC or London | $170k to $250k base + performance bonus Albert Bow is partnering with a top-tier systematic trading group to hire across 23 seats split between two streams. More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Doczen
Join a fast, founder-led team 🚀 Building an inbox-native operating fabric for the commodities sector—where AI transforms everyday communications into reliable, scalable workflows. This is a hands-on role shaping how LLMs orchestrate mission-critical commodity operations, with More ❯
Join a fast, founder-led team 🚀 Building an inbox-native operating fabric for the commodities sector—where AI transforms everyday communications into reliable, scalable workflows. This is a hands-on role shaping how LLMs orchestrate mission-critical commodity operations, with More ❯
Full Stack Engineer (React Native, Node.js, AWS) Location: London, UK - Full Time, On-site. A well-funded fintech start-up is hiring for a Full-Stack Engineer to help craft both the experience and infrastructure that power their mission. Backed More ❯