DevOps Lead/Architect — Contract — London — Hybrid (3 days onsite) Inside IR35 | 6 months | FS sector We are looking for a DevOps Lead/Architect to drive observability, automation, and GitOps best practices within a global financial services environment. What you'll be doing Architect and scale observability platforms using Datadog + Geneos Lead infrastructure automation using Terraform/IaC More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Queen Square Recruitment
DevOps Lead/Architect — Contract — London — Hybrid (3 days onsite) Inside IR35 | 6 months | FS sector We are looking for a DevOps Lead/Architect to drive observability, automation, and GitOps best practices within a global financial services environment. What you'll be doing Architect and scale observability platforms using Datadog + Geneos Lead infrastructure automation using Terraform/IaC More ❯
hosted on AWS. Architect and optimise systems: Define service boundaries, data ownership, and failure-recovery patterns for scalable, high-availability systems. Raise engineering quality: Champion best practices for testing, observability, and security. Review critical PRs and guide technical decisions across the team. Operate and improve production systems: Monitor performance, reliability, and cost efficiency. Lead incident response and drive continuous improvement. … Django) Cloud: AWS (Lambda, ECS/Fargate, S3, DynamoDB, CloudWatch, API Gateway) Data & Messaging: PostgreSQL, Redis, Kafka or SQS CI/CD & Infrastructure: Docker, Terraform, GitHub Actions, CloudFormation Monitoring & Observability: Prometheus, Grafana, OpenTelemetry Testing: Pytest, integration and load testing frameworks Key Skills & Expertise Proven experience designing and delivering production systems using Python on AWS . Strong understanding of distributed systems … API design, and event-driven architectures. Deep knowledge of system observability, logging, and performance optimisation. Familiarity with modern security and data-privacy best practices. Excellent communicator who can document and articulate technical trade-offs clearly. Behaviours & Attributes Ownership: Takes full responsibility for systems from design to operation. Pragmatism: Balances long-term architecture with delivery velocity. Influence: Raises standards and mentors More ❯
hosted on AWS. Architect and optimise systems: Define service boundaries, data ownership, and failure-recovery patterns for scalable, high-availability systems. Raise engineering quality: Champion best practices for testing, observability, and security. Review critical PRs and guide technical decisions across the team. Operate and improve production systems: Monitor performance, reliability, and cost efficiency. Lead incident response and drive continuous improvement. … Django) Cloud: AWS (Lambda, ECS/Fargate, S3, DynamoDB, CloudWatch, API Gateway) Data & Messaging: PostgreSQL, Redis, Kafka or SQS CI/CD & Infrastructure: Docker, Terraform, GitHub Actions, CloudFormation Monitoring & Observability: Prometheus, Grafana, OpenTelemetry Testing: Pytest, integration and load testing frameworks Key Skills & Expertise Proven experience designing and delivering production systems using Python on AWS . Strong understanding of distributed systems … API design, and event-driven architectures. Deep knowledge of system observability, logging, and performance optimisation. Familiarity with modern security and data-privacy best practices. Excellent communicator who can document and articulate technical trade-offs clearly. Behaviours & Attributes Ownership: Takes full responsibility for systems from design to operation. Pragmatism: Balances long-term architecture with delivery velocity. Influence: Raises standards and mentors More ❯
healing architecture with GKE (Kubernetes) at its core Supporting key platforms: Airflow, BigQuery, PostgreSQL clusters Enhancing developer experience through GitLab CI/CD, Coder remote environments, and a modern observability stack (Prometheus, Grafana, Mimir) Driving automation and reliability across infrastructure and pipelines What we’re looking for 2–4 years’ experience in a Cloud, Platform, or DevOps role Solid hands … and optimise — with a pragmatic, problem-solving mindset Great communication skills and a collaborative, customer-focused approach Familiarity with CI/CD (GitLab), and an interest in data or observability tools is a plus A STEM degree (2:1 or higher) or equivalent hands-on experience Why join Work with a modern, cloud-native stack at scale Be part of More ❯
healing architecture with GKE (Kubernetes) at its core Supporting key platforms: Airflow, BigQuery, PostgreSQL clusters Enhancing developer experience through GitLab CI/CD, Coder remote environments, and a modern observability stack (Prometheus, Grafana, Mimir) Driving automation and reliability across infrastructure and pipelines What we’re looking for 2–4 years’ experience in a Cloud, Platform, or DevOps role Solid hands … and optimise — with a pragmatic, problem-solving mindset Great communication skills and a collaborative, customer-focused approach Familiarity with CI/CD (GitLab), and an interest in data or observability tools is a plus A STEM degree (2:1 or higher) or equivalent hands-on experience Why join Work with a modern, cloud-native stack at scale Be part of More ❯
Manage and optimise key platforms such as Airflow , BigQuery , and PostgreSQL clusters. Developer Experience: Enhance internal developer productivity through Coder remote dev environments, GitLab CI/CD pipelines, and observability tooling. Collaboration: Partner closely with Data Engineering, Trading Technology, and Platform teams to deliver robust, scalable cloud solutions. Required Skills and Experience Experience: 2-4 years in a Cloud, Platform … and continuous integration concepts. Mindset: Pragmatic, customer-focused, and driven by efficiency and automation. Education: Minimum 2:1 degree in a STEM subject or equivalent experience. Desirable: Exposure to observability tooling (Grafana, Prometheus, Mimir). Interest in data platforms or AI-enabled development workflows. Learn More For more information, contact George Harris at Harrington Starr for a confidential conversation, or More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Harrington Starr
Manage and optimise key platforms such as Airflow , BigQuery , and PostgreSQL clusters. Developer Experience: Enhance internal developer productivity through Coder remote dev environments, GitLab CI/CD pipelines, and observability tooling. Collaboration: Partner closely with Data Engineering, Trading Technology, and Platform teams to deliver robust, scalable cloud solutions. Required Skills and Experience Experience: 2-4 years in a Cloud, Platform … and continuous integration concepts. Mindset: Pragmatic, customer-focused, and driven by efficiency and automation. Education: Minimum 2:1 degree in a STEM subject or equivalent experience. Desirable: Exposure to observability tooling (Grafana, Prometheus, Mimir). Interest in data platforms or AI-enabled development workflows. Learn More For more information, contact George Harris at Harrington Starr for a confidential conversation, or More ❯
service meshes, and container registries. - Implement GitHub Actions/Argo CD pipelines for automated, zero-touch deployments. - Lead security hardening efforts using GuardDuty, CloudWatch, IAM best practices. - Set up observability stacks for proactive monitoring and performance tuning. - Own backup, disaster recovery for services that youʼve created. Cross-Functional & Process - Collaborate closely with other engineers, product managers and CTO. - Mentor engineers … experience (Sagemaker, Kubeflow, ZenML). - Experience building RESTful services around AI pipelines. - ISO 27001, NIST SSDF, OWASP SAMM, or GDPR compliance literacy. - Experience with AWS Karpenter, Prometheus, or similar observability stacks. Soft Skills Research-driven mindset, eager to experiment and iterate. Able to bridge the gap between cutting-edge AI research and practical deployment. Strong communicator with the ability to More ❯
service meshes, and container registries. - Implement GitHub Actions/Argo CD pipelines for automated, zero-touch deployments. - Lead security hardening efforts using GuardDuty, CloudWatch, IAM best practices. - Set up observability stacks for proactive monitoring and performance tuning. - Own backup, disaster recovery for services that youʼve created. Cross-Functional & Process - Collaborate closely with other engineers, product managers and CTO. - Mentor engineers … experience (Sagemaker, Kubeflow, ZenML). - Experience building RESTful services around AI pipelines. - ISO 27001, NIST SSDF, OWASP SAMM, or GDPR compliance literacy. - Experience with AWS Karpenter, Prometheus, or similar observability stacks. Soft Skills Research-driven mindset, eager to experiment and iterate. Able to bridge the gap between cutting-edge AI research and practical deployment. Strong communicator with the ability to More ❯
and postmortem processes, driving root cause analysis and long-term fixes. Automation & Tooling Champion automation to reduce toil and improve system reliability. Oversee the development and maintenance of internal observability, tools and platforms. Collaborate with engineering and DevOps teams to embed reliability into the software development lifecycle. Collaboration & Strategy Partner with product, engineering, DevOps and Customer Support teams to align … on priorities and roadmaps. Contribute to the strategic direction of infrastructure and reliability initiatives. Advocate for best practices in observability, CI/CD, and infrastructure as code. What You Will Bring: Proven experience managing or leading SRE, DevOps, or infrastructure teams. Strong background in systems engineering, cloud platforms (AWS, Azure), and container orchestration (Kubernetes) Excellent leadership, communication, and problem-solving More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Areti Group | B Corp™
Own CI/CD pipelines and Docker -based runtime on AWS ; Infrastructure-as-Code via CDK/Terraform (CDKTF) . Apply secure-by-design and TDD ; instrument apps for observability and performance . Collaborate with product, platform, and security teams to meet operational and compliance requirements. The toolkit you’ll use Frontend: TypeScript, React.js, Vite, Material-UI, HTML5, CSS Backend … Docker , CI/CD . Building and consuming RESTful APIs ; JSON schemas; integration testing. Comfortable in AWS and modern Infrastructure-as-Code approaches. Strong engineering fundamentals: code reviews, testing, observability, performance tuning . Security Clearance: Active SC or DV (must be current). Nice-to-haves Military background (RAF/Army/Navy) or delivery in defence, aerospace, or government More ❯
Own CI/CD pipelines and Docker -based runtime on AWS ; Infrastructure-as-Code via CDK/Terraform (CDKTF) . Apply secure-by-design and TDD ; instrument apps for observability and performance . Collaborate with product, platform, and security teams to meet operational and compliance requirements. The toolkit you’ll use Frontend: TypeScript, React.js, Vite, Material-UI, HTML5, CSS Backend … Docker , CI/CD . Building and consuming RESTful APIs ; JSON schemas; integration testing. Comfortable in AWS and modern Infrastructure-as-Code approaches. Strong engineering fundamentals: code reviews, testing, observability, performance tuning . Security Clearance: Active SC or DV (must be current). Nice-to-haves Military background (RAF/Army/Navy) or delivery in defence, aerospace, or government More ❯
with UK retailers and marketplaces. In this role, you'll ensure our systems are reliable, scalable, and secure. You'll help automate deployments, evolve our cloud infrastructure, and improve observability and developer experience — making it easier for product teams to deliver quality software quickly and safely. Why Zopa Manchester? We're building a new tech hub right in the heart … platform and developer experience teams Ensuring our container platforms (including Kubernetes) are reliable, secure, and up to date Designing scalable, self-service tools to reduce operational toil Supporting infrastructure observability through metrics, tracing, and alerting Working closely with product teams to foster a culture of reliability engineering About You Experience in a Platform/Site Reliability Engineering or similar role More ❯
code across the stack. Participating in architectural discussions and helping shape engineering best practices. Troubleshooting and resolving production issues across services and systems. Contributing to CI/CD pipelines, observability, and automation alongside platform engineers. Your Skills & Experience: Must-haves to be successful in this role: Strong experience writing backend services in Go. Proficiency in React and modern JavaScript/… and code styles. Nobody can do everything, but here are a few related things we’re interested in: Experience working lower in the stack, e.g., databases, infrastructure, Kubernetes, or observability tooling. Exposure to CI/CD tooling Interest in natural language processing, AI, or distributed systems. Here’s our promise to you: We are going to work with you – to More ❯
code across the stack. Participating in architectural discussions and helping shape engineering best practices. Troubleshooting and resolving production issues across services and systems. Contributing to CI/CD pipelines, observability, and automation alongside platform engineers. Your Skills & Experience: Must-haves to be successful in this role: Strong experience writing backend services in Go. Proficiency in React and modern JavaScript/… and code styles. Nobody can do everything, but here are a few related things we’re interested in: Experience working lower in the stack, e.g., databases, infrastructure, Kubernetes, or observability tooling. Exposure to CI/CD tooling Interest in natural language processing, AI, or distributed systems. Here’s our promise to you: We are going to work with you – to More ❯
Experience: Proven delivery of enterprise OpenTelemetry environments, including production-scale collector deployment and config management. Hands-on experience with metrics, logs, traces, attribute design, and routing logic. Familiarity with observability backends (Dynatrace, Splunk, Prometheus, Tempo, Grafana). Strong collaboration skills with developers and infra teams in large, governed organisations. More ❯
Help to improve the resilience, automation, and observability of production systems that power a mission-critical quant trading platform for a systematic hedge fund. This isn’t your typical ops role - they're looking for Engineers who can write code to eliminate toil, improve reliability and automate release, monitoring and recovery processes. You'll build and maintain automated tools in More ❯
Help to improve the resilience, automation, and observability of production systems that power a mission-critical quant trading platform for a systematic hedge fund. This isn’t your typical ops role - they're looking for Engineers who can write code to eliminate toil, improve reliability and automate release, monitoring and recovery processes. You'll build and maintain automated tools in More ❯
and JavaScript/TypeScript (React.JS) being essential to drive their frontend and backend systems. You will be designing and delivering scalable, high-performance solutions from product requirements, ensuring robust observability through metrics and monitoring. You’ll work on event-driven architectures using CQRS, apply SOLID principles, and leverage Docker to build high-availability, high-throughput platforms. Experience with AWS services More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Sharpe Search
and JavaScript/TypeScript (React.JS) being essential to drive their frontend and backend systems. You will be designing and delivering scalable, high-performance solutions from product requirements, ensuring robust observability through metrics and monitoring. You’ll work on event-driven architectures using CQRS, apply SOLID principles, and leverage Docker to build high-availability, high-throughput platforms. Experience with AWS services More ❯
NUnit). Expertise in RESTful and GraphQL APIs, Git, and SOLID principles. Strategic thinking, strong communication, and a love for collaboration. Bonus: Experience with Azure, DevOps, Entity Framework, and observability practices. Why You'll Love It Here: Developer-led culture with hack days, and open access to leadership. Transparent progression and tailored development plans. Great perks: profit share, training budget More ❯
NUnit). Expertise in RESTful and GraphQL APIs, Git, and SOLID principles. Strategic thinking, strong communication, and a love for collaboration. Bonus: Experience with Azure, DevOps, Entity Framework, and observability practices. Why You'll Love It Here: Developer-led culture with hack days, and open access to leadership. Transparent progression and tailored development plans. Great perks: profit share, training budget More ❯
Employment Type: Permanent
Salary: £70000 - £80000/annum Pension, 25 days holiday, Profit Sha
complex data ecosystem Design flexible data ingestion and transformation pipelines for financial market data and trading systems Build and maintain AI/ML infrastructure, including model serving, evaluation, and observability frameworks Collaborate directly with clients to ensure the platform meets real-world enterprise requirements Contribute to both strategic technical direction and hands-on implementation as part of a small, high More ❯