SQL Server (2016–2022), including performance tuning, high availability, and architecture. Strong scripting skills (e.g., PowerShell) and experience with automation/configuration tools like Ansible or Chef. Familiarity with observability tools, monitoring frameworks, and incident management practices. A mindset focused on eliminating TOIL, improving developer experience, and scaling operations through code. Excellent communication and collaboration skills. Bonus Points Experience with More ❯
Manchester, England, United Kingdom Hybrid/Remote Options
Suits Me
grow technically and professionally Owning delivery of critical platform services that power Suits Me's financial products Overseeing the full development lifecycle, from architectural design and planning to deployment, observability, and continuous improvement Establishing engineering standards, including code quality, CI/CD, testing, and documentation practices Driving adoption of best practices around API design, data modelling, and event-driven architectures More ❯
Manchester, England, United Kingdom Hybrid/Remote Options
Awaze
environment. Partner with Product to balance innovation with reliability, ensuring our core platforms can scale to support millions of bookings. Champion engineering best practices such as CI/CD, observability, automated testing, and platform reliability. Create an environment where teams can experiment, learn, and deliver value quickly and safely. Play a key role in shaping how we attract, develop, and More ❯
establish team governance frameworks, and ensure adoption of deployment best practices across the team Infrastructure as Code: Infrastructure as Code with Terraform, ARM Templates or AWS CloudFormation/CDK Observability Strategy: DataDog, Application Insights or Amazon CloudWatch implementation with comprehensive observability strategy and cloud governance Cloud Strategy: Strategic SAAS using Azure (Functions, Queue, Blob Storage, VMs, etc) or AWS (Lambda … testing strategy, establish quality governance for the team, and measure team adoption with continuous improvement initiatives Application Security: Confident vulnerability management, thread modelling and tracking Production Support: Knowledge of observability and production support practices Essential Experience Requirements 6+ years in software engineering, minimum 1 year of formal people management experience or 2+ years of technical leadership with mentoring responsibilities Demonstrable More ❯
manchester, north west england, united kingdom Hybrid/Remote Options
On the Beach
ll Be Doing Day To Day Technical Leadership : Serve as a technical authority for the Platforms Team, guiding the design, implementation, and optimization of our AWS, GitOps, Monitoring and Observability, Developer Portal and Kubernetes (EKS) platforms. Architecture and Design : Architect and design scalable, reliable, and secure Platform solutions, ensuring they meet current and future needs. Platform Management : Lead the management … with a modern programming/scripting languages such as C#, Go, Python, TypeScript, or Java. Advanced scripting skills in languages such as Python, Bash, or similar. Expertise with monitoring, observability and Log analytics tooling such as New Relic Prometheus, Grafana, ELK stack. Any experience implementing AI Ops and/or AI Coding tooling to enhance Developer Experience and enable efficient More ❯
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
Continuous Deployment and Integration Extensive experience in technical delivery and agile methodologies A strong advocate for test-driven development and a mentor to others in this practice Experience with Observability as a practice (logging, App Insights, metrics and auditing). An expert of DevOps (CI/CD) & Cloud Services (Azure preferred) Expert of several Design Patterns. A master of good More ❯
Continuous Deployment and Integration Extensive experience in technical delivery and agile methodologies A strong advocate for test-driven development and a mentor to others in this practice Experience with Observability as a practice (logging, App Insights, metrics and auditing). An expert of DevOps (CI/CD) & Cloud Services (Azure preferred) Expert of several Design Patterns. A master of good More ❯
Leading and scaling a technically advanced team responsible for building and productionising a mission-critical backend platform. Architecting and maintaining high-availability, data-intensive systems across AWS with strong observability and monitoring foundations. Collaborating with cross-functional teams to integrate APIs and services, maintaining clean architecture principles. Driving technical quality through mentorship, test-driven development, and modern CI/CD More ❯
world delivery. Partner with Product and Design to align tech decisions with business impact, bringing empathy, pragmatism, and curiosity to every discussion. Build a culture of continuous improvement, from observability and resilience to team rituals and engineering craft. Coach engineers to grow, empowering them to make technical decisions, own outcomes, and share accountability. What are we looking for? We’re More ❯
services/message buses and other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of security/performance requirements More ❯
Leading and scaling a technically advanced team responsible for building and productionising a mission-critical backend platform. Architecting and maintaining high-availability, data-intensive systems across AWS with strong observability and monitoring foundations. Collaborating with cross-functional teams to integrate APIs and services, maintaining clean architecture principles. Driving technical quality through mentorship, test-driven development, and modern CI/CD More ❯
and help shape a culture of technical excellence. What You Bring; Strong coding skills in Python, PowerShell, or Go Deep understanding of systems engineering and cloud infrastructure Experience with observability, CI/CD, and automation frameworks Ability to assess risk, plan resources, and influence long-term technical direction Comfortable advising senior stakeholders and shaping cross-functional initiatives Leadership & Communication: Skilled More ❯
with UK retailers and marketplaces. In this role, you'll ensure our systems are reliable, scalable, and secure. You'll help automate deployments, evolve our cloud infrastructure, and improve observability and developer experience — making it easier for product teams to deliver quality software quickly and safely. Why Zopa Manchester? We're building a new tech hub right in the heart … platform and developer experience teams Ensuring our container platforms (including Kubernetes) are reliable, secure, and up to date Designing scalable, self-service tools to reduce operational toil Supporting infrastructure observability through metrics, tracing, and alerting Working closely with product teams to foster a culture of reliability engineering About You Experience in a Platform/Site Reliability Engineering or similar role More ❯
NUnit). Expertise in RESTful and GraphQL APIs, Git, and SOLID principles. Strategic thinking, strong communication, and a love for collaboration. Bonus: Experience with Azure, DevOps, Entity Framework, and observability practices. Why You'll Love It Here: Developer-led culture with hack days, and open access to leadership. Transparent progression and tailored development plans. Great perks: profit share, training budget More ❯
NUnit). Expertise in RESTful and GraphQL APIs, Git, and SOLID principles. Strategic thinking, strong communication, and a love for collaboration. Bonus: Experience with Azure, DevOps, Entity Framework, and observability practices. Why You'll Love It Here: Developer-led culture with hack days, and open access to leadership. Transparent progression and tailored development plans. Great perks: profit share, training budget More ❯
Employment Type: Permanent
Salary: £70000 - £80000/annum Pension, 25 days holiday, Profit Sha
Ringway, Altrincham, Cheshire, England, United Kingdom
The Hut Group
potential technical risks and develop strategies to mitigate them, ensuring that the application is secure, robust and reliable Champion performance optimisation across the frontend stack while ensuring accessibility and observability are baked into all solutions Deeply committed to crafting intuitive, impactful, and optimised user experiences that turn complex workflows into seamless, engaging journeys Share your knowledge within a democratic team More ❯
Wigan, Lancashire, England, United Kingdom Hybrid/Remote Options
Searchability
As part of their continued investment in reliability and platform performance, they are now seeking an experienced Site Reliability Engineer to strengthen their engineering function and help evolve their observability and automation capabilities. THE BENEFITS Hybrid working model (office and remote) Opportunity to define and lead SRE strategy within a collaborative culture Exposure to modern cloud-native and containerised environments … and performance of complex online platforms supporting high-volume transactions. Working closely with operations and product teams, you'll monitor production systems, develop automation to improve uptime, and refine observability to provide real-time insight into platform health. You'll also play a key role in performance testing, system tuning and incident management to ensure smooth operation during critical events. … SITE RELIABILITY ENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and debugging Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and Infrastructure More ❯
Wigan, Greater Manchester, United Kingdom Hybrid/Remote Options
Searchability (UK) Ltd
As part of their continued investment in reliability and platform performance, they are now seeking an experienced Site Reliability Engineer to strengthen their engineering function and help evolve their observability and automation capabilities. THE BENEFITS Hybrid working model (office and remote) Opportunity to define and lead SRE strategy within a collaborative culture Exposure to modern cloud-native and containerised environments … and performance of complex online platforms supporting high-volume transactions. Working closely with operations and product teams, you'll monitor production systems, develop automation to improve uptime, and refine observability to provide real-time insight into platform health. You'll also play a key role in performance testing, system tuning and incident management to ensure smooth operation during critical events. … SITE RELIABILITY ENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and debugging Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and Infrastructure More ❯
modular, cloud-native platform (Azure/AWS – your call) Driving a culture shift across engineering – CI/CD, SRE, DevEx, clean code Setting engineering standards: quality gates, testing practices, observability, automation Working with Product, Delivery, and Exec teams to align on priorities and timelines What we’re looking for: Senior engineering leader (Head of/Director level) with strong architectural … design, modern API practices (REST, gRPC) TypeScript across services and frontend (frameworks are flexible) Infrastructure as Code (Terraform) CI/CD baked in, GitOps model preferred Emphasis on testing, observability, and secure-by-design Why this role? You’ll join a profitable, well-backed SaaS business with real scale — and a brief to modernise how engineering operates from the ground More ❯
Manchester Area, United Kingdom Hybrid/Remote Options
Lynx Recruitment
Director of Next Generation Engineering Salary: Up to £150,000 + bonus + benefits Location: Manchester - Hybrid working About the Role We're working with a leading AI and technology innovation consultancy that helps organisations design and deliver intelligent, data More ❯
for background tasks and failure retries Rate-limit handling and backoff strategies Cloud deployment experience with strong security hygiene, configuration management, and secrets handling Exposure to monitoring, logging, and observability tools The ideal candidate values reliability, secure coding practices, graceful failure, and clean software architecture suitable for a globally distributed gaming audience. More ❯
secure, compliant, and high-availability environments for our AI workloads. You’ll lead a small cross-functional team, manage deployment pipelines, automate processes, and enforce defence-grade reliability and observability standards. Key Responsibilities Lead and manage a small team (developers + DevOps engineers) delivering secure, scalable, and high-availability AI systems. Own AWS-based infrastructure : Lambda, EC2, S3, CloudFormation/… Python , JSON , and serverless architectures . Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4–8 years in software development and/or DevOps , including 2+ More ❯
years of commercial experience working primarily in an AWS Cloud environment using approaches/tooling like ours (see technical skills), delivering scalable, performant, reliable solutions. Strong data reliability/observability, data governance and information security credentials. What technical skills are required? ETL/ELT & Data Transformation: Amazon Redshift (query tuning, distribution/sort keys, workload management) Data modelling (normalisation, dimensional … data processing, automation, testing) SQL (advanced query writing and optimization) Data Engineering Best Practices: CI/CD for data pipelines (Git, GitHub Actions, etc.) Data quality checks, monitoring, and observability Infrastructure as Code (Terraform etc.) Other Tools & Ecosystem: Experience with logging/monitoring Exposure to data governance, cataloguing, and lineage tools Ability to work with a range of structured, semi More ❯
Manchester Area, United Kingdom Hybrid/Remote Options
ECOM
Great opportunity for Senior Python Engineers to work remotely for a UK based AI scale-up. You'd join a large engineering department and would work within a cross functional product-based team responsible for building cloud-native, event-driven More ❯