Manchester, North West, United Kingdom Hybrid / WFH Options
Daniel James Resourcing Ltd
to work closely with architects, developers, and product teams Desirable: AWS certifications Experience in environments where performance, scale, and uptime matter Familiarity with monitoring, observability, and cost optimisation tools This is a rare chance to be part of something at the ground floor, delivering real engineering impact from day one More ❯
other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of More ❯
to implement and maintain reliable and scalable systems while adhering to industry best practices and security standards. Responsibilities and Impact: Design, implement, and maintain observability solutions to track system health and performance. Analyze observability data to identify and troubleshoot potential issues proactively. Develop and implement alerts and notifications for critical … or a related field. 5+ years of experience as a Site Reliability Engineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running in AWS Comfortable with Infrastructure as Code, Terraform is preferred Comfortable with CI/CD pipelines such More ❯
lead R&D or impact analysis initiatives, where hands on rapid prototyping may be required. Skills: TypeScript, Node, React AWS, Lambdas Docker, Kubernetes, Terraform Observability CI/CD The money is good too – up to £80k plus benefits including 10% annual bonus, pension, private healthcare and flexible working. If you More ❯
Infrastructure as Code (Terraform), and DevSecOps practices. You will be instrumental in designing and implementing Azure cloud infrastructure with a strong emphasis on governance, observability, and compliance. your Responsibilities: Design and document secure, scalable Azure cloud architectures aligned with enterprise and regulatory standards Lead implementation using Infrastructure as Code (Terraform More ❯
Infrastructure as Code (Terraform), and DevSecOps practices. You will be instrumental in designing and implementing Azure cloud infrastructure with a strong emphasis on governance, observability, and compliance. your Responsibilities: Design and document secure, scalable Azure cloud architectures aligned with enterprise and regulatory standards Lead implementation using Infrastructure as Code (Terraform More ❯
lead R&D or impact analysis initiatives, where hands on rapid prototyping may be required. Skills: TypeScript, Node, React AWS, Lambdas Docker, Kubernetes, Terraform Observability CI/CD The money is good too - up to £80k plus benefits including 10% annual bonus, pension, private healthcare and flexible working. If you More ❯
stakeholders to define solutions Mentor junior developers and promote best practices Drive improvements in development processes and tools Investigate and resolve production issues Ensure observability through logging, metrics, and diagnostics What You Bring 5+ years of backend development experience Expertise in C#, .NET (preferably .NET 6+), and SQL Server Deep More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Apollo Solutions
Vault with various internal banking systems. Establish coding standards, enforce clean code practices, and drive team-wide adoption of CI/CD, automated testing, observability, and secure development. Build scalable, reusable APIs with reliability and performance at the core. Work closely with DevOps, SREs, and platform teams to ensure resilient More ❯
other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of More ❯
other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of More ❯
Strong software engineering and SRE background. Expertise in Java and Kubernetes in production environments. Proven success leading large-scale service migrations. Solid experience in observability, incident management, and infrastructure automation. This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please apply with a copy More ❯
other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of More ❯
manchester, north west england, United Kingdom Hybrid / WFH Options
On the Beach
with Kubernetes, Terraform, and CI/CD pipelines for scalable and automated deployments. Understanding of GraphQL and modern API-driven architectures. Strong focus on observability and monitoring, ensuring system reliability and performance. Knowledge of internet scale real-time data processing, supplier integrations, and transactional systems. A pragmatic approach to balancing More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Manchester Digital
they work on, from ideation through to development, testing, and deployment, so you should expect to champion and mentor on best practices like TDD, Observability, and IaC. Skills: CI/CD, TDD, SOLID The salary is competitive - up to £90k plus benefits including hybrid working (1-2 days per month More ❯
workflows, and refine the design system and web components. Manage CI/CD, IaC, and incident response while optimising the developer platform (Backstage) and observability across AWS Serverless. Contribute to team meetings, planning, and retrospectives, ensuring smooth workflows and efficiency. Lead technical discussions, drive key projects, and keep stakeholders informed. More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
InterQuest Group (UK) Limited
to development, testing and deployment, so you should expect to maintain high standards in code quality through TDD and be comfortable with practices like Observability and IaC. As a Senior Software Engineer, you'll also be given the opportunity to shape the overall approach to software engineering and mentor less More ❯
GitLab) Experience with modern CI/CD tools and techniques Desirable skills: Experience in BDD and creating tests using Gherkin syntax Experience working with observability and monitoring tools such as Firebase or similar Experience of non-functional testing in a mobile environment Equal Opportunities Statement At AND Digital we embrace More ❯
customer-focused organisation which provides operational excellence whilst identifying new areas of growth as part of our day to day objectives. The CoE Lead - Observability & Tools at JD Sports Fashion Plc is a critical, hands-on technical role focused on designing, building, and maintaining the company's Observability platform. This … focus on implementation and adoption. What You'll Be Doing: We are looking for an experienced CoE Lead to design, build, and maintain our Observability platform. The CoE Lead will work closely with DevOps, Engineering, Service Reliability, and Service Delivery teams to continuously improve our Observability capabilities. This role is … technical, hands-on position with a 75% focus on framework design and 25% on implementation and adoption. You will contribute to pipeline design, enabling observability from the first deployment in test environments and providing early insights for Engineering, Service Reliability, Service Delivery, and DevOps teams. The role involves building frameworks More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure as Code, DevSecOps automation, and self-service enablement, to help development teams ship faster, safer, and … more cost-efficiently. What you'll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules … DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices More ❯
Vitals for optimal performance. Integrate third-party software into the platform, including tag management using Google Tag Manager (GTM) . Improve and maintain platform observability tools and systems. Manage and enhance automated CI/CD pipelines for efficient and reliable deployments. Ensure sites are accessible to all users, meeting WCAG More ❯
manchester, north west england, United Kingdom Hybrid / WFH Options
On the Beach
Write clean, maintainable code and participate actively in code reviews. Operational Support: Support production systems, identify root causes of issues, and contribute to improving observability and automation. Knowledge Sharing: Document systems and help onboard and guide new team members. Skills and Experience Cloud Experience: Solid hands-on experience with AWS More ❯
reporting and security leads to ensure data platforms are meeting product needs to service client expectations. Guide teams to ensure a high degree of observability of data platform reliability and performance, working alongside the Head of Platform to enhance visibility of these metrics throughout the business. Drive innovation in related More ❯
the heart of a high-impact engineering transformation. You’ll combine core software engineering skills with Site Reliability Engineering (SRE) principles to deliver automation, observability, and resilience across our systems. What You’ll Do 🚢 Lead migration efforts for services running in Kubernetes, ensuring smooth rollouts with zero-downtime strategies. 🧠 Design … environments (AWS + Kubernetes). 🔧 Automate processes to reduce toil and accelerate delivery using Infrastructure-as-Code and CI/CD best practices. 📊 Implement observability through enhanced logging, metrics, and alerts to maintain service health throughout migration. 🔍 Troubleshoot complex systems and lead incident response, root cause analysis, and iterative improvements. More ❯
manchester, north west england, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A Site Reliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability … maintainability. You will also help engineer tools and automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles … including the creation and management of effective Service Level Indicators (SLI) and Service Level Objectives (SLO) for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty. Excellent knowledge of programming languages including Python, Golang and JavaScript. Knowledge and More ❯