as well as accurate health monitoring through all our clients, both new and old. The person in this role will join the SiteReliabilityEngineering team (SRE). The main role of the SRE team is to facilitate the scalability of Dayshape and allow us to meet the demands of an increasing client base. What you'll … do Lead initiatives to enhance Dayshape's ability to scale our cloud platform Maintain and improve our cloud estate in Azure Improve SRE and other teams' working lives through automation of manual tasks Lead in making the deployment of Dayshape more scalable Increase our knowledge sharing of SRE across the organisation Improve the observability of Dayshape through reporting and tool … Infrastructure as Code". We are looking for someone who has a genuine enthusiasm for IaC and experience of Delivered IaC frameworks that can be contributed to by other engineering teams. An understanding of the patterns and practices required to deliver reliable, high-quality results, such as unit and integration testing, CI/CD etc. Writing and debugging Python More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
functional teams to enhance cloud infrastructure and improve performance Driving automation and Infrastructure as Code (IaC) using tools such as Terraform, GitHub, and Ansible Implementing best practices in security, reliability, and scalability across cloud environments Working with Agile methodologies to optimise cloud platform performance and streamline development processes Providing guidance and mentorship to teams, helping to establish best practices … working What You’ll Need: 4+ years of experience in Platform Engineering, Cloud Engineering, or DevOps Hands-on experience in DevSecOps, SiteReliabilityEngineering (SRE), or Agile Delivery Strong cloud expertise in AWS, Azure, or GCP Proficiency in Infrastructure as Code (IaC) using Terraform, GitHub, and Ansible A solid understanding of cloud security, networking, and … projects, collaborate with industry experts, and be part of a company that values innovation and continuous learning. If you are looking for a challenging and rewarding role in cloud engineering, contact Anna-Jane Murphy at Anson McCade to learn more. AMC/AJM/CPEKJ1 More ❯
functional teams to enhance cloud infrastructure and improve performance Driving automation and Infrastructure as Code (IaC) using tools such as Terraform, GitHub, and Ansible Implementing best practices in security, reliability, and scalability across cloud environments Working with Agile methodologies to optimise cloud platform performance and streamline development processes Providing guidance and mentorship to teams, helping to establish best practices … working What You’ll Need: 4+ years of experience in Platform Engineering, Cloud Engineering, or DevOps Hands-on experience in DevSecOps, SiteReliabilityEngineering (SRE), or Agile Delivery Strong cloud expertise in AWS, Azure, or GCP Proficiency in Infrastructure as Code (IaC) using Terraform, GitHub, and Ansible A solid understanding of cloud security, networking, and … projects, collaborate with industry experts, and be part of a company that values innovation and continuous learning. If you are looking for a challenging and rewarding role in cloud engineering, contact Anna-Jane Murphy at Anson McCade to learn more. AMC/AJM/CPEKJ1 More ❯
sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious … impact. If you're ready to shape the future, let's talk. About the Role We're looking for a Senior SiteReliability Engineer to join our SRE team. This is a hybrid role that blends deep platform engineering with application-level troubleshooting . You'll be responsible for the stability, performance, and resilience of our cloud … code Resolve incidents and support root causes (Java and GoLang services) Contribute to postmortems and reliabilityengineering initiatives Who You Are Essential Experience 5+ years in an SRE, DevOps, or infrastructure role Deep hands-on experience with AWS , EKS/Kubernetes , and Terraform Working knowledge of Kafka tuning, monitoring, and operational troubleshooting Strong familiarity to be able to More ❯
SiteReliability Engineer page is loaded SiteReliability Engineer Apply locations IND-BLR-Divyasree Technopolis time type Full time posted on Posted Yesterday job requisition id R About LSEG: The London Stock Exchange Group (LSEG) is a global financial markets infrastructure and data provider headquartered in London, UK. Established in 2007, though its core institution-the … on SQL Server and SSIS today, we're actively exploring cloud-native platforms-your voice will help guide that transition. Collaborative Environment : Work multi-functionally with guides in data engineering, DevOps, and analytics in a culture that values curiosity, accountability, and continuous improvement. Tech that Matters : You'll support systems that drive real-time business decisions, impact thousands of More ❯
Core, BCG X, and CT worldwide. This role is also accountable for embedding security within DevSecOps practices, enforcing automation at scale, and applying SiteReliabilityEngineering (SRE) principles across all security services. The role requires strong partnership with ISRM, with a focus on balancing and prioritizing security requirements, automation opportunities, user experience needs, and broader business outcomes. … that support modern work scenarios, remote access, zero-trust networking, and AI/ML workloads. Leverage automation frameworks and IaC to improve scalability and reduce manual intervention. Operational Security, SRE & Assurance: Ensure security platforms are resilient, continuously monitored, and designed for 24x7 support and incident response readiness. Embed security telemetry and observability to enable proactive threat detection and automated response. … Apply SRE principles to improve reliability, performance, and maintainability of security services. Lead platform health, patching automation, and vulnerability remediation workflows. Define service level objectives (SLOs) and key performance indicators (KPIs) for all security services. Compliance, Governance & Risk Management: Ensure alignment with global compliance requirements such as ISO 27001, NIST, SOC 2, GDPR, and others. Partner with governance, legal More ❯
strong background in DevOps design and transformation, cloud-native engineering, and modern DevOps tooling. The ideal candidate will also bring expertise in SiteReliabilityEngineering (SRE) principles and practices, with a focus on building scalable, reliable, and resilient systems. Key Responsibilities: Architect and implement scalable, secure, and high-performance DevOps solutions. Lead DevOps transformation initiatives across … enterprise environments. Design and implement cloud-native solutions on Azure, AWS, or GCP. Apply SRE principles to ensure system reliability, availability, and performance. Build and maintain CI/CD pipelines and infrastructure as code (IaC). Evaluate and integrate modern DevOps tools and practices. Collaborate with cross-functional teams to align DevOps and SRE strategies with business goals. Mentor … and lead DevOps teams, fostering a culture of innovation and continuous improvement. Leverage AI and machine learning to optimize DevOps and SRE processes. Ensure compliance, security, and operational excellence in all DevOps practices. Required Qualifications: 15+ years of experience in IT, with a strong focus on DevOps and cloud architecture. Proven experience in DevOps design and transformation across multiple projects. More ❯
strong background in DevOps design and transformation, cloud-native engineering, and modern DevOps tooling. The ideal candidate will also bring expertise in SiteReliabilityEngineering (SRE) principles and practices, with a focus on building scalable, reliable, and resilient systems. Key Responsibilities: • Architect and implement scalable, secure, and high-performance DevOps solutions. • Lead DevOps transformation initiatives across … enterprise environments. • Design and implement cloud-native solutions on Azure, AWS, or GCP. • Apply SRE principles to ensure system reliability, availability, and performance. • Build and maintain CI/CD pipelines and infrastructure as code (IaC). • Evaluate and integrate modern DevOps tools and practices. • Collaborate with cross-functional teams to align DevOps and SRE strategies with business goals. • Mentor … and lead DevOps teams, fostering a culture of innovation and continuous improvement. • Leverage AI and machine learning to optimize DevOps and SRE processes. • Ensure compliance, security, and operational excellence in all DevOps practices. ͏ Required Qualifications: • 15+ years of experience in IT, with a strong focus on DevOps and cloud architecture. • Proven experience in DevOps design and transformation across multiple projects. More ❯
SiteReliability Engineer - Outside IR35-Edinburgh with Remote-6 months Initial Contract-Immediate Start My client are currently working along their journey to move from on-premise project work to a Cloud-based offering (AWS) and require a seasoned SiteReliability Engineer to provide hands-on technical skills and to support and improve the current digital More ❯
Job Title: Cloud Engineer/SRE - Golang & Github Location: Remote - UK, London Salary/Rate: Up to £690 a day Inside IR35 Start Date: August 2025 Job Type: 12 Month Contract Company Introduction: We are seeking a highly skilled Cloud Engineer/SRE with Development experience in Go and Github to join our client in the Global Analytical Risk sector. … We are seeking a highly skilled and motivated Cloud Engineer/SRE to join our newly formed Enterprise GitHub Operations & Tooling team. This is a foundational role where you will be instrumental in designing, building, and managing the core services and tooling that underpin our extensive use of GitHub Enterprise. You will be responsible for developing code and solutions that … managing) GitHub Actions (designing complex workflows, custom actions) GitHub Enterprise, Organization and Repository settings. Operations/Infrastructure Background: Proven experience in an operations, sitereliabilityengineering (SRE), or infrastructure engineering role, with a strong appreciation for automation and stability. Modern SDLC Practices: Familiarity with: Dependency management. Security remediation processes and secure coding practices. Testing frameworks and More ❯
Senior SiteReliability Engineer Central London (Hybrid) Up to £100k + Car Allowance & Bonus TRIA are working with a leading hospitality client to hire a Senior SRE, where they are investing heavily in the performance, stability, and reliability of its digital platforms. This is a hands-on leadership role - you won't just guide others, you'll … Improving alerting, monitoring, and system-level metrics Driving better SLOs, SLIs, and overall uptime What you'll bring: Experience in high-traffic digital or eCommerce platforms 5+ years in SRE/DevOps roles; strong background in incident response Observability, automation, and infrastructure as code expertise Leadership skills - mentoring others or leading from the front The stack includes Kubernetes, Terraform, AWS … Python, and modern CI/CD tools, and it's evolving. If you understand what a good SRE practice looks like, and want to leave systems in a better place than you found them, please apply to be considered and learn more More ❯
we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a SiteReliability Engineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 5+ years' experience in SiteReliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web More ❯
plan. CI/CD Pipeline Management: Manage and optimise CI/CD pipelines using tools like GitHub Actions, Travis , and other automation frameworks. SiteReliabilityEngineering (SRE): Perform SRE duties to ensure system availability, performance, and scalability. Application Support: Work closely with application teams to support application deployment and performance monitoring . Cloud Administration: Administer and optimise … orchestration tools. Proficiency in monitoring tools such as DataDog, Splunk, or New Relic . Strong understanding of CI/CD pipelines and automation tools. Experience with incident management and SRE best practices. Excellent problem-solving skills and the ability to work collaboratively across teams. We are looking to find individuals keen to join our scaling team - our tech has real More ❯
are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior SiteReliability Engineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 7+ years' experience in SiteReliability Engineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools More ❯
Operations SiteReliability Engineer page is loaded Operations SiteReliability Engineer Apply locations United Kingdom-Bristol-Almondsbury-Hempton Court time type Full time posted on Posted 30+ Days Ago job requisition id R022662 Please Note: 1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign … Provide feedback and coaching to upstream teams (both internal and vendors) to reduce escalations and to continually improve overall experience for customers. Professional Experience Required A degree in Systems Engineering, Computer Science or related fields with related experience preferred 5+ years of experience administering Linux systems Strong hands-on experience of variants of linux distros 2+ years Operational experience … salary Generous bonus scheme Equity package Competitive company pension Employee stock purchase plan (ESPP) Private Medical Insurance (Individual or family) Life Assurance scheme (up to 4x salary) Ample on-site parking. This role will need to participate in weekends and holidays on-call support as and when required. Broadcom is proud to be an equal opportunity employer. We will More ❯
are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior SiteReliability Engineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 7+ years' experience in SiteReliability Engineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools More ❯
Join us as a Senior SiteReliability Engineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as … a Senior SiteReliability Engineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning - SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise … strategic thinking and digital and technology, as well as job-specific technical skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability More ❯
City Of Westminster, London, United Kingdom Hybrid / WFH Options
Track24 Limited
or New Relic to gain monitoring and performance insights. Incident Management: Establish and oversee monitoring and incident management processes to ensure system reliability. SiteReliabilityEngineering (SRE): Perform SRE duties to ensure system availability, performance, and scalability. Application Support: Work closely with application teams to support application deployment and performance monitoring We use AWS internally, however are More ❯
everyone can do their best work. Whether you're building on our platform, supporting our customers, or shaping our story: You can just ship things. About the Role: As SRE Manager, you will lead the creation and operation of a 24/7 SiteReliabilityEngineering function for Vercel. Your primary goal is to act as the … If you're located beyond that distance, the role is fully remote. For location-specific details, please connect with our recruiting team. What You Will Do: Build & nurture the SRE team at Vercel, holding a high bar for technical work and teamwork. Build rapport with each member of the team and support them as they level up their skills. Define … directly with executive leadership to communicate risks and opportunities and influence cross-engineering prioritization. Partner more specifically with CDN and Compute engineering teams to define and manage SRE-driven project initiatives that improve the robustness and operational efficiency of the company's most critical serving systems. About You: At least 5 years experience in an SRE role, or More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
to gemstone supplies They have a presence in London, Hong Kong, Amsterdam, and as well in Mumbai and now in New York in 2001. About the role : As the SRE Manager, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and services through both direct technical contribution along with team building and … tooling. Drive automation initiatives to streamline operational workflows and improve efficiency. Develop and maintain tools, scripts, and dashboards to monitor system health, performance, and reliability. Build a first class SRE team. Through a combination of leading by example, coaching and mentoring, mould the team would want to have around you. Provide leadership and guidance to the SRE team, fostering a … culture of collaboration, innovation, and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with More ❯
Salford, Manchester, United Kingdom Hybrid / WFH Options
Lloyds Bank plc
drive continuous improvement as we transition to cloud-native technologies. You'll challenge the status quo and push boundaries by working closely with the DevOps COE and the wider engineering community. Join us as an innovator as we enter the next phase of our transformation journey. We're looking for passionate and curious technology specialists with innovative minds who … and compliance principles into architecture and development, ensuring alignment with regulatory and risk frameworks. DevOps & Quality Engineering - Practical experience with DevOps or SiteReliabilityEngineering (SRE), including automation, CI/CD, and quality assurance practices. Leadership & Mentorship - Leads cross-functional teams, drives delivery, coaches others, and fosters a culture of continuous improvement and development. Business Acumen … future trends, drives change initiatives, and shapes technology roadmaps to deliver long-term value. It would be great if you had any of the following Infrastructure as Code & Cloud Engineering - Hands-on experience with tools like Terraform, Chef, Puppet, and Ansible, combined with exposure to cloud platforms such as GCP, AWS, Azure, or ICP/OCP. CI/CD More ❯
drive continuous improvement as we transition to cloud-native technologies. You'll challenge the status quo and push boundaries by working closely with the DevOps COE and the wider engineering community. Join us as an innovator as we enter the next phase of our transformation journey. We're looking for passionate and curious technology specialists with innovative minds who … and compliance principles into architecture and development, ensuring alignment with regulatory and risk frameworks. DevOps & Quality Engineering - Practical experience with DevOps or SiteReliabilityEngineering (SRE), including automation, CI/CD, and quality assurance practices. Leadership & Mentorship - Leads cross-functional teams, drives delivery, coaches others, and fosters a culture of continuous improvement and development. Business Acumen … future trends, drives change initiatives, and shapes technology roadmaps to deliver long-term value. It would be great if you had any of the following Infrastructure as Code & Cloud Engineering - Hands-on experience with tools like Terraform, Chef, Puppet, and Ansible, combined with exposure to cloud platforms such as GCP, AWS, Azure, or ICP/OCP. CI/CD More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Lloyds Bank plc
drive continuous improvement as we transition to cloud-native technologies. You'll challenge the status quo and push boundaries by working closely with the DevOps COE and the wider engineering community. Join us as an innovator as we enter the next phase of our transformation journey. We're looking for passionate and curious technology specialists with innovative minds who … and compliance principles into architecture and development, ensuring alignment with regulatory and risk frameworks. DevOps & Quality Engineering - Practical experience with DevOps or SiteReliabilityEngineering (SRE), including automation, CI/CD, and quality assurance practices. Leadership & Mentorship - Leads cross-functional teams, drives delivery, coaches others, and fosters a culture of continuous improvement and development. Business Acumen … future trends, drives change initiatives, and shapes technology roadmaps to deliver long-term value. It would be great if you had any of the following Infrastructure as Code & Cloud Engineering - Hands-on experience with tools like Terraform, Chef, Puppet, and Ansible, combined with exposure to cloud platforms such as GCP, AWS, Azure, or ICP/OCP. CI/CD More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Lorien
Junior SiteReliability Engineer Hybrid - Manchester x2 days a week Salary up to £45,000 + Bonus The Company: Lorien Global are supporting a growing business based in Manchester City Centre as they expand their Support Services team. With an exciting pipeline of work ahead, they're looking to hire an experienced Junior SiteReliability Engineer More ❯
Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Ordnance Survey Limited
hear from you. Essential Criteria Good knowledge of Azure Cloud hosting technologies Experience with PostgreSQL databases (including PostGIS spatial extension) Good understanding of SiteReliabilityEngineering (SRE) and software engineering best ractices Experience investigating the root cause of failures to understand why they have occurred and propose/enact solutions, and work with external suppliers if More ❯