26 to 50 of 199 Site Reliability Engineering Jobs in England

Senior Staff Site Reliability Engineer, AViD, YouTube Ads

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

identify connections, adopt different perspectives and quickly respond to changing circumstances in a strategic way. About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both … internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating ...

Site Reliability Engineer, iCloud

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. Apple Services Site Reliability Engineering (SRE) teams are responsible for the systems and services that directly support those customers and their experiences. We are looking for an SRE with experience … BIG. Operating at our scale, across multiple geographies and servicing hundreds of millions of users presents unique challenges. As a Software Developer in SRE at Apple, you'll need to solve these problems using data, teamwork, and your own expertise. ASE Products Site Reliability teams are responsible ...

Site Reliability & Network Systems Administrator

Hiring Organisation: Franklin Bates Limited
Location: Leamington Spa, Warwickshire, West Midlands, United Kingdom
Employment Type: Permanent
Salary: £55,000

Site Reliability & Network Systems Administrator Location: Leamington Spa - hybrid Salary: c£50k Join one of the industry's most innovative web platform organisations. We are working exclusively with an award-winning, market-leading SaaS platform business whose technology supports thousands of users every day. Following continued growth … particularly interested in people with experience of: Microsoft Azure setup, implementation and administration Azure networking and virtual infrastructure Site Reliability Engineering (SRE) principles Infrastructure monitoring and alerting platforms Web monitoring and uptime management tools DNS, TCP/IP, VPNs, firewalls and enterprise networking Microsoft Entra ID (Azure ...

Site Reliability Engineer (SRE) - Cloud & Automation

Hiring Organisation: Spencer Rose Ltd
Location: London, United Kingdom
Employment Type: Permanent
Salary: GBP 60,000 - 70,000 Annual

Site Reliability Engineer (SRE) - Cloud & Automation London, Docklands (hybrid) £60,000 - £70,000 per annum + annual discretionary bonus On behalf of a leading financial services organisation, I'm looking for a highly capable Site Reliability Engineer (SRE) to drive the adoption of SRE methodologies across … days per week in their Canary Wharf office, therefore you must be within a reasonable commute of London. Responsibilities: Lead the implementation of SRE practices across the organisation, working closely with infrastructure teams to optimise deployment processes and embed automation and operational excellence. Enhance observability and reliability , defining ...

SRE Managing Consultant

Hiring Organisation: Akkodis
Location: City of London, London, United Kingdom
Employment Type: Permanent
Salary: £90000 - £100000/annum

SRE Managing Consultant Cloud Operating Model & Reliability Transformation Security Clearance: SC eligible (UK residency required) Shape the Future of Cloud Reliability Are you passionate about building resilient, scalable cloud platforms that truly support the business? Do you thrive at the intersection of engineering excellence, operating models … senior stakeholder advisory? We're looking for a Managing Consultant in Site Reliability Engineering (SRE) to help organisations shift from reactive operations to measurable, product-aligned reliability - embedding SRE as a core engineering discipline across cloud and hybrid environments. You'll work with senior leaders ...

Software Engineer, GPU Infrastructure- ChatGPT Engineering

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

About the Team ChatGPT Engineering builds and operates the compute platform powering one of the world's largest AI products. Every ChatGPT conversation relies on massive GPU clusters serving inference workloads with high reliability, efficiency, and performance. As our GPU fleet continues to grow, we're investing … production infrastructure, preferably GPU clusters or other compute-intensive distributed systems. Have a background in Production Engineering, Site Reliability Engineering (SRE), Infrastructure Engineering, or Platform Engineering. Have built software that automates operational workflows rather than relying on manual processes. Have experience with Kubernetes, Linux systems ...

Lead Product Manager AIOPs

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

responsible for S&P Global's enterprise AIOps platform and strategy, driving the modernization of IT Operations and Site Reliability Engineering (SRE) through intelligent observability, event intelligence, automation, and AI-driven insights.DTS Platform & Tools – Service Enablement: We serve as thought leaders in AIOps, partnering across IT Operations … SRE, engineering, infrastructure, service management, and application teams to solve enterprise operational challenges. Our mission is to improve reliability, reduce operational complexity, optimize technology investments, and enable more proactive and resilient technology operations by applying AI.**Responsibilities and Impact:*** Own and execute the AIOps product roadmap, aligning priorities ...

Lead Product Manager AIOPs

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

responsible for S&P Global's enterprise AIOps platform and strategy, driving the modernization of IT Operations and Site Reliability Engineering (SRE) through intelligent observability, event intelligence, automation, and AI-driven insights. DTS Platform & Tools – Service Enablement: We serve as thought leaders in AIOps, partnering across … Operations, SRE, engineering, infrastructure, service management, and application teams to solve enterprise operational challenges. Our mission is to improve reliability, reduce operational complexity, optimize technology investments, and enable more proactive and resilient technology operations by applying AI. Responsibilities and Impact: Own and execute the AIOps product roadmap, aligning ...

Senior Site Reliability Engineer

Hiring Organisation: VIQU IT
Location: United Kingdom, Morley, West Yorkshire
Employment Type: Permanent
Salary: £65000 - £75000/annum

play a key role in driving best practice across cloud infrastructure, automation, reliability and operational excellence while helping to develop the wider SRE function. Key Responsibilities Design, build and maintain scalable, secure AWS infrastructure using AWS CDK. Drive improvements across platform reliability, automation and infrastructure as code. Lead … observability and incident management across cloud environments. Collaborate with engineering teams to improve performance, resilience and operational efficiency. Mentor and coach engineers, promoting SRE and DevOps best practices. Contribute to technical direction, architecture discussions and continuous improvement initiatives. Key Requirements Proven experience in a Senior Site Reliability ...

Senior Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Senior Site Reliability Engineer (SRE) - GCP/Kubernetes We are seeking an experienced and highly motivated Senior Site Reliability Engineer (SRE) to join our small, agile engineering team. This role offers the unique opportunity to drive the reliability, scalability, and performance of our core … Kubernetes application deployment. Monitoring & Observability: Implement and manage robust monitoring, alerting, and logging solutions to ensure clear system visibility and proactive issue identification. Reliability & Performance: Define, measure, and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Participate in on-call rotation (if applicable) and lead post ...

Senior Site Reliability Engineering Manager

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Role Overview Sr. Manager, Site Reliability Engineering (London) is an experienced leader responsible for overseeing a globally distributed team of SRE technologists with diverse skills in software development, systems, network, application, and/or database management. This role ensures seamless, continuous coverage of Cboe's real‐time … features; monitor development activities, change‐management tickets, evaluate impact; approve and execute daily change tickets; organize testing prior to deployment; work with software engineering to resolve systemic issues; ensure compliance obligations are met. Incident Response & Escalation Management: Serve as senior escalation point for production incidents across European ...

Site Reliability Engineer

Hiring Organisation: Randstad Digital
Location: London, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £60,000

Site Reliability Engineer (SRE) - 100% Remote Location: Fully Remote Duration: Permanent Are you passionate about building unbreakable systems and automating away the noise? We are looking for a dedicated Site Reliability Engineer (SRE) to join our remote team. Your primary mission will be to design, implement … complex challenges in the Azure ecosystem and sharing your knowledge with others, we want you on our team! What You Will Do As an SRE, you will be accountable for the delivery and support of production and non-production systems within the Azure ecosystem. Your day-to-day responsibilities will ...

Site Reliability Engineer

Hiring Organisation: Randstad Digital
Location: London, United Kingdom
Employment Type: Permanent
Salary: GBP 60,000 Annual

Principal Platform Engineer

Hiring Organisation: SF Partners Admin
Location: Bristol, Avon, South West, United Kingdom
Employment Type: Permanent, Work From Home

observability strategies using Prometheus, Grafana, OpenTelemetry and modern APM tooling. Improve reliability through SLOs, incident reviews and Site Reliability Engineering (SRE) practises. Embed DevSecOps, supply-chain security and secure CI/CD practises. Drive cloud-native modernisation and legacy transformation. Support FinOps and cloud optimisation across … Internal Developer Platforms or self-service engineering. Observability tools including Prometheus, Grafana, OpenTelemetry, ELK, Datadog, Dynatrace or New Relic. DevSecOps and supply-chain security. SRE practises, SLOs, SLIs and incident management. Platform governance, cloud standards or landing zones. FinOps and cost optimisation. We are particularly interested in candidates who have ...

MongoDB SRE: Build Resilient, Automated Systems

Hiring Organisation: Jobleads-UK
Location: Knutsford, England, United Kingdom

Join our team as aMongoDB Site Reliability Engineer, where you'll be at the forefront of designing and maintaining robust, high-performance systems that power critical financial services. In this dynamic and fast-paced environment, your role will be essential to ensuring our infrastructure remains resilient, secure … solving, multi-layered problems and building systems that perform reliably amid shifting priorities, we encourage you to apply. To be successful as a MongoDB Site Reliability Engineer, you should have experience with: Working in Site Reliability Engineering, DevOps, and MongoDB administration in financial services. Using ...

SRE - Site Reliability Engineer - Observability & Performance

Hiring Organisation: Sanderson Recruitment
Location: Bristol, Avon, South West, United Kingdom
Employment Type: Contract
Contract Rate: £550 - £600 per day

SRE - Observability and Performance Up to £600 per day outside IR35 6 month initial contract Bristol - Largely remote I'm currently working with a client who is looking for an SRE to implement and enhance observability across Java applications, middleware and Linux infrastructure using Grafana. The role is focused … offering the opportunity to develop cloud-ready observability, automation and operational capabilities as the platform evolves. Essential Skills: Strong hands-on experience in DevOps, SRE, Platform Engineering or Systems Engineering environments. Expertise in Grafana, observability, monitoring, alerting and telemetry implementation. Strong Java application troubleshooting skills, including JVM performance ...

Site Reliability Engineer - SRE

Hiring Organisation: Sanderson Recruitment
Location: City of London, London, United Kingdom
Employment Type: Permanent

Role: Site Reliability Engineer Location: London (Hybrid) Salary: £80,000 - £105,000 As our Site Reliability Engineer, you'll work closely with our feature team and other colleagues to meet defined service level objectives and continually improve systems and environments. You'll define error budgets that … Very strong engineering skills in Java, JavaScript or Python Open Telemetry experience Must have Core Java/Python Must have experience as an SRE knowledge of Python Data Structures Strong knowledge of deploy and release services, automation and troubleshooting Experience of utilising tools and technology across the software development ...

Site Reliability Engineer - SRE

Hiring Organisation: Sanderson Recruitment
Location: City, London, United Kingdom
Employment Type: Permanent
Salary: GBP 100,000 Annual

Director, Head of Technology Resilience and Production Operations

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

heighten customer satisfaction by protecting revenue, safeguarding MUFG’s reputation, and ensuring regulatory confidence. The role holder will work with the Head of Digital Engineering Services and Solutions Department Head to deliver transformation through a reliable, robust, sustainable, scalable, and efficient operating model by leveraging best practices. The role … planning of Technology Resilience related projects and programmes measuring the effectiveness of these services delivered. It is a leadership position integral to the Digital Engineering Solutions and Services Leadership team, maintaining compliance and regulatory obligations. Key Responsibilities Plan & execute a comprehensive Technology Resilience and Production Operations strategy to scale ...

Site Reliability Engineer I - Connex

Hiring Organisation: Mastercard
Location: Harrogate, North Yorkshire, Yorkshire, United Kingdom
Employment Type: Permanent

networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary Site Reliability Engineer I - ConnexAbout Mastercard Mastercard is a global technology company in the payments industry, dedicated to enabling an inclusive digital economy that … household bills Almost all state benefit payments Joining Vocalink means contributing to systems that millions rely on daily. ________________________________________ The Opportunity We are seeking a Site Reliability Engineer I to support mission-critical applications within the UKs national payment infrastructure (FPS and CTS). This role is ideal ...

Operations Engineer

Hiring Organisation: Ascent Resourcing Limited
Location: Birmingham, West Midlands, England, United Kingdom
Employment Type: Full-Time
Salary: £55,000 - £60,000 per annum

applications, integrations, enterprise platforms, customer-facing technologies, and supporting infrastructure are effectively monitored, maintained, and supported. The focus is on proactively identifying issues, improving reliability, minimising service disruption, and maximising platform availability. The Operations Engineer will work collaboratively with Engineering, Platform, Integration, Product, Cybersecurity, and Enterprise IT teams … DevOps practices, and infrastructure automation. Experience supporting Salesforce, APIs, integrations, digital platforms, or enterprise SaaS solutions. Understanding of Site Reliability Engineering (SRE), operational engineering principles, or reliability-focused practices. ...

Senior Platform Engineer

Hiring Organisation: SF Partners
Location: Bristol, United Kingdom
Employment Type: Permanent
Salary: £75000 - £90000/annum

month in office. SC Clearance eligibility required £75,000 - £90,000 + excellent benefits We are supporting one of the UK's largest Platform Engineering growth programmes, delivering secure, cloud-native platforms that underpin critical digital services used by millions every day. This is an exciting opportunity to join … highly skilled engineering community working across some of the UK’s most complex and innovative cloud environments. You will support development teams to build, deploy and operate modern applications at enterprise scale. You will remain hands-on with technology whilst influencing engineering standards, enhancing developer experience and evolving ...

Site Reliability Engineer — AWS & Observability

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

aspects of GitHub workflows and deployment pipelines Incident response – Lead incidents, run blameless post-mortems, and drive continuous improvement Enable developers – Mentor teams on SRE and observability practices, helping them quickly understand and resolve issues Leverage AI tooling – Use AI‐assisted development tools (e.g. GitHub Copilot) to accelerate infrastructure work … explore AI‐driven approaches to incident detection, root cause analysis, and remediation What We're Looking For Essential 3+ years in an SRE, Platform, or DevOps engineering role AWS services: CloudWatch, X-Ray, Lambda, API Gateway, S3, SQS, Aurora PostgreSQL, DynamoDB, CloudFront, VPC, IAM, Security Groups Python for scripting ...

DevSecOps Engineer

Hiring Organisation: 167 Solutions Ltd
Location: North West London, London, United Kingdom
Employment Type: Permanent
Salary: £90,000

+ Benefits Type: Permanent About the Company 167 Solutions is partnering with an innovative technology organisation that is scaling its cloud engineering and platform capabilities. We are seeking a hands-on DevSecOps Engineer who can embed security directly into the software development lifecycle while remaining actively involved in engineering … will be responsible for automating security controls, improving cloud security posture, developing CI/CD pipelines, and implementing security tooling within a fast-paced engineering environment. The successful candidate will have strong software engineering capabilities alongside cloud and security expertise. Key Responsibilities Design, build and maintain secure ...

DevSecOps Engineer

Hiring Organisation: 167 Solutions Ltd
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £40,000 - £70,000 per annum

+ Benefits Type: Permanent About the Company 167 Solutions is partnering with an innovative technology organisation that is scaling its cloud engineering and platform capabilities. We are seeking a hands-on DevSecOps Engineer who can embed security directly into the software development life cycle while remaining actively involved … engineering, automation, cloud infrastructure, and platform delivery. This is not a traditional security administration or governance role. We are looking for an engineer who writes code, builds automation, develops cloud-native solutions, and integrates security into modern software delivery practices. The Opportunity As a DevSecOps Engineer, you will work ...