101 to 116 of 116 Site Reliability Engineer Jobs in the UK

Data Reliability Engineer II — Cloud DB Ops & SRE (Hybrid)

Hiring Organisation
Jobleads-UK
Location
Belfast, Northern Ireland, United Kingdom
leading derivatives marketplace in Northern Ireland is looking for a Data Reliability Engineer II to take responsibility for the reliability and performance of database systems. This role focuses on proactive incident management, performance tuning, and automation within a modern data ecosystem, utilizing technologies like GCP Cloud ...

Investment Banking SRE - automation, incidents, scale engineering, banking, payment - Contract,

Hiring Organisation
Scope AT Limited
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Investment Banking SRE - automation, incidents, scale engineering, banking, payment - Contract Description: The SRE Lead designs and leads reliability engineering across banking and payments, establishing SRE standards, automation, and learning practices to improve resilience, reduce incidents, and scale engineering led operations. Leads Site Reliability Engineering (SRE) strategy, automation … reliability by design, and observability. Reduces incidents and improves recovery. Skilled in resilient engineering, risk control, and scaling operations across complex banking and payments environments. 8+ years experience. Inside IR35 - hybrid working (3 days a week in office) By applying to this job you are sending us your ...

Site Reliability Engineer - BACLJP00013172

Hiring Organisation
Huxley Associates
Location
Shortlands, Greater London, UK
client within Investment Banking is currently seeking for an SRE Lead. Do you have the right skills and experience for this role Read on to find out, and make your application. I'm working on an SRE Lead role within a banking/payments environment that I thought might … interest. You'd lead SRE strategy, driving automation, observability, and reliability by design, with a focus on reducing incidents and improving recovery. Looking for someone with 8 years' experience in SRE, strong resilience engineering background, and the ability to scale operations in complex environments. Logistics: £600 p/ ...

Senior Software Engineer/SRE - BQL Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
City Of London, England, United Kingdom
Senior Software Engineer/SRE - BQL Reliability Engineering Location London Business Area Engineering and CTO Ref # 10049751 Description & Requirements Bloomberg runs on data. It’s our business and our product. BQL is the single API for all client-facing structured data at Bloomberg. As Bloomberg Terminal … research, and modeling. In terms of scale, BQL handles ~100 million requests hourly from ~100K active firms. The BQL Platform Observability team owns the reliability of the BQL platform, using observability as its primary lever. We ensure that the BQL ecosystem—spanning workload management layers, the query engine ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
them. About the Role SREs at Fluidstack sit at the core of our infrastructure, working across software, hardware, and operations to ensure the reliability and performance of our global GPU cloud. They partner closely with teams including networking, platform engineering, and data center operations to build systems that scale … nature of the AI ecosystem. Strong technical and interpersonal communication skills, a low ego, and a positive mental attitude. Minimum Requirements 5+ years of SRE, DevOps, Sysadmin, and/or HPC engineering experience. Great verbal and written communication skills in English. Experience deploying and operating Kubernetes and/or SLURM ...

Platform Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
England, United Kingdom
decisions clearly to non-technical stakeholders and customers Uphold a culture of: do, document, automate Willingness to cross train with Platform Engineering/Platform SRE to fully support both our infrastructure and platform stacks. Willingness to cross train with HPC Engineering, supported by NVIDIA to enhance our HPC supportability offering … Requirements 5+ Years Proven experience in globally scaled, performance-intensive environments operating to a 24/7 support model in an SRE or equivalent role 3+ years experience in both running, deploying and optimising orchestration platforms with a strong emphasis on Kubernetes Expert-level Linux administration, especially Ubuntu distributions Proficiency ...

Manager – Site Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
thinking. ## ## Role summary:You will be responsible for ensuring stability, resilience, and performance of our production systems while driving continuous improvement and SRE best practices across the platform.## What you'll be doing:* **Service Ownership**Assume end-to-end accountability for Clearing production environment, ensuring high availability, optimal … incidents, leading resolution efforts, managing stakeholder communications, and driving root cause analysis and remediation.* **Team Leadership & Talent Development**Build and mentor a high-performing SRE team. Promote a culture of accountability, continuous improvement, and blameless postmortems to enhance operational excellence.* **Operational Excellence & SLA Compliance**Ensure consistency to response and resolution ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
will work with Development and Product Management to design and deliver new functionality. You will perform deep dives into both systemic and latent reliability issues; partner with software engineers across the organization to produce and roll out fixes. You will drive standardization efforts across multiple disciplines and services … solid understanding of continuous integration, deployment and operations concepts. You have production experience of managing Windows Infrastructure running IIS workloads Passion for resolving reliability issues and identify strategies to mitigate going forward. Automation mindset - if you can automate it, do it. Fluency in English What you will gain ...

Junior Site Reliability Engineer

Hiring Organisation
Granite Recruitment and Consulting
Location
Bath, Somerset, South West, United Kingdom
Employment Type
Permanent
Salary
£45,000
JuniorSiteReliabilityEngineerBath/Hybrid-£35,000-£45,000 AgreatopportunityforaJuniorSiteReliabilityEngineertojoinanorganisationwhowillbetransitioningtowardsaDevOpsenvironment,andpickuprelevantskillsalongtheway. TherolewillbejoiningaleadingsoftwarehouseincentralBath,buildingandmaintaininginfrastructurefortheirSaaSproducts.Theyareintheprocessofinvestingheavilyintheirtechnicalenvironmentandarelookingforanengineerwhocanhelpwithnewtechimplementations. Coreresponsibilitieswillinclude: Maintaininghostinginfrastructure(ensuringavailabilityofweb,databaseandmessagingsystems) Managingservers(acrossdevelopment,testandproduction) Monitorrespondtoinfrastructureevents(suchasavailabilityandcapacitymanagement) Beproactivewithinfrastructureupgrades Doesthissoundlikeyou? Experiencemanagingserversinthecloud Proactive,notreactive Productionsystemsawareness Possiblyagraduatewithaplacementyear/somerelevantexperience,orsomeoneworkinginasupport/infrastructureposition Thetechnicalenvironmentlookslikethis: AWS cloudinfrastructure ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Responsibilities Schedule and monitor real-time trading systems Monitor open and close of markets as it relates to the system Reduce the number of trading hiccups Monitor and escalates real-time system alerts Handle any ...

Site Reliability Engineer, GPUs in AI

Hiring Organisation
Radley James
Location
City of London, London, United Kingdom
London. The team of engineers and researchers come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic etc. They are looking for a Senior Systems Engineer to focus on cluster management, platform engineering that handles high number of GPUs (their range currently is in the 20k-40k), monitoring/reliability ...

Site Reliability Engineering Manager

Hiring Organisation
F5 Consultants
Location
Reading, England, United Kingdom
SRE TECHNICAL LEAD Location: Berkshire - Hybrid 2-3 days onsite Salary: Up to £120,000 + 10% bonus Clearance: Active SC clearance required We have an exciting opportunity for an SRE Technical Lead/Manager to join a major UK critical infrastructure programme delivering large-scale cloud-native transformation … enterprise scale. In this role, you’ll take ownership of SRE strategy and platform reliability across complex Kubernetes and OpenShift environments, helping shape engineering standards, operational maturity, and long-term platform stability. You’ll work within a modern cloud-native environment leveraging Kubernetes, OpenShift, GitOps, service mesh, observability tooling ...

Senior Software Engineer/SRE - TRAX Observability London, GBR Posted today

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Senior Software Engineer/SRE – TRAX Observability Location: London Business Area: Engineering and CTO Ref #: 10049287 About TRAX TRade Automation and eXecution (TRAX) is part of Bloomberg Enterprise Products Engineering. We build trade automation solutions and multiple Execution Management Systems (EMSs) that enable clients to route orders, execute … client‐facing issues. Network & Stakeholder Exposure Collaborate with engineering and product teams across London, Frankfurt, Tel Aviv, and New York, as well as peer SRE teams focused on Scalability and Resilience. You’ll develop strong stakeholder management and communication skills. We’ll trust you to: Enhance and maintain systems that ...

Senior SRE & Observability Engineer – Trade Tech

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Bloomberg L.P. is seeking a Senior Software Engineer/SRE for the TRAX Observability team in London. This role involves enhancing systems for performance metrics, improving telemetry reliability, and collaborating with various teams across global offices. Candidates should have experience with high-level programming languages, Unix/Linux ...

SRE-NOC Engineer (24/7): Reliability, Automation & Observability

Hiring Organisation
Jobleads-UK
Location
United Kingdom
NICE is looking for an SRE – NOC to combine traditional Network Operations with engineering-driven reliability practices. The role emphasizes 24/7 service reliability, incident response, and operational automation. You will lead incident responses, monitor service health, implement automation to reduce toil, and ensure the operational readiness ...

Senior Cloud QA Engineer — Test Automation & SRE Alignment

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
Advantage Resourcing UK Ltd is seeking a Senior Software Engineer in Test on a temporary contract until March 2027. Located in Manchester, this position requires demonstrable experience in developing and testing cloud-native applications, alongside programming skills in Java or Python. Key responsibilities include defining testing processes, implementing frameworks ...