101 to 120 of 120 Site Reliability Engineer Jobs

Data Reliability Engineer II — Cloud DB Ops & SRE (Hybrid)

Hiring Organisation
Jobleads-UK
Location
Belfast, Northern Ireland, United Kingdom
leading derivatives marketplace in Northern Ireland is looking for a Data Reliability Engineer II to take responsibility for the reliability and performance of database systems. This role focuses on proactive incident management, performance tuning, and automation within a modern data ecosystem, utilizing technologies like GCP Cloud ...

Senior Site Reliability Engineer

Hiring Organisation
Trust In SODA
Location
Gloucestershire, England, United Kingdom
Senior Infrastructure SRE 👔 AI Infrastructure Startup 📍 Gloucester - remote working 💵 £75-95k + Equity Do you want to work for a pioneering tech company that’s redefining how AI infrastructure is built and scaled? Do you want to work in a business critical technical role, utilising all the latest technologies … have just received Series B funding which they will be putting towards a huge tech scale-out. They are now looking for a Senior SRE to join their Gloucester team and play a pivotal role in the ongoing scaling and optimisation of their platform. Required Skills: Linux (Kernal level troubleshooting ...

Site Reliability Engineer

Hiring Organisation
Computappoint
Location
City Of London, England, United Kingdom
base Location: City of London Hybrid pattern: 3 days onsite a week The role: A global financial markets client is looking for an SRE to join its Production Services team. You'll support mission-critical trading, clearing, and market data platforms - environments where reliability genuinely isn't optional. … role blends application support, platform engineering and SRE practice. It suits someone who leans toward automation and observability over reactive firefighting. Responsibilities: Managing OpenShift and Kubernetes clusters across physical, virtual, and containerised environments Operating observability stacks ( Grafana , Prometheus, Splunk) and driving proactive monitoring Automating operational tasks using Python , Bash ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Cambridge, England, United Kingdom
will be responsible for the continued development of our monitoring systems and use them to proactively identify and communicate performance, reliability, security and cost issues. You will assist in responding to incidents and the remediation of vulnerabilities in our platform. You will also identify, plan and implement improvements … commitment to ISO 27001 compliance. Minimum Bachelor 2:1 degree in computer science or a related field 2+ years experience in a professional DevOps, SRE, Platform Engineering or similar role Self-motivated with strong problem-solving and analytical skills Experience using and configuring monitoring tools, ideally Grafana and Prometheus ...

Site Reliability Engineer (Python)

Hiring Organisation
Mphasis
Location
City of London, London, United Kingdom
need an experienced SRE to focus predominantly on automation, optimization, and process re-engineering using AI for the Market Risk Platform. Strong Python and provable agentic AI delivery Primary Objectives: Eliminate Operational toil and recurring manual work through durable automation Re-engineer support/change processes to reduce handoffs … outcomes. Productionize with monitoring, logging and post incident learnings feeding back into the agent/tooling Observability (enablemen for automation) Required skills & Experience Senior SRE experience on distributed systems and batch/intraday workloads in a production environment. Strong Python Provable agentic AI experience showing Tool integration, guard rails, evaluation ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
them. About the Role SREs at Fluidstack sit at the core of our infrastructure, working across software, hardware, and operations to ensure the reliability and performance of our global GPU cloud. They partner closely with teams including networking, platform engineering, and data center operations to build systems that scale … nature of the AI ecosystem. Strong technical and interpersonal communication skills, a low ego, and a positive mental attitude. Minimum Requirements 5+ years of SRE, DevOps, Sysadmin, and/or HPC engineering experience. Great verbal and written communication skills in English. Experience deploying and operating Kubernetes and/or SLURM ...

Platform Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
England, United Kingdom
decisions clearly to non-technical stakeholders and customers Uphold a culture of: do, document, automate Willingness to cross train with Platform Engineering/Platform SRE to fully support both our infrastructure and platform stacks. Willingness to cross train with HPC Engineering, supported by NVIDIA to enhance our HPC supportability offering … Requirements 5+ Years Proven experience in globally scaled, performance-intensive environments operating to a 24/7 support model in an SRE or equivalent role 3+ years experience in both running, deploying and optimising orchestration platforms with a strong emphasis on Kubernetes Expert-level Linux administration, especially Ubuntu distributions Proficiency ...

Manager – Site Reliability Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
thinking. ## ## Role summary:You will be responsible for ensuring stability, resilience, and performance of our production systems while driving continuous improvement and SRE best practices across the platform.## What you'll be doing:* **Service Ownership**Assume end-to-end accountability for Clearing production environment, ensuring high availability, optimal … incidents, leading resolution efforts, managing stakeholder communications, and driving root cause analysis and remediation.* **Team Leadership & Talent Development**Build and mentor a high-performing SRE team. Promote a culture of accountability, continuous improvement, and blameless postmortems to enhance operational excellence.* **Operational Excellence & SLA Compliance**Ensure consistency to response and resolution ...

Site Reliability Engineer: Cloud, Monitoring & Security

Hiring Organisation
Jobleads-UK
Location
Cambridge, England, United Kingdom
Cambridge Wireless Ltd) in Cambridge is looking for a DevOps Engineer to enhance monitoring systems, respond to incidents, and improve cloud infrastructure. The role requires a minimum of a Bachelor's degree and at least 2 years of experience in a relevant position. Ideal candidates will have experience with ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
will work with Development and Product Management to design and deliver new functionality. You will perform deep dives into both systemic and latent reliability issues; partner with software engineers across the organization to produce and roll out fixes. You will drive standardization efforts across multiple disciplines and services … solid understanding of continuous integration, deployment and operations concepts. You have production experience of managing Windows Infrastructure running IIS workloads Passion for resolving reliability issues and identify strategies to mitigate going forward. Automation mindset - if you can automate it, do it. Fluency in English What you will gain ...

Junior Site Reliability Engineer

Hiring Organisation
Granite Recruitment and Consulting
Location
Bath, Somerset, South West, United Kingdom
Employment Type
Permanent
Salary
£45,000
JuniorSiteReliabilityEngineerBath/Hybrid-£35,000-£45,000 AgreatopportunityforaJuniorSiteReliabilityEngineertojoinanorganisationwhowillbetransitioningtowardsaDevOpsenvironment,andpickuprelevantskillsalongtheway. TherolewillbejoiningaleadingsoftwarehouseincentralBath,buildingandmaintaininginfrastructurefortheirSaaSproducts.Theyareintheprocessofinvestingheavilyintheirtechnicalenvironmentandarelookingforanengineerwhocanhelpwithnewtechimplementations. Coreresponsibilitieswillinclude: Maintaininghostinginfrastructure(ensuringavailabilityofweb,databaseandmessagingsystems) Managingservers(acrossdevelopment,testandproduction) Monitorrespondtoinfrastructureevents(suchasavailabilityandcapacitymanagement) Beproactivewithinfrastructureupgrades Doesthissoundlikeyou? Experiencemanagingserversinthecloud Proactive,notreactive Productionsystemsawareness Possiblyagraduatewithaplacementyear/somerelevantexperience,orsomeoneworkinginasupport/infrastructureposition Thetechnicalenvironmentlookslikethis: AWS cloudinfrastructure ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Responsibilities Schedule and monitor real-time trading systems Monitor open and close of markets as it relates to the system Reduce the number of trading hiccups Monitor and escalates real-time system alerts Handle any ...

Site Reliability Engineers (DV Security Clearance)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
safe and secure. Our work spans cloud, on‐prem and hybrid environments, underpinned by strong cyber and engineering best practices. We are seeking experienced Site Reliability Engineers to join our cross‐functional teams supporting secure, cloud‐based and big‐data platforms. Working closely with clients and internal delivery ...

Site Reliability Engineers (SRE)

Hiring Organisation
Dns Info Ltd
Location
London, United Kingdom
Employment Type
Any
Salary
GBP Annual
Location: 5 days onsite Domain: Banking/Finance/Trading - Market Risk Skills: SRE experience with Python-based applications (not Java) Exposure to cloud technologies Trade Lifecycle/Market Risk/Risk platform experience Experience: Minimum 8+ years ...

SRE Observability Engineer

Hiring Organisation
Access Computer Consulting
Location
City of London, London, United Kingdom
Employment Type
Contract
Contract Rate
£350 - £450/day
recruiting for an SRE Observability Engineer to work in London 2-3 days a week, remaining time remote. The role falls inside IR35 so you will be required to work through an umbrella company for the duration of the contract. This is a 6 month contract which will transfer ...

SRE-NOC Engineer (24/7): Reliability, Automation & Observability

Hiring Organisation
Jobleads-UK
Location
United Kingdom
NICE is looking for an SRE – NOC to combine traditional Network Operations with engineering-driven reliability practices. The role emphasizes 24/7 service reliability, incident response, and operational automation. You will lead incident responses, monitor service health, implement automation to reduce toil, and ensure the operational readiness ...

Production/Site Reliability Engineering Division - London / Amsterdam- Leading Global Market Maker

Hiring Organisation
eFinancialCareers
Location
Amsterdam, Noord-Holland, Netherlands
Employment Type
Permanent
Salary
EUR Annual
sustainable growth across various regions. Collaborating closely with traders, development, and operational teams, you will deploy, monitor, troubleshoot, and optimize trading infrastructure to enhance reliability, performance, and automation. Responsibilities Provide technical expertise and support to high-frequency trading floors, troubleshooting complex issues and resolving incidents in a fast-paced … trading products, systems, and exchanges, ensuring readiness for events such as corporate actions or special announcements. Architect and maintain services to maximize reliability, ensuring the trading systems operate within performance thresholds. Collaborate with development teams to establish best practices and ensure the long-term stability of production systems, even ...

Azure DevOps Engineer — SRE & AI Transformation

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
BigHand Limited is seeking a professional to work on a greenfield AI transformation programme in Greater London. The ideal candidate will have extensive hands-on experience with Microsoft Azure Cloud services, Kubernetes, and infrastructure automation ...

SRE Release Engineer: Automate & Optimize Deployments

Hiring Organisation
Jobleads-UK
Location
Bristol, England, United Kingdom
Deepstreamtech is seeking a Release Engineer to work within the SRE Release Engineering team in Bristol. The role focuses on developing engineering solutions to enable frequent deployment of high-quality code. Key responsibilities include optimizing testing, deploying and releasing processes across engineering teams, maintaining existing tooling, and ensuring best ...

Cloud Platform & SRE Principal Engineer (GCP)

Hiring Organisation
Jobleads-UK
Location
City of Westminster, England, United Kingdom
Kingfisher is looking for a Principal Engineer - Cloud to set the technical direction for their Cloud Platform and SRE capabilities in the City of Westminster. This hands-on leadership role will involve defining the vision for cloud platforms and compute services, mentoring engineers, and driving complex initiatives with significant ...