Site Reliability Engineering Jobs in London

101 to 125 of 169 Site Reliability Engineering Jobs in London

Front End Software Engineer, Console

London, United Kingdom
Evroc
shaping the user experience of our innovative cloud technology. Your contributions will not only focus on coding but will also influence the architecture of our applications and the overall engineering culture. This is an opportunity to work at the forefront of cloud technology and define how our users interact with our platform. What You'll Tackle Develop User-Centric … high performance and responsiveness of applications through efficient code practices and performance tuning. Create a scalable framework for building our UI which allows us to spend most of our engineering time on a small number of highly differentiated areas. Contribute to Code Quality: Participate in code reviews and contribute to the development of best practices to enhance code quality … Design: Experience with responsive web design principles and frameworks (e.g., Tailwind CSS) to create visually appealing interfaces. Developer-Centered Approach: Experience in building UIs for a developer, DevOps or SRE audience. Testing Frameworks: Proficiency in using testing frameworks such as Playwright and Jest to ensure high code quality and user experience. Version Control: Proficient in using Git for version control More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of Production Management Resiliency - Director | London, UK (London)

London, UK
Hybrid / WFH Options
Citi
and minimizing client impact duration. Responsibilities: Implement Enhanced Testing and Recovery: Oversee the implementation and execution of Production Swing testing for critical applications, ensuring applications run from their alternate site for a minimum of 5 days. Implement and oversee Data Recovery testing, ensuring applications can recover critical data from backup solutions within the defined Impact Tolerance (ITOL). Drive … of established resiliency patterns. Leverage cloud-native services and features to enhance application resiliency. This includes services for auto-scaling, load balancing, and disaster recovery. Explore and implement chaos engineering practices to proactively identify and address system weaknesses under stress. Proactive Vulnerability Management: Proactively identify vulnerabilities through regular architecture reviews, comprehensive scenario testing, and foundational testing. Document and demonstrate … requirements. Provide regular updates to senior management on the status of resilience initiatives and key performance indicators. Key Qualifications: Relevant professional software engineering experience - and in particular in SRE roles Expertise analyzing complex application, database, network, and OS issues across a distributed large scale customer facing systems Strong communication skills and ability to work effectively across multiple business and More ❯
Employment Type: Full-time
Posted:

Senior Cloud Security Engineer

London, United Kingdom
Hybrid / WFH Options
Miro Group
and the collaborative experiences that enable our global customers to innovate with confidence. Operating as part of the broader Infrastructure organization, the Cloud Security team partners closely with key engineering groups including Networking, Compute, and Observability to embed security deeply across Miro's cloud environment. The team also maintains strong alignment with our peers in the Security organization-such … audits of the cloud, infrastructure and data security Perform security reviews (architecture review, design review, code review) of the cloud, containers and infrastructure changes Work closely with DevOps/SRE teams to provide security expertise Build automation to perform security and compliance at scale in the cloud Oversee the monitoring of Cloud security posture, and ensuring vulnerabilities are managed properly More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - Met Office

London, United Kingdom
Microsoft Corporation
most exciting products in Microsoft Azure, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Azure Customer Experience (CXP) team is searching for a customer obsessed Site Reliability Engineer to work on a HPC environment, that can drive reliability engineering excellence and embody our culture of inclusiveness, growth-mindset, and unwavering dedication to … observability, monitoring, alerting, and toil reduction form the foundations of our code and we work with teams across Microsoft and external customers to ensure success. We work on exciting engineering challenges in a fun and supporting environment, with access to cutting edge technology surrounded by world-class engineers. Qualifications In-depth technical experience in software engineering, network engineering … pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. UK Baseline Personnel Security Standards; UK Security Clearance Responsibilities Collaborating closely with the existing SRE teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting SLO's and averting incidents altogether when possible. Collaborating with the customers to understand their More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - Quant Hedge Fund - £275k

City of London, London, United Kingdom
Paragon Alpha - Hedge Fund Talent Business
who have attracted talent from rival hedge funds and big tech firms alike, due to their sophisticated tech infrastructure and great work life balance. They're looking for a (SRE) Site Reliability Engineer to come and join the Infra team and act as a SME within Cloud, Automation and DevOps. The role would entail helping to streamline and … of trading/research applications into production. Stack: Python, AWS, Kubernetes, Linux The company is open to people outside of finance, the emphasis is for an expert and passionate SRE, who can bring fresh perspectives on automation and scalability to the firm. If you're keen to find out more, please do apply More ❯
Posted:

Site Reliability Engineer - Quant Hedge Fund - £275k

London Area, United Kingdom
Paragon Alpha - Hedge Fund Talent Business
who have attracted talent from rival hedge funds and big tech firms alike, due to their sophisticated tech infrastructure and great work life balance. They're looking for a (SRE) Site Reliability Engineer to come and join the Infra team and act as a SME within Cloud, Automation and DevOps. The role would entail helping to streamline and … of trading/research applications into production. Stack: Python, AWS, Kubernetes, Linux The company is open to people outside of finance, the emphasis is for an expert and passionate SRE, who can bring fresh perspectives on automation and scalability to the firm. If you're keen to find out more, please do apply More ❯
Posted:

Senior Software Engineer/SRE - Observability

London, United Kingdom
Avature
Senior Software Engineer/SRE - Observability Location London Business Area Engineering and CTO Ref # Description & Requirements The Role As a Senior Software Engineer/SRE in Bloomberg's Market Data group, you'll help ensure our real-time data infrastructure is reliable, automated, and observable at massive scale. Our systems process hundreds of billions of financial ticks per … the Market Data Ontology and Knowledge Graph, enabling rich context, traceability, and insight across the platform. Our systems power operational awareness, automation, and strategic investment decisions. We partner across engineering, platform, and infrastructure to define standards, surface gaps, and drive reliability across one of the world's most critical real-time data platforms. What You'll Do Expand … , Mathematics, or equivalent practical experience Commercial experience and proficiency in Python or C++ Experience designing, scaling, and operating distributed systems. Strong collaboration and communication skills-especially across infrastructure, SRE, and product teams. We'd Love to See Experience with semantic technologies: ontologies, RDF, or graph databases (e.g., Neo4j, RDF4J). Familiarity with ETL or EIS platforms like Apache Camel More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer/SRE - Observability London, GBR Posted today

London, United Kingdom
Bloomberg L.P
Senior Software Engineer/SRE - Observability Location London Business Area Engineering and CTO Ref # Description & Requirements The Role As a Senior Software Engineer/SRE in Bloomberg's Market Data group, you'll help ensure our real-time data infrastructure is reliable, automated, and observable at massive scale. Our systems process hundreds of billions of financial ticks per … the Market Data Ontology and Knowledge Graph, enabling rich context, traceability, and insight across the platform. Our systems power operational awareness, automation, and strategic investment decisions. We partner across engineering, platform, and infrastructure to define standards, surface gaps, and drive reliability across one of the world's most critical real-time data platforms. What You'll Do Expand … , Mathematics, or equivalent practical experience Commercial experience and proficiency in Python or C++ Experience designing, scaling, and operating distributed systems. Strong collaboration and communication skills-especially across infrastructure, SRE, and product teams. We'd Love to See Experience with semantic technologies: ontologies, RDF, or graph databases (e.g., Neo4j, RDF4J). Familiarity with ETL or EIS platforms like Apache Camel More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering Excellence Lead

London, United Kingdom
Hybrid / WFH Options
Trili
boundaries, passionate about crypto and working in a high intensity environment then you're exactly the type of person who will succeed here. The Role We are seeking an Engineering Excellence Lead to own strategic initiatives that enhance the performance, productivity, and quality of our engineering organisation. This role reports directly to the Head of Engineering and … partners closely with Engineering Managers and Product managers to ensure our technology teams operate at the highest levels of effectiveness and agility. You will drive continuous improvement across engineering processes, tools, metrics, and delivery frameworks. With a team of 100+ talented engineers across multiple domains and geographies (mostly France and UK), you will be instrumental in scaling best … training, and coaching for engineering staff. Technology Enablement: Evaluate and deploy tools - especially AI - that support engineering productivity, observability, and collaboration. Work closely with DevOps, QA, and SRE teams to align infrastructure and operational excellence with engineering needs. Own key vendor relationships, evaluation of partnerships and represent technology on committees What you'll need: Proven success in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering Operations Manager

London, United Kingdom
Hybrid / WFH Options
Trili
boundaries, passionate about crypto and working in a high intensity environment then you're exactly the type of person who will succeed here. The Role We are seeking an Engineering Operations Manager to own strategic initiatives that enhance the performance, productivity, and quality of our engineering organisation. This role reports directly to the Head of Engineering and … partners closely with Engineering Managers and Product managers to ensure our technology teams operate at the highest levels of effectiveness and agility. You will drive continuous improvement across engineering processes, tools, metrics, and delivery frameworks. With a team of 100+ talented engineers across multiple domains and geographies (mostly France and UK), you will be instrumental in scaling best … training, and coaching for engineering staff. Technology Enablement: Evaluate and deploy tools - especially AI - that support engineering productivity, observability, and collaboration. Work closely with DevOps, QA, and SRE teams to align infrastructure and operational excellence with engineering needs. Own key vendor relationships, evaluation of partnerships and represent technology on committees What you'll need: Proven success in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

VP, Engineering (Campaign Management)

London, United Kingdom
Group M Worldwide Inc
Title: VP Engineering, Activation WHO WE ARE Choreograph is WPP's global data products and technology company. We're on a mission to transform marketing by building the fastest, most connected data platform that bridges marketing strategy to scaled activation. We work with agencies and clients to transform the value of data by bringing together technology, data and analytics … of thinkers, builders, creators and problem solvers are over 1,000 strong, across 20 markets around the world. WHO WE ARE LOOKING FOR This position is a hands-on engineering leadership role. Internally, the role is accountable for the successful technical delivery of many products within the Activation product area, entailing hands-on architecture and guidance for the implementation … techniques. You will influence the management of technical debt across projects; making the right calls between balancing pragmatic delivery and compromising implementation patterns. You will help guard our central SRE practice and ensure adoption within the Activation product area. You will drive innovation; coaching engineers to experiment and choose appropriate technologies to fit into our ecosystem - this requires you to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Restaurant Technology Problem Manager

London, United Kingdom
Hybrid / WFH Options
McDonald's Corporation
What will my accountabilities be? As a Restaurant Technology Problem Manager, you will be responsible for owning ITIL Problem Management and Service Data Analytics whilst managing a team of Site Reliability Engineers and Level 3 technical subject matter experts. Your accountabilities could be refined as follows: Own ITIL Problem & Change Management Take ownership of ITIL Problem Management activities … whilst also providing early insight to impacting problems and remediation activities. Communication and Reporting Produce reports and presentations to improve communication back into the business, providing insights into system reliability, performance and improvement initiatives. Work closely with functions within the Technology team and our respective suppliers to communicate top impacting problems and work collaboratively to identify avenues for remediation … and addressing root causes of system issues. Experience managing and contributing to mid-large projects related to system reliability improvements. Knowledge of Site Reliability Engineering (SRE) Practices: including error budgeting, service level objectives (SLOs), and service level indicators (SLIs). Demonstrated ability to collaborate with cross-functional teams, including developers and system administrators. Hands-on experience More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Global Banking & Markets - Production Support Engineer - Associate - London

London, United Kingdom
WeAreTechWomen
Goldman Sachs, our Engineers don't just make things - we make things possible. Change the world by connecting people and capital with ideas. Solve the most challenging and pressing engineering problems for our clients. Join our engineering teams that build massively scalable software and systems, architect low latency infrastructure solutions, proactively guard against cyber threats, and leverage machine … learning alongside financial engineering to continuously turn data into action. Create new businesses, transform finance, and explore a world of opportunity at the speed of markets. Engineering is at the critical center of our business, and our dynamic environment requires innovative strategic thinking and immediate, real solutions. Goldman Sachs Engineers are innovators and problem-solvers, building solutions in … who evolve, adapt to change, and thrive in a fast-paced global environment. Want to push the limit of digital possibilities? Start here. Fixed Income, Currency, and Commodities (FICC) Engineering plays a key role in the firm's ability to provide liquidity and execution services for institutional clients around the world, two important revenue drivers for the firm. In More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer

London, United Kingdom
Techwaka
released our first product offering: Mindgard AI Security Labs, and continue to build a team of engineers to join us on our journey. The role: You will join an engineering team that is building and maintaining our flagship AI Red Teaming Platform. You'll help customers adopt AI while staying secure from the new security threats that affect AI … Kubernetes. Confident working with data pipelines and a variety of databases. You'llstand out if you: Haveexperience building cyber security products. Haveexperience working in startups. Have experience as a site reliability engineer or similar role. Have experience working with extreme programming practices such as pair programming,tdd,trunk baseddevelopment, and close customer collaboration. Areable to work onsite from More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Devops Engineer

City of London, London, England, United Kingdom
Lynx Recruitment Ltd
Devops Engineer wanted for a leading financial institute based out of Central London, Greater London and other UK locations. I need candidates with a solid Infrastructure Engineering or SRE background, Network Configs, Security Layers, Troubleshooting, Azure technologies, AKS Container/docker builds, Ci/CD pipelines, Scripting, IaC or BiCep deployments, SQL Databases and On-prem to cloud moves More ❯
Employment Type: Full-Time
Salary: £55,000 - £95,000 per annum, Inc benefits
Posted:

Associate Director - Application Development (London)

London, UK
WeAreTechWomen
of geographically diversified software engineers responsible to develop scalable solutions by working directly with product development team. Our team culture is oriented towards equality in the realm of software engineering irrespective of hierarchy promoting innovation. One should feel empowered to iterate over ideas and experimentation without being afraid of failure. Responsibilities and Impact: You will enable S&P ratings … business to provide best in class end-to-end analytical, commercial and customer experience by building feature rich solutions including big data engineering, analytics, business intelligence and workflow capabilities. Complete SDLC: architecture, design, development and support of tech solutions Active participation in all scrum ceremonies, follow AGILE best practices effectively. Play a key role in the development team to … engineering, SQL/NOSQL databases, Big Data, and Analytics Lead SAFe by example Gathers and refines requirements by engaging with stakeholders, product manager, and QA manager DevOps/SRE improvements and ensures continuous progress is made Enables developers by developing required architecture, design, and reference implementation About S&P Global Ratings At S&P Global Ratings, our analyst-driven More ❯
Employment Type: Full-time
Posted:

Associate Director - Application Development

London, United Kingdom
WeAreTechWomen
of geographically diversified software engineers responsible to develop scalable solutions by working directly with product development team. Our team culture is oriented towards equality in the realm of software engineering irrespective of hierarchy promoting innovation. One should feel empowered to iterate over ideas and experimentation without being afraid of failure. Responsibilities and Impact: You will enable S&P ratings … business to provide best in class end-to-end analytical, commercial and customer experience by building feature rich solutions including big data engineering, analytics, business intelligence and workflow capabilities. Complete SDLC: architecture, design, development and support of tech solutions Active participation in all scrum ceremonies, follow AGILE best practices effectively. Play a key role in the development team to … engineering, SQL/NOSQL databases, Big Data, and Analytics Lead SAFe by example Gathers and refines requirements by engaging with stakeholders, product manager, and QA manager DevOps/SRE improvements and ensures continuous progress is made Enables developers by developing required architecture, design, and reference implementation About S&P Global Ratings At S&P Global Ratings, our analyst-driven More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Cloud Security Engineer Software Development Oslo, HQ

London, United Kingdom
reMarkable
believe that you Have experience with scripting and programming in one or more languages. You have worked as a software developer in a software team, or maybe as a site reliability engineer, and somewhere down the line sparked an interest and passion for security that you would like to pursue and make the center of your career. In More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
LinuxRecruit
Has anyone actually ever given you a good description of what SRE is? Recently I've met dozens of companies implementing an SRE function. Half are just rebranding an ops team (because Ops ain't cool), some don't want to call the additional silo they have created 'DevOps' (because apparently that's the wrong thing to do) so they … re calling it SRE and the rest actually don't really know how to describe what they're doing. And if you can't describe it simply, you don't know what it is, chief (because Google do it, isn't the right answer). That was until today, when I met a company who actually white boarded their vision … process rather than the build. We discussed Kubernetes, Prometheus and API Gateways. Most importantly, they spoke like they knew what the hell they were on about. Not just about SRE, but on the whole Engineering process. This is a company with at the top of their game, who are about to introduce a brand new monitisation model to a More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Different Technologies Pty Ltd
application performance - identifying, and implementing, improvements to application performance and stability. Collaborate with the design and implementation of the desired pipelines and process for deployment to production environment. The SRE will work closely with Platform and Software domains to ensure continuous improvement of performance and stability whilst adhering to standards. Undertake ad-hoc projects and other activities as required. Key … Accountabilities and Activities Contribute to the SRE function including: Drive evolution of the DevOps/GitOps toolchain, promoting improvements to streamline the software delivery process and showing improvements through metrics. Accountable for halting or stopping a project/product if the solution is not technically acceptable. Responsible for producing and maintaining documentation relating to application design, integration processes, testing procedures … to create operational run and playbooks. Integration with Domains including: Collaborating with Domains to plan, design, test and maintain the application. Design patterns for any component or structure under SRE responsibility. Implementation of components such as Monitoring and Logging. Manage the runbook preparations of Domains. Liaise and support other teams on work items including: Developing, refining, and tuning integrations between More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer, Compute Germany, Netherlands, United Kingdom

London, United Kingdom
Hybrid / WFH Options
vercel.com
looking for experienced SREs help grow our small team into a global footprint that can provide expert engagement across our core serving systems. As an early member of the SRE team you will report directly to the Director of Managed Infrastructure and play a foundational role in expanding our SRE practice, integrating reliability principles more deeply into Vercel's … Devise repeatable, low-toil operational practices through the development of automated systems for software delivery, system failover, and capacity management. About You: At least 3 years experience in an SRE role, or at least 5 years experience in an adjacent role (e.g. platform engineering), operating in a scaled environment. Firm grasp of the SRE philosophy and mindset, with practical … experience working on or directly with SRE teams that have proactively engaged in system design and improvement. Strong sense of accountability and commitment to problem solving, backed by a curiosity to dig deep and identify root causes. Willingness to proactively engage with development teams to influence the course of software design and operational practices. Capability to manage risk, make decisions More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
London Stock Exchange Group
Site Reliability Engineer page is loaded Site Reliability Engineer Apply locations IND-BLR-Divyasree Technopolis time type Full time posted on Posted Yesterday job requisition id R About LSEG: The London Stock Exchange Group (LSEG) is a global financial markets infrastructure and data provider headquartered in London, UK. Established in 2007, though its core institution-the … on SQL Server and SSIS today, we're actively exploring cloud-native platforms-your voice will help guide that transition. Collaborative Environment : Work multi-functionally with guides in data engineering, DevOps, and analytics in a culture that values curiosity, accountability, and continuous improvement. Tech that Matters : You'll support systems that drive real-time business decisions, impact thousands of More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Teya Services Ltd
mission, and comprehensive benefits. Your Mission Provide self-service cloud-native products for delivery teams while matching business requirements such as security, compliance, cost and reliability. As a Senior SRE, you will: Take part in the design, development, deployment and management of infrastructure products Evangelize the best practices around observability, reliability, security and performance Help the company grow faster More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data & Reporting SRE

London, United Kingdom
Hybrid / WFH Options
ZILO
sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious … role, drives our progress and creates real impact. If you're ready to shape the future, let's talk. We are seeking an experienced Site Reliability Engineer (SRE) with deep subject-matter expertise in data processing and reporting. In this role, you will own the reliability, performance, and operational excellence of our real-time and batch data … and trace data to pinpoint failure points across AWS, Flink, Kafka, and Python layers. Lead post-incident reviews: identify root causes, document findings, and drive corrective actions to closure. Reliability & Monitoring Design, implement, and maintain robust observability for data pipelines: dashboards, alerts, distributed tracing. Define SLOs/SLIs for data freshness, throughput, and error rates; continuously monitor and optimize. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of Product Engineering (London)

London, UK
Hybrid / WFH Options
Kroo Bank Ltd
This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board. As the Head of Product Engineering, you will be responsible for designing, implementing, and maintaining our innovative banking products and services. You will lead independent, product-aligned squads of full-stack engineers who build on the shared platform provided … Technology & Product Vision: Collaborate closely with the CTO and key stakeholders to co-create and align on a unified technical vision for a shared platform that empowers diverse product engineering teams. Be a passionate champion of our technology and product vision, inspiring your teams to undertake and deliver on ambitious projects. Operational Leadership & Execution: Oversee the day-to-day … operations of product engineering, ensuring seamless delivery of our current account, savings products, personal loans, and an expanding suite of banking offerings. Establish and maintain a stable organisational and technology baseline that enables engineers to build, test, deploy products efficiently while upholding high quality standards. Champion, protect and evolve shared engineering culture. Maintain and evolve engineering principles More ❯
Employment Type: Full-time
Posted:
Site Reliability Engineering
London
10th Percentile
£65,000
25th Percentile
£81,250
Median
£105,000
75th Percentile
£118,125
90th Percentile
£138,750