London, England, United Kingdom Hybrid / WFH Options
Natobotics
Join to apply for the SiteReliability Engineer (SRE) role at Natobotics . Role: SRE Lead Location: Birmingham, UK (Hybrid, 2-3 days WFO) Contract: 3 months (Possible extension) Are you a skilled SiteReliability Engineer (SRE) with experience in maintaining scalable and reliable infrastructure? We … re looking for a proactive leader with a passion for automation, incident management, and system optimization. Key Skills Required: 5+ years of SRE or similar experience Expertise in Cloud Platforms (SIEM technologies preferred) Proficiency in Python or Bash scripting Hands-on experience with Infrastructure as Code (e.g., Terraform, Ansible) Familiarity … maintenance Ensure SIEM data sources remain healthy and troubleshoot logging issues Additional Details: Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and Information Technology Industries: IT Services and IT Consulting #J-18808-Ljbffr More ❯
London, England, United Kingdom Hybrid / WFH Options
OSB Group
between UK and India offices will be required. What you will be doing: As Group Head of Cloud & Platforms , you will have a solid engineering and Azure cloud architecture background to drive the cloud transformation strategy, evolving cloud governance frameworks, optimising costs and ensuring regulatory compliance through agile deliver. … UK payment journeys with appropriate redundancy Ensure all cloud deployments meet security and compliance requirements for payment card processing and core banking functions Establish engineering standards and practices that ensure security, scalability and reliability for mission-critical banking workloads across hybrid environments Drive the adoption of DevOps practices … and SiteReliabilityEngineering models for payment processing systems What's in it for you? Base salary up to c£140,000 Car allowance of £7,500 Enhanced family-focused benefits Annual bonus opportunity up to 40% + LTIPs 30 days annual leave + bank holidays Please More ❯
growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a SiteReliability Engineer III at JPMorgan Chase within the AIML Data Platform Team, you will solve complex and broad business problems with simple and … and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability … junior engineers. Required qualifications, capabilities, and skills Formal training or certification on SiteReliabilityEngineering concepts and applied experience Expertise in SRE principles, reliability, scalability, and performance of application and infrastructure. Expertise in programming with Python and Infrastructure as Code tools such as Terraform. Experience working More ❯
Job Description Job Title: Senior SiteReliability Engineer (SRE) Location: London, UK – Onsite (5 days/week) Employment Type: Permanent Salary: Up to £80,000 per annum (Gross) About the Role: We are seeking a highly skilled and motivated SiteReliability Engineer (SRE) to join our … London-based team. This role is ideal for someone passionate about service reliability, scalability, and performance. As an SRE, you will collaborate with development and operations teams to automate infrastructure, enhance observability, and reduce manual processes (TOIL) to improve overall system health. Key Responsibilities: Design, build, and maintain scalable … Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience). 8+ years of relevant experience in SRE, DevOps, or Infrastructure Engineering roles. #J-18808-Ljbffr More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description Who we are looking for A Junior SiteReliability Engineer, who will improve system reliability, observability and performance through strong engineering and assist with incident resolution and operational excellence. Supported by our sitereliabilityengineering team, you will work to integrate … reliability and observability practices into the Software Development Life Cycle (SDLC). With support from central teams, you will help foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands and enhance overall performance. You will ensure the health, performance … and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including instrumentation with tools such as OpenTelemetry, improving logging practices, and developing features for maintainability. You will also assist in creating tools and automation for effective service management. More ❯
Stoke-on-Trent, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description Who we are looking for A Junior SiteReliability Engineer, who will improve system reliability, observability and performance through strong engineering and assist with incident resolution and operational excellence. Supported by our sitereliabilityengineering team, you will work to integrate … reliability and observability practices into the Software Development Life Cycle (SDLC). With support from central teams, you will help foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands and enhance overall performance. You will ensure the health, performance … and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including instrumentation with tools such as OpenTelemetry, improving logging practices, and developing features for maintainability. You will also assist in creating tools and automation for effective service management. More ❯
the job poster from Xcede A technology-focused, multi-strat investment firm, operating at the cutting edge of their industry, is looking for a Site … Reliability Engineer to join their highly skilled, innovative team. Essential skills: Strong proficiency in Python for infrastructure and automation Hands-on experience in SRE, DevOps or production engineering roles Deep understanding of monitoring, incident response workflows, and system architecture Productive approach to improving systems and reducing technical debt … operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting). You will also: apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs; lead incident response and implement improvements; collaborate More ❯
London, England, United Kingdom Hybrid / WFH Options
TieTalent
between UK and India offices will be required. What You Will Be Doing As Group Head of Cloud & Platforms , you will have a solid engineering and Azure cloud architecture background to drive the cloud transformation strategy, evolving cloud governance frameworks, optimising costs and ensuring regulatory compliance through agile deliver. … UK payment journeys with appropriate redundancy Ensure all cloud deployments meet security and compliance requirements for payment card processing and core banking functions Establish engineering standards and practices that ensure security, scalability and reliability for mission-critical banking workloads across hybrid environments Drive the adoption of DevOps practices … on any grounds. Nice-to-have skills Azure Cloud Security DevOps London, England Work experience Agile Coach/Scrum Master SiteReliability (SRE) Cloud Architect Languages English Seniority level Seniority level Director Employment type Employment type Full-time Job function Job function Information Technology Industries Technology, Information and More ❯
Bolton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
required: Yes col-narrow-right Job Views: 3 Posted: 06.06.2025 Expiry Date: 21.07.2025 col-wide Job Description: Who we are looking for A Junior SiteReliability Engineer, who will improve system reliability, observability and performance through strong engineering and assist with incident resolution and operational excellence. … Supported by our sitereliabilityengineering team, you will work to integrate reliability and observability practices into the Software Development Life Cycle (SDLC). With support from central teams, you will help foster a culture where these principles are integral to development. Your contributions will ensure … meet user demands and enhance overall performance. You will ensure the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including instrumentation with tools such as OpenTelemetry, improving logging practices, and developing features for maintainability. More ❯
London, England, United Kingdom Hybrid / WFH Options
Wayve Technologies Ltd
The role We're on the lookout for a SiteReliability Engineer (SRE) with a thirst for innovation and a desire to establish Operational Excellence and best practices. You'll be instrumental in fortifying the backbone of our AI-driven autonomous vehicles, ensuring they're robust, resilient, and … our efficiency, aiming to make manual interventions a thing of the past. About you In order to set you up for success as an SRE at Wayve, we’re looking for the following skills and experience. Essential Over 8 years experience in SiteReliabilityEngineering or a More ❯
what matters. We are in it for the long term, come join us on this journey. As a Senior SiteReliability Engineer (SRE), you'll be joining a team whose mission is to ensure the availability, performance, security and reliability of our platform and core services, ensuring … monitoring of those systems, for building tooling and automation to reduce TOIL and for responding to incidents as part of our 24/7 SRE on-call team. ReliabilityEngineering at Board Intelligence The SRE team: Strives to provide the highest standards of Availability, Scalability, Performance and Security … and responds to incidents as part of a 24/7 rota Key responsibilities of the role We're looking for a great Senior SRE to be a hands on individual contributor to key technical projects and to help us build a first-class SRE function. This role will involve More ❯
London, England, United Kingdom Hybrid / WFH Options
Algolia
and also internal teams. We're looking for a Tier Three Support Engineer to partner with our R&D Engineering teams including the SRE (SiteReliability Engineer) team in order to go deep into the technical details of the product investigating the most difficult and complicated technical … and also internal teams. We're looking for a Tier Three Support Engineer to partner with our R&D Engineering teams including the SRE (SiteReliability Engineer) team in order to go deep into the technical details of the product investigating the most difficult and complicated technical … dashboards in Datadog Conducting professional and empathetic conversations with customers to gather information, troubleshoot, and resolve their technical obstacles. Submitting bug reports to the Engineering team for problems needing attention. Partnering with Product Teams and Engineering to develop subject matter expertise and serve as a Subject Matter Expert More ❯
Social network you want to login/join with: SiteReliability Engineer (SRE) - Crypto High-Frequency Trading, Slough Client: Selby Jennings Location: Slough, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: We are looking for a … SiteReliability Engineer (SRE) to help design and build automation, configuration, and deployment tooling for our high-frequency trading (HFT) platform. This role ensures our trading systems remain highly available, scalable, and robust, supporting a fast-paced environment. Responsibilities: Develop scalable tools for automation, deployment, and infrastructure management. … Enhance system performance, reliability, and efficiency through automation. Manage AWS infrastructure, ensuring smooth configuration and deployment. Implement observability tools for monitoring and debugging. Ensure fault tolerance, redundancy, and high availability of trading systems. Support infrastructure for C++ and Rust-based trading systems, ensuring seamless integration. Qualifications: Strong programming skills More ❯
Social network you want to login/join with: SiteReliability Engineer (SRE) - Crypto High-Frequency Trading, London Client: Selby Jennings Location: London, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: We are looking for a … SiteReliability Engineer (SRE) to help design and build automation, configuration, and deployment tooling that underpins our high-frequency trading (HFT) platform. This role is crucial for ensuring our trading systems remain highly available, scalable, and robust, supporting the fast-paced environment. What You'll Be Doing Develop … scalable production tools to automate deployment, monitoring, and infrastructure management. Improve system performance, reliability, and efficiency through automation and tooling. Manage AWS-based infrastructure, ensuring seamless configuration and deployment. Implement observability tools for monitoring, debugging, and performance insights. Ensure fault tolerance, redundancy, and high availability across trading systems. Support More ❯
disciplines. We're always on the lookout for energetic, creative people to join our team. Your New Role SiteReliabilityEngineering (SRE) team members work with our Global Content Delivery teams to deliver exabytes of content for our brands globally. The SRE has a highly skilled combination … of engineering and operations skills and is focused on automating and improving operations. Their job is to guarantee system reliability, performance, and supportability with a strong engineering emphasis on building autonomous solutions that deliver value to end-users early, often, & fast. They are central to the reputation … deployments. Coordinate with relevant teams to build useful tools to support network operations (internal and external). Qualifications and Experience The Essentials: Passionate about SRE, DevOps, Automation, and infrastructure platforms. Understand the mechanical sympathy between software workloads and the demand it places on the underlying hardware. Working knowledge of non More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Fruition Group
Job Title: Senior SiteReliability Engineer (SRE) Location: Leeds (Hybrid - c. 1-2 days per week) Salary: £60,000 - £80,000 + benefits Why Apply? This is a fantastic opportunity for a seasoned Senior SiteReliability Engineer to take a lead role in shaping the infrastructure … most innovative businesses in their market. Working with cutting-edge technology, this role offers high-impact challenges, meaningful collaboration, and excellent career progression. Senior SRE Responsibilities Manage and optimise cloud infrastructure to ensure scalability, high availability, and security. Design and implement robust CI/CD pipelines for efficient product delivery. … like GitlabCI, Terraform/OpenTofu, Ansible, and scripting languages such as PowerShell or Python. Champion infrastructure best practices and mentor junior team members. Senior SRE Requirements Extensive experience in SRE or DevOps roles within high-availability, cloud-native environments. Strong expertise with AWS (including EKS, MSK, RDS, VPC design, encryption More ❯
SiteReliability Engineer with Python Our Client looking to bring on a sitereliability engineer to help deploy, manage, troubleshoot, and enhance our complex cloud-based set of internal tools and externally managed services for a variety of users across our wide-ranging organization. You will … have at least 7 to 10 years hands-on expertise working as a SiteReliability Engineer. You will work closely with IT, product, and engineering to extend and maintain this set of tools and services and to help debug and resolve problems. In addition, the ideal candidate … Actively lead any critical issue post-mortem processes, including coordination of any meetings and further steps to take Qualifications -?7+ years experience with software engineering, software development, and/or system operations -?Experience debugging complex problems and implementing timely cost-effective solutions -?Experience designing, building, and operating large-scale More ❯
SiteReliability Engineer (SRE) Remote (UK) £85,000 – £105,000 (DoE) We’re a growing FinTech scale-up and we’re on the lookout for an experienced SiteReliability Engineer to join our remote-first engineering team. Things are moving fast here, and as we … continue to grow; reliability, automation, and scalability have never been more important to us. You will be our first SRE so a strong background in implementing SRE best practices would be Ideal. You will know what good looks like and strive to continuously improve automation, availability and resilience. This … tooling using AWS, Terraform, Docker, and CI/CD pipelines. Supporting and evolving our container-based architecture (we use ECS and Fargate). Driving SRE best practices: SLIs/SLOs, error budgets, reducing toil, and improving observability. Using (and hopefully enjoying!) tools like Datadog, Prometheus, Grafana, and Nix to support More ❯
London, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
Direct message the job poster from Spectrum IT Recruitment Account Manager @ Spectrum IT | Recruitment, Customer Service SiteReliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a SiteReliability Engineer. The company deliver … credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in … or PowerShell Solid grasp of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience More ❯
SiteReliability Engineer/DevOps Engineer page is loaded SiteReliability Engineer/DevOps Engineer Apply locations Farringdon time type Full time posted on Posted 9 Days Ago job requisition id R94904 SiteReliability Engineer/DevOps Engineer Are you enthusiastic about designing and … managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data … joining a vibrant, diverse, and collaborative team where you are free to grow and contribute actively. About Role: We are a high-performing systems engineering team operating in a fast-paced enterprise environment, focused on modernising our infrastructure while upholding strict security and compliance standards. Our engineers work with More ❯
SiteReliability Engineer/DevOps Engineer page is loaded SiteReliability Engineer/DevOps Engineer Apply locations Farringdon time type Full time posted on Posted 9 Days Ago job requisition id R94904 SiteReliability Engineer/DevOps Engineer Are you enthusiastic about designing and … managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( https://www.lexisnexisip.com ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property … joining a vibrant, diverse, and collaborative team where you are free to grow and contribute actively. About Role: We are a high-performing systems engineering team operating in a fast-paced enterprise environment, focused on modernising our infrastructure while upholding strict security and compliance standards. Our engineers work with More ❯
SiteReliability Engineer Remote - Canada, Americas/Engineering We offer The Tyk API Management platform is helping to drive the connected world and power new products and services. We're changing the way that organisations connect any number of their systems and services.Whether internal, external, public or … like an environment that you believe could work for you then read on to find out more. The role: We're looking for a SiteReliability Engineer to manage, maintain, improve and provide support on our platform. You will be curious by nature, always looking for ways to … be advocate of continuous improvement Reliability of our new global Tyk Cloud platform Automation of operations and support Writing and maintaining documentation on SRE processes and policies Recommending and implementing ways of driving operational efficiency and driving down our cost to run, without impacting service Assisting in penetration testing More ❯
Applications Support & SiteReliability Team Lead (3779) Be among the first 25 applicants About GBG Enabling safe and rewarding digital lives for genuine people, everywhere. We make it our mission to ensure more genuine people have digital access to opportunities, and businesses have access to more genuine people. … on diverse and reliable data to create a single point of truth for identity and address verification. About The Team This role leads the SiteReliability/Apps support Engineering team within Documents & Biometrics, responsible for ensuring GBG delivers a world-class experience for all our customers … and team members globally. The team is a 2nd line technical function, providing a gateway service between 1st line Customer Support and 3rd line Engineering for supported products and services. They are customer-oriented, knowledgeable, process-driven, and technically skilled, proactively supporting, maintaining, and monitoring live services. The role More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description SiteReliability Engineer (SRE) - Kubernetes, Observability, Prometheus, Dynatrace, OpenTelemetry Role Overview This is a fantastic opportunity with a consulting company seeking to fill multiple SRE roles. You will play a key role in managing client platforms with a strong emphasis on observability and Kubernetes expertise. Joining … visits will be required for meetings, which will be fully funded. Requirements Minimum of 2 years' commercial experience in a Platform/DevOps/SRE role At least 6 months' experience specifically as a SiteReliability Engineer (SRE) Solid experience with Observability tools such as Prometheus and Grafana … Strong exposure to Kubernetes Must have resided in the UK for over 5 years to obtain Security Clearance Salary & Benefits The salary for the SRE roles is negotiable based on experience, with an expected starting point of £55,000 basic, along with an excellent benefits package and comprehensive training opportunities. More ❯
Join us as a SiteReliability Engineer In this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You’ll enjoy significant stakeholder interaction, working … chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development What you'll do As our SiteReliability Engineer, you’ll work closely with our feature team and other colleagues to meet defined service level objectives and continually improve system … and environment reliability. You’ll define SLOs, SLIs and error budgets that support finding the right balance between risk reliability and continuous improvement. You’ll also provide structure and help to our release process, suggesting and making improvements where possible. You’ll scale systems sustainably through mechanisms like automation More ❯