Are you passionate about infrastructure automation and engineering? Do you have experience of building reliable scalable systems, or previous knowledge of Service ReliabilityEngineering? Do you have experience working hands-on with automation approaches and tools in an infrastructure engineering or operations capacity? Do you want … to be deeply involved in exciting strategic automation and want to contribute to our cloud transformation for Aviva? We are looking for passionate Cloud engineering professional to join our diverse and growing team to shape and actively contribute to the future of Cloud Service Desk and Service Reliabilityengineering teams in Aviva. A bit about the job: Manage BAU tasks to support Cloud Hosting Platform Services in AWS and Microsoft Azure, including incident and problem management, patching, rebuilding, recoverability, and ensuring security compliance Oversee performance of the sourcing partner, taking action to improve service delivery for More ❯
SiteReliability Engineer - 12 months Contract Get AI-powered advice on this job and more exclusive features. Pay Range This range is provided by Tenth Revolution Group. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Position Details Role … SiteReliability Engineer Type: 12-month Contract Pay: Up to £675 per day (Inside IR35) Location: Wokingham (2 days per week in-office, travel expenses covered) Security Clearance: BPSS check and ability to gain SC Clearance (covered by the client) Job Description Our client is seeking SiteReliability Engineers to work on exciting projects with their end customer. Skills and Requirements OpenShift Version control systems such as Git CICD tools such as Azure DevOps, GitHub Actions, GitLab, Jenkins, TeamCity Scripting languages such as PowerShell, Bash L1 to L3 networking Logging and monitoring systems, visualization tools More ❯
Category: Other EU work permit required: Yes Job Views: 2 Posted: 23.05.2025 Expiry Date: 07.07.2025 Job Description: Job Title: SiteReliability Engineer (SRE) – High-Frequency Trading Infrastructure Location: Onsite – New York City, London, or Singapore Our Client, a leading high-frequency trading firm, is seeking a SiteReliability Engineer (SRE) to architect and build next-generation production tools and infrastructure for their ultra-low-latency trading platform. This is a high-impact, mission-critical role focused on reliability, scalability, and performance in one of the most competitive and technologically advanced industries. About the Role … This opportunity is ideal for an experienced SRE who thrives in production-critical environments. The successful candidate will join a high-caliber team of engineers and work on automating, scaling, and securing systems that drive global trading operations. Key Responsibilities Design and develop scalable production tools for deployment, monitoring, and More ❯
Location: Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 31.05.2025 Expiry Date: 15.07.2025 col-wide Job Description: SiteReliability Engineer #2494 Position Summary: Our partner, an innovative PaaS company specializing in remote monitoring and network management solutions, is looking for a … SiteReliability Engineer to help ensure the reliability, scalability, and performance of critical infrastructure and applications. You will build and maintain highly available systems, support and optimize CI/CD pipelines, and identify optimal solutions for our products. Collaboration with development, DevOps, and other teams is essential … Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related field or equivalent experience. 7+ years of experience in SiteReliabilityEngineering, DevOps, Infrastructure, or related roles. Deep understanding of AWS services and modules. Strong Linux administration and troubleshooting skills. Experience with More ❯
Job Description Job Description Play a key role in ensuring system reliability at one of the world’s most iconic and largest financial institutions. As a SiteReliability Engineer II at JPMorgan Chase within the CORPORATE DATA & ANALYTICS SERVICE Team, you will use technology to solve business … problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you’ll also have the opportunity to collaborate with cross-functional teams to continually improve your knowledge of JPMorgan Chase’s business and relevant technologies. … diagnosing, and resolving incidents, working with others to address root causes. Recognize toil within your role and proactively work towards eliminating it through systems engineering or application code updates. Understand observability patterns and strive to implement and improve service level indicators, objectives, monitoring, and alerting solutions for optimal transparency More ❯
complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliability Engineer with a strong focus … using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain … days every 3 years) Private health insurance Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Engineering and Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at WALT Labs by 2x Get notified about new More ❯
The SiteReliabilityEngineering (SRE) team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives, and working with developers and product managers to ensure that our products are not only reliable and performant, but also cost-efficient. Our … on-call and incident management functions, supporting a high-throughput platform which processes more than 15 billion events per day. To ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand service level objectives, think through failures scenarios, and design … systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information Security team to ensure that cloud infrastructure is properly secured, and that sufficient controls are in place to meet our compliance goals with respect to industry standards such as SOC 2. Role Responsibilities Write high-quality More ❯
growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a SiteReliability Engineer III at JPMorgan Chase within the Azure Cloud team, you will solve complex and broad business problems with simple and straightforward … and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Guide and assist others in designing appropriate solutions and gaining consensus from peers where appropriate. … software engineers and teams to design and implement deployment approaches using automated CI/CD pipelines. Design, develop, test, and implement solutions for availability, reliability, and scalability in applications. Implement infrastructure, configuration, and network as code for applications and platforms. Work with stakeholders and team members to resolve complex More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Social network you want to login/join with: Junior SiteReliability Engineer, London Client: Trust In SODA Location: London, United Kingdom Job Category: Other - EU work permit required: Yes Job Views: 6 Posted: 16.06.2025 Expiry Date: 31.07.2025 Job Description: Role: Junior SiteReliability Engineer Sector … scale SiteReliabilityengineering project involving cutting-edge serverless technology? We are partnering with an innovative InsureTech business seeking a Junior SRE to join their team and lead this exciting project. The ideal candidate will demonstrate leadership and management experience with technologies including: Programming/Scripting (.NET More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
KnowBe4, Inc
Snr. SiteReliability Engineer (Remote position located in Leeds/Sheffield, United Kingdom) Sheffield, United Kingdom About KnowBe4 KnowBe4, the provider of the world's largest security awareness training and simulated phishing platform, is used by tens of thousands of organizations around the globe. KnowBe4 enables organizations to … manage the ongoing problem of social engineering by helping them train employees to make smarter security decisions, every day. Fortune has ranked us as a best place to work for women, for millennials, and in technology for four years in a row! We have been certified as a "Great … every day fun and engaging; from team lunches to trivia competitions to local outings, there is always something exciting happening at KnowBe4. KnowBe4’s SiteReliability Engineers help ensure that our platforms are reliable, secure, scalable, and efficient. They work alongside other engineers in a fast-paced, agile More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
KnowBe4, Inc
Snr. SiteReliability Engineer (Remote position located in Leeds/Sheffield, United Kingdom) Sheffield, United Kingdom About KnowBe4 KnowBe4, the provider of the world's largest security awareness training and simulated phishing platform, is used by tens of thousands of organizations around the globe. KnowBe4 enables organizations to … manage the ongoing problem of social engineering by helping them train employees to make smarter security decisions, every day. Fortune has ranked us as a best place to work for women, for millennials, and in technology for four years in a row! We have been certified as a "Great … every day fun and engaging; from team lunches to trivia competitions to local outings, there is always something exciting happening at KnowBe4. KnowBe4’s SiteReliability Engineers help ensure that our platforms are reliable, secure, scalable, and efficient. They work alongside other engineers in a fast-paced, agile More ❯
Vacancy for Snr SiteReliability Engineer (SRE) at Preservica Abingdon/Remote, UK About You You have a proven track record in DevOps and software development, with a passion for creating reliable solutions to deploy software at scale and speed. You are eager to challenge the status quo … growing, so self-motivation, organization, and the ability to multitask and prioritize are crucial. The Role Serve as a primary visionary for DevOps/SiteReliabilityEngineering across the entire technology organization. Eliminate process bottlenecks to enable frictionless, reliable, and high-velocity feature development through automation of More ❯
SiteReliability Engineer (Python scripting Preferred) We are looking for a skilled and adaptable SiteReliability Engineer (SRE) to join our team. This role is a blend of scripting and operational responsibilities, ideal for someone who enjoys both building automation and engaging in hands-on support … to ensure system reliability and performance. London hybrid working - Contract Opportunity - London Hybrid Must have's Python scripting - They could take someone with Go Automation experience Prometheus/grafana/Prom QL CI/CD AWS Splunk Key Responsibilities Develop and maintain automation scripts, primarily in Python(Go experience More ❯
London, England, United Kingdom Hybrid / WFH Options
Deutsche Bank
this role, you will ensure the reliability, performance, and scalability of real-time trading systems by applying SiteReliabilityEngineering (SRE) principles. You will engage directly with Traders, Strats, and Development teams to optimize trading workflows, troubleshoot complex issues, and drive continuous improvement in both processes … are maintained and monitored to provide a stable environment for all users. You will help develop and mentor junior team members to foster an engineering culture that seeks to automate and reduce manual effort to minimize risk and costs. What we’ll offer you A healthy, engaged and well … optimisation and capacity resource management to ensure efficient use of resources and cost-effective solutions Delivering both business and technology related benefits by aligning reliabilityengineering practices with business goals and partnering with developers to design and deploy scalable fault-tolerant solutions to meet evolving business needs Your More ❯
We're on a mission to democratize audio creation by building world-class audio infrastructure for our customers. As a SiteReliability Engineer, you'll play a key role in improving our platform's developer operations, including observability, monitoring, and overall reliability. You … will be part of a cross-functional team dedicated to implementing robust DevOps practices and enhancing infrastructure and sitereliabilityengineering (SRE). A customer-focused mindset is essential, as the team collaborates closely with stakeholders to ensure solutions meet business and user needs. In addition to More ❯
Bradford, Yorkshire, United Kingdom Hybrid / WFH Options
Freemans Grattan Holdings (fgh)
our customer journey. Working collaboratively with a team of transformation experts you will have the flexibility to leverage your professional experience to solve computer engineering issues across a variety of technical areas, dependent on where your interests lie. Innovation is key as we look for new ideas which will … in a DevOps, or SiteReliabilityEngineering building high-traffic, high availability systems. Experience with sitereliabilityengineering (SRE) principles and monitoring tools, including New Relic. Experience in website performance monitoring and tuning using tools such as Lighthouse and the ability to troubleshoot performance More ❯
can make a meaningful impact. See more about our culture on . Role Summary We are seeking a Lead SiteReliability Engineer (SRE) to drive our infrastructure team in their mission to build a reliable, fault tolerant and scalable infrastructure. You will be responsible for ensuring the reliability of sites in critical distributed environments and improving how our customers interact with our core products. Reporting line: Head of Engineering Location: What you will do As a Lead SiteReliability Engineer, you balance team supervision, project management, day-to-day operations on production systems with … across the team • Contribute to open-source projects, research publications, blog articles and conferences About you • 10+ years of experience in a DevOps/SRE role. • Experience with building and leading high-performing teams. • Experience with cloud computing and highly available distributed systems • Exposure to sitereliability issues More ❯
Halian Technology looking for a talented and driven SiteReliability Engineer (SRE) to join our growing technology team. In this role, youll ensure the reliability, scalability, and performance of our digital platforms that support memorable customer experiences across the hospitality sector. Youll work alongside our engineering, product, and infrastructure teams to build high-availability systems and automated operations that support the future of digital hospitality. Key Responsibilities: Drive system reliability, availability, and performance through engineering excellence. Design and implement monitoring, alerting, and observability tools using platforms like Datadog. Automate operational tasks using scripting … standards. Optimise system resources for both performance and cost-effectiveness. Contribute to incident response and participate in on-call rotations. Track and improve key SRE metrics such as error rates, incident count, and monitoring coverage. What Youll Bring: 3+ years of experience in SiteReliabilityEngineering, DevOps More ❯
Bolton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Job Title: AI Ops Consultant/SiteReliability Engineer (SRE) Client: Opus Recruitment Solutions Location: Bolton, Greater Manchester, United Kingdom Job Category: Other EU Work Permit Required: Yes Job Views: 5 Posted: 14.06.2025 Expiry Date: 29.07.2025 Job Description: Are you looking to advance your career into SiteReliabilityEngineering (SRE) and AI Ops? We are partnering with an innovative company that has recently been acquired by a leading European AI Ops consultancy. They are expanding their UK presence to replicate their success across Europe and are seeking multiple Grafana and PagerDuty consultants to join … you will work alongside expert SREs on modern AI Ops projects. If you have a few years of experience as a System Engineer or SRE and want to gain hands-on experience with AI Ops, this is an excellent opportunity. The role offers a salary of £55,000 and the More ❯
Job Description The SRE Manager is responsible for leading the SiteReliabilityEngineering function across Europe, ensuring the reliability, scalability, and performance of critical infrastructure and services. This role plays a key part in the global follow-the-sun support model, working closely with the Global … SRE Leader to support platforms worldwide. The ideal candidate will bring strong technical leadership, deep subject matter expertise, and a passion for operational excellence to a high-impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring … a regional SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business operations. Oversee team workload, ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability More ❯
London, England, United Kingdom Hybrid / WFH Options
Natobotics
Join to apply for the SiteReliability Engineer (SRE) role at Natobotics . Role: SRE Lead Location: Birmingham, UK (Hybrid, 2-3 days WFO) Contract: 3 months (Possible extension) Are you a skilled SiteReliability Engineer (SRE) with experience in maintaining scalable and reliable infrastructure? We … re looking for a proactive leader with a passion for automation, incident management, and system optimization. Key Skills Required: 5+ years of SRE or similar experience Expertise in Cloud Platforms (SIEM technologies preferred) Proficiency in Python or Bash scripting Hands-on experience with Infrastructure as Code (e.g., Terraform, Ansible) Familiarity … maintenance Ensure SIEM data sources remain healthy and troubleshoot logging issues Additional Details: Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and Information Technology Industries: IT Services and IT Consulting #J-18808-Ljbffr More ❯
and resolution. Escalate incidents which cannot be resolved for more in-depth investigation and engagement with Alcidion’s SiteReliabilityEngineering (SRE), development and product teams as required. Ensure robust processes and procedures are in place and adhered to, for the efficient and consistent management of the … Support Manager/Head of Support for NHS customers in the EPR (or other complex systems) space. Experience supporting cloud hosted solutions alongside a Site-Reliability-Engineering/Managed Services team. Support-development focussed with a good attention to details and a strong desire to assist customers More ❯
London, England, United Kingdom Hybrid / WFH Options
OSB Group
between UK and India offices will be required. What you will be doing: As Group Head of Cloud & Platforms , you will have a solid engineering and Azure cloud architecture background to drive the cloud transformation strategy, evolving cloud governance frameworks, optimising costs and ensuring regulatory compliance through agile deliver. … UK payment journeys with appropriate redundancy Ensure all cloud deployments meet security and compliance requirements for payment card processing and core banking functions Establish engineering standards and practices that ensure security, scalability and reliability for mission-critical banking workloads across hybrid environments Drive the adoption of DevOps practices … and SiteReliabilityEngineering models for payment processing systems What's in it for you? Base salary up to c£140,000 Car allowance of £7,500 Enhanced family-focused benefits Annual bonus opportunity up to 40% + LTIPs 30 days annual leave + bank holidays Please More ❯
Principal SiteReliability Engineer iwoca London, United Kingdom Apply now Posted 6 days ago Hybrid Job Permanent Competitive Principal SiteReliability Engineer - Core Systems Hybrid in London or Remote within the UK The company Imagine a world where every small business has the power to thrive. … of funding one million businesses. The role will focus on complex data systems and flows that power multiple internal products and services. Our biggest reliability challenges aren't sudden spikes in users or data volume - they're the accumulating complexity, interconnectivity, and constant evolution of these … internal systems. So, you'll need to empathise with different team's pressures, have a practical, problem-solving mindset, and do some far-sighted SRE work. You'll combine hands-on work with technical leadership to: Influence without authority- understand our systems, advocate for reliability improvements, and build relationships More ❯
growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a SiteReliability Engineer III at JPMorgan Chase within the AIML Data Platform Team, you will solve complex and broad business problems with simple and … and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability … junior engineers. Required qualifications, capabilities, and skills Formal training or certification on SiteReliabilityEngineering concepts and applied experience Expertise in SRE principles, reliability, scalability, and performance of application and infrastructure. Expertise in programming with Python and Infrastructure as Code tools such as Terraform. Experience working More ❯