Social network you want to login/join with: SiteReliabilityEngineer, City of London col-narrow-left Location: City of London, United Kingdom Job Category: Information Technology EU work permit required: Yes col-narrow-right Job Reference: BBBH64028_1750084692 Job Views: 6 Posted: 16.06.2025 Expiry Date: 31.07.2025 col-wide Job Description: SiteReliabilityEngineer Whitehall Resources require a SiteReliabilityEngineer to work with a key client on a 6 month initial contract. *This role will involve on site work in London 3 days per week. *Inside IR35. *This role will require some on-call work. SiteReliabilityEngineer The Role As a SiteReliability/DevOps Engineer, you will play a critical role in managing cloud infrastructure, ensuring the reliability of production systems, and improving end-to-end deployment pipelines. This role combines deep operational responsibilities with a strong focus on automation, observability, and continuous improvement. You will be responsible for maintaining high system availability, enabling rapid delivery through CI/ More ❯
Join to apply for the SiteReliabilityEngineer/DevOps Engineer role at hackajob 19 hours ago Be among the first 25 applicants Join to apply for the SiteReliabilityEngineer/DevOps Engineer role at hackajob Get AI-powered advice on this job and more exclusive features. hackajob is collaborating with … LexisNexis UK to connect them with exceptional tech professionals for this role. SiteReliabilityEngineer/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team The LexisNexis Intellectual Property (IP) division provides international patent content and a suite … RDS, Azure VMs, Azure Functions). Maintaining and improving system documentation and operational procedures. Mentor team members and contribute to a culture of learning and inclusion. Continuously improving infrastructure reliability and reducing manual work (TOIL). Participating in incident response and root cause analysis. Why Join Us? Join our team and contribute to a culture of innovation, collaboration, and More ❯
SiteReliabilityEngineer/DevOps Engineer page is loaded SiteReliabilityEngineer/DevOps Engineer Apply locations Farringdon time type Full time posted on Posted 9 Days Ago job requisition id R94904 SiteReliabilityEngineer/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do … you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data to support LexisNexis IP search and analytics applications, empowering our customers with actionable insights … RDS, Azure VMs, Azure Functions). Maintaining and improving system documentation and operational procedures. Mentor team members and contribute to a culture of learning and inclusion. Continuously improving infrastructure reliability and reducing manual work (TOIL). Participating in incident response and root cause analysis. Why Join Us? Join our team and contribute to a culture of innovation, collaboration, and More ❯
SiteReliabilityEngineer/DevOps Engineer page is loaded SiteReliabilityEngineer/DevOps Engineer Apply locations Farringdon time type Full time posted on Posted 9 Days Ago job requisition id R94904 SiteReliabilityEngineer/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do … you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( https://www.lexisnexisip.com ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data to support LexisNexis IP search and analytics applications, empowering our … RDS, Azure VMs, Azure Functions). Maintaining and improving system documentation and operational procedures. Mentor team members and contribute to a culture of learning and inclusion. Continuously improving infrastructure reliability and reducing manual work (TOIL). Participating in incident response and root cause analysis. Why Join Us? Join our team and contribute to a culture of innovation, collaboration, and More ❯
Social network you want to login/join with: Senior SiteReliabilityEngineer (SRE), Wokingham col-narrow-left Client: Leap29 Location: Wokingham, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: 08fd37c63578 Job Views: 4 Posted: 29.06.2025 Expiry Date: 13.08.2025 col-wide Job Description: Senior SiteReliabilityEngineer (SRE) Location: Wokingham (2 days a week onsite) Type: Inside IR35 Rate: £80.00 an hour DOE We’re seeking a Senior SiteReliabilityEngineer to play a key role in the stability, scalability, and performance of critical platforms and applications. This is a leadership-level position suited to individuals who can move seamlessly between code, infrastructure … tools, and teams to ensure platform reliability and enable continuous improvement in how software is built, released, and operated. What You’ll Be Responsible For As a Senior SRE, you’ll lead initiatives that: Ensure availability, latency, and performance of mission-critical systems across cloud and hybrid environments. Architect observability solutions (monitoring, logging, alerting) that detect and prevent failures More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Ltd
problems. As of today, the Camlin operation spans over 20 countries across the globe. Job Overview: We are seeking a dedicated and experienced SiteReliabilityEngineer (SRE) to join our dynamic team. The SRE will be responsible for ensuring the reliability, performance, and availability of our critical systems and services. This role requires a blend of … software engineering and operations skills to build and run large-scale, distributed, fault-tolerant systems. Key Responsibilities: System Reliability and Performance: Design, implement, and maintain scalable and reliable infrastructure. Monitor system performance, detect issues, and ensure maximum uptime. Develop and implement strategies for disaster recovery and data backup. Automation and Tooling: Automate repetitive tasks to improve efficiency and reduce … Conduct post-incident reviews to identify root causes and prevent recurrence. Develop and maintain incident response protocols and playbooks. Collaboration and Communication: Work closely with development teams to integrate reliability into the software development lifecycle. Communicate effectively with stakeholders about system status and health. Provide guidance and mentorship to junior team members. Security and Compliance: Ensure systems comply with More ❯
Service ReliabilityEngineer (SRE)/DevOps Engineer A great opportunity for a SREEngineer to be working with one of Europe's largest tech companies. The Service ReliabilityEngineer combines excellent software engineering, IT operations skills and database expertise. This role will help software development teams' ability to reliably and consistently deliver applications by More ❯
Service ReliabilityEngineer (SRE)/DevOps Engineer A great opportunity for a SREEngineer to be working with one of Europe's largest tech companies. The Service ReliabilityEngineer combines excellent software engineering, IT operations skills and database expertise. This role will help software development teams' ability to reliably and consistently deliver applications by More ❯
Job Description Service ReliabilityEngineer (SRE)/DevOps Engineer A great opportunity for a SREEngineer to be working with one of Europe's largest tech companies. The Service ReliabilityEngineer combines excellent software engineering, IT operations skills and database expertise. This role will help software development teams' ability to reliably and consistently deliver More ❯
London, England, United Kingdom Hybrid / WFH Options
Gorgias
Join to apply for the Senior SiteReliabilityEngineer role at Gorgias Join to apply for the Senior SiteReliabilityEngineer role at Gorgias Get AI-powered advice on this job and more exclusive features. Gorgias is the conversational AI platform for ecommerce that drives sales and resolves support inquiries. Trusted by over … product recommendations. Gorgias, where every customer interaction feels personal, support becomes sales, and conversations shape success. Relocate to either: Paris, Lisbon or Belgrade. Relocation and Visa provided. About The SRE Team We are seeking a highly skilled and experienced Senior SiteReliabilityEngineer (SRE) to join our team. As an SRE at Gorgias, you will play a … crucial role in ensuring the reliability, scalability, and performance of our systems, enabling the seamless delivery of our products and services. The SRE team at Gorgias maintains the core infrastructure and services that make up the heart of our product. We have the privilege to work with high throughput systems and TB-scale data stores serving billions of queries More ❯
We are seeking a foundational member for the Cloud Infrastructure team at Writer. This role involves contributing to the development and implementation of our SiteReliability Engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer's critical systems, proactively guaranteeing that our high-ROI products reach customers seamlessly. Your responsibilities … ensure cost efficiency. Ensure the security and compliance of our systems, adhering to industry standards and regulations. Provide mentorship and technical guidance to junior engineers, fostering a culture of reliability and continuous improvement. Stay current with emerging technologies and industry trends to improve our sitereliability practices. Is this you? Proven expertise in SiteReliability … Kubernetes) and orchestration tools. Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) for maintaining system health and performance. Ability to lead and mentor junior engineers in reliability and system optimization best practices. Excellent communication skills for effective collaboration with cross-functional teams and stakeholders. Proactive in identifying and mitigating potential system failures and performance issues. Preferred More ❯
London, England, United Kingdom Hybrid / WFH Options
Durlston Partners
and experience — talk with your recruiter to learn more. Base pay range $250,000.00/yr - $300,000.00/yr Direct message the job poster from Durlston Partners Senior SiteReliabilityEngineer | Remote (EU/UK) | High-Performance Trading A leading trading firm operating at scale in the digital asset space is hiring a Senior SiteReliabilityEngineer to help scale, secure, and optimise its global trading infrastructure. This is a remote-first role open to engineers across the UK and EU. This isn’t just another DevOps job — they’re looking for a tinkerer. Someone who enjoys getting deep into the internals of systems, thrives on debugging tough problems, and constantly looks … Industries Capital Markets, IT Services and IT Consulting, and Financial Services Referrals increase your chances of interviewing at Durlston Partners by 2x Sign in to set job alerts for “SiteReliabilityEngineer” roles. Wilmslow, England, United Kingdom 1 week ago SiteReliabilityEngineer | North America | Canada | Europe | Fully Remote Intermediate SiteReliabilityMore ❯
the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a SiteReliabilityEngineer III at JPMorgan Chase within the Corporate Technology, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud … associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate Collaborates … teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications Implements infrastructure, configuration, and network as code for the applications and platforms in your remit Collaborates with technical experts, key stakeholders, and team More ❯
this company would be a great experience as their employees work in a supportive and autonomous environment. If you are looking for a challenging, yet rewarding role as a SiteReliabilityEngineer, this is the opportunity for you. What You'll Be Doing: Designing, creating, and delivering technical infrastructure code or services to improve the performance of More ❯
Join to apply for the Senior Cloud/SREEngineer role at LexisNexis 6 days ago Be among the first 25 applicants Join to apply for the Senior Cloud/SREEngineer role at LexisNexis SiteReliability Engineering/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in … area or product line. It contributes directly to project plans, schedules, and methodologies for implementing cross-functional software assets and infrastructure. Responsibilities include cloud platform design across multiple systems, SRE activities, mentoring less-experienced team members, and collaborating with users, customers, and stakeholders to translate their requirements into effective solutions. Additionally, it focuses on fostering a culture of innovation and … and orchestration tools (e.g., Docker, Kubernetes/EKS). Proficiency in scripting languages (e.g., Python, Bash, TypeScript, PowerShell). Knowledge of networking concepts and security best practices. Familiarity with SRE activities and best practices. Familiarity with DevOps practices and tools. Experience with monitoring and logging tools (e.g., DataDog, Coralogix, AWS CloudWatch, Azure Monitor). Excellent problem-solving and stakeholder management More ❯
we can make a meaningful impact. See more about our culture on https://mistral.ai/careers . About The Job Mistral AI is seeking an Applied AI Engineer focused on DevOps to facilitate the adoption of its products among customers and collaborate with them to address complex technical challenges. Applied AI Engineers, ML Infra at Mistral AI … in English • You hold a Bachelor's or Master's degree in Computer Science, Engineering, or a related field • You have 2+ years of experience in a DevOps or SiteReliability Engineering role • You're experienced with deploying and managing AI-based products in production environments • You are fluent in Python • You have experience with containerization technologies such … You hold strong communication skills with an ability to explain complex technical concepts in simple terms to technical and non-technical audiences Ideally you have: • Experience as a Customer Engineer, Forward Deployed Engineer, Sales Engineer, Solutions Architect, or Technical Product Manager • Familiarity with AI frameworks such as PyTorch or TensorFlow • Contributions to open-source projects, particularly in More ❯
SiteReliabilityEngineer/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the … RDS, Azure VMs, Azure Functions). Maintaining and improving system documentation and operational procedures. Mentor team members and contribute to a culture of learning and inclusion. Continuously improving infrastructure reliability and reducing manual work (TOIL). Participating in incident response and root cause analysis. Why Join Us? Join our team and contribute to a culture of innovation, collaboration, and More ❯
SiteReliabilityEngineer/DevOps Engineer Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division (https://www.lexisnexisip.com) provides international patent content and a suite of online and analytic tools that meet the … RDS, Azure VMs, Azure Functions). Maintaining and improving system documentation and operational procedures. Mentor team members and contribute to a culture of learning and inclusion. Continuously improving infrastructure reliability and reducing manual work (TOIL). Participating in incident response and root cause analysis. Why Join Us? Join our team and contribute to a culture of innovation, collaboration, and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
SiteReliabilityEngineer Location: London … Hybrid (3 days WFH) Salary Range: Up to £140,000 Annapurna is working on behalf of a pioneering technology company to recruit a SiteReliabilityEngineer (SRE) . This is a unique opportunity to play a vital role in developing cutting-edge AI systems that power autonomous vehicle technology. What to Expect: The SRE will be instrumental … in ensuring the stability, resilience, and efficiency of complex autonomous systems. This is a role for someone who thrives on innovation, loves solving infrastructure and reliability challenges, and wants to play a significant role in shaping the future of AI-driven mobility. Key responsibilities include: Ensuring smooth and continuous operation of autonomous vehicle systems in real-world environments. Developing More ❯
SiteReliabilityEngineer Location: London … Hybrid (3 days WFH) Salary Range: Up to £140,000 Annapurna is working on behalf of a pioneering technology company to recruit a SiteReliabilityEngineer (SRE) . This is a unique opportunity to play a vital role in developing cutting-edge AI systems that power autonomous vehicle technology. What to Expect: The SRE will be instrumental … in ensuring the stability, resilience, and efficiency of complex autonomous systems. This is a role for someone who thrives on innovation, loves solving infrastructure and reliability challenges, and wants to play a significant role in shaping the future of AI-driven mobility. Key responsibilities include: Ensuring smooth and continuous operation of autonomous vehicle systems in real-world environments. Developing More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Sectigo
Sectigo Manchester, England, United Kingdom SiteReliabilityEngineer Sectigo Manchester, England, United Kingdom Get AI-powered advice on this job and more exclusive features. Job Description We are looking for a SiteReliabilityEngineer to join our growing global team at Sectigo. Job Description We are looking for a SiteReliabilityEngineer to join our growing global team at Sectigo. The SiteReliabilityEngineer will design and implement solutions to reduce toil and ensure reliability of our critical services at Sectigo. This is a full-time and remote position, with the ideal candidate located within 1-hour of vehicle commute … distance from Manchester, U.K. Here are the core functions, responsibilities, and expectations for this role: Ensure the reliability of our critical products and services by meeting or exceeding SRE objectives. Instantiate and maintain production infrastructure using Infrastructure as Code and Configuration Management tools. Build and maintain proper monitoring of our services by utilizing centralized logging and time series databases. More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
to gemstone supplies They have a presence in London, Hong Kong, Amsterdam, and as well in Mumbai and now in New York in 2001. About the role : As the SRE Manager, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and services through both direct technical contribution along with team building and … tooling. Drive automation initiatives to streamline operational workflows and improve efficiency. Develop and maintain tools, scripts, and dashboards to monitor system health, performance, and reliability. Build a first class SRE team. Through a combination of leading by example, coaching and mentoring, mould the team would want to have around you. Provide leadership and guidance to the SRE team, fostering a … culture of collaboration, innovation, and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with More ❯
Dundee, Angus, United Kingdom Hybrid / WFH Options
Ivanti
of Ivanti's SaaS product offerings. We are responsible for the reliability, deployment, and operation of the Ivanti Cloud product portfolio. We are seeking individuals eager to drive SRE maturity through the research and development of internal tooling, operational enhancements, and deployment pipelines. Ivanti SRE takes a holistic view of operational procedures, incident response procedures, application and infrastructure monitoring … and process automation. Ivanti SRE is a blend of infrastructure, networking, automation, development, and application administration. This is a hands-on technical position. The ideal candidate will have a software engineering background and strong experience with continuous deployment, SaaS delivery, and production incident response. This role requires that applicants reside in Scotland and be authorized to work in the United … a pivotal role in shaping the company's growth trajectory through continuous innovation and customer-centric solutions. What You Will Be Doing Researching maintaining, and contributing to automation of SRE tools and processes Contributing to solutions toward reducing toil within SRE Participating in code review and analysis with SRE peers Composing and reviewing contributions to SRE documentation such as system More ❯
Dundee, Scotland, United Kingdom Hybrid / WFH Options
Ivanti
of Ivanti’s SaaS product offerings. We are responsible for the reliability, deployment, and operation of the Ivanti Cloud product portfolio. We are seeking individuals eager to drive SRE maturity through the research and development of internal tooling, operational enhancements, and deployment pipelines. Ivanti SRE takes a holistic view of operational procedures, incident response procedures, application and infrastructure monitoring … and process automation. Ivanti SRE is a blend of infrastructure, networking, automation, development, and application administration. This is a hands-on technical position. The ideal candidate will have a software engineering background and strong experience with continuous deployment, SaaS delivery, and production incident response. This role requires that applicants reside in Scotland and be authorized to work in the United … a pivotal role in shaping the company's growth trajectory through continuous innovation and customer-centric solutions. What You Will Be Doing Researching,, maintaining, and contributing to automation of SRE tools and processes Contributing to solutions toward reducing toil within SRE Participating in code review and analysis with SRE peers Composing and reviewing contributions to SRE documentation such as system More ❯
London, England, United Kingdom Hybrid / WFH Options
Blockchain Ventures
we share the passion to code, create, and ultimately build an open, accessible and fair financial future, one piece of software at a time. We are looking for a SiteReliabilityEngineer to join our Core team to encourage infrastructure best practices across our organization that would allow to securely scale a distributed financial platform that touches … of people a day. Our distributed financial platform tackles some of the most interesting problems in the crypto for millions of our customers and continues to grow rapidly. The SRE team at Blockchain combines software and systems engineering to provide a platform that abstracts complexity for increased security, reliability, and rapid product delivery. The SRE organization at Blockchain is … a rapid, secure, and scalable manner. WHAT YOU WILL DO You will play a critical role in evolving our infrastructure as we develop solutions to complex technical problems involving reliability, latency, bandwidth, and security. You will be an integral part of improving observability, monitoring, and alerting throughout the platform. You will help coordinate work across different areas of the More ❯