Operations SiteReliability Engineer page is loaded Operations SiteReliability Engineer Apply locations United Kingdom-Bristol-Almondsbury-Hempton Court time type Full time posted on Posted 30+ Days Ago job requisition id R022662 Please Note: 1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign … Provide feedback and coaching to upstream teams (both internal and vendors) to reduce escalations and to continually improve overall experience for customers. Professional Experience Required A degree in Systems Engineering, Computer Science or related fields with related experience preferred 5+ years of experience administering Linux systems Strong hands-on experience of variants of linux distros 2+ years Operational experience … salary Generous bonus scheme Equity package Competitive company pension Employee stock purchase plan (ESPP) Private Medical Insurance (Individual or family) Life Assurance scheme (up to 4x salary) Ample on-site parking. This role will need to participate in weekends and holidays on-call support as and when required. Broadcom is proud to be an equal opportunity employer. We will More ❯
are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior SiteReliability Engineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 7+ years' experience in SiteReliability Engineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools More ❯
Join us as a Senior SiteReliability Engineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as … a Senior SiteReliability Engineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning - SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise … strategic thinking and digital and technology, as well as job-specific technical skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability More ❯
City Of Westminster, London, United Kingdom Hybrid / WFH Options
Track24 Limited
or New Relic to gain monitoring and performance insights. Incident Management: Establish and oversee monitoring and incident management processes to ensure system reliability. SiteReliabilityEngineering (SRE): Perform SRE duties to ensure system availability, performance, and scalability. Application Support: Work closely with application teams to support application deployment and performance monitoring We use AWS internally, however are More ❯
everyone can do their best work. Whether you're building on our platform, supporting our customers, or shaping our story: You can just ship things. About the Role: As SRE Manager, you will lead the creation and operation of a 24/7 SiteReliabilityEngineering function for Vercel. Your primary goal is to act as the … If you're located beyond that distance, the role is fully remote. For location-specific details, please connect with our recruiting team. What You Will Do: Build & nurture the SRE team at Vercel, holding a high bar for technical work and teamwork. Build rapport with each member of the team and support them as they level up their skills. Define … directly with executive leadership to communicate risks and opportunities and influence cross-engineering prioritization. Partner more specifically with CDN and Compute engineering teams to define and manage SRE-driven project initiatives that improve the robustness and operational efficiency of the company's most critical serving systems. About You: At least 5 years experience in an SRE role, or More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
to gemstone supplies They have a presence in London, Hong Kong, Amsterdam, and as well in Mumbai and now in New York in 2001. About the role : As the SRE Manager, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and services through both direct technical contribution along with team building and … tooling. Drive automation initiatives to streamline operational workflows and improve efficiency. Develop and maintain tools, scripts, and dashboards to monitor system health, performance, and reliability. Build a first class SRE team. Through a combination of leading by example, coaching and mentoring, mould the team would want to have around you. Provide leadership and guidance to the SRE team, fostering a … culture of collaboration, innovation, and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with More ❯
Salford, Manchester, United Kingdom Hybrid / WFH Options
Lloyds Bank plc
drive continuous improvement as we transition to cloud-native technologies. You'll challenge the status quo and push boundaries by working closely with the DevOps COE and the wider engineering community. Join us as an innovator as we enter the next phase of our transformation journey. We're looking for passionate and curious technology specialists with innovative minds who … and compliance principles into architecture and development, ensuring alignment with regulatory and risk frameworks. DevOps & Quality Engineering - Practical experience with DevOps or SiteReliabilityEngineering (SRE), including automation, CI/CD, and quality assurance practices. Leadership & Mentorship - Leads cross-functional teams, drives delivery, coaches others, and fosters a culture of continuous improvement and development. Business Acumen … future trends, drives change initiatives, and shapes technology roadmaps to deliver long-term value. It would be great if you had any of the following Infrastructure as Code & Cloud Engineering - Hands-on experience with tools like Terraform, Chef, Puppet, and Ansible, combined with exposure to cloud platforms such as GCP, AWS, Azure, or ICP/OCP. CI/CD More ❯
drive continuous improvement as we transition to cloud-native technologies. You'll challenge the status quo and push boundaries by working closely with the DevOps COE and the wider engineering community. Join us as an innovator as we enter the next phase of our transformation journey. We're looking for passionate and curious technology specialists with innovative minds who … and compliance principles into architecture and development, ensuring alignment with regulatory and risk frameworks. DevOps & Quality Engineering - Practical experience with DevOps or SiteReliabilityEngineering (SRE), including automation, CI/CD, and quality assurance practices. Leadership & Mentorship - Leads cross-functional teams, drives delivery, coaches others, and fosters a culture of continuous improvement and development. Business Acumen … future trends, drives change initiatives, and shapes technology roadmaps to deliver long-term value. It would be great if you had any of the following Infrastructure as Code & Cloud Engineering - Hands-on experience with tools like Terraform, Chef, Puppet, and Ansible, combined with exposure to cloud platforms such as GCP, AWS, Azure, or ICP/OCP. CI/CD More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Lloyds Bank plc
drive continuous improvement as we transition to cloud-native technologies. You'll challenge the status quo and push boundaries by working closely with the DevOps COE and the wider engineering community. Join us as an innovator as we enter the next phase of our transformation journey. We're looking for passionate and curious technology specialists with innovative minds who … and compliance principles into architecture and development, ensuring alignment with regulatory and risk frameworks. DevOps & Quality Engineering - Practical experience with DevOps or SiteReliabilityEngineering (SRE), including automation, CI/CD, and quality assurance practices. Leadership & Mentorship - Leads cross-functional teams, drives delivery, coaches others, and fosters a culture of continuous improvement and development. Business Acumen … future trends, drives change initiatives, and shapes technology roadmaps to deliver long-term value. It would be great if you had any of the following Infrastructure as Code & Cloud Engineering - Hands-on experience with tools like Terraform, Chef, Puppet, and Ansible, combined with exposure to cloud platforms such as GCP, AWS, Azure, or ICP/OCP. CI/CD More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Lorien
Junior SiteReliability Engineer Hybrid - Manchester x2 days a week Salary up to £45,000 + Bonus The Company: Lorien Global are supporting a growing business based in Manchester City Centre as they expand their Support Services team. With an exciting pipeline of work ahead, they're looking to hire an experienced Junior SiteReliability Engineer More ❯
Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Ordnance Survey Limited
hear from you. Essential Criteria Good knowledge of Azure Cloud hosting technologies Experience with PostgreSQL databases (including PostGIS spatial extension) Good understanding of SiteReliabilityEngineering (SRE) and software engineering best ractices Experience investigating the root cause of failures to understand why they have occurred and propose/enact solutions, and work with external suppliers if More ❯
love to hear from you. Good knowledge of Azure Cloud hosting technologies Experience with PostgreSQL databases (including PostGIS spatial extension) Good understanding of SiteReliabilityEngineering (SRE) and software engineering best ractices Experience investigating the root cause of failures to understand why they have occurred and propose/enact solutions, and work with external suppliers if … anyway. You might be the ideal candidate for this or other roles within Ordnance Survey. Skills required for this role: Information Technology (IT) Systems, Infrastructure Services, Process Improvements, Software Engineering, Technical Consulting, Waterfall Model Support Engineer (Fulltime) - Salary - £41,892.00 to £48,874.00 (dependent on experience) - Hybrid working Join Our Dynamic API and Data Distribution Team! Are you passionate More ❯
you have demonstrated your capability as a Support Engineer you will be able to choose the path you want to take into Software Engineering, either product development or SRE (sitereliabilityengineering). This makes the role ideal for someone looking to take their first steps in a software engineering career. The role Part of … cares about customer experience, dealing with other Cronofy team members across the business, and ensures that customers have positive experiences with Cronofy. This role will also work alongside the Engineering team on a range of product development projects that will, in the long term, aid the ability to support customers on technical product-related issues. This role reports to … First class documentation In-depth understanding of Cronofy's API and Scheduler tool to provide a high level of customer support, resolve issues and find practical solutions Engaging with Engineering/Development team to provide appropriate technical solutions, often through the full development lifecycle Working on development projects as part of the Product Engineering team Requirements: Experience with More ❯
software-defined networking principles. Embed zero-trust principles and user-centric design into all remote connectivity services. Align remote connectivity architecture with broader enterprise network, security, and cloud strategies. Engineering & Operations: Lead the engineering, deployment, and lifecycle management of remote access solutions such as Cisco AnyConnect, Zscaler, and other mainstream VPN … platforms. Drive automation of remote access provisioning, policy enforcement, and configuration management through Infrastructure as Code (IaC) and zero-touch deployment practices. Apply SiteReliabilityEngineering (SRE) principles to improve performance, availability, and troubleshooting. Establish observability practices across all access points with real-time metrics, logs, and telemetry. Security, Compliance & Governance: Ensure compliance with corporate security and … segmentation, and endpoint-based access control. Proven ability to scale remote connectivity solutions to tens of thousands of users and devices. Experience with IaC, network automation, observability tooling, and SRE methodologies. Preferred Qualifications: Certifications such as CCNP, CCIE, PCNSE, Zscaler Certified, or equivalent. Familiarity with secure hybrid work and cloud networking models. Background in network performance optimization, user-centric design More ❯
occasional travel to Scotland Employment Type: 6 month Contract Rate: £550 per day, Outside of IR35 Role Overview Morgan Hunt are seeking an experienced SiteReliability Engineer (SRE)/Unix Infrastructure Engineer to support the deployment, migration, and optimisation of critical infrastructure services. The role involves ensuring high availability, disaster recovery readiness, and automation-driven improvements across RHEL More ❯
occasional travel to Scotland Employment Type: 6 month Contract Rate: £550 per day, Outside of IR35 Role Overview Morgan Hunt are seeking an experienced SiteReliability Engineer (SRE)/Unix Infrastructure Engineer to support the deployment, migration, and optimisation of critical infrastructure services. The role involves ensuring high availability, disaster recovery readiness, and automation-driven improvements across RHEL More ❯
faceted and complex environment. This ICG-Tech "business aligned tech" resource will collaborate with ICG-Markets "the business", CTI "Enterprise Infrastructure", and CISO "Security" teams driving technical architecture and engineering of CTB and End of Vendor Support (EoVS) initiatives where we choose to innovate. Role Overview/What will you do: Reporting to the head of HPAi, the Systems … Architect will be responsible for ensuring best in breed technology architecture and engineering are in alignment with business requirements. This will be done in collaboration with CISO and CTI Compute resources aligned to HPAi who represent Enterprise and CISO priorities and standards. Active collaboration with all parties is required to be successful. The resource should be able to effectively … as an example. Kernel-bypass technology knowledge (such as Solarflare OpenOnload/TCPDirect/ef_vi, RoCE, DPDK) Prior experience in a similar SiteReliabilityEngineering (SRE), DevOps, distributed computing, systems engineering/administration Ability to context switch to work on several projects at once Knowledge of JIRA, SCRUM, Sprints This job description provides a high More ❯
hub right in the heart of the city - with real backing, big ambition and the support of an established, award-winning company. You'll be part of a new engineering chapter in Manchester, helping shape how we grow and build the team from day one, and create software and platforms with a purpose: to make finance fairer and more … to date Designing scalable, self-service tools to reduce operational toil Supporting infrastructure observability through metrics, tracing, and alerting Working closely with product teams to foster a culture of reliabilityengineering About you: 4+ years in a Platform/SiteReliabilityEngineering or similar role Solid Linux admin skills (we use FlatcarOS & Amazon Linux) Experience More ❯
team of passionate thinkers, innovators, and dreamers - and help us connect people and build communities to create economic opportunity for all. About the team and the role: As a SiteReliability Engineer at eBay, you'll play a key role in managing major incidents and the overall health of our services, making sure they are both resilient and … high-performing. You'll create strategies for availability and reliability, enhance domain ecosystem observability, and support a shift toward a more engineering-focused culture. Your contributions will ensure that eBay's technology remains cutting-edge and reliable for our global community. What you will accomplish: Proactive Monitoring : Continuously monitor the health of eBay's critical services to identify … and address potential issues before they escalate. Solution Development : Collaborate with Architecture, Engineering, and Operations teams to develop solutions that ensure high site availability, reliability and performance. Collaborative Problem Solving : Work closely with partner teams to resolve recurring technical issues, onboard new alerts, and develop high-quality Standard Operating Procedures (SOPs). Automation and Process Enhancement : Identify More ❯
Mission We unleash the potential of organisations through the science of board effectiveness, building better businesses and benefiting society. The Opportunity As a Senior SiteReliability Engineer (SRE), you'll be joining a team whose mission is to ensure the availability, performance, security and reliability of our platform and core services, ensuring that they meet the needs … be responsible for visibility and monitoring of those systems, for building tooling and automation to reduce TOIL and for responding to incidents as part of our 24/7 SRE on-call team. The SRE team: Strives to provide the highest standards of Availability, Scalability, Performance and Security for our Software as a Service environments across multiple cloud vendors and … work Proactively monitors our platform and responds to incidents as part of a 24/7 rota Key responsibilities of the role We're looking for a great Senior SRE to be a hands on individual contributor to key technical projects and to help us build a first-class SRE function. This role will involve: Hands on work with technical More ❯
in high-impact delivery teams that support some of the worlds most well-known organisations. Youll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such … mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User Desirable Experience DevOps or SiteReliabilityEngineering (SRE) experience Automation with Terraform or similar tools Building CI/CD pipelines Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to new technologies in fast-paced More ❯
impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools … A proactive mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User DevOps or SiteReliabilityEngineering (SRE) experience Automation with Terraform or similar tools Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to new technologies in fast-paced environments Please note that all More ❯
impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools … A proactive mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User DevOps or SiteReliabilityEngineering (SRE) experience Automation with Terraform or similar tools Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to new technologies in fast-paced environments Please note that all More ❯
become the UK's most loved retirement expert. Purpose As a Senior Application Support Engineer, you will play a crucial role in powering our Retail applications by partnering with engineering and business teams to build deep technical and business expertise. You'll be the go-to expert across a diverse, modern, and complex technology landscape, ensuring seamless support and … with a broad range of technologies, including: Practical experience with performance monitoring tools such as Dynatrace or equivalent. Skills & Knowledge Solid understanding of SiteReliabilityEngineering (SRE) principles, including incident management, monitoring, alerting, and performance tuning. Strong knowledge of Software Development Lifecycle (SDLC) processes. Familiarity with incident management platforms like ServiceNow, PagerDuty, or similar tools. Excellent analytical … e.g., annuities, equity release) is advantageous. Experience with automation and scripting to improve manual processes (e.g., PowerShell, Bash). Familiarity with agile methodologies and experience working in DevOps/SRE-driven environments. Company Benefits A Competitive Salary, Pension Scheme and Life Assurance Along with 25 Days Annual Leave plus an Additional Day on us for your Birthday Private Medical Cover More ❯
the opportunity for employees with a range of experience to develop their skills while shaping our platform and making meaningful contributions to the firm. Role: Development Operations Engineer DevOps Engineering requires engagement across development, product and operational teams to ensure Droit's enterprise software products are successfully built, tested and supported in production. Successful candidates will demonstrate skills in … build engineering, sitereliabilityengineering, and client support. At Droit, DevOps is involved in the full lifecycle of product development, from inception to deployment, optimization, and daily operation. As a DevOps Engineer, you will implement solutions that increase the velocity of product delivery, while identifying and resolving problems before they become production issues. As a member More ❯