Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom Hybrid / WFH Options
DWP Digital
secure solutions across projects and initiatives which are transforming how government works. You'll use your knowledge of areas such as software development and sitereliabilityengineering to help keep DWP safe and compliant and you'll translate this to our brilliant project teams. The scale of More ❯
preston, lancashire, north west england, united kingdom Hybrid / WFH Options
DWP Digital
secure solutions across projects and initiatives which are transforming how government works. You'll use your knowledge of areas such as software development and sitereliabilityengineering to help keep DWP safe and compliant and you'll translate this to our brilliant project teams. The scale of More ❯
sunderland, tyne and wear, north east england, united kingdom Hybrid / WFH Options
DWP Digital
secure solutions across projects and initiatives which are transforming how government works. You'll use your knowledge of areas such as software development and sitereliabilityengineering to help keep DWP safe and compliant and you'll translate this to our brilliant project teams. The scale of More ❯
+ Shares + Benefits We are hiring to enhance our Security and Compliance efforts. In this role, you will collaborate with Senior Leadership and SiteReliabilityEngineering to support our GRC initiatives. We provide a low latency, high throughput distributed messaging system to the e-trading Fixed More ❯
This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to support platforms worldwide. We are looking for SRE talent with experience in an On-Prem/Datacenter environment. The ideal candidate will bring strong technical leadership, experience in … impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations More ❯
This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to support platforms worldwide. We are looking for SRE talent with experience in an On-Prem/Datacenter environment. The ideal candidate will bring strong technical leadership, experience in … impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations More ❯
s possible in home healthcare. Together, we will continue to make a meaningful difference in the lives of millions. Your mission? As a Senior SRE , you'll contribute to influence and shape both the strategy and implementation of our evolving observability capabilities across the Birdie system; you'll leverage OpenTelemetry … and SRE practices to support squads in proactively identifying issues before they impact customers; You'll play a vital role in building and maintaining our Incident Management and On-Call "experience", building automations and driving practices that unify critical system operations and make OOH support run smoothly; You'll be … championing reliability and stability across the Engineering Organisation, instilling SRE principles within teams and leaving your mark on our product. You'll scale into acting as a "Tech Lead", developing our Disaster Recovery strategy, supporting Platform and Product in defining and executing targeted improvements that cross-functionally achieve More ❯
CI/CD systems to ensure our teams have access to the best tools available. We combine problem-solving skills with software and systems engineering to take a proactive approach in building fault-tolerant and secure systems, improving observability and zealously automating away toil. In this role you will … Use your sitereliability expertise to design, operate and support Preqin's infrastructure, middleware and internal services. Improving their performance … availability, scalability, latency and efficiency. Drive technical excellence in everything we do, fostering a culture of data-driven reliability, monitoring and automation, following SRE best-practices. Work alongside development teams to design and build scalable and high available services, while establishing effective build frameworks for continuous deployment and self More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
production environment by monitoring availability and taking a holistic view of system health Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward … getting ahead of customer needs, and innovating to continually improve Provide primary operational support and engineering for multiple large distributed software applications How will you make an impact? Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding Partner with development … Participate in system design consulting, platform management, and capacity planning Create sustainable systems and services through automation and uplifts Balance feature development speed and reliability with well-defined service level objectives Have you got what it takes? 3-6 years of working experience in a similar role, with a More ❯
Software engineering is at the heart of what we do here at giffgaff. Our agile engineering teams build and support a wide variety of applications and services. These combine to create our unique user experience on the giffgaff website, enable a whole range of awesome features via modern … We want you to share your opinions on how we are doing things - and help us get better! The Role We're looking for SRE engineers with passion and energy, a strong desire to learn and improve and a commitment to testing and excellence. You'll have to make tough … logging and tracing (Prometheus, EFK, Alertmanager, Jaeger/Zipkin) Troubleshooting in complex environments using the monitoring tools Establishing and measuring SLIs and SLOs with engineering teams Participate in periodic 24x7 on-call duties Build and manage systems, infrastructure and applications through automation (Terraform, Ansible) CI/CD tools: Jenkins More ❯
thinkers, builders, creators, and problem solvers are over 1,000 strong, across 20 markets around the world. ABOUT CHOREOGRAPH OPTIMIZATION We're a Product & Engineering team within Choreograph that's focused on optimizing the impact that advertising has for audiences and the brands that we work with. We do … outcomes for brands and audiences. As a Senior Engineer, you will work side by side with a Product Manager, a Design Lead, and your engineering team to design, build, deliver, and operate purposeful solutions. The role is a hands-on development role, involving contributing to product strategy, crafting scalable … billion ad impressions across the globe. You'll play a leading role in significantly scaling this further. WHAT YOU WILL DO Lead an engineering team, overseeing and participating in all stages of development, delivery, and operations, together with Product and Design leaders. Architect solutions that are simple, scalable, reliable More ❯
Seeking a versatile and proactive Platform Engineer/SiteReliability Engineer (SRE) to join our team. You will be a cornerstone of our infrastructure and DevOps practice, responsible for designing, supporting, and improving our cloud-based platforms. Collaborating closely with developers, you will play a critical role in … in maintaining our security posture and ensuring the company successfully achieves and maintains Cyber Essentials Plus certification annually. Operational Excellence: Drive automation, improve system reliability, and optimize performance across the platform. Required Skills and Experience (Must-Haves): Proven hands-on experience designing, building, managing, and supporting infrastructure within Microsoft More ❯
Type: Full-time Location Type: On-site Location: London, England, United Kingdom Salary: Not disclosed Description As a critical and trusted member of the Systems Engineering team, you'll be working side-by-side with software engineers to design and deliver mission critical services and systems. You'll … experience with any of Python, Go, PHP, Ruby, Node.js; Experience with incident response and a security-focused mindset; University Degree in Computer Science/Engineering (or related field) is preferred; Additional Benefits: Fast-track your career growth - our meritocratic culture is known for promoting from within and producing industry More ❯
culture. We believe in celebrating one another's achievements, open communication, and transparent leadership. About the Position The DevOps Engineer role reports to the SRE Lead, working as part of a small team providing and maintaining the development tools (CI/CD tool chain), providing DevOps excellence and best practice More ❯
Systems Reliability Engineers use a software engineering approach to architect, design, automate, monitor, and build applications at scale. This includes operating and engineering software with close business segment alignment to deliver platforms through efficient, effective and resilient architectures. … SREs are talented engineers that are focused on improving quality through a data driven approach: instrumentation, automation, and functional/unit testing. Responsibilities: The SRE will help create, build and deliver new technologies or platforms. This will include consultation, designing, building, and supporting development pipelines, automating infrastructure and operations, creating … on load test runs against a moderately complex system. Demonstrates exceptional troubleshooting methodology, including the ability to author and instruct new methodologies to the SRE team. Independently resolve moderately to highly complex system and application incidents. Able to identify and propose system and application fixes for performance bottlenecks. Able to More ❯
ARM, Pulumi Building secure applications and infrastructure Strong communication skills, with the ability to convey and or understand complex technical concepts clearly and concisely SRE skills including observability and telemetry monitoring HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul) Containerisation using Docker, Kubernetes, OpenShift & Helm Programming skills using languages such as More ❯
We have an exciting and rewarding opportunity for you to take your software engineering career to the next level. As a Lead SiteReliability Engineer at JPMorgan Chase within CCB, you are an integral part of an agile team that works to enhance, build, and deliver trusted … design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems. Support the engineering teams in building fault-tolerant, scalable applications by engaging in design discussions, RFCs and code reviews. Drive decisions that influence the product design, application More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home. Location: Hybrid working in Cheltenham with possible 24/7 call out when on rota Security Clearance: Must … practices. Experience building and maintaining robust CI/CD pipelines. Proven experience deploying full-stack solutions to cloud infrastructure. Comprehensive experience in implementing Service Reliability processes. Understanding of agile software development principles and practices, with the ability to collaborate in a fast-paced, evolving environment. Knowledge of or understanding More ❯
seeking a DevOps Engineer for floLIVE . About the role In this project, the DevOps Engineer will play a pivotal role in ensuring the reliability of the infrastructure, optimizing processes, and maintaining security. You will be responsible for managing and maintaining Linux-based systems (CentOS, Ubuntu) to ensure their … delivery processes in a fast-growing startup. We're looking for you if you have: 5+ years of previous experience as a DevOps or SiteReliability Engineer. Proven experience in System Administration, specifically with Linux environments (CentOS, Ubuntu). Strong expertise in networking, including routing protocols, network design More ❯
Pipeline team to contribute Modules and Incremental improvements to the Core Pipeline, Core Services, and Core Operating team's Libraries and Services. Collaborate with SRE team members to ensure development and operations work is delivered in full and on time (agile/product sprints). Write and maintain systems/ More ❯
Pipeline team to contribute Modules and Incremental improvements to the Core Pipeline, Core Services, and Core Operating team's Libraries and Services. Collaborate with SRE team members to ensure development and operations work is delivered in full and on time (agile/product sprints). Write and maintain systems/ More ❯
in banking, healthcare, insurance, and more. This is a full time role, with the ability to work flexibly with our global teams. As an SRE Engineer, you will be part of a Platform Team dedicated to making Gravitee's products best in class both in terms of product and infrastructure. More ❯
This is a Vice President position within Platform ReliabilityEngineering and Management leveraging SRE Principles and Practices based out of London. This role is looking for a multi skilled professional with strong technical leadership, people management skills to deliver critical services ensuring a highly stable, reliable, and resilient … to eliminate manual day to day support activities; scope and create automation for deployment, management and visibility of our services. Extensive experience with implementing SRE principles in the organization such as SLOs/SLIs and TOIL measurement Implement best practices for building successful monitoring and alerting systems. Experience with Observability … platforms like Datadog and open telemetry is desired. You will work closely with engineering/development teams to design, build, and maintain systems and help them decide on products to use, schema design and query tuning. Extensive troubleshooting abilities across the stack QUALIFICATIONS Required Skills: Bachelor's degree or More ❯
This is a Vice President position within Platform ReliabilityEngineering and Management leveraging SRE Principles and Practices based out of London. This role is looking for a multi skilled professional with strong technical leadership, people management skills to deliver critical services ensuring a highly stable, reliable, and resilient … to eliminate manual day to day support activities; scope and create automation for deployment, management and visibility of our services. Extensive experience with implementing SRE principles in the organization such as SLOs/SLIs and TOIL measurement Implement best practices for building successful monitoring and alerting systems. Experience with Observability … platforms like Datadog and open telemetry is desired. You will work closely with engineering/development teams to design, build, and maintain systems and help them decide on products to use, schema design and query tuning. Extensive troubleshooting abilities across the stack QUALIFICATIONS Required Skills: Bachelor's degree or More ❯
the ability to "work from anywhere" for two weeks of the year Paid one month sabbatical after four years' employment Role Overview Luminance's SRE team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a … response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of customer satisfaction. With a focus on automation, scalability, reliability and security, the team enable Luminance to ensure a performant, seamless experience for its users. You will join a small, dynamic team of creative More ❯