Type: Full-time Location Type: On-site Location: London, England, United Kingdom Salary: Not disclosed Description As a critical and trusted member of the Systems Engineering team, you'll be working side-by-side with software engineers to design and deliver mission critical services and systems. You'll be More ❯
The SRE Manager is responsible for leading the SiteReliability Engineering function across Europe, ensuring the reliability, scalability, and performance of critical infrastructure and services. This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to … impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a regional SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations, Security More ❯
The SRE Manager is responsible for leading the SiteReliability Engineering function across Europe, ensuring the reliability, scalability, and performance of critical infrastructure and services. This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to … impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a regional SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations, Security More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
production environment by monitoring availability and taking a holistic view of system health Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward … Participate in system design consulting, platform management, and capacity planning Create sustainable systems and services through automation and uplifts Balance feature development speed and reliability with well-defined service level objectives Have you got what it takes? 3-6 years of working experience in a similar role, with a … Python, Go, Java, C#) and experience with scripting languages (e.g., Bash, PowerShell). Deep understanding of cloud computing platforms (e.g., AWS), the working and reliability constraints of some of the prominent services (e.g., EC2, ECS, Lambda, DynamoDB etc) Experience with infrastructure as code tools such as CloudFormation, Terraform. Deep More ❯
Senior Cloud/SREEngineer Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that … contributes directly to project plans, schedules, and methodologies for implementing cross-functional software assets and infrastructure. Responsibilities include cloud platform design across multiple systems, SRE activities, mentoring less-experienced team members, and collaborating with users, customers, and stakeholders to translate their requirements into effective solutions. Additionally, it focuses on fostering … Kubernetes/EKS). Proficiency in scripting languages (e.g., Python, Bash, TypeScript, PowerShell). Knowledge of networking concepts and security best practices. Familiarity with SRE activities and best practices. Familiarity with DevOps practices and tools. Experience with monitoring and logging tools (e.g., DataDog, Coralogix, AWS CloudWatch, Azure Monitor). Excellent More ❯
This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to support platforms worldwide. We are looking for SRE talent with experience in an On-Prem/Datacenter environment. The ideal candidate will bring strong technical leadership, experience in … high-impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations, Security More ❯
This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to support platforms worldwide. We are looking for SRE talent with experience in an On-Prem/Datacenter environment. The ideal candidate will bring strong technical leadership, experience in … high-impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations, Security More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home. Location: Hybrid working in Cheltenham with possible 24/7 call out when on rota Security Clearance: Must … practices. Experience building and maintaining robust CI/CD pipelines. Proven experience deploying full-stack solutions to cloud infrastructure. Comprehensive experience in implementing Service Reliability processes. Understanding of agile software development principles and practices, with the ability to collaborate in a fast-paced, evolving environment. Knowledge of or understanding More ❯
the ability to "work from anywhere" for two weeks of the year Paid one month sabbatical after four years' employment Role Overview Luminance's SRE team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a … response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of customer satisfaction. With a focus on automation, scalability, reliability and security, the team enable Luminance to ensure a performant, seamless experience for its users. You will join a small, dynamic team of creative More ❯
Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including … infrastructure. Use your experience in software development, systems engineering and networking to proactively prevent repeatable issues. Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design. Drive a culture of intolerance to manual activity which results in a highly automated environment More ❯
DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we More ❯
development team Be on call and respond to incidents promptly Who We Are Looking For Have at least 5 years of experience as a SRE or DevOps Build infrastructure with Murphy's Law in mind Are keen on monitoring atmospheric pressure in the data center Trust people but automate all More ❯
What if your coding skills could do more than just build software, what if they could make entire systems faster, smarter, and unbreakable? Bringing your skills in software development, you'll work on internal tooling and automation, using Golang, Python More ❯
production environment by monitoring availability and taking a holistic view of system health Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward … Participate in system design consulting, platform management, and capacity planning Create sustainable systems and services through automation and uplifts Balance feature development speed and reliability with well-defined service-level objectives More ❯
Platform Engineering Delivery Lead - In-depth experience and understanding of Observability and SRE Practices Permanent Hybrid - London, flexible working, generally 1-2 days per week onsite in London Bridge About GlobalLogic GlobalLogic, a Hitachi Group Company, is a trusted digital engineering partner to the world's largest and most forward … an experienced and senior level Platform Engineering leader with a hands-on engineering background and an in-depth experience and understanding of Observability and SRE Practices. In this role you will need to be able to articulate the business benefits that Observability and SRE provide to our clients and take … effectively with business growth and demand. Process Optimization: Evaluate and provide recommendations to improve existing business processes for optimal efficiency. Technical Expertise Observability and SRE Practices: In-depth understanding of observability and SiteReliability Engineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir) or More ❯
leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications. We are looking for an experienced Lead for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Lead SRE, you will be very hands-on technically while also directly … adheres to the highest security standards. They build essential security infrastructure and implement controls that reinforce the platform's security posture. This is an SRE team, which means you can expect a highly hands-on approach, tackling the technical challenges of implementing large scale solutions. This team is deeply involved … implement, and manage cloud-native security tools and platforms for endpoint security, identity management (IAM), and CSPM Qualifications: Experience: 7+ years of experience in SRE, infrastructure engineering or similar role, with a strong focus on security work, with ideally 2+ years in a leadership or senior engineering role Security Mindset More ❯
We have an exciting opportunity for an SRE/DevOps Engineer to join a leading software house. Reporting to the Head of Engineering, this role offers the chance to stay hands-on and participate in strategic decisions. We want someone with fresh ideas and supporting experience who enjoys collaborating … with a cross-functional team to develop real-world solutions and positive user experiences. Responsibilities: Ownership and implementation across multiple projects from our SRE strategy that focuses on maturity, scalability, resilience, security and automation. Manage and maintain environments to ensure high availability and security. Build and maintain infrastructure as code … identify and address performance bottlenecks, security vulnerabilities, and system failures. Automate manual tasks to improve operational efficiency and reduce technical debt. Collaborate with other SRE engineers to ensure resilience and scalability across the platform. Work with our teams to directly influence and drive the adoption of SRE best practices and More ❯