Our partner, an innovative PaaS company specializing in remote monitoring and network management solutions, is looking for a SiteReliabilityEngineer to help ensure the critical infrastructure and applications' reliability, scalability, and performance. In this role, you’ll build and maintain highly available systems, support and optimize CI/CD pipelines, and determine optimal solutions for … of endpoints. Experience and Education: Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related technical field/experience. 7+ years of experience in SiteReliability Engineering, DevOps, Infrastructure, or related roles. Deep understanding of AWS and its various modules and services. Strong background in Linux administration and troubleshooting. Proven experience in implementing More ❯
Our partner, an innovative PaaS company specializing in remote monitoring and network management solutions, is looking for a SiteReliabilityEngineer to help ensure the critical infrastructure and applications' reliability, scalability, and performance. In this role, you’ll build and maintain highly available systems, support and optimize CI/CD pipelines, and determine optimal solutions for … of endpoints. Experience and Education: Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related technical field/experience. 7+ years of experience in SiteReliability Engineering, DevOps, Infrastructure, or related roles. Deep understanding of AWS and its various modules and services. Strong background in Linux administration and troubleshooting. Proven experience in implementing More ❯
applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. Our team is at globally located, focused on ensuring production stability, automations, reliability and observability. We are looking for solution-oriented, commercially minded, customer-focused individuals, used to working in an agile environment who want to be a part of building something … the firm. Your work will have a massive impact, both on us as a company, as well as our clients and our business partners around the world. As a SiteReliabilityEngineer III at JPMorgan Chase within the Corporate Technology - Market Risk, you will solve complex and broad business problems with simple and straightforward solutions. Through code … associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Able to drive the continuous improvement of reliability, monitoring and alerting for our mission-critical microservices. Reduce toil by More ❯
applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. Our team is at globally located, focused on ensuring production stability, automations, reliability and observability. We are looking for solution-oriented, commercially minded, customer-focused individuals, used to working in an agile environment who want to be a part of building something … the firm. Your work will have a massive impact, both on us as a company, as well as our clients and our business partners around the world. As a SiteReliabilityEngineer III at JPMorgan Chase within the Corporate Technology - Market Risk, you will solve complex and broad business problems with simple and straightforward solutions. Through code … associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Able to drive the continuous improvement of reliability, monitoring and alerting for our mission-critical microservices. Reduce toil by More ❯
SiteReliabilityEngineer (DV Security Clearance) Position Description CGI was recognised in the Sunday Times Best Places to Work List 2025 and has been named one of the 'World's Best Employers' by Forbes magazine. We offer a competitive salary, excellent pension, private healthcare, plus a share scheme (3.5% + 3.5% matching) which makes you a member … agencies most challenging problems. Our teams work alongside our clients to help them understand how to exploit technologies to maintain competitive advantage. Our systems are engineered for performance, security, reliability and scalability; built with modern CI and CD tooling and techniques. We are currently looking for an experienced cloud infrastructure engineer to join our team - being able to More ❯
the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role We’re looking for SiteReliability Engineers who can help us build, operate, and maintain high-performance, scalable, and reliable services for our production infrastructure, across both cloud & on-prem environments. SiteReliability Engineers combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges. Our team strives to automate processes wherever possible, using whichever tools are best for the job. You’ll be the experts for the environments that you operate infrastructure in, helping partner teams build More ❯
London, England, United Kingdom Hybrid / WFH Options
Thought Machine
Magazine named us one of the world’s most innovative fintechs, and the Financial Times recognised us as one of Europe’s fastest-growing companies in 2023. The Client SiteReliabilityEngineer role in Infrastructure, Client Services will be responsible for enabling and supporting our clients to deliver a best in class cloud native implementation of Thought … infrastructure, from presales to production at scale. This role supports clients in their cloud infrastructure preparation, deployment, optimisation and troubleshooting. Duties Hands on cloud infrastructure consulting both on client site and remote Working with customers and external partners to design and prepare suitable cloud infrastructure to ensure Thought Machine Vault products can be tested and run successfully at scale. … systems outside of Vault to empower holistic digital transformation in collaboration with Thought Machine Client Architects Supporting and troubleshooting client, SaaS and internal cloud infrastructure both remotely and on site, including by promoting and deploying suitable monitoring, logging and alerting tools Working closely with internal product and engineering teams to ensure client feedback is incorporated into improvements to the More ❯
sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious … re ready to shape the future, let's talk. Job Description: As a Go Developer at ZILO Technologies, you will play a crucial role in maintaining and enhancing the reliability, performance, and scalability of our platform. You will be responsible for addressing defect fixes, implementing small changes, and contributing to ongoing enhancements of our Go-based microservices stack. Key … platform. Implement small changes and enhancements to improve system functionality and performance. Contribute to the design, development, and deployment of microservices in a Go environment. Monitor system performance and reliability, proactively addressing potential issues. Develop and maintain automation tools to streamline operational processes. Participate in on-call rotations to ensure 24/7 system availability and rapid incident response. More ❯
sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious … you're ready to shape the future, let's talk. Job Description: As a Developer at ZILO Technologies, you will play a crucial role in maintaining and enhancing the reliability, performance, and scalability of our platform. You will be responsible for addressing defect fixes, implementing small changes, and contributing to ongoing enhancements of our Java-based microservices stack. Key … platform. Implement small changes and enhancements to improve system functionality and performance. Contribute to the design, development, and deployment of microservices in a Java environment. Monitor system performance and reliability, proactively addressing potential issues. Develop and maintain automation tools to streamline operational processes. Participate in on-call rotations to ensure 24/7 system availability and rapid incident response. More ❯
Key Accountabilities: Working with Curve’s engineering teams to support the infrastructure they need and the platforms on which their services run Observing our platforms and services to measure reliability, find areas for improvement, and discover any risks to the stability or security of our systems Maintaining new and existing infrastructure with code, by writing well-designed Terraform modules … them and prevent them from happening again Sharing your work and talking about it within the Platform and Engineering team, to spread knowledge and be an ambassador for good sitereliability practices Deploying innovative new tools to help accelerate engineers and make their lives easier, giving them more time to focus on what they are building Documenting and … driving the adoption of engineering best practices across the wider Engineering team Demonstrating ownership of all initiatives from concept to launch, and embodying unwavering commitment and reliability, with a genuine willingness to contribute and address challenges Projects/initiatives that we want you to contribute to, or lead the charge on: Helping Curve scale to many millions of customers More ❯
innovative technical solutions for government agencies’ most challenging problems. Our teams work alongside clients to exploit technologies for maintaining a competitive advantage. Our systems are engineered for performance, security, reliability, and scalability, built with modern CI and CD tooling and techniques. We are currently seeking an experienced cloud infrastructure engineer to join our team. Being able to think More ❯
Are you an experienced Senior DevOps/SiteReliabilityEngineer looking for your next contract role? Join one of the world's leading IT services, consulting, and business solutions organization. Founded in 1968, the company consistently ranks among the top global IT service providers. With a presence in over 50 countries, the company has built a reputation … across industries including banking, healthcare, telecommunications, and retail. The leading consultancy firm has partnered with a global technology leader and they are currently seeking an experienced Senior DevOps/SiteReliabilityEngineer to join the team. Additionally, this role provides a hybrid working arrangement based in London. Ready to make a move? Get in touch and apply More ❯
We have an exciting and rewarding opportunity for you to take your software engineering career to the next level. As a Lead SiteReliabilityEngineer at JPMorgan Chase within CCB, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable More ❯
through all the related job information below. We have an exciting and rewarding opportunity for you to take your software engineering career to the next level. As a Lead SiteReliabilityEngineer at JPMorgan Chase within CCB, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology More ❯
Social network you want to login/join with: SiteReliabilityEngineer (SRE) - Crypto High-Frequency Trading, Slough Client: Selby Jennings Location: Slough, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: We are looking for a SiteReliabilityEngineer (SRE) to help … role ensures our trading systems remain highly available, scalable, and robust, supporting a fast-paced environment. Responsibilities: Develop scalable tools for automation, deployment, and infrastructure management. Enhance system performance, reliability, and efficiency through automation. Manage AWS infrastructure, ensuring smooth configuration and deployment. Implement observability tools for monitoring and debugging. Ensure fault tolerance, redundancy, and high availability of trading systems. … IaC) tools like Terraform or Ansible. Experience in low-latency or high-performance environments. Proactive problem-solving skills and team collaboration. We seek talented engineers passionate about automation and reliability, thriving in a high-performance environment, working across development, trading, and infrastructure teams to optimize system performance. #J-18808-Ljbffr More ❯
Social network you want to login/join with: SiteReliabilityEngineer (SRE) - Crypto High-Frequency Trading, London Client: Selby Jennings Location: London, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: We are looking for a SiteReliabilityEngineer (SRE) to help … highly available, scalable, and robust, supporting the fast-paced environment. What You'll Be Doing Develop scalable production tools to automate deployment, monitoring, and infrastructure management. Improve system performance, reliability, and efficiency through automation and tooling. Manage AWS-based infrastructure, ensuring seamless configuration and deployment. Implement observability tools for monitoring, debugging, and performance insights. Ensure fault tolerance, redundancy, and More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
play a critical role in ensuring system reliability, scalability, and performance across both AWS and Azure environments. This is your opportunity to lead cloud-native transformation and embed SRE best practices into engineering at scale. What you’ll be doing as their SiteReliabilityEngineer: You’ll be the go-to expert for designing and maintaining … CI/CD pipelines to reduce toil and accelerate deployment frequency. Build observability into everything—own monitoring, alerting, and incident response to minimize MTTR and improve system health. Champion SRE culture and reliability-focused engineering—help shape sustainable engineering practices, SLAs, SLOs, and error budgets. Contribute across the stack with flexibility in tooling—experience with Python, Go, or TypeScript … dental insurance 25 days annual leave + bank holidays R&D and personal training budgets And much more... This is an incredibly rare chance for a seasoned, high-performing SRE to leave your mark on high-impact transformation projects in a business that’s truly committed to doing things the right way. #J-18808-Ljbffr More ❯
London, England, United Kingdom Hybrid / WFH Options
Explore Group
firm is redefining how financial institutions handle compliance and data—leveraging automation, AI, and modern cloud infrastructure. They’re now looking for a SiteReliabilityEngineer (SRE) to join their London-based team. This is a hybrid role with 2 days per week in the office, offering the best of both collaboration and flexibility. Role: SiteReliabilityEngineer (SRE) Location: London (Hybrid – 2 days/week in-office) Term: Permanent Key Responsibilities: Design and manage scalable, cloud-based infrastructure (AWS) Drive automation, monitoring, and CI/CD best practices Collaborate with engineering to ensure system reliability and performance Lead incident response and implement proactive improvements Key Requirements: Hands-on experience in SRE … Full-time Job function Job function Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Explore Group by 2x Get notified about new SiteReliabilityEngineer jobs in London Area, United Kingdom . Hounslow, England, United Kingdom 1 week ago London, England, United Kingdom 1 day ago London Area, United Kingdom More ❯
weeks ago Be among the first 25 applicants SiteReliabilityEngineer - Healthcare Technology UK | Hybrid | Full-time | Permanent We're working with a leading healthtech company to find a SiteReliabilityEngineer to support and optimise the platforms … behind critical clinical systems. This is a hybrid role offering flexibility, technical challenge, and the chance to make a direct impact on healthcare delivery. You'll join a collaborative SRE team focused on maintaining cloud and on-premise environments, improving deployment pipelines, reducing manual work, and supporting project delivery. You'll work closely with internal teams across software development, support … and delivery. Key Technologies Include Azure, AWS, GCP Kubernetes, Terraform, Azure DevOps Linux, and Windows Server We're looking for enthusiastic people with experience in SRE or DevOps roles, particularly in environments using containerised and cloud-based applications. Strong communication skills and the ability to work across teams are essential. Applicants must have the right to live and work in More ❯
Market-leading salary and annual discretionary bonus. Pension contributions, in addition to Health Insurance, Life Assurance. What You’ll Be Doing It's not just DevOps — it's true SRE: SLAs, SLOs, SLIs, error budgets, and incident tooling are at the centre of the role. Own and evolve observability frameworks, define resilience strategies, and contribute directly to the Java backend … AKS, Application Insights, Terraform). You’ll be the bridge between software engineering and operations, directly influencing architectural decisions. What You’ll Bring 8+ years in production engineering or SRE roles. Expertise in monitoring, alerting, and incident tooling (Prometheus, Grafana, OpenTelemetry, ELK, etc.). Experience with Azure, Kubernetes, and scalable systems in high-uptime environments (fintech/crypto preferred). … England, United Kingdom 3 days ago London, England, United Kingdom 1 week ago SiteReliabilityEngineer Graduate Programme 2025 London, England, United Kingdom 1 week ago SRE/DevOps Engineer – High Frequency Trading - Multi Strategy Hedge Fund - Multi Billion Dollar Hedge Fund - Multiple Headcount - Open to Relocation - Up to £700k TC London, England, United Kingdom More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description SiteReliabilityEngineer We're hiring for a Senior SiteReliabilityEngineer on behalf of a leading-edge client that’s redefining real-time analytics and infrastructure at scale. This is a fantastic opportunity to work with a passionate, technically strong team delivering critical infrastructure to support live data pipelines, scalable compute … high-growth, high-performance environment. Role Highlights: Salary: Up to £85,000 Location: Hybrid (Leeds) Tech: AWS, Kubernetes, IaC, Observability, CI/CD Focus: Infrastructure scalability, automation, performance, and reliability What You’ll Be Doing: Designing and scaling cloud- infrastructure with a strong emphasis on AWS (EKS, MSK, DynamoDB, RDS) Driving automation with Terraform/OpenTofu , scripting (Python, PowerShell … performance tuning, and cost optimisation strategies Collaborating with engineers and data scientists to ensure platform reliability and rapid iteration What We’re Looking For: Solid experience as an SRE or DevOps Engineer in cloud-first environments Strong command of AWS infrastructure , container orchestration with Kubernetes , and modern CI/CD tooling Proficiency in scripting (Python or PowerShell ) for More ❯
Join to apply for the SiteReliabilityEngineer role at Grid Dynamics Join to apply for the … SiteReliabilityEngineer role at Grid Dynamics Hybrid position with on-calls We are seeking a highly motivated and skilled SiteReliabilityEngineer (SRE) to ensure the reliability, performance, and scalability of the client’s critical Data Platform solutions. In this role, you will be instrumental in providing dedicated support and maintaining the … health of the data infrastructure. Hybrid position with on-calls We are seeking a highly motivated and skilled SiteReliabilityEngineer (SRE) to ensure the reliability, performance, and scalability of the client’s critical Data Platform solutions. In this role, you will be instrumental in providing dedicated support and maintaining the health of the data infrastructure. More ❯
regulated solutions to institutional finance, this firm is redefining how digital assets are secured and managed. As part of their expansion, they’re looking to find a hands-on SiteReliabilityEngineer to join their mission-critical engineering team. The Role: Our client is building a next-generation digital assets platform using Java Spring microservices on Azure … and annual discretionary bonus. Pension contributions, in addition to Health Insurance, Life Assurance. 25 Annual Leave. What You’ll Be Doing It's not just DevOps — it's true SRE: SLAs, SLOs, SLIs, error budgets, and incident tooling are at the centre of the role. Own and evolve observability frameworks, define resilience strategies, and contribute directly to the Java backend … AKS, Application Insights, Terraform). You’ll be the bridge between software engineering and operations, directly influencing architectural decisions. What You’ll Bring 8+ years in production engineering or SRE roles. Deep Java/Spring experience. Expertise in monitoring, alerting, and incident tooling (Prometheus, Grafana, OpenTelemetry, ELK, etc.). Experience with Azure, Kubernetes, and scalable systems in high-uptime environments More ❯
regulated solutions to institutional finance, this firm is redefining how digital assets are secured and managed. As part of their expansion, they’re looking to find a hands-on SiteReliabilityEngineer to join their mission-critical engineering team. The Role: Our client is building a next-generation digital assets platform using Java Spring microservices on Azure … and annual discretionary bonus. Pension contributions, in addition to Health Insurance, Life Assurance. 25 Annual Leave. What You’ll Be Doing It's not just DevOps — it's true SRE: SLAs, SLOs, SLIs, error budgets, and incident tooling are at the centre of the role. Own and evolve observability frameworks, define resilience strategies, and contribute directly to the Java backend … AKS, Application Insights, Terraform). You’ll be the bridge between software engineering and operations, directly influencing architectural decisions. What You’ll Bring 8+ years in production engineering or SRE roles. Deep Java/Spring experience. Expertise in monitoring, alerting, and incident tooling (Prometheus, Grafana, OpenTelemetry, ELK, etc.). Experience with Azure, Kubernetes, and scalable systems in high-uptime environments More ❯
working to deliver mission critical solutions helping to keep the nation safe, secure and prosperous. Working on leading edge technology solutions including AI/DS, Cyber, Cloud, DevOps/SRE, Platform Engineering We have secured long term work, across the full spectrum, on the latest framework with the client, which provides the springboard for our ongoing growth and development in … this domain, so join us on what will be an incredible growth journey. Role As a SREEngineer, within our National Security Business, you thrive on providing technical leadership; working closely and embedding with customers to maintain the operational integrity of mission critical services in a National Security environment. You’re a SREEngineer that provides technical leadership … simple add-ons to multiple vendors; you’ll be working directly with customers to provide ensure successful delivery of software rollouts that enable the nations safety. As our next SREEngineer, you’ll operate with customer availability as a priority; catching issues early and ensuring continued operational integrity of services that are strategically important to the nation. Whilst developing More ❯