A Developer possesses a unique skill set that synergises well with SiteReliabilityEngineering (SRE). With a strong foundation in Golang development, valuable expertise is brought to the table, enabling contributions to innovative solutions for complex monitoring, automation, and capacity management challenges. As a SiteReliability Engineer, you can shape the way this company manages intricate automation and monitoring solutions. At the forefront of a new era in technology, you will ensure critical applications run seamlessly with minimal intervention, collaborating closely with Software Development and Platform teams to optimise system performance for this … However, proficiency in programming languages such as Golang, Python, or JavaScript is essential! If you are a Golang Engineer looking to transition into the SRE world, or vice versa, this is an opportunity you won't want to miss More ❯
do your best work. Learn more at . We are looking for experienced people who are competent in the cloud and knowledgeable about the SRE (sitereliabilityengineering) domain. The team The Core Architecture Team (CAT) produces and manages the core technology, methodologies, and frameworks that underpin More ❯
Global SiteReliability Engineer Location: London About Us Founded in 2013, GSR is a leading market maker and programmatic trading firm in the fast-evolving world of cryptocurrency trading. With over 200 employees across seven countries, we provide billions of dollars in liquidity daily to cryptocurrency protocols and … to be deeply embedded in every major sector of the cryptocurrency ecosystem. About the Role We are seeking a SiteReliability Engineer (SRE) to design, optimize, and support highly available systems across our global trading infrastructure. As part of GSR's SRE team, you will manage a multi … infrastructure, including: Networking & Exchange Connectivity Linux Systems & Kubernetes Administration Microservice Orchestration & Observability Disaster Recovery & Security Optimization Your mission is to improve latency, scalability, and reliability, ensuring GSR remains a best-in-class market maker. We value engineers who drive automation, reduce friction, and enhance developer velocity through better tooling More ❯
nurture others and learn from them, then this is your challenge! The Team The Infrastructure as a Service (IaaS) team aims at upholding the reliability and scalability we expect from Algolia's infrastructure for its critical systems and products. Our focus is on enabling teams across Algolia to leverage … this infrastructure while keeping it under control through an always increasing level of automation. The Opportunity The Senior SiteReliability Engineer position within the IaaS team provides a dynamic opportunity for a professional with foundational experience in maintaining and optimising scalable infrastructures. This role specifically … concentrates on three key areas: Server and container hosting, cloud and network expertise and flawless observability. As a Senior SiteReliability Engineer (SRE) , you will play a pivotal role in designing, implementing, and maintaining highly available, scalable, and fault-tolerant systems. Your work will directly impact the effectiveness More ❯
a powerful robotics stack that includes computer vision, motion planning, SLAM, controls, estimation, and secure communications. ABOUT THE JOB We are looking for a SiteReliability Engineer for our growing team in London, UK. The role will require the balance of validating changes across multiple products, robots, software … engineering and test teams, as well as pushing forward new features for short suspense customer requirements. This person will … work with a team that will develop processes & tools that allow Anduril to develop and release safety critical systems quickly and reliably. A strong SRE will have a proven track record of developing CI/CD pipelines and a background in development, operations, and automated test coverage infrastructure, both on More ❯
disciplines. We're always on the lookout for energetic, creative people to join our team. Your New Role SiteReliabilityEngineering (SRE) team members work with our Global Content Delivery teams to deliver exabytes of content for our brands globally. The SRE has a highly skilled combination … of engineering and operations skills and is focused on automating and improving operations. Their job is to guarantee system reliability, performance, and supportability with a strong engineering emphasis on building autonomous solutions that deliver value to end-users early, often, & fast. They are central to the reputation … deployments. Coordinate with relevant teams to build useful tools to support network operations (internal and external). Qualifications and Experience The Essentials: Passionate about SRE, DevOps, Automation, and infrastructure platforms. Understand the mechanical sympathy between software workloads and the demand it places on the underlying hardware. Working knowledge of non More ❯
part of a team operating global services, handling the requests of hundreds of millions of Apple customers. This kind of scale presents unique challenges. SRE teams at Apple support the full infrastructure stack; from individual API performance to network traffic management. Responsibilities will be both broad and deep. SRE teams … of Apple's payment services including Apple Pay. Good ideas are heard and results are rewarded. As a valued member in our Wallets & Payments SRE team, you'll be on a team whose mission is to build and continuously improve Apple's most critical payment platform services. Do you like … analytical problem solving and analytical thinking skills. Ability to clearly and accurately communicate day-to-day operations to ensure detailed hand-off to regional SRE teams. Preferred Qualifications Prior experience in supporting large-scale banking or payment systems, is a plus. Expertise in API design and interface technologies. Expertise with More ❯
our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role We're looking for SiteReliability Engineers who can help us build, operate, and maintain high-performance, scalable, and reliable services for our production infrastructure, across both cloud … on-prem environments. SiteReliability Engineers combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges. Our team strives to automate processes wherever possible, using whichever tools are best for the job. You'll … experts for the environments that you operate infrastructure in, helping partner teams build & configure their software to operate reliably within. We strongly believe in engineering teams being responsible for the operations of their services in production. In this role, you'll work closely with engineers to advocate and participate More ❯
SiteReliability Engineer - DevOps Engineer 18 Month Contract PAYE - Fully Remote/or Hybrid based in Midlands if preferred. The role We are working with one of the finest gaming studios in the industry and are on the lookout for … an exceptional SiteReliability Engineer who can bring their expertise and unique thinking to help make their team even stronger! As an SRE , the main purpose is solving for scale through collaboration and automation, bringing engineering principles to infrastructure and operational problems. Work closely with the different More ❯
our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role We're looking for SiteReliability Engineers who can help us build, operate, and maintain high-performance, scalable, and reliable services for our production infrastructure. SiteReliability Engineers combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges. Our team strives to automate processes wherever possible, using whichever tools are best for the job. We strongly believe in engineering teams being More ❯
databases and we want to grow that number, along with delivering more features without compromising from reliability and scalability. This is where our SRE team comes into the picture. The SRE team is responsible for managing Neon's multi-region, multi-cloud deployment in close collaboration with the broader … engineering team. All the features we want to implement can only reach our customers if the changes are delivered reliably, which means the SRE team plays a significant role in defining our pace of development. Successful candidates will get the opportunity to contribute to the effort of evolving Neon … cloud and infrastructure topics Be ready to join an on-call rotation We're looking for someone who has 4+ years experience working in SiteReliabilityEngineering Experience with cloud infrastructure components in Azure and/or AWS Experience in a complex Linux infrastructure environment Experience focusing More ❯
We are seeking talented Senior SiteReliability Engineers to join our growing SRE team! You will tackle complex challenges by designing and implementing scalable, reliable infrastructure and services that power the future of customer engagement technology. In this pivotal role, you'll leverage your extensive expertise in backend … systems and infrastructure management to enhance the performance and reliability of our platforms. Your contributions will directly influence the shaping of architecture and operational excellence needed for our product to thrive. Some things you'll do Architect and maintain critical infrastructure to enable Customer.io to scale and handle real … processing of billions of messages. Strategically plan and implement infrastructure growth to meet evolving demands and repeatability. Streamline and automate processes for efficiency and reliability, removing manual toil. Participate in on-call rotations to swiftly address availability incidents and support technical engineers with customer-related issues. Develop observability to More ❯
SiteReliability Engineer (SRE) - Data Platform London, England, United Kingdom Software and Services At Apple, we believe that innovation flourishes in an environment where ideas are challenged, collaboration is encouraged, and technology is pushed to its limits. This environment is only possible when diverse minds come together, bringing … innovation in everything we do. Imagine what you could accomplish here! Join Apple and help us make the world a better place. As an SRE on our team, you'll be responsible for architecting, optimizing, and scaling distributed storage and analytics systems. You'll collaborate closely with development teams to … of distributed systems, beyond individual components. We firmly believe in ownership, with software engineers accountable for the code they write. Description The Apple Services Engineering (ASE) organization builds and provides systems and infrastructure that fuel Apple's services (such as iCloud, iTunes, Siri, and Maps). At ASE, we More ❯
Platform Engineering Delivery Lead - In-depth experience and understanding of Observability and SRE Practices Permanent Hybrid - London, flexible working, generally 1-2 days per week onsite in London Bridge About GlobalLogic GlobalLogic, a Hitachi Group Company, is a trusted digital engineering partner to the world's largest and … and senior level Platform Engineering leader with a hands-on engineering background and an in-depth experience and understanding of Observability and SRE Practices. In this role you will need to be able to articulate the business benefits that Observability and SRE provide to our clients and take … effectively with business growth and demand. Process Optimization: Evaluate and provide recommendations to improve existing business processes for optimal efficiency. Technical Expertise Observability and SRE Practices: In-depth understanding of observability and SiteReliabilityEngineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir More ❯
real-time data, set us apart as the leader in payments. We're on the hunt for an exceptional SiteReliability Engineer (SRE) to join our dedicated team. As an SRE at Paymentology, you'll be the superhero responsible for maintaining, improving, and ensuring the high availability, scalability … and service quality levels. Contribute to the design of reliable cloud infrastructure and implement reusable cloud-uptime components as code. Regularly review and optimise SRE practices, tools, and methodologies to enhance overall system reliability and team efficiency. Observability and Automation: Contribute to the design, implementation, and maintenance of observability … a culture of reliability. Requirements Bachelor's Degree in Computer Science, Information Technology, or related field. A minimum of 3 years in a dedicated SRE role, as well as 5+ years of prior software development experience. Comprehensive understanding of large-scale distributed platform architecture. Extensive hands-on cloud experience, particularly More ❯
SiteReliability Engineer | Inside IR35 | Hybrid - 2 Days Onsite London | 6 Month Contract Our client a multinational and respected consultancy is hiring for a Lead SiteReliability Engineer with expertise in AWS and DevOps Tools for a new project in the Public Sector. Technical Skills/ More ❯
london, south east england, United Kingdom Hybrid / WFH Options
RP International
SiteReliability Engineer | Inside IR35 | Hybrid - 2 Days Onsite London | 6 Month Contract Our client a multinational and respected consultancy is hiring for a Lead SiteReliability Engineer with expertise in AWS and DevOps Tools for a new project in the Public Sector. Technical Skills/ More ❯
are seeking Senior Cloud & Application Security Engineers to help our client define and implement its cloud security strategy. If you're an experienced Security Engineering professional excited to work with cutting-edge technology and collaborate with diverse teams, we want to hear from you! Key Skills: Strong understanding of … manage multiple security projects effectively. Responsibilities Security Strategy: Define and execute cloud security strategy, partnering with platform and SiteReliabilityEngineering (SRE) teams to build robust infrastructure that supports our business. Perimeter Security: Establish platform perimeter security by implementing controls at ingress and egress points, including creating … security services, including certificate authorities, encryption services, insecure configuration scanners, and security control canaries. Key Requirements: Essential: 5+ years of experience in cloud security engineering, particularly with AWS and Azure, and at least 2+ years in software development. Desired: Ability to work independently, take initiative, and maintain a keen More ❯
are seeking Senior Cloud & Application Security Engineers to help our client define and implement its cloud security strategy. If you're an experienced Security Engineering professional excited to work with cutting-edge technology and collaborate with diverse teams, we want to hear from you! Key Skills: Strong understanding of … manage multiple security projects effectively. Responsibilities Security Strategy: Define and execute cloud security strategy, partnering with platform and SiteReliabilityEngineering (SRE) teams to build robust infrastructure that supports our business. Perimeter Security: Establish platform perimeter security by implementing controls at ingress and egress points, including creating … security services, including certificate authorities, encryption services, insecure configuration scanners, and security control canaries. Key Requirements: Essential: 5+ years of experience in cloud security engineering, particularly with AWS and Azure, and at least 2+ years in software development. Desired: Ability to work independently, take initiative, and maintain a keen More ❯
which pronouns you use (For example: she/her, he/him, they/them, etc). At Bumble, SiteReliability Engineers (SRE) are responsible for ensuring the reliability, scalability and performance of software systems while bridging the gap between development, security and operations. We proactively manage … infrastructure provisioning. Monitor system health and performance, identifying and fixing issues Respond to system outages, troubleshooting root causes and implementing preventative measures Collaborate with engineering teams and security engineers to improve system reliability, security and performance Participate in on-call rotations Create and maintain documentation to improve knowledge … must Proficiency in at least Python or Golang programming languages Experience with CI/CD pipelines Strong proficiency with Kubernetes architecture Prior experience in SRE, System administration or DevOps roles Strong proficiency with Linux/Unix operating systems, including hands-on experience in configuration and troubleshooting Proficiency with using Puppet More ❯
enable and empower industries at a global scale. About the Team: The global Production Operations group is integral to ensuring the operational stability and reliability of our … worldwide 24x7 on-premises and cloud environments. As the first line of defense this team has ownership of operations engineering. Collaborating closely with IT, SRE, Network, and Data engineering teams, and key stakeholders across business, product, and software engineering teams. We play a crucial role in maintaining systems … issues, providing both internal and external teams with technical support and ensuring the issue remains in custody until resolution. Collaborate with product and software engineering teams to relay operational insights and requirements. Automation, Tooling & Research Continuously identify opportunities for optimization and present findings to technical leads and management. Research More ❯
leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications. We are looking for an experienced Lead for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Lead SRE, you will be very hands-on technically while also directly … adheres to the highest security standards. They build essential security infrastructure and implement controls that reinforce the platform's security posture. This is an SRE team, which means you can expect a highly hands-on approach, tackling the technical challenges of implementing large scale solutions. This team is deeply involved … implement, and manage cloud-native security tools and platforms for endpoint security, identity management (IAM), and CSPM Qualifications: Experience: 7+ years of experience in SRE, infrastructure engineering or similar role, with a strong focus on security work, with ideally 2+ years in a leadership or senior engineering role More ❯
A prestigious, technology-driven hedge fund is seeking a highly skilled SiteReliability Engineer (SRE) to join their global infrastructure team. This is a unique opportunity to work in a high-performance, low-latency trading environment where technology is at the heart of the firm’s competitive edge. … critical role in ensuring the performance, reliability, and scalability of the systems that power the fund’s trading and research platforms. As an SRE, you will work closely with software engineers and investment teams to build automation-first solutions that support the firm’s most advanced strategies. Key Responsibilities … across the business. Design and implement automation to eliminate manual tasks and reduce operational risk. Collaborate with software and investment teams to embed the SRE mindset early in the development lifecycle. Ideal Candidate: SRE with experience working with data systems Ability to program (structured, OOP, and TDD) using one or More ❯
A prestigious, technology-driven hedge fund is seeking a highly skilled SiteReliability Engineer (SRE) to join their global infrastructure team. This is a unique opportunity to work in a high-performance, low-latency trading environment where technology is at the heart of the firm’s competitive edge. … critical role in ensuring the performance, reliability, and scalability of the systems that power the fund’s trading and research platforms. As an SRE, you will work closely with software engineers and investment teams to build automation-first solutions that support the firm’s most advanced strategies. Key Responsibilities … across the business. Design and implement automation to eliminate manual tasks and reduce operational risk. Collaborate with software and investment teams to embed the SRE mindset early in the development lifecycle. Ideal Candidate: SRE with experience working with data systems Ability to program (structured, OOP, and TDD) using one or More ❯
the role: We are looking for a highly capable and experienced SiteReliability Engineer to join our growing tech team. As an SRE you will be a hands-on coach for the development teams maintaining and improving our solutions' reliability. You will be part of our DevOps team … but spend most of your time working closely with the engineering teams. Our ideal candidate will be passionate about best practices within technology teams, fully supportive of what the group is doing, and who wishes to make a difference. Responsibilities: Work with the development teams to build robust and More ❯