Insight Global is looking for an Operations Site Reliability Engineer to help with global operational support for a leading infrastructure software product company’s customer-facing SaaS products. You will be part of a team of engineers that demonstrates superb technical competency, operates mission-critical infrastructure and ensures the highest levels of availability (24x7x365), performance and security. This More ❯
Als Site Reliability Engineer bij Conclusion Mission Critical beheer je en bouw je mee aan IT-oplossingen voor klanten met een grote impact op de maatschappij Even opletten, Site Reliability Engineer. Bij Conclusion Mission Critical krijg je een baan met inhoud. Werk dat ertoe doet, zoals: Zorgen voor stabiele en veilige IT-platformen, zodat softwareteams zich kunnen … slimme IT-oplossingen. Wat krijg je van ons In deze rol krijg je een contract voor minimaal 32 uur per week. Omdat wij erop vertrouwen dat jij uitblinkt als Site Reliability Engineer bieden we daarbovenop (op basis van een fulltime dienstverband): Een salaris dat past bij je kennis en ervaring; Een bonusregeling; Direct een vast contract; Een leaseauto More ❯
Halian Technology looking for a talented and driven Site Reliability Engineer (SRE) to join our growing technology team. In this role, youll ensure the reliability, scalability, and performance of our digital platforms that support memorable customer experiences across the hospitality sector. Youll work alongside our engineering, product, and infrastructure teams to build high-availability systems and automated operations … and participate in on-call rotations. Track and improve key SRE metrics such as error rates, incident count, and monitoring coverage. What Youll Bring: 3+ years of experience in Site Reliability Engineering, DevOps, or equivalent roles. Strong skills in cloud-based infrastructure (Azure or AWS) using IaC practices. Hands-on experience building and managing CI/CD pipelines and More ❯
As a Site Reliability Engineer at Conclusion Mission Critical, you manage and help build IT solutions for clients that have a major impact on society. Just a heads-up, DevOps Engineer: at Conclusion Mission Critical, your role makes a real impact. Work that truly matters, like: Ensuring stable and secure IT platforms so that software teams can … the energy domain. What we offer In this role, you'll receive a contract for at least 32 hours per week. Because we trust you to excel as a Site Reliability Engineer, we also offer (based on a full-time contract): A salary that matches your skills and experience; A bonus scheme; A permanent contract from day one More ❯
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled Site Reliability Engineer with a strong focus on Google Cloud Platform (GCP More ❯
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled Site Reliability Engineer with a strong focus on Google Cloud Platform (GCP More ❯
Social network you want to login/join with: Site Reliability Engineer (Datadog), London col-narrow-left Client: JLL Location: London, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: 148a2230002a Job Views: 28 Posted: 17.06.2025 Expiry Date: 01.08.2025 col-wide Job Description: JLL empowers you to shape a brighter way . … you’re looking to apply your relevant experience to a new industry, join our team as we help shape a brighter way forward. JLL is looking for an Observability Engineer to assist in the support and administration of the Datadog monitoring platform. The focus of this role will be ensuring the reliability, scalability and efficiency of Datadog for monitoring … across the organization. Configure AIOps capabilities to deliver noise reduction, event correlation and enhanced Root Cause Analysis. Experience & Education Minimum 5 years of experience working as a Observability/Site Reliability Engineer supporting Network, Infrastructure and applications. Previous experience developing ansible scripts to support automation. Previous experience supporting and administrating an AIOps platform (i.e. Watchdog, Moogsoft, Big Panda More ❯
product faster than ever before. This is a unique opportunity to leave your mark on one of the most influential and trafficked corners of the internet. As a Senior Site Reliability Engineer on Reddit’s Infrastructure SRE team, you’ll use your knowledge of distributed systems and architecture to improve the reliability and performance of Reddit’s engineering … services, which are used by Reddit engineering teams to build, deploy, and operate Reddit. Deliver software to improve the availability, scalability, latency, and efficiency of observability components. Identify and engineer away risk across Reddit’s systems. Automate : Take repetitive, manual, or risky tasks and automate them out of existence. Build tools and integrate systems to support Reddit’s evolution. … performance, reduce cost, and improve the experience for millions of users Contribute upstream changes to the open source projects we use Qualifications 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role. Proficiency in one or more programming languages. We’re predominantly writing code in Go and Python. Experience with Kubernetes and Cloud More ❯
As a Site Reliability Engineer (SRE) , you'll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the highest levels of government. If you think you can see yourself delivering our mission and pursuing our goals with us, then check out the job description below! What you'll do: Monitor platform … with Infrastructure as Code (IaC), containerization, K8, and CI/CD Automation. Experience with container orchestration tools (Rancher, OpenShift, etc.) Willing to work in downtown Washington, DC on client site at least 3 days per week. A Bachelor's degree and an active TS SCI clearance. SALARY RANGE: $139,000 - $221,000 The salary range for this position is More ❯
The Site Reliability Engineering (SRE) team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives, and working with developers and product managers to ensure that our products are not only reliable and performant, but also cost-efficient. Our platform is built on Google Kubernetes Engine (GKE) and utilizes several other … discussing performance analysis, cost analysis, and operational metrics Preferred Qualifications Experience designing, analyzing, and troubleshooting distributed systems Experience maintaining Kubernetes clusters in a production environment Previous experience as a Site Reliability Engineer, DevOps Engineer, or similar role Pendo was founded in 2013 by former product managers, who combined their heads and hearts to build something they wanted More ❯
eager to expand their skills while working on an exciting new venture. Your work will have a significant impact on our company, clients, and business partners worldwide. As a Site Reliability Engineer III at JPMorgan Chase within the Corporate Technology - Market Risk, you will address complex business problems with simple solutions. Using code and cloud infrastructure, you will More ❯
Principal Site Reliability Engineer iwoca London, United Kingdom Apply now Posted 6 days ago Hybrid Job Permanent Competitive Principal Site Reliability Engineer - Core Systems Hybrid in London or Remote within the UK The company Imagine a world where every small business has the power to thrive. That's the world we're building at iwoca. Small … with Kubernetes, PostgreSQL hosted in AWS RDS, and Snowflake. A track record of shaping incident processes, on-call practices, or sharing reliability ownership across multiple teams. Deep understanding of site reliability principles and applying them to databases, including observability and limiting the impact of long-running or resource-heavy queries. Experience with infrastructure automation, like setting up monitoring and More ❯
and future states of the organisation and make faster, more informed decisions. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney. Role: Principal Site Reliability Engineer You will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure. You will collaborate across product, platform, and operations More ❯
Social network you want to login/join with: Senior Site Reliability Engineer, London col-narrow-left Client: MongoDB Location: London, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: eddbd57b71de Job Views: 3 Posted: 17.06.2025 Expiry Date: 01.08.2025 col-wide Job Description: MongoDB’s mission is to empower innovators to create … growing product, MongoDB Realm- our serverless platform offering that allows developers to build apps on MongoDB without managing any infrastructure, and our newest offering, Atlas Data Lake. The Cloud Site Reliability Engineering Team designs and builds the global infrastructure on which we deploy our services. As our customers grow and globalize, our services must satisfy demands for low-latency More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
KnowBe4, Inc
Snr. Site Reliability Engineer (Remote position located in Leeds/Sheffield, United Kingdom) Sheffield, United Kingdom About KnowBe4 KnowBe4, the provider of the world's largest security awareness training and simulated phishing platform, is used by tens of thousands of organizations around the globe. KnowBe4 enables organizations to manage the ongoing problem of social engineering by helping them … person, we strive to make every day fun and engaging; from team lunches to trivia competitions to local outings, there is always something exciting happening at KnowBe4. KnowBe4’s Site Reliability Engineers help ensure that our platforms are reliable, secure, scalable, and efficient. They work alongside other engineers in a fast-paced, agile development environment, and share solutions to … Airbrake Cloud-native infrastructure in AWS - ECS, Lambda, Step Functions, SNS/SQS, Transit Gateway, Aurora, DynamoDB, CloudFront, S3, AppSync, API Gateway, and many more. Responsibilities: Work with other Site Reliability Engineers to build highly scalable and resilient applications and infrastructure in AWS Maintain and improve extensible infrastructure-as-code using Terraform Learn, maintain, and improve our existing deployment More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
KnowBe4, Inc
Snr. Site Reliability Engineer (Remote position located in Leeds/Sheffield, United Kingdom) Sheffield, United Kingdom About KnowBe4 KnowBe4, the provider of the world's largest security awareness training and simulated phishing platform, is used by tens of thousands of organizations around the globe. KnowBe4 enables organizations to manage the ongoing problem of social engineering by helping them … person, we strive to make every day fun and engaging; from team lunches to trivia competitions to local outings, there is always something exciting happening at KnowBe4. KnowBe4’s Site Reliability Engineers help ensure that our platforms are reliable, secure, scalable, and efficient. They work alongside other engineers in a fast-paced, agile development environment, and share solutions to … Airbrake Cloud-native infrastructure in AWS - ECS, Lambda, Step Functions, SNS/SQS, Transit Gateway, Aurora, DynamoDB, CloudFront, S3, AppSync, API Gateway, and many more. Responsibilities: Work with other Site Reliability Engineers to build highly scalable and resilient applications and infrastructure in AWS Maintain and improve extensible infrastructure-as-code using Terraform Learn, maintain, and improve our existing deployment More ❯
your best. Be a part of a company that is part of the community; driven to improve our future and protect our freedom. We are looking for experienced Systems Engineer/Site Reliability Engineer (SRE) to join our technology-based program supporting a key Government customer. The Systems Engineer/SRE provides subject expertise and guidance … the development, testing, and implementation of technical solutions. Determining whether technical solutions meet defined requirements. The SRE may also provide Agile DevOps support to mission critical systems. The Systems Engineer/SRE may have the opportunity to build strong systems, software, and cloud environments and provide operations and maintenance for critical systems. The candidate will provide technical expertise and … of projects through all aspects of the software development lifecycle including scope and work estimation, architecture and design, coding, and unit testing. ABC Required Education, Experience, & Skills The Systems Engineer will support the team in the following activities (including but not limited to): Ensuring reliability, getting systems back to steady-state as quickly as possible Eliminating toil, automating wherever More ❯
HQ or the wider global organisation, you'll be a part of collaborative, high-performing teams, creating cutting-edge software, platforms, and infrastructure. The Role Join us as a Site Reliability Engineer and help us build the future of data sovereignty! We're seeking an SRE passionate about creating high-performance, scalable, and reliable services for our production … implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. Engineer the Acra platform for high availability and fault tolerance. This includes ensuring resilience against Cloud Availability Zone outages and the ability to gracefully handle node failures. Guarantee 99.9% uptime More ❯
Principal Site Reliability Engineer - Core Systems Hybrid in London or Remote within the UK The company Imagine a world where every small business has the power to thrive. That's the world we're building at iwoca. Small businesses aren't just statistics - they're the heartbeat of our communities, the character of our high streets, and the … with Kubernetes, PostgreSQL hosted in AWS RDS, and Snowflake. A track record of shaping incident processes, on-call practices, or sharing reliability ownership across multiple teams. Deep understanding of site reliability principles and applying them to databases, including observability and limiting the impact of long-running or resource-heavy queries. Experience with infrastructure automation, like setting up monitoring and More ❯
Indianapolis, Indiana, United States Hybrid / WFH Options
Eli Lilly and Company
around the world. Come tackle complex challenges and ensure the reliability of critical applications to help patients! Lilly's Software Product Engineering team is actively looking for a Lead Site Reliability Engineer (SRE). Are you ready to own and complete complicated and technically ambitious tasks? Are you passionate about technology with extensive experience in observability, AWS, Kubernetes … everyone is informed about the status of applications and any incidents that occur. Provide guidance and training to junior team members, helping them develop their skills and knowledge in site reliability engineering. Stay current with industry trends and emerging technologies and be willing to adapt and innovate to improve application reliability and performance. Always prioritize the needs and experiences More ❯
learners in more than 175 countries. Every Preply lesson sparks change, fuels ambition, and drives progress that matters. Meet the team! As a member of the Platform tribe, the Site Reliability Engineer (SRE) at Preply combines software development, infrastructure operations and business skills to run a large-scale, fault-tolerant, global language education platform. The SRE ensures that More ❯
London, England, United Kingdom Hybrid / WFH Options
Sporty
AWS, Kubernetes, Docker, Prometheus, Grafana, Security, Python etc You should apply if you have 4+ years experience in a SRE or DevOps position, or if you're a Software Engineer looking to transition then that's also great! You're a veteran in AWS technologies Experience deploying and releasing into new regions You've managed multiple Kubernetes clusters in More ❯
top AI computing platform. We equip engineers with the tools to deploy AI that is fast, secure, affordable, and built to scale. Whether they need powerhouse GPU hardware on-site or the flexibility of cloud-based solutions, we've got the horsepower to make it happen. Lambda's AI Cloud has been adopted by the world's leading companies More ❯
Wheely is not a traditional ride-hailing company. We are building a platform with user privacy at its core while successfully scaling a five-star service to millions of rides across multiple cities. Over the past few years, we have More ❯
Want a fast-paced, rewarding career at a fast-growing, global tech company? Luminance is a young AI company that is growing rapidly: today, Luminance's technology is helping over 600 customers in 70 countries globally. With ambitious growth plans More ❯