London, England, United Kingdom Hybrid / WFH Options
Free-Work UK
Join to apply for the Staff Database ReliabilityEngineer role at Free-Work UK 2 days ago Be among the first 25 applicants Join to apply for the Staff Database ReliabilityEngineer role at Free-Work UK Get AI-powered advice on this job and more … UK. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range Staff Database ReliabilityEngineer/DRE (MySQL NoSQL RDBMS) London/WFH to £120k Do you have experience of working on highly scalable, data intensive systems … platforms with 176 million active users and ambitious growth plans to become the world's number one music streaming service. As a Staff Database ReliabilityEngineer/DRE you will collaborate with software engineering teams, acting as an internal consultant to provide expertise on databases, helping to choose More ❯
London, England, United Kingdom Hybrid / WFH Options
Proton
Join to apply for the Systems & ReliabilityEngineer (VPN) role at Proton 1 week ago Be among the first 25 applicants Join to apply for the Systems & ReliabilityEngineer (VPN) role at Proton Get AI-powered advice on this job and more exclusive features. Join Proton … Our infrastructure spans thousands of servers worldwide, operating on a distributed architecture that connects our suite of applications across platforms. As a Systems and ReliabilityEngineer on the VPN Infrastructure team, among several areas, you'll be at the forefront of our battle against censorship regimes worldwide. You … need privacy and security most. What you will do: Infrastructure Management: Oversee and optimize our rapidly expanding global network of thousands of servers Automation & Reliability: Design and implement automation systems that ensure 99.95%+ uptime across our infrastructure Problem Solving: Serve as an active member of the on-call More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Cambridge GaN Devices
Senior/Principal ReliabilityEngineer (Power Devices) Team - R&D Engineering Location - Cambridge Contract - Permanent (Full time or part time) About CGD Making Sustainable Power Electronics Possible A spin-out of the Cambridge University, Cambridge GaN Devices (CGD) is a fabless semiconductor company that develops a range of … friendly and game-changing company is for you, we want to hear from you. The Opportunity CGD is looking for a Senior/Principal ReliabilityEngineer to join our growing engineering team. The Senior/Principal ReliabilityEngineer will primarily be responsible for all reliability aspects of CGD's GaN power devices and ICs. This includes designing the reliability experiments as well as analysis and developing reliability models. The Senior/Principal ReliabilityEngineer will also develop new electrical reliability tests, specifically for GaN ICs, and gain insight into More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description Staff Database ReliabilityEngineer/DRE (MySQL NoSQL RDBMS) London/WFH to £120k Do you have experience of working on highly scalable, data intensive systems? You could be progressing your career at one of Europe's fastest growing audio content platforms with 176 million active … users and ambitious growth plans to become the world's number one music streaming service. As a Staff Database ReliabilityEngineer/DRE you will collaborate with software engineering teams, acting as an internal consultant to provide expertise on databases, helping to choose which to use for a … seriously cool and offer a sociable environment with a range of perks and experiences. About you: You have experience in a similar role Database ReliabilityEngineer DRE, Site ReliabilityEngineer SRE position You have a deep knowledge of MySQL including on-prem and cloud tooling e.g. More ❯
London, England, United Kingdom Hybrid / WFH Options
CrowdStrike Holdings, Inc
Sr. Software Engineer - Reliability, Ireland (Remote) page is loaded Sr. Software Engineer - Reliability, Ireland (Remote) Apply locations Ireland - Remote United Kingdom - London time type Full time posted on Posted Today job requisition id R20922 As a global leader in cybersecurity, CrowdStrike protects the people, processes and … data as we continue to experience dramatic growth in the usage of our products. What You'll Do: Develop and maintain services to meet reliability and scalability demands. Develop and enhance monitoring services. Brainstorm, define, and build collaboratively with members across multiple teams. Obsess about learning, and champion the More ❯
Localize is seeking a Platform ReliabilityEngineer to join our growing engineering team. As Localize expands, the scalability, reliability, and performance of our infrastructure and applications have become paramount. This role is dedicated to overseeing and managing all aspects of Localize's technical infrastructure, databases, software tools … and to implementing systems for effective monitoring, alerting, and maintenance. You will be responsible for the scalability, stability, reliability, and performance of the Localize platform. This role will also support Devops and enhance systems used by the engineering team to improve productivity. Key Responsibilities: Oversee and manage Localize's … infrastructure across AWS and Cloudflare. Ensure the scalability, reliability, performance, and security of Localize's data stores, specifically Redis and MongoDB, through effective configuration, monitoring, query optimization, and backup management. Oversee and automate deployment process. Own and improve monitoring of uptime and performance using tools such as Bugsnag, Datadog More ❯
London, England, United Kingdom Hybrid / WFH Options
Wayve Technologies Ltd
The role We're on the lookout for a Site ReliabilityEngineer (SRE) with a thirst for innovation and a desire to establish Operational Excellence and best practices. You'll be instrumental in fortifying the backbone of our AI-driven autonomous vehicles, ensuring they're robust, resilient, and … up for success as an SRE at Wayve, we’re looking for the following skills and experience. Essential Over 8 years experience in Site Reliability Engineering or a similar role, especially in a production environment. Expertise in Python, C++, or Rust, with a solid foundation in cloud computing platforms More ❯
Staff Software Engineer, AI Reliability Engineering London, UK About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed … researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role Anthropic is seeking talented and experienced Reliability Engineers, including Software Engineers and Systems Engineers with experience and interest in reliability, to join our team. We will be defining and achieving reliability metrics for all of Anthropic's internal and external products and services. While significantly improving reliability for Anthropic's services, we plan to use the developing capabilities of modern AI models to reengineer the way we work. This team will be a critical part of Anthropic's mission More ❯
Mountain View, California, United States Hybrid / WFH Options
LinkedIn
our infrastructure teams and share the knowledge with a broader community while making a real impact within our company. As a Sr. Staff Software Engineer, you will be a key technical leader and role model within the organization. We are looking for a technical lead who designs and develops … infrastructure, IaaS, ML/AI infrastructure, storage, graph, stream-processing, Pub/Sub systems, Kubernetes, and platforms. Suggested Skills: -Distributed Systems -Technical Leadership -Infrastructure Reliability -Systems Infrastructure LinkedIn is committed to fair and equitable compensation practices. The pay range for this role is $191,000 to $315,000. Actual More ❯
We are looking for a motivated and tech-savvy Reliability Support Engineer ready to join our B2B SaaS company and become the wizard that makes any platform issue disappear. With around 100 employees, our team is mainly based in Malaga, Spain, but we are incorporated in the US More ❯
About the Role: CrowdStrike is looking to hire a Senior Engineer to the Data Services team to help us take our database systems to the next level. We're looking for a highly-technical, hands-on engineer, who loves to work with data plane services like Cassandra, ElasticSearch More ❯
About the Role: CrowdStrike is looking to hire an experienced Engineer to the Data Services team to help us take our database systems to the next level. We're looking for a highly-technical, hands-on engineer, who loves to work with data plane services like Cassandra, ElasticSearch More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
will power billions of devices. We are looking for another hands-on safety architect to join the team. Are you a forward-thinking, passionate engineer motivated by the challenge of shaping and deploying ground-breaking technology? If so, we look forward to learning more about you! Job Description As … a safety architect you will conduct investigations to understand the safety/reliability performance and limitations of both existing technologies and new ideas. Features or methodologies which show promise will be adopted for forthcoming CPU development projects. You will describe the results of your investigations and support the CPU … developing or verifying hardware using Verilog and associated technologies In Return You will work directly with talented engineers across the company on safety and reliability solutions for next-generation hardware. Your work will have a direct impact on our bottom line and the ability to deliver improvements for our More ❯
London, England, United Kingdom Hybrid / WFH Options
CoreWeave
will be expected to attend onboarding training at our US Headquarters for up to 2 weeks within their first month of employment. The Fleet Reliability Operations team is responsible for the day-to-day provisioning, management and uptime of CoreWeave’s ever-expanding fleet of server nodes. Playing a … entropy to maximize the number of nodes CoreWeave can deliver to customers. We seek curious, creative, and persistent problem solvers to join our Fleet Reliability Operations team. This individual will join a team of committed engineers working to deploy nodes as fast as they can be racked and turned More ❯