St. Louis, Missouri, United States Hybrid / WFH Options
Futran Tech Solutions Pvt. Ltd
Sr System ReliabilityEngineer (Automation) Location: St Louis, MO (Hybrid, 2 days onsite/Week) Primary Skill: ITIL/ITSM, Linux, CI/CD pipeline, Jenkins, Shellscripting, Any Monitoring tool (Preferred Splunk/Dynatrace), Troubleshooting Secondary Skill: REST API, Bitbucket, Maven, SQL Preferubly looking for 5 to 7 years of experience candiate The Role Plan, manage, and oversee … they are live by measuring and monitoring availability, latency, and overall system health. Scale systems sustainably through mechanisms like automation and evolving systems by pushing for changes that improve reliability and velocity. Work with a global team spread across tech hubs in multiple geographies and time zones. Ability to share knowledge and explain processes and procedures to others. Able More ❯
New York City (Manhattan), New York, United States Hybrid / WFH Options
CoreWeave
Senior Hardware ReliabilityEngineer, GPU & PCIe Join to apply for the Senior Hardware ReliabilityEngineer, GPU & PCIe role at CoreWeave Senior Hardware ReliabilityEngineer, GPU & PCIe 1 week ago Be among the first 25 applicants Join to apply for the Senior Hardware ReliabilityEngineer, GPU & PCIe role at … exciting challenges in the industry. CoreWeave powers the creation and delivery of the intelligence that drives innovation. What You'll Do CoreWeave is seeking a highly skilled and motivated Engineer, focusing on GPU and PCIe troubleshooting, to join our Hardware Engineering team, reporting to the Hardware Engineering Manager. In this role, you will play a crucial part in the … Partner with external vendors on failure analysis Track component RMAs Develop and maintain hardware/firmware management services. Automate all aspects of the server hardware lifecycle. Serve as the senior point of contact for hardware escalation and troubleshooting. Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture. Create and maintain accurate documentation of hardware designs More ❯
the best way to pay and be paid. Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa. Job Description Sr. Site ReliabilityEngineer, Product Reliability Engineering - Middleware Shape the future of global payments while growing your career About The Role We're looking for a curious and innovative Product … ReliabilityEngineer to join our Middleware team at Visa. Here, you'll be part of a diverse group of problem-solvers who ensure billions of transactions flow seamlessly across the world's largest payment network. What You'll Do Design for Reliability: Architect and implement solutions that keep Visa's middleware services running with Always On availability … other Promotes work-life balance and sustainable on-call rotations Encourages innovation and experimentation Champions personal growth and career development Impact & Growth In this role, you'll: Shape the reliability standards for global payment systems Mentor and be mentored by talented engineers Drive automation and observability initiatives Influence architectural decisions Build systems that process billions in transactions This is More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Cambridge GaN Devices
Senior/Principal ReliabilityEngineer (Power Devices) Team - R&D Engineering Location - Cambridge Contract - Permanent (Full time or part time) About CGD Making Sustainable Power Electronics Possible A spin-out of the Cambridge University, Cambridge GaN Devices (CGD) is a fabless semiconductor company that develops a range of energy-efficient GaN-based power devices to make greener … growth, and if the prospect of joining a friendly and game-changing company is for you, we want to hear from you. The Opportunity CGD is looking for a Senior/Principal ReliabilityEngineer to join our growing engineering team. The Senior/Principal ReliabilityEngineer will primarily be responsible for all reliability aspects of CGD's GaN power devices and ICs. This includes designing the reliability experiments as well as analysis and developing reliability models. The Senior/Principal ReliabilityEngineer will also develop new electrical reliability tests, specifically for GaN ICs, and gain insight into their reliability by collaborating with TCAD and multi More ❯
San Francisco, California, United States Hybrid / WFH Options
Samsara
of a recently public company, you'll have the autonomy and support to make an impact as we build for the long term. About the role: Samsara's Hardware Reliability team enables an exceptional customer experience by enabling reliable hardware, identifying opportunities in product performance, and engaging cross-functionally to resolve key issues. Samsara's Senior Hardware ReliabilityEngineer will design quality processes that guarantee the high-quality standards of Samsara's hardware. The Reliabilityengineer is responsible for implementing and executing comprehensive reliability strategies/plans that span the entire product development lifecycle from concept to warranty repair. The Reliabilityengineer quickly aggregates and analyzes test, field performance, and manufacturing … role, you will: Work cross-functionally with partners in Hardware/Firmware Engineering, Operations, and Support to ensure our customers have a best-in-class product experience. Design in reliability through the product development process and own product reliability modeling, testing, and validation. Set reliability requirements and allocate reliability targets to subsystems. Lead product quality investigations More ❯
California, Missouri, United States Hybrid / WFH Options
Cloudflare Inc
and Infrastructure teams, contributing to the evolution of database platform architecture, access control, and automation strategies. They will play a critical role in enhancing PostgreSQL infrastructure and improving database reliability, automation, and tooling.If you are passionate about PostgreSQL, automation, and database tooling , we'd love to hear from you! Role Responsibilities Build, deploy, and manage PostgreSQL databases in production … s enterprise customers at no cost. Athenian Project : In 2017, we created the Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration. Since the project, we've provided services to more than 425 local government election websites More ❯
About the Role: CrowdStrike is looking to hire a SeniorEngineer to the Data Services team to help us take our database systems to the next level. We're looking for a highly-technical, hands-on engineer, who loves to work with data plane services like Cassandra, ElasticSearch, and Kafka, and is comfortable building automation around large More ❯
Charlotte, North Carolina, United States Hybrid / WFH Options
Total Quality Logistics
General information Country USA State North Carolina City Charlotte Descriptions & requirements About the role: We are seeking a Senior Database ReliabilityEngineer (DBRE) to enhance the reliability of our critical database infrastructure using a software-engineering-based approach. This role will be a direct member of our Database Administration Team, acting as both an individual contributor … toil. Stay updated with emerging technologies and recommend enhancements. On-call, but working on ways to avoid on-call. What you need: 5+ years of experience in database administration, reliability engineering, or database engineering 5+ years of experience with scripting languages (Python, Bash, PowerShell) for automation Experience working in high-transaction, large-scale, 24/7/365 database More ❯
Cincinnati, Ohio, United States Hybrid / WFH Options
Total Quality Logistics
General information Country USA State Ohio City Cincinnati Descriptions & requirements About the role: We are seeking a Senior Database ReliabilityEngineer (DBRE) to enhance the reliability of our critical database infrastructure using a software-engineering-based approach. This role will be a direct member of our Database Administration Team, acting as both an individual contributor as … toil. Stay updated with emerging technologies and recommend enhancements. On-call, but working on ways to avoid on-call. What you need: 5+ years of experience in database administration, reliability engineering, or database engineering 5+ years of experience with scripting languages (Python, Bash, PowerShell) for automation Experience working in high-transaction, large-scale, 24/7/365 database More ❯
Tampa, Florida, United States Hybrid / WFH Options
Total Quality Logistics
General information Country USA State Florida City Tampa Descriptions & requirements About the role: We are seeking a Senior Database ReliabilityEngineer (DBRE) to enhance the reliability of our critical database infrastructure using a software-engineering-based approach. This role will be a direct member of our Database Administration Team, acting as both an individual contributor as … toil. Stay updated with emerging technologies and recommend enhancements. On-call, but working on ways to avoid on-call. What you need: 5+ years of experience in database administration, reliability engineering, or database engineering 5+ years of experience with scripting languages (Python, Bash, PowerShell) for automation Experience working in high-transaction, large-scale, 24/7/365 database More ❯
Hawthorne, California, United States Hybrid / WFH Options
SpaceX
exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal ofenabling human life on Mars. SR. HARDWARE RELIABILITYENGINEER (AVIONICS) The Build Reliability team at SpaceX is an interdisciplinary team committed to the reliability of avionics hardware. The breadth of exposure will span across … solar, etc.) as you collaborate with other core groups at SpaceX, supporting the hardware from design to production and operation. You use data to understand the factors that drive reliability and support quality objectives in design and development teams to drive improvements in reliability in manufacturing, testing, and flight. You can dive deep into the details to identify … to execute on the mission to make humanity interplanetary. RESPONSIBILITIES: Responsible partner with core electrical design engineering and avionics manufacturing teams through product development and production to improve product reliability by applying best practices to designs and establishing repeatable process control Support new product development by prototyping early development hardware to learn the design limitations through a HASS style More ❯
Mountain View, California, United States Hybrid / WFH Options
LinkedIn
Flink, and GraphQL - come join our infrastructure teams and share the knowledge with a broader community while making a real impact within our company. As a Sr. Staff Software Engineer, you will be a key technical leader and role model within the organization. We are looking for a technical lead who designs and develops technology to serve business and … distributed systems related to data infrastructure, IaaS, ML/AI infrastructure, storage, graph, stream-processing, Pub/Sub systems, Kubernetes, and platforms. Suggested Skills: -Distributed Systems -Technical Leadership -Infrastructure Reliability -Systems Infrastructure LinkedIn is committed to fair and equitable compensation practices. The pay range for this role is $191,000 to $315,000. Actual compensation packages are based on More ❯