ESPN and ESPN+, and much more. Innovation: We develop and execute groundbreaking products and techniques that shape industry norms and enhance how audiences experience sports, entertainment & news. The Data Reliability Engineering team for Disney's Product and Data Engineering team is responsible for maintaining and improving the reliability of Disney Entertainment's big data platform, which processes hundreds … of terabytes of data and billions of events daily. Job Summary: The Senior Software Engineer will help us in the ongoing mission of delivering outstanding services to our users allowing Disney Entertainment to be more data-driven. You will work closely with our partner teams to monitor and drive improvements for reliability and observability of their critical … be required to build high quality data models and products that monitor and reports on data pipeline health and data quality. Work closely with all members of the Data Reliability Engineering team to set project deliverables, review design documents, perform code reviews and help mentor junior members of the team. Collaborate with engineering teams to improve, maintain, performance tune More ❯
The teams' primary responsibilities are the enablement of new technology or modules and to provide engineering support working with the foundry. The team helps establishing design rules, design manual, reliability and qualification test vehicles, process assumptions, computational lithography, process tolerances, wafer acceptance criteria, patterning solutions, tool selection, process development for new structures and materials. Additionally, the team analyzes inline … data and generates appropriate engineering line controls. The PIC ReliabilityEngineer will support the development and qualification of new silicon photonic and electronic products in a Foundry environment. The PIC ReliabilityEngineer will work on completing reliability tasks, creating and maintaining qualification plans, process and design improvements, supporting tape-outs and design for reliability rules revisioning. Responsibilities: Own reliability processes for photonic semiconductor components, implement changes to the design rules for reliability improvements. Maintain/write reliability plans and documents. Work with test and foundry reliability teams to complete reliability testing and qualifications. Review and assess reliability reports and plans with design team and Foundry. Drive interactions More ❯
The teams' primary responsibilities are the enablement of new technology or modules and to provide engineering support working with the foundry. The team helps establishing design rules, design manual, reliability and qualification test vehicles, process assumptions, computational lithography, process tolerances, wafer acceptance criteria, patterning solutions, tool selection, process development for new structures and materials. Additionally, the team analyzes inline … data and generates appropriate engineering line controls. The PIC ReliabilityEngineer will support the development and qualification of new silicon photonic and electronic products in a Foundry environment. The PIC ReliabilityEngineer will work on completing reliability tasks, creating and maintaining qualification plans, process and design improvements, supporting tape-outs and design for reliability rules revisioning. Responsibilities: Own reliability processes for photonic semiconductor components, implement changes to the design rules for reliability improvements. Maintain/write reliability plans and documents. Work with test and foundry reliability teams to complete reliability testing and qualifications. Review and assess reliability reports and plans with design team and Foundry. Drive interactions More ❯
SeniorReliability Sustainment Systems Engineer Job Category: Engineering Time Type: Full time Minimum Clearance Required to Start: None Employee Type: Regular Percentage of Travel Required: Up to 10% Type of Travel: Continental US The Opportunity: CACI is seeking a highly motivated SeniorReliability Sustainment Engineer to join our team supporting NavalX within the Office … possess strong interpersonal and communication skills to build trusted relationships with stakeholders and ensure the delivery of high-quality technical support within a cost-constrained environment. Responsibilities: As a SeniorReliability Sustainment Engineer, you will develop and implement strategies to enhance the reliability and maintainability of systems throughout their lifecycle. Key duties include: Analyzing performance and … failure data to identify reliability improvement opportunities. Developing and executing preventive maintenance strategies and reliability improvement plans. Conducting root cause analysis (RCA) and recommending corrective actions. Creating and maintaining reliability models and forecasts for new and legacy systems. Collaborating with design, manufacturing, and sustainment teams to integrate reliability best practices. Leading reliability-focused training for More ❯
Req ID: 123583 Region: Americas Country: United States State/Province: Texas City: Richardson Summary The SeniorReliabilityEngineer, works in cross functional teams with designers, customers and manufacturing engineering and project leaders to ensure products designed can meet reliability specifications. Define the reliability testing strategy, reliability test plan and conduct tests. Complete a … stress based MTBF analysis of products, thus providing initial reliability predictions. Detailed Description Performs tasks such as, but not limited to, the following: Lead reliability analysis and work with the design teams to ensure the products designed can meet reliability specification. Work with the design team and customers to understand and define reliability requirements of products. … Conduct design review and close loops with the design team to fix design issues. Define the reliability testing strategy, reliability test plan and conduct test. Complete a stress based MTBF analysis of the product, thus providing initial reliability predictions. Coordinate third-party performance and regulatory testing. Train relevant manufacturing partner personnel to conduct the test. Perform failure More ❯
Job Title: Site ReliabilityEngineer (SRE) Location: Dallas, Austin, or Omaha (Onsite) Compensation: $55-$60 per hour (W2/Visa) Job Summary: Charles Schwab is seeking a Site ReliabilityEngineer (SRE) to support our industry-leading Order Management System (OMS) within a highly performant and scalable compute environment. In this role, you will be responsible for … work across multiple teams. Ability to work effectively both independently and in a team environment. Detail-oriented with a high sense of urgency. Experience with incident response and service reliability best practices. More ❯
key performance indicators (KPIs), and make data-driven decisions to enhance process effectiveness Adept at creating detailed reports, presentations, and summaries tailored to various audiences, from technical teams to senior leadership. Excellent verbal and written communication abilities to clearly convey complex technical information to both technical and non-technical stakeholders. Experience in leading change management initiatives within an organization. … high-tech manufacturing industries. Project management certification (e.g., PMP) or experience leading projects from initiation to completion. JOB DESCRIPTION HM will also consider more experienced candidates as well The ReliabilityEngineer will assume the role of Process Manager and be responsible for overseeing and optimizing the Issues Triage and Investigation (ITI) processes, which directs our issues handling and … between cross-functional teams, and ensuring that the process effectively facilitate the resolution of issues. The Process Manager will work closely with teams such as Fleet Management, Quality, Operations, Reliability Testing and Field Service to drive continuous improvements. Key Responsibilities: Process Development & Optimization: Design, implement, and continually refine the ITI processes and cross-functional communication. Identify bottlenecks, inefficiencies, and More ❯
New York City (Manhattan), New York, United States Hybrid / WFH Options
CoreWeave
Senior Hardware ReliabilityEngineer, GPU & PCIe Join to apply for the Senior Hardware ReliabilityEngineer, GPU & PCIe role at CoreWeave Senior Hardware ReliabilityEngineer, GPU & PCIe 1 week ago Be among the first 25 applicants Join to apply for the Senior Hardware ReliabilityEngineer, GPU & PCIe role at … exciting challenges in the industry. CoreWeave powers the creation and delivery of the intelligence that drives innovation. What You'll Do CoreWeave is seeking a highly skilled and motivated Engineer, focusing on GPU and PCIe troubleshooting, to join our Hardware Engineering team, reporting to the Hardware Engineering Manager. In this role, you will play a crucial part in the … Partner with external vendors on failure analysis Track component RMAs Develop and maintain hardware/firmware management services. Automate all aspects of the server hardware lifecycle. Serve as the senior point of contact for hardware escalation and troubleshooting. Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture. Create and maintain accurate documentation of hardware designs More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Cambridge GaN Devices
Senior/Principal ReliabilityEngineer (Power Devices) Team - R&D Engineering Location - Cambridge Contract - Permanent (Full time or part time) About CGD Making Sustainable Power Electronics Possible A spin-out of the Cambridge University, Cambridge GaN Devices (CGD) is a fabless semiconductor company that develops a range of energy-efficient GaN-based power devices to make greener … growth, and if the prospect of joining a friendly and game-changing company is for you, we want to hear from you. The Opportunity CGD is looking for a Senior/Principal ReliabilityEngineer to join our growing engineering team. The Senior/Principal ReliabilityEngineer will primarily be responsible for all reliability aspects of CGD's GaN power devices and ICs. This includes designing the reliability experiments as well as analysis and developing reliability models. The Senior/Principal ReliabilityEngineer will also develop new electrical reliability tests, specifically for GaN ICs, and gain insight into their reliability by collaborating with TCAD and multi More ❯
Senior Service ReliabilityEngineer - Apple Data Platform Austin, Texas, United States Software and Services Summary Posted: Dec 03, 2024 Role Number: The Apple Services Engineering team (ASE) is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple … bigger part of Apple's business than ever before, these teams remain small, and multi-functional, offering greater exposure to the array of opportunities here. Description As a Service ReliabilityEngineer, you will be responsible for providing the platform for mission critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to … RPC API and services using Golang OR Python. Minimum Qualifications Bachelor's Degree in Computer Science, an engineering-related field, or equivalent related experience. 5+ years in a Service Reliability Engineering, DevOps, or Infrastructure focused role Must be an expert and have in-depth professional experience working with Kubernetes and AWS Preferred Qualifications Experience working on multiple cloud environment More ❯
Overview Location Requirements: This role is Remote or Hybrid to one of our offices: Austin, Naperville or Wayne. Frontline has a dynamic career growth opportunity for a Senior Database ReliabilityEngineer (DBRE) to help develop, build and support efficient database solutions for our custom applications. Acting as a key member of the Data Operations Team, this individual … the need arises. This individual will proactively identify opportunities for automation, improved efficiency, and elimination of redundant processes on all levels of operational and development-focused database work. The Senior Database ReliabilityEngineer takes pride in their work and has a sense of ownership for the systems that they support. We have a great work environment here More ❯
NVIDIA and the Artificial Intelligence industry. As the computational power increases with every GPU generation, developing efficient and reliable systems is an imperative. We are looking for a System ReliabilityEngineer to join NVIDIA's existing Reliability Engineering team, involved in NVIDIA's diverse system product range specifically Graphics and High-Performance Computing printed circuit boards and … Data Center Servers. What you'll be doing: Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack, cluster) from Concept to End-of-Life phase. Establish, deliver and maintain product reliability standards and metrics for NVIDIA's new system technologies, using existing tools and processes or developing new as required. Participate in … product and engineering design reviews, assess the reliability budget of products/designs, and inspire changes that enhance product reliability. Interface and interact with all pertinent engineering groups, suppliers, and partners ensuring the desired reliability is achieved using Design for Reliability (DfR) methods including FMEA and DoE approaches. Define and implement Reliability Plans & Specifications. Provide reliabilityMore ❯
San Francisco, California, United States Hybrid / WFH Options
Samsara
of a recently public company, you'll have the autonomy and support to make an impact as we build for the long term. About the role: Samsara's Hardware Reliability team enables an exceptional customer experience by enabling reliable hardware, identifying opportunities in product performance, and engaging cross-functionally to resolve key issues. Samsara's Senior Hardware ReliabilityEngineer will design quality processes that guarantee the high-quality standards of Samsara's hardware. The Reliabilityengineer is responsible for implementing and executing comprehensive reliability strategies/plans that span the entire product development lifecycle from concept to warranty repair. The Reliabilityengineer quickly aggregates and analyzes test, field performance, and manufacturing … role, you will: Work cross-functionally with partners in Hardware/Firmware Engineering, Operations, and Support to ensure our customers have a best-in-class product experience. Design in reliability through the product development process and own product reliability modeling, testing, and validation. Set reliability requirements and allocate reliability targets to subsystems. Lead product quality investigations More ❯
world where every person is free to move and pursue their dreams. In this position In this highly interdisciplinary role, you will work with multiple teams and help set reliability targets at the system and subsystem level. You will oversee the design validation test campaigns throughout different stages of vehicle development from proof of concept to the launch. You … to support an entire vehicle. Qualifications You'll have B.S. in Mechanical, Electrical, or Chemical Engineering, Material Science, applied Physics or Chemistry with 5+ years of relevant experience in reliability engineering, test design, and failure analysis Advanced knowledge of reliability engineering mathematics including but not limited to: different statistical distributions used in reliability analyses, reliability requirement … campaigns Proven ability to operate effectively in a fast-paced environment Even better, you may have M.S. or Ph.D. in Mechanical or Electrical Engineering Automotive experience or coursework in Reliability Engineering Experience with Microsoft Office, Confluence, Jira, instrumentation, and measurement of powertrain systems Strong communication skills Software skills: Python, Cloud Computing (Google Cloud, AWS, Azure), SQL, Power BI, Tableau More ❯
world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance_ THE ROLE: Join our passionate Reliability team and play a critical role in ensuring the highest quality of AMD products for our customers. You'll tackle challenging tasks, leveraging your expertise to develop, design, and … implement innovative software and product reliability test regimens. Get hands-on with the latest technologies that power our hyper-scale cloud services, contributing to a cutting-edge environment with a direct impact on the user experience. This is a unique opportunity to leverage your knowledge of computer systems and hardware to contribute to the development of cutting-edge hyperscale … cloud services. You'll collaborate with program management, vendors, and design engineers, acting as the internal consultant on all reliability matters. This is more than just a job; it's a chance to make a real impact on the future of cloud computing. AMD's Data Center Platform Engineering Group is designing, building and delivering innovative technology infrastructure enabling More ❯
Job Description: Job Title: ReliabilityEngineer, Power Electronics Location : Fremont, CA Job Type : Full-Time (New-Hire) Job Level: SeniorEngineer Job Description: We are seeking a skilled and detail-oriented Reliability (DfR) Engineer to join our team. This role will focus on ensuring the long-term reliability, durability, and performance of Power … Electronic products. In this role, you will be responsible for representing reliability during the design, development and deployment lifecycle of our products. You will work closely with the design and validation teams to optimize product life and performance under real-world conditions. Key Responsibilities: Design for Reliability: Implement Design for Reliability (DFR) methodologies to proactively identify and … mitigate potential failure modes in power electronic systems. Ensure designs meet industry standards for thermal, electrical, and mechanical performance. Product Requirements : Interpret product and system level requirements. Help define reliability requirements early in the design process and ensure that the product's use case, operation and environment are sufficiently specified. Risk Analysis : Use first principles approach to identify risk More ❯
whatever gives you purpose. We're united by purpose, inspired by you. About the Team: The Database Administration (DBA) team at UKG plays a critical role in ensuring the reliability, performance, security, and availability of our diverse suite of products and services. We are a collaborative and dynamic team responsible for managing and optimizing various database systems, with a … strong emphasis on PostgreSQL, across different environments and product lines. We embrace automation, infrastructure-as-code principles, and a proactive approach to database management. About the Role: As a Senior Cloud Database Engineer, you will be a leader in the design, implementation, and management of sophisticated cloud database systems, with a primary focus on PostgreSQL and/or … forefront of industry trends and technologies in cloud database management, driving innovation and recommending strategic improvements. About You: Basic Qualifications: 4-7+ years of experience as a Database Engineer or similar role, with a focus on cloud environments. 4-7+ years of experience with PostgreSQL and\or MSSQL database architecture, implementation, and management. Advanced proficiency in Python More ❯
whatever gives you purpose. We're united by purpose, inspired by you. About the Team: The Database Administration (DBA) team at UKG plays a critical role in ensuring the reliability, performance, security, and availability of our diverse suite of products and services. We are a collaborative and dynamic team responsible for managing and optimizing various database systems, with a … strong emphasis on PostgreSQL, across different environments and product lines. We embrace automation, infrastructure-as-code principles, and a proactive approach to database management. About the Role: As a Senior Cloud Database Engineer, you will be a leader in the design, implementation, and management of sophisticated cloud database systems, with a primary focus on PostgreSQL and/or … forefront of industry trends and technologies in cloud database management, driving innovation and recommending strategic improvements. About You: Basic Qualifications: 4-7+ years of experience as a Database Engineer or similar role, with a focus on cloud environments. 4-7+ years of experience with PostgreSQL and\or MSSQL database architecture, implementation, and management. Advanced proficiency in Python More ❯
Big Data) Sr System ReliabilityEngineer St Louis, Missouri (Hybrid, Resource is required in St Louis and ready to travel to Missouri office whenever required ) Skillset required: ITSM, Production Support, Hadoop, Hive, Spark, Nifi, Impala, Secondary Skill: Git/BitBucket, Jenkins, Maven, Artifactory, and Chef The Role • Plan, manage, and oversee all aspects of a Production Environment for … they are live by measuring and monitoring availability, latency and overall system health. • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity. • Work with a global team spread across tech hubs in multiple geographies and time zones. • Ability to share knowledge and explain processes and procedures to others. Requirements More ❯
the best way to pay and be paid. Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa. Job Description Sr. Site ReliabilityEngineer, Product Reliability Engineering - Middleware Shape the future of global payments while growing your career About The Role We're looking for a curious and innovative Product … ReliabilityEngineer to join our Middleware team at Visa. Here, you'll be part of a diverse group of problem-solvers who ensure billions of transactions flow seamlessly across the world's largest payment network. What You'll Do Design for Reliability: Architect and implement solutions that keep Visa's middleware services running with Always On availability … other Promotes work-life balance and sustainable on-call rotations Encourages innovation and experimentation Champions personal growth and career development Impact & Growth In this role, you'll: Shape the reliability standards for global payment systems Mentor and be mentored by talented engineers Drive automation and observability initiatives Influence architectural decisions Build systems that process billions in transactions This is More ❯
We are currently seeking a Senior Electronic Hardware ReliabilityEngineer to join our Product Reliability team in Oskaloosa. This role will lead reliability-engineering efforts-including planning, analysis, and testing activities-for both new product development and sustaining engineering projects. The goal? Drive continuous improvement in product and system reliability through a hands-on … collaborative approach. Duties: Including but not limited to: Collaborate with product development teams to integrate design-for-reliability principles into new systems. Provide technical leadership for planning and executing reliability evaluations, ensuring alignment with real-world use conditions. Partner with data scientists to define reliability test deliverables and conduct Design of Experiments (DoE) for investigating reliability concerns. Analyze failure data, identify root causes and trends, and recommend corrective actions with cross-functional teams. Lead efforts to refine and enhance internal reliability engineering processes. Assess the impact of field failures on service costs across a product's lifecycle. Assist with the development and evaluation of warranty cost models. Support data analysis related to field performance More ❯
Are you passionate about driving innovation in technology and solving complex problems? Join our Hardware Reliability Team as a Senior Hardware Development Engineer and contribute to the systems that power some of the most advanced cloud computing infrastructures in the world! Our AWS Hardware Engineering team is at the forefront of system innovation, directly impacting services such … as EC2, S3, EBS, DDB, Outpost, and CloudFront. We are dedicated to optimizing quality, performance, reliability, and cost, and we want individuals who are ready to tackle challenging projects! In this exhilarating role, you will: Enhance the reliability of hardware used in Amazon servers, overseeing every phase from design and testing to manufacturing and operations. Utilize extensive production … suppliers in addressing Amazon's server hardware needs through thorough auditing, modeling, and innovative engineering solutions. Make a positive impact on customers and the environment by enhancing server hardware reliability and performance. Actively participate in operations to identify and resolve critical issues. Provide technical leadership to various teams, sharing expertise to elevate their productivity. Requirements: Bachelor's degree in More ❯
Senior Automation Engineer , MAN2 Reliability Maintenance Engineering Job ID: Amazon UK Services Ltd. Here at Amazon we are looking to hire an experienced Senior Automation Engineer to join the team at our Fulfillment Center (FC) in MAN2, Warrington. The Senior Automation Engineer will ensure that Safety comes first in all Facilities efforts. This … being self-directed, and the ability to multi-task and deliver results in a dynamic environment. Key job responsibilities The following roles and responsibilities are required for a successful Senior Automation Engineer: • Prepare specifications and technical detail to fully define performance on equipment, material and services • Perform PLC control level issue diagnosis • Follow change management process and develop … system and equipment configuration changes and keep the documentation updated on the SharePoint site. A day in the life The following roles and responsibilities are required for a successful Senior Automation Engineer: • Prepare specifications and technical detail to fully define performance on equipment, material and services • Perform PLC control level issue diagnosis • Follow change management process and develop More ❯
diverse, and engaged team creates a collaborative environment that openly celebrates all cultures and affords personal and professional growth opportunities. Learn . Overview of Job Function: Verint's Sr. ReliabilityEngineer is responsible for all aspects of the development and operational reliability of platforms and applications. In this highly skilled, hands-on role, our Sr. ReliabilityEngineer ensures the scalability, availability, performance, and reliability of cloud-based systems and participates in and leads the design, development, testing, deployment, monitoring, and support of cloud-native solutions, while also serving as a subject matter expert for customer implementation and cloud platform support. This Sr. ReliabilityEngineer works closely with a global team of … engineers to build robust, observable, and resilient systems that meet business objectives, following DevOps and SRE best practices. The engineer also contributes to continuous integration and deployment (CI/CD) processes, incident response, and postmortem analysis, while mentoring junior engineers and driving process improvements. Principal Duties and Essential Responsibilities: Ongoing evaluation (test) of feature design - proactively work with others More ❯
diverse, and engaged team creates a collaborative environment that openly celebrates all cultures and affords personal and professional growth opportunities. Learn . Overview of Job Function: Verint's Sr. ReliabilityEngineer is responsible for all aspects of the development and operational reliability of platforms and applications. In this highly skilled, hands-on role, our Sr. ReliabilityEngineer ensures the scalability, availability, performance, and reliability of cloud-based systems and participates in and leads the design, development, testing, deployment, monitoring, and support of cloud-native solutions, while also serving as a subject matter expert for customer implementation and cloud platform support. This Sr. ReliabilityEngineer works closely with a global team of … engineers to build robust, observable, and resilient systems that meet business objectives, following DevOps and SRE best practices. The engineer also contributes to continuous integration and deployment (CI/CD) processes, incident response, and postmortem analysis, while mentoring junior engineers and driving process improvements. Principal Duties and Essential Responsibilities: Ongoing evaluation (test) of feature design - proactively work with others More ❯