functional teams to deliver high-impact solutions in a fast-paced, research-driven environment. Key Responsibilities Design and implement scalable, distributed systems to support AI-driven workloads, ensuring high performance and reliability. Develop robust software solutions using Python (and potentially C++) to address complex technical challenges in AI and distributed computing. Work within a larger team to rapidly develop … distributed systems, machine learning, or a related field. Experience in the full lifecycle of developing, deploying, and maintaining large-scale cloud production systems, demonstrating expertise in scalability, reliability, and performance optimization More ❯
functional teams to deliver high-impact solutions in a fast-paced, research-driven environment. Key Responsibilities Design and implement scalable, distributed systems to support AI-driven workloads, ensuring high performance and reliability. Develop robust software solutions using Python (and potentially C++) to address complex technical challenges in AI and distributed computing. Work within a larger team to rapidly develop … distributed systems, machine learning, or a related field. Experience in the full lifecycle of developing, deploying, and maintaining large-scale cloud production systems, demonstrating expertise in scalability, reliability, and performance optimization More ❯
machine learning systems. Key Responsibilities: Rapid Prototyping: Collaborate with a multidisciplinary team to quickly develop and test proof-of-concept prototypes, validating research ideas in real-world, industrial environments. Performance Optimization: Perform detailed profiling and optimization of operating systems and large-scale distributed systems, leveraging a range of heterogeneous hardware (e.g., CPU, NPU) for enhanced performance. Research Publications: Transform … Technical Documentation: Create and maintain comprehensive technical documentation outlining research methods, design decisions, and implementation specifics to ensure reproducibility and support internal knowledge sharing. Stakeholder Engagement: Communicate research progress, performance insights, and key outcomes to internal stakeholders through clear and engaging presentations. Required: Bachelor's or Master's degree in Computer Science or a related technical field. Be comfortable …/or machine learning (NeurIPS, ICML, ICLR). Experience in the full lifecycle of developing, deploying, and maintaining large-scale cloud production systems, demonstrating expertise in scalability, reliability, and performance optimization. More ❯
machine learning systems. Key Responsibilities: Rapid Prototyping: Collaborate with a multidisciplinary team to quickly develop and test proof-of-concept prototypes, validating research ideas in real-world, industrial environments. Performance Optimization: Perform detailed profiling and optimization of operating systems and large-scale distributed systems, leveraging a range of heterogeneous hardware (e.g., CPU, NPU) for enhanced performance. Research Publications: Transform … Technical Documentation: Create and maintain comprehensive technical documentation outlining research methods, design decisions, and implementation specifics to ensure reproducibility and support internal knowledge sharing. Stakeholder Engagement: Communicate research progress, performance insights, and key outcomes to internal stakeholders through clear and engaging presentations. Required: Bachelor's or Master's degree in Computer Science or a related technical field. Be comfortable …/or machine learning (NeurIPS, ICML, ICLR). Experience in the full lifecycle of developing, deploying, and maintaining large-scale cloud production systems, demonstrating expertise in scalability, reliability, and performance optimization. More ❯