networking, virtualization, cloud, etc.). Strong technical troubleshooting in multi-platform, distributed environments. Strong understanding of distributed storage systems. Expertise in Linux/Unix administration. Deep understanding of networking (Infiniband, Ethernet, DPDK, UCX), cloud computing, and distributed storage. Proficiency in Python, Bash, and experience with automation scripting for system monitoring and troubleshooting. Knowledge of POSIX, NFS, S3 protocols, log management More ❯
Altair Grid Engine. Proficiency in developing workflows for application builds and testing. Experience with setting up CUDA, OpenMPI, TensorFlow, and PyTorch. Familiarity with cloud services and technologies. Knowledge of InfiniBand or other fast interconnect technologies. A plus: Understanding of drug development processes and workflows commonly encountered in biotech/pharma R&D environments. Strong communication skills, both verbal and written. More ❯
Staff Software Engineer, AI Reliability Engineering London, UK About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our More ❯