Senior Software Engineer II, Inference
- Hiring Organisation
- CoreWeave
- Location
- Sunnyvale, California, United States
- Employment Type
- Permanent
- Salary
- USD Annual
streaming token delivery. Proven track record improving tail latency (P95/P99) and service reliability through metrics-driven work. Preferred: Contributions to inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve, TorchServe). Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies. Leading multi-team initiatives ...