26 to 33 of 33 Permanent Ray Jobs

AI Systems Research Engineer - LLM Optimisation

Hiring Organisation: Project People
Location: United Kingdom

large-scale inference and data pipelines, focusing on KV cache management, heterogeneous memory scheduling, and high-throughput inference serving using frameworks like vLLM, Ray Serve, and modern PyTorch Distributed systems. Scalable Model Serving Infrastructure : Develop and evaluate frameworks that enable efficient multi-tenant, low-latency, and fault-tolerant AI serving … computing, or large-scale AI infrastructure are also welcome At least 2 years of experience with LLM inference/serving framework optimization (vLLM/Ray Serve/TensorRT-LLM/PyTorch) Hands-on experience with distributed KV cache optimization Familiarity with GPU and how they execute LLMs Strong knowledge ...

Senior Software Engineer - HPC Cost Optimization & Efficiency

Hiring Organisation: Zoox
Location: Foster City, California, United States
Employment Type: Permanent
Salary: USD Annual

forecasting models and budget management tools for capacity planning Qualifications Experience optimizing large-scale distributed systems for cost and efficiency Experience with Ray.io, particularly Ray Core and Ray Data Experience with Kubernetes, particularly for heterogeneous workloads and cost optimization Experience with cloud cost management on AWS (Cost Explorer) or similar ...

Data Platform Engineer

Hiring Organisation: Top Engineer
Location: San Francisco, California, United States
Employment Type: Any
Salary: USD Annual

Data Platform Engineer At this technology driven investment firm, "data is everything" isn't a talking point. It's literally the job. You will own the data layer that powers quantitative researchers, data scientists, and ...

Staff Software Engineer, Applied Training

Hiring Organisation: CoreWeave
Location: New York City, New York, United States
Employment Type: Permanent
Salary: USD Annual

company doing large-scale training. Familiarity with agentic AI: RL training with rollouts, agent evaluation, sandbox isolation for running untrusted code. Background with Slurm, Ray, or similar workload orchestration. Opinions on where they fall short. Experience with container runtimes, isolation (gVisor, Kata), or serverless platforms. OSS contributions to Kubernetes SIGs … Ray, PyTorch, or similar. Wondering if you're a good fit? We believe in investing in our people and value candidates who can bring their diverse experiences to our teams - even if you aren't a 100% skill or experience match. Why CoreWeave? At CoreWeave, we work hard, have ...

Senior Software Engineer II, Applied Training

Hiring Organisation: CoreWeave
Location: New York City, New York, United States
Employment Type: Permanent
Salary: USD Annual

Associate Software Engineer, Graduate

Hiring Organisation: Encord
Location: San Francisco, California, United States
Employment Type: Permanent
Salary: USD Annual

Python, Go, C#, C++, Java, Ruby on Rails, or JavaScript Familiarity with any of the following is a bonus: React, Kubernetes, GCP, PyTorch, CUDA, Ray Tech stack We are technology agnostic at Encord and not looking for experience across all of these - as long as you're open to learning … please apply. Backend: Python Frontend: TypeScript and React Deployment: Kubernetes Infrastructure: GCP Machine Learning: PyTorch, CUDA, Ray Why Encord Competitive salary, commission, and meaningful equity in a high-growth start-up Clear, accelerated growth opportunities as the company scales rapidly Strong in-person culture: 4-5 days/week ...

Senior Software Engineer II, AI Workload Orchestration

Hiring Organisation: CoreWeave
Location: Sunnyvale, California, United States
Employment Type: Permanent
Salary: USD Annual

native platform for admitting, scheduling, and operating AI workloads at scale. This platform integrates multiple orchestration and scheduling frameworks such as Kueue, Volcano, and Ray to support modern AI training and inference workflows. It complements SUNK (Slurm on Kubernetes) by providing a Kubernetes-first, cloud-native orchestration layer with deep … operational metrics Comfortable owning services in production and participating in on-call rotations Preferred: Experience with Kubernetes-native orchestration frameworks such as Kueue, Volcano, Ray, Kubeflow, or Argo Workflows Familiarity with GPU-based workloads, ML training, or inference pipelines Knowledge of scheduling concepts such as quota enforcement, pre-emption ...

Data Scientist - Senior

Hiring Organisation: GRVTY
Location: Springfield, Virginia, United States
Employment Type: Permanent
Salary: USD Annual

Job Description Job Description What You'll Be Owning: We are seeking a highly skilled Senior Data Scientist to support mission-critical intelligence operations. The ideal candidate brings deep analytic expertise, advanced data science capabilities ...