AI Infrastructure Architect
Responsibilities:
Design a unified AI Infra & Serving architecture platform for composite AI workloads such as LLM Training & Inference, RLHF, Agent, and Multimodal processing. This platform will integrate inference, orchestration, and state management, defining the technical evolution path for Serverless AI + Agentic Serving
Design a heterogeneous execution xkybehq framework across CPU/GPU/NPU for agent me
Before applying for this role, please read the following information about this opportunity found below. Please click on the apply button to read the full job description