AI Platform & Cloud Engineer
- Hiring Organisation
- Axle
- Location
- bethesda, maryland, united states
clusters and public cloud environments (GCP/AWS) are utilized effectively. Refactoring & Model Serving : Transform experimental code (Jupyter Notebooks, R scripts) developed by NLP and Omics researchers into robust, containerized software packages. Deploy and optimize model inference servers (e.g., vLLM, Triton Inference Server) to expose AI models as reliable internal …/EKS) and using GitOps workflows (ArgoCD/Flux). Knowledge of Graph Database administration (Neo4j) and object storage architectures. Familiarity with Serverless event processing (Cloud Functions) and ML Engineering concepts (quantization, distillation, serving via Triton/vLLM). Disclaimer: The above description is meant to illustrate the general ...