Tuning: Use methods like LoRA, QLoRA, SFT, PEFT, and RLHF Inference & Acceleration: Serve models using vLLM, DeepSpeed, Triton, TensorRT Multi-Agent Orchestration: Work with LangChain, AutoGen, CrewAI, DSPy and similar tools Cloud & MLOps (AWS): Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS Full-Stack Integration: Build APIs (FastAPI, Flask) and More ❯
serving and inference using vLLM, DeepSpeed, TensorRT, Triton, and other acceleration frameworks. Multi-Agent Systems : Develop and integrate agentic capabilities using frameworks such as LangChain, CrewAI, AutoGen, and DSPy. AWS Cloud & MLOps: Deploy scalable machine learning workloads on AWS using services like SageMaker, Bedrock, Lambda, S3, DynamoDB, ECS, and EKS. More ❯
serving and inference using vLLM, DeepSpeed, TensorRT, Triton, and other acceleration frameworks. Multi-Agent Systems : Develop and integrate agentic capabilities using frameworks such as LangChain, CrewAI, AutoGen, and DSPy. AWS Cloud & MLOps: Deploy scalable machine learning workloads on AWS using services like SageMaker, Bedrock, Lambda, S3, DynamoDB, ECS, and EKS. More ❯
Nottingham, England, United Kingdom Hybrid / WFH Options
Digital Waffle
focus Proficiency in Python and/or TypeScript , and experience with production-grade systems Familiarity with LLMs, AI agents, or orchestration frameworks (OpenAI, Anthropic, LangChain, etc.) Strong grasp of data modelling , cloud infrastructure (preferably AWS), and modern APIs Experience with relational and NoSQL databases (e.g. PostgreSQL, MongoDB) Problem-solver who More ❯