vector databases), AutoGPT Data Engineering & ML Pipelines: Apache Airflow, MLflow, Kubeflow, dbt, Prefect Cloud & Deployment Platforms: AWS SageMaker, Azure ML, Google Vertex AI APIs & Orchestration: OpenAI API, Anthropic Claude, Weaviate, FastAPI (for AI applications) MLOps & Experimentation: Weights & Biases, DVC (Data Version Control), Docker, Kubernetes General 2+ years of professional experience in relevant fields. Experience mentoring, coaching, or teaching others in More ❯
modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS SageMaker More ❯
modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS SageMaker More ❯
Cambridge, Cambridgeshire, England, United Kingdom Hybrid / WFH Options
Ascent Sourcing Ltd
experience with memory frameworks like Mem0 or Letta. Understanding and production experience with traditional RAG and agentic RAG architectures. Strong grasp of embedding systems and vector databases (e.g., Pinecone, Weaviate, FAISS). Production experience with LiveKit or similar audio/video platforms. What You’ll Be Doing Building and deploying scalable AI features, agents, and services powered by LLMs. Prototyping More ❯
and maintain AI microservices using Docker, Kubernetes, and FastAPI, ensuring smooth model serving and error handling; Vector Search & Retrieval: Implement retrieval-augmented workflows: ingest documents, index embeddings (Pinecone, FAISS, Weaviate), and build similarity search features. Rapid Prototyping: Create interactive AI demos and proofs-of-concept with Streamlit, Gradio, or Next.js for stakeholder feedback; MLOps & Deployment: Implement CI/CD pipelines … tuning LLMs via OpenAI, HuggingFace or similar APIs; Strong proficiency in Python; Deep expertise in prompt engineering and tooling like LangChain or LlamaIndex; Proficiency with vector databases (Pinecone, FAISS, Weaviate) and document embedding pipelines; Proven rapid-prototyping skills using Streamlit or equivalent frameworks for UI demos. Familiarity with containerization (Docker) and at least one orchestration/deployment platform; Excellent communication More ❯