Central London, London, United Kingdom Hybrid / WFH Options
Staffworx Limited
LLM integrations). Exposure to AI ethics, data privacy, and compliance regulations. Prior experience in multi-agent systems or autonomous AI workflows. Hands-on experience with vector databases (Pinecone, Weaviate, FAISS) and AI embeddings. Remote WorkingSome remote working CountryUnited Kingdom LocationWC1 Job TypeContract or Permanent Start DateApr-Jul 25 Duration9 months initial or permanent Visa RequirementApplicants must be eligible to More ❯
or LLM-powered applications in production environments. Proficiency in Python and ML libraries such as PyTorch, Hugging Face Transformers , or TensorFlow. Experience with vector search tools (e.g., FAISS, Pinecone, Weaviate) and retrieval frameworks (e.g., LangChain, LlamaIndex). Hands-on experience with fine-tuning and distillation of large language models. Comfortable with cloud platforms (Azure preferred), CI/CD tools, and More ❯
or LLM-powered applications in production environments. Proficiency in Python and ML libraries such as PyTorch, Hugging Face Transformers , or TensorFlow. Experience with vector search tools (e.g., FAISS, Pinecone, Weaviate) and retrieval frameworks (e.g., LangChain, LlamaIndex). Hands-on experience with fine-tuning and distillation of large language models. Comfortable with cloud platforms (Azure preferred), CI/CD tools, and More ❯
modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS SageMaker More ❯
modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS SageMaker More ❯
vector databases), AutoGPT Data Engineering & ML Pipelines: Apache Airflow, MLflow, Kubeflow, dbt, Prefect Cloud & Deployment Platforms: AWS SageMaker, Azure ML, Google Vertex AI APIs & Orchestration: OpenAI API, Anthropic Claude, Weaviate, FastAPI (for AI applications) MLOps & Experimentation: Weights & Biases, DVC (Data Version Control), Docker, Kubernetes General 2+ years of professional experience in relevant fields. Experience mentoring, coaching, or teaching others in More ❯
and maintain AI microservices using Docker, Kubernetes, and FastAPI, ensuring smooth model serving and error handling; Vector Search & Retrieval: Implement retrieval-augmented workflows: ingest documents, index embeddings (Pinecone, FAISS, Weaviate), and build similarity search features. Rapid Prototyping: Create interactive AI demos and proofs-of-concept with Streamlit, Gradio, or Next.js for stakeholder feedback; MLOps & Deployment: Implement CI/CD pipelines … tuning LLMs via OpenAI, HuggingFace or similar APIs; Strong proficiency in Python; Deep expertise in prompt engineering and tooling like LangChain or LlamaIndex; Proficiency with vector databases (Pinecone, FAISS, Weaviate) and document embedding pipelines; Proven rapid-prototyping skills using Streamlit or equivalent frameworks for UI demos. Familiarity with containerization (Docker) and at least one orchestration/deployment platform; Excellent communication More ❯
Full-Stack Integration : Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js. Vector Databases & Search : Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch. Required skills & experience: 3-5+ years in machine learning and software development Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers Experience with More ❯
Full-Stack Integration : Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js. Vector Databases & Search : Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch. Required skills & experience: 3-5+ years in machine learning and software development Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers Experience with More ❯
Full-Stack Integration : Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js. Vector Databases & Search : Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch. Required skills & experience: 3-5+ years in machine learning and software development Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers Experience with More ❯
Full-Stack Integration : Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js. Vector Databases & Search : Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch. Required skills & experience: 3-5+ years in machine learning and software development Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers Experience with More ❯
Full-Stack Integration : Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js. Vector Databases & Search : Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch. Required skills & experience: 3-5+ years in machine learning and software development Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers Experience with More ❯
Full-Stack Integration : Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js. Vector Databases & Search : Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch. Required skills & experience: 3-5+ years in machine learning and software development Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers Experience with More ❯
City of London, London, Finsbury Square, United Kingdom
The Portfolio Group
Full-Stack Integration : Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js. Vector Databases & Search : Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch. Required skills & experience: 3-5+ years in machine learning and software development Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers Experience with More ❯
and LLMs (eg, OpenAI, Claude, Mistral, etc.) Develop Python-based services and APIs for integration with AI models Implement Retrieval-Augmented Generation (RAG) pipelines using vector databases (eg, FAISS, Weaviate) Collaborate with data scientists and DevOps to deploy AI models into production Contribute to prompt engineering and fine-tuning strategies for LLMs Ensure solutions meet governance, ethical, and security standards … experience as an AI Engineer or Machine Learning Engineer Strong Python development skills Hands-on experience with LangChain and other LLM frameworks Familiarity with vector databases (eg, FAISS, Pinecone, Weaviate) Experience working with OpenAI, Anthropic, or other generative AI APIs Ability to work independently in a remote team and deliver results to tight timelines Desirable Skills: Knowledge of cloud platforms More ❯
tools Cloud & MLOps (AWS): Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS Full-Stack Integration: Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js Vector Search: Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch Required skills & experience: 3-5+ years of experience in ML engineering and software development Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face Proven experience with More ❯
tools Cloud & MLOps (AWS): Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS Full-Stack Integration: Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js Vector Search: Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch Required skills & experience: 3-5+ years of experience in ML engineering and software development Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face Proven experience with More ❯
Our benefits Share Options (EMI) scheme 25 days annual leave, plus flexible bank holidaysand the opportunity to buy additional days Enhanced workplace Pension scheme - opt in salary sacrifice scheme Life Insurance (3x annual salary) Employee Assistance Programme (EAP) and workplace More ❯
background in Computer Science and Software Development Experience with complex RAG pipelines (without using Langchain or LlamaIndex) Tech stack includes: Foundation and open-source LLMs, vector databases (Pinecone, Qdrant, Weaviate), embeddings, Next.js, Vercel, MongoDB, Cohere reranking, multimodal parsing (e.g., Unstructured.io) This is a project-based, part-time, contract role. Must provide work samples (GitHub). Must be based in US More ❯