Central London, London, United Kingdom Hybrid / WFH Options
Staffworx Limited
custom LLM integrations). Exposure to AI ethics, data privacy, and compliance regulations. Prior experience in multi-agent systems or autonomous AI workflows. Hands-on experience with vector databases (Pinecone, Weaviate, FAISS) and AI embeddings. Remote WorkingSome remote working CountryUnited Kingdom LocationWC1 Job TypeContract or Permanent Start DateApr-Jul 25 Duration9 months initial or permanent Visa RequirementApplicants must be eligible More ❯
Express, Next.js Integrate ML models and embeddings into production pipelines using AWS SageMaker , Bedrock or OpenAI APIs Build support systems for autonomous agents including memory storage, vector search (e.g., Pinecone, Weaviate) and tool registries Enforce system-level requirements for security, compliance, observability and CI/CD Drive PoCs and reference architectures for multi-agent coordination , intelligent routing and goal-directed … similar Experience with secure cloud deployments and production ML model integration Bonus Skills Applied work with multi-agent systems , tool orchestration, or autonomous decision-making Experience with vector databases (Pinecone, Weaviate, FAISS) and embedding pipelines Knowledge of AI chatbot frameworks (Rasa, BotPress, Dialogflow) or custom LLM-based UIs Awareness of AI governance , model auditing, and data privacy regulation (GDPR, DPA More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Staffworx
Express, Next.js Integrate ML models and embeddings into production pipelines using AWS SageMaker , Bedrock or OpenAI APIs Build support systems for autonomous agents including memory storage, vector search (e.g., Pinecone, Weaviate) and tool registries Enforce system-level requirements for security, compliance, observability and CI/CD Drive PoCs and reference architectures for multi-agent coordination , intelligent routing and goal-directed … similar Experience with secure cloud deployments and production ML model integration Bonus Skills Applied work with multi-agent systems , tool orchestration, or autonomous decision-making Experience with vector databases (Pinecone, Weaviate, FAISS) and embedding pipelines Knowledge of AI chatbot frameworks (Rasa, BotPress, Dialogflow) or custom LLM-based UIs Awareness of AI governance , model auditing, and data privacy regulation (GDPR, DPA More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Staffworx
Express, Next.js Integrate ML models and embeddings into production pipelines using AWS SageMaker , Bedrock or OpenAI APIs Build support systems for autonomous agents including memory storage, vector search (e.g., Pinecone, Weaviate) and tool registries Enforce system-level requirements for security, compliance, observability and CI/CD Drive PoCs and reference architectures for multi-agent coordination , intelligent routing and goal-directed … similar Experience with secure cloud deployments and production ML model integration Bonus Skills Applied work with multi-agent systems , tool orchestration, or autonomous decision-making Experience with vector databases (Pinecone, Weaviate, FAISS) and embedding pipelines Knowledge of AI chatbot frameworks (Rasa, BotPress, Dialogflow) or custom LLM-based UIs Awareness of AI governance , model auditing, and data privacy regulation (GDPR, DPA More ❯
and modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS More ❯
and modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
TEKsystems, Inc
management using frameworks such as LangChain, CrewAI, and Autogen. Engineer and tune prompts to enhance the performance and reliability of generative tasks. Design RAG systems using vector databases like Pinecone, Chroma, and PosgreSQL for contextual retrieval. Incorporate semantic search and embedding strategies for more relevant and grounded LLM responses. Utilize Guardrails to implement applications that adhere to responsible AI guidelines. More ❯
CI/CD : Experience with continuous integration and deployment tools such as GitLab , GitHub , or Jenkins . Database Management Vector Databases: Experience with and (but not limited to) ChromaDB, Pinecone, PGVector, MongoDB , Qdrant etc. NoSQL: Familiarity with NoSQL databases (e.g., MongoDB preferred). SQL: Experience working with SQL databases like PostgreSQL. Version Control Proficient in Git and version control platforms More ❯
CI/CD : Experience with continuous integration and deployment tools such as GitLab , GitHub , or Jenkins . Database Management Vector Databases: Experience with and (but not limited to) ChromaDB, Pinecone, PGVector, MongoDB , Qdrant etc. NoSQL: Familiarity with NoSQL databases (e.g., MongoDB preferred). SQL: Experience working with SQL databases like PostgreSQL. Version Control Proficient in Git and version control platforms More ❯
CI/CD : Experience with continuous integration and deployment tools such as GitLab , GitHub , or Jenkins . Database Management Vector Databases: Experience with and (but not limited to) ChromaDB, Pinecone, PGVector, MongoDB , Qdrant etc. NoSQL: Familiarity with NoSQL databases (e.g., MongoDB preferred). SQL: Experience working with SQL databases like PostgreSQL. Version Control Proficient in Git and version control platforms More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Capgemini
CI/CD : Experience with continuous integration and deployment tools such as GitLab , GitHub , or Jenkins . Database Management Vector Databases: Experience with and (but not limited to) ChromaDB, Pinecone, PGVector, MongoDB , Qdrant etc. NoSQL: Familiarity with NoSQL databases (e.g., MongoDB preferred). SQL: Experience working with SQL databases like PostgreSQL. Version Control Proficient in Git and version control platforms More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
varied use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate More ❯
varied use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
varied use cases. • Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
varied use cases. • Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate More ❯
varied use cases. • Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Anson McCade
varied use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Anson McCade
varied use cases. • Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate More ❯
large-scale infrastructure, and modern backend development using Java, Python, Golang, Spring Boot, Flask, and Kubernetes. We focus on integrating RAG-powered LLMs, implementing advanced vector search (FAISS, Milvus, Pinecone), and building scalable and high-performance AI-driven solutions. You Might Be a Good Fit If You: Have deep hands-on software engineering expertise in Java or Python Thrive in … applications using Java, Python, and modern backend frameworks Integrate LLMs into enterprise-scale systems using internal frameworks and libraries Design and implement vector search solutions using FAISS, Milvus, and Pinecone Build scalable APIs and backend services using Spring Boot, Flask, and FastAPI Optimize data storage and retrieval with PostgreSQL/MongoDB and distributed databases Deploy and manage cloud-native applications … Succeed in This Role: Proficiency in Java or Python for backend development Strong knowledge of Spring Boot, Flask, FastAPI, and API design Experience with vector search frameworks (FAISS, Milvus, Pinecone) Expertise in Kubernetes and Docker for scalable deployment Understanding of authentication & security frameworks (Spring Security, SSO) Hands-on experience with PostgreSQL and distributed storage Experience with Maven or Gradle for More ❯
developer tools, open-source culture, and improving developer workflows. Excellent communication and collaboration skills in a remote-first environment. Experience contributing to open-source AI projects. Experience with LangChain, Pinecone, or similar AI frameworks/infrastructure. Past experience building AI features into developer platforms or tools. Benefits Our entire company is distributed, so we take remote work seriously. If you More ❯
NestJS) Designing secure, scalable database structures in PostgreSQL with Redis caching Implementing event-driven architectures using Kafka or RabbitMQ Integrating with OpenAI, Anthropic, and vector DBs like pgvector or Pinecone Managing infrastructure using Docker , Kubernetes , and CI/CD pipelines Leading conversations on architecture, performance, and scalability Shipping work that’s immediately valuable to real customers Competitive pay + revenue More ❯
London, England, United Kingdom Hybrid / WFH Options
Conquer AI
NestJS) Designing secure, scalable database structures in PostgreSQL with Redis caching Implementing event-driven architectures using Kafka or RabbitMQ Integrating with OpenAI, Anthropic, and vector DBs like pgvector or Pinecone Managing infrastructure using Docker , Kubernetes , and CI/CD pipelines Leading conversations on architecture, performance, and scalability Shipping work that’s immediately valuable to real customers Competitive pay + revenue More ❯
building complex architectures from MVP to production Solid hands-on experience with AI/LLM applications and model deployment Comfortable across front-end (HTML, CSS, Tailwind) and back-end (Pinecone, microservices, serverless) Brownie points: SEO know-how Deeper AI/ML chops (Colab, Streamlit, FastAPI, PyTorch, etc.) Entrepreneurial streak and previous startup exposure Ability to pivot quickly and learn on More ❯