in a client-facing, consultative role. Deep knowledge of microservices, distributed systems, and event-driven architectures. Strong foundation in AI/ML, especially LLMs, RAG , and vector databases (e.g. Pinecone, Weaviate). Experience with enterprise data integration (e.g. PostgreSQL, SharePoint, APIs, warehouses). Cloud architecture experience (AWS, Azure, or GCP). Proficiency in Python (strongly preferred), and familiarity with Docker More ❯
Guildford, Surrey, United Kingdom Hybrid / WFH Options
Unily
defining new patterns, processes, and tooling from the ground up. Experience with GitHub Actions, Terraform, or similar DevOps tooling. Familiarity with vector databases or hybrid search infrastructure (e.g., pgvector, Pinecone). Experience in data privacy, ethical AI, or governance frameworks. Familiarity with real-time applications using technologies like WebSockets or serverless edge functions. We are united by a shared purpose More ❯
Model Serving: Triton Inference Server, Hugging Face Inference Endpoints API Integration: OpenAI, Anthropic, Cohere, Mistral APIs LLM Frameworks: LangChain, LlamaIndex – for building LLM-powered applications Vector Databases: FAISS, Weaviate, Pinecone, Qdrant (Nice-to-Have) Retrieval-Augmented Generation (RAG): Experience building hybrid systems combining LLMs with enterprise data With a focus within Energy Trading, Oil & Gas, Financial Markets and Commodities, we More ❯
Model Serving: Triton Inference Server, Hugging Face Inference Endpoints API Integration: OpenAI, Anthropic, Cohere, Mistral APIs LLM Frameworks: LangChain, LlamaIndex – for building LLM-powered applications Vector Databases: FAISS, Weaviate, Pinecone, Qdrant (Nice-to-Have) Retrieval-Augmented Generation (RAG): Experience building hybrid systems combining LLMs with enterprise data With a focus within Energy Trading, Oil & Gas, Financial Markets and Commodities, we More ❯
Model Serving: Triton Inference Server, Hugging Face Inference Endpoints API Integration: OpenAI, Anthropic, Cohere, Mistral APIs LLM Frameworks: LangChain, LlamaIndex – for building LLM-powered applications Vector Databases: FAISS, Weaviate, Pinecone, Qdrant (Nice-to-Have) Retrieval-Augmented Generation (RAG): Experience building hybrid systems combining LLMs with enterprise data With a focus within Energy Trading, Oil & Gas, Financial Markets and Commodities, we More ❯
Model Serving: Triton Inference Server, Hugging Face Inference Endpoints API Integration: OpenAI, Anthropic, Cohere, Mistral APIs LLM Frameworks: LangChain, LlamaIndex – for building LLM-powered applications Vector Databases: FAISS, Weaviate, Pinecone, Qdrant (Nice-to-Have) Retrieval-Augmented Generation (RAG): Experience building hybrid systems combining LLMs with enterprise data With a focus within Energy Trading, Oil & Gas, Financial Markets and Commodities, we More ❯
london (city of london), south east england, united kingdom
Eaglecliff Recruitment
Model Serving: Triton Inference Server, Hugging Face Inference Endpoints API Integration: OpenAI, Anthropic, Cohere, Mistral APIs LLM Frameworks: LangChain, LlamaIndex – for building LLM-powered applications Vector Databases: FAISS, Weaviate, Pinecone, Qdrant (Nice-to-Have) Retrieval-Augmented Generation (RAG): Experience building hybrid systems combining LLMs with enterprise data With a focus within Energy Trading, Oil & Gas, Financial Markets and Commodities, we More ❯
Basingstoke, Hampshire, South East, United Kingdom Hybrid / WFH Options
iDPP
and AI agents) Build and maintain APIs, data pipelines, and backend components (primarily Python, FastAPI, or Flask) Engineer robust, containerised architectures using Docker and Kubernetes Work with ElasticSearch, Weaviate, Pinecone, and other vector databases Help deliver high-availability, fault-tolerant systems for mission-critical workloads What Were Looking For Degree in Computer Science, AI, or related field, or equivalent hands More ❯
in data engineering, ML engineering, or similar technical roles Strong Python skills and comfort working across complex ingestion workflows Experience managing NoSQL and vector databases at scale (MongoDB, Weaviate, Pinecone, etc.) Solid understanding of modern data pipeline tools (Airflow, Prefect, Dagster) Practical experience with LLM development, embeddings, and RAG architectures Familiarity with distributed systems and cloud platforms (AWS, GCP, or More ❯
in data engineering, ML engineering, or similar technical roles Strong Python skills and comfort working across complex ingestion workflows Experience managing NoSQL and vector databases at scale (MongoDB, Weaviate, Pinecone, etc.) Solid understanding of modern data pipeline tools (Airflow, Prefect, Dagster) Practical experience with LLM development, embeddings, and RAG architectures Familiarity with distributed systems and cloud platforms (AWS, GCP, or More ❯
southampton, south east england, united kingdom Hybrid / WFH Options
idpp
and AI agents) Build and maintain APIs, data pipelines, and backend components (primarily Python, FastAPI, or Flask) Engineer robust, containerised architectures using Docker and Kubernetes Work with ElasticSearch, Weaviate, Pinecone, and other vector databases Help deliver high-availability, fault-tolerant systems for mission-critical workloads What We’re Looking For Degree in Computer Science, AI, or related field, or equivalent More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
IT Graduate Recruitment
from senior ML engineers, AI researchers, and founders. Freedom to experiment with state-of-the-art models, tools, and frameworks. Modern tech stack (Python, LangChain, Hugging Face, OpenAI API, Pinecone, Kubernetes, etc.). Flexible working — remote-first culture with in-person team sessions for collaboration. Career acceleration — opportunities to own projects, lead development, and shape the product roadmap. An environment … Retrieval-Augmented Generation, Python, Data Science, AI Research, MLOps, Data Pipelines, Prompt Engineering, Model Fine-Tuning, Cloud Computing, AWS, Azure, Google Cloud, AI Infrastructure, Transformers, Reinforcement Learning, Vector Databases, Pinecone, Weaviate, Semantic Search, API Development, AI Deployment, Model Serving, AI Automation, Early Stage Startup, AI Startups, Tech Startup, Machine Intelligence, Applied AI, AI Applications, AI Innovation, AI Product Development, AI More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
record of shipping AI products into production (not just research or prototypes). Preferred tech stack - TypeScript, React, LangChain, and LLM orchestration. Experience with RAG systems , vector databases (e.g. Pinecone, PGVector, Weaviate), and model evaluation . Comfortable working in a fast-paced startup environment with a high degree of autonomy. Exposure to agentic systems, prompt optimisation, or multi-agent frameworks. More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
record of shipping AI products into production (not just research or prototypes). Preferred tech stack - TypeScript, React, LangChain, and LLM orchestration. Experience with RAG systems , vector databases (e.g. Pinecone, PGVector, Weaviate), and model evaluation . Comfortable working in a fast-paced startup environment with a high degree of autonomy. Exposure to agentic systems, prompt optimisation, or multi-agent frameworks. More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
record of shipping AI products into production (not just research or prototypes). Preferred tech stack - TypeScript, React, LangChain, and LLM orchestration. Experience with RAG systems , vector databases (e.g. Pinecone, PGVector, Weaviate), and model evaluation . Comfortable working in a fast-paced startup environment with a high degree of autonomy. Exposure to agentic systems, prompt optimisation, or multi-agent frameworks. More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
record of shipping AI products into production (not just research or prototypes). Preferred tech stack - TypeScript, React, LangChain, and LLM orchestration. Experience with RAG systems , vector databases (e.g. Pinecone, PGVector, Weaviate), and model evaluation . Comfortable working in a fast-paced startup environment with a high degree of autonomy. Exposure to agentic systems, prompt optimisation, or multi-agent frameworks. More ❯
london (city of london), south east england, united kingdom
SR2 | Socially Responsible Recruitment | Certified B Corporation™
record of shipping AI products into production (not just research or prototypes). Preferred tech stack - TypeScript, React, LangChain, and LLM orchestration. Experience with RAG systems , vector databases (e.g. Pinecone, PGVector, Weaviate), and model evaluation . Comfortable working in a fast-paced startup environment with a high degree of autonomy. Exposure to agentic systems, prompt optimisation, or multi-agent frameworks. More ❯
Guildford, England, United Kingdom Hybrid / WFH Options
Intellect Group
and future-ready. Key Responsibilities Develop and own the AI strategy and product roadmap . Lead the design, prototyping, and production of LLM-based applications (OpenAI, Hugging Face, LangChain, Pinecone). Build technical integrations with platforms such as Microsoft 365, Salesforce, and CMS tools. Define and embed AI governance standards , ensuring ethical and compliant use. Act as an AI thought More ❯
woking, south east england, united kingdom Hybrid / WFH Options
Intellect Group
and future-ready. Key Responsibilities Develop and own the AI strategy and product roadmap . Lead the design, prototyping, and production of LLM-based applications (OpenAI, Hugging Face, LangChain, Pinecone). Build technical integrations with platforms such as Microsoft 365, Salesforce, and CMS tools. Define and embed AI governance standards , ensuring ethical and compliant use. Act as an AI thought More ❯
z2bz0 years of experience in full stack development (React + Node or equivalent stack) Experience integrating and fine-tuning LLMs, embeddings, or vector search (e.g. OpenAI, LangChain, Pinecone, Lovable and Google’s Agent Development Kit or equivalent vide coding platforms) Strong frontend sensibility (you care how things feel and look , not just how they run) Comfort building MVPs that can More ❯
z2bz0 years of experience in full stack development (React + Node or equivalent stack) Experience integrating and fine-tuning LLMs, embeddings, or vector search (e.g. OpenAI, LangChain, Pinecone, Lovable and Google’s Agent Development Kit or equivalent vide coding platforms) Strong frontend sensibility (you care how things feel and look , not just how they run) Comfort building MVPs that can More ❯
z2bz0 years of experience in full stack development (React + Node or equivalent stack) Experience integrating and fine-tuning LLMs, embeddings, or vector search (e.g. OpenAI, LangChain, Pinecone, Lovable and Google’s Agent Development Kit or equivalent vide coding platforms) Strong frontend sensibility (you care how things feel and look, not just how they run) Comfort building MVPs that can More ❯
z2bz0 years of experience in full stack development (React + Node or equivalent stack) Experience integrating and fine-tuning LLMs, embeddings, or vector search (e.g. OpenAI, LangChain, Pinecone, Lovable and Google’s Agent Development Kit or equivalent vide coding platforms) Strong frontend sensibility (you care how things feel and look, not just how they run) Comfort building MVPs that can More ❯
london (city of london), south east england, united kingdom
WORK-SELF
z2bz0 years of experience in full stack development (React + Node or equivalent stack) Experience integrating and fine-tuning LLMs, embeddings, or vector search (e.g. OpenAI, LangChain, Pinecone, Lovable and Google’s Agent Development Kit or equivalent vide coding platforms) Strong frontend sensibility (you care how things feel and look, not just how they run) Comfort building MVPs that can More ❯