City of London, London, United Kingdom Hybrid / WFH Options
Bloc Recruitment
Build AI systems that improve accuracy, speed up assessments, and delight users Work with industry experts to turn messy claim processes into AI-driven workflows Deploy solutions using LLMs, LangChain, and RAG at scale What we're looking for 2+ years in AI/ML engineering or research Proven track record taking projects from idea → production → impact Bias to action More ❯
performing engineers while raising the quality bar. Collaborate : Translate product goals into clear technical execution through strong communication and cross-team alignment. Nice to Have Experience with orchestration frameworks ( LangChain, LangGraph ) and multi-agent workflows Knowledge of vector databases, RAG pipelines, and lightweight model hosting Ability to design data pipelines and feedback loops for improving AI-driven features Awareness of More ❯
of planning, reasoning, and task execution Develop agent communication protocols and coordination mechanisms Create robust error handling and recovery systems for agent workflows AI Infrastructure & Integration: Implement and optimize LangChain pipelines for agent reasoning and tool usage Design and maintain vector database solutions for knowledge retrieval and semantic search Build scalable data ingestion pipelines for structured and unstructured data Integrate … agent reasoning Implement efficient caching and data persistence strategies Required Technical Skills: Core AI/ML Frameworks Expert-level proficiency with LangGraph for agent workflow orchestration Deep experience with LangChain for LLM application development Hands-on experience with Google Cloud AI services (Vertex AI, Gemini, PaLM) Strong understanding of prompt engineering and LLM optimization techniques Data & Storage: Proficiency with vector More ❯
Hampton, Virginia, United States Hybrid / WFH Options
Iron EagleX, Inc
leveraging state-of-the-art frameworks. Apply advanced prompt engineering techniques such as few-shot learning, chain-of-thought reasoning, self-consistency, and retrieval-augmented generation. Utilize frameworks like LangChain and DSPy to design, test, and optimize prompts. Address token limitations, cost trade-offs, and model-specific constraints across various LLMs (ChatGPT, Claude, Llama, Mistral, etc.). Implement model fine … of 5 years of experience in machine learning, AI model development, or related fields. Proficiency in prompt engineering and LLM optimization techniques. Hands-on experience with frameworks such as LangChain, DSPy, and LangGraph. Strong understanding of AI workflow automation and agent-based architectures. Solid Python programming skills and experience with version control (Git). Familiarity with Agile development methodologies. Due More ❯
london, south east england, united kingdom Hybrid / WFH Options
asobbi
Support pre-sales efforts, workshops, and proof-of-concepts alongside go-to-market teams. AI & ML Engineering Implement and optimise AI/ML models using frameworks like PyTorch, HuggingFace, LangChain, and NVIDIA Triton. Fine-tune foundation models for domain-specific use cases. Deploy and maintain inference services using REST/gRPC APIs, containerised and distributed systems. Data & Knowledge Integration Build … talks, case studies) on real-world AI implementation. Required Qualifications Background in AI/ML engineering, applied AI, or technical solutions delivery. Strong experience with: Retrieval-Augmented Generation (e.g., LangChain, LlamaIndex, vector databases). LLM fine-tuning techniques (LoRA, PEFT, instruction tuning). Deploying models in production (Triton Inference Server, HuggingFace, Kubernetes). Advanced Python skills; bonus for experience in More ❯
slough, south east england, united kingdom Hybrid / WFH Options
asobbi
Support pre-sales efforts, workshops, and proof-of-concepts alongside go-to-market teams. AI & ML Engineering Implement and optimise AI/ML models using frameworks like PyTorch, HuggingFace, LangChain, and NVIDIA Triton. Fine-tune foundation models for domain-specific use cases. Deploy and maintain inference services using REST/gRPC APIs, containerised and distributed systems. Data & Knowledge Integration Build … talks, case studies) on real-world AI implementation. Required Qualifications Background in AI/ML engineering, applied AI, or technical solutions delivery. Strong experience with: Retrieval-Augmented Generation (e.g., LangChain, LlamaIndex, vector databases). LLM fine-tuning techniques (LoRA, PEFT, instruction tuning). Deploying models in production (Triton Inference Server, HuggingFace, Kubernetes). Advanced Python skills; bonus for experience in More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
asobbi
Support pre-sales efforts, workshops, and proof-of-concepts alongside go-to-market teams. AI & ML Engineering Implement and optimise AI/ML models using frameworks like PyTorch, HuggingFace, LangChain, and NVIDIA Triton. Fine-tune foundation models for domain-specific use cases. Deploy and maintain inference services using REST/gRPC APIs, containerised and distributed systems. Data & Knowledge Integration Build … talks, case studies) on real-world AI implementation. Required Qualifications Background in AI/ML engineering, applied AI, or technical solutions delivery. Strong experience with: Retrieval-Augmented Generation (e.g., LangChain, LlamaIndex, vector databases). LLM fine-tuning techniques (LoRA, PEFT, instruction tuning). Deploying models in production (Triton Inference Server, HuggingFace, Kubernetes). Advanced Python skills; bonus for experience in More ❯
trends within AI and ML, whilst planning, developing, and maintaining production. What Youll Bring 3+ years experience in Generative AI experience Proven experience with language-based Generative AI eg. LangChain, LangSmith, LangGraph, RAG, Proficiency in Python, data science libraries such as numpy, Pandas, scikit-learn Experienced in LLM frameworks such as LangChain Ability to build and create prototypes in a More ❯
and ML, whilst incorporating new ideas and techniques into the team. What Youll Bring 3+ years experience in Generative AI experience Proven experience with language-based Generative AI eg. LangChain, LangSmith, LangGraph, RAG, Proficiency in Python, data science libraries such as numpy, Pandas, scikit-learn Experienced in LLM frameworks such as LangChain Strong client-facing consulting or B2B experience Ability More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
MicroTECH Global Ltd
Hybrid Working Required - 3 Days On, 2 days Remote Our client are an AI-Native FinTech start-up based in London. Brief:We're building an AI-native financial operations platform. As an ML/LLM Engineer, you'll design More ❯
of-the-art tools and platforms. Essential Skills for the Role: Python (FastAPI): Proficiency in Python, particularly with the FastAPI framework, is vital to develop robust, scalable AI solutions. LangChain: Practical experience using LangChain to integrate and operationalize large language models. Google AI SDK & Azure Open AI SDK: Expertise with these SDKs to deploy and manage AI capabilities within GCP More ❯
of-the-art tools and platforms. Essential Skills for the Role: Python (FastAPI): Proficiency in Python, particularly with the FastAPI framework, is vital to develop robust, scalable AI solutions. LangChain: Practical experience using LangChain to integrate and operationalize large language models. Google AI SDK & Azure Open AI SDK: Expertise with these SDKs to deploy and manage AI capabilities within GCP More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
eTeam
a strong grasp of architectural principles, design patterns, and hands-on experience building scalable applications. Familiarity with Angular, Next.js, Node.js-based APIs, and GenAI frameworks (e.g., OpenAI, Hugging Face, LangChain) is a plus. This role demands a deep understanding of application compatibility, security, and API integration within modern cloud environments. Key Responsibilities: • Lead the design and development of robust, scalable … Bedrock). Preferred Skills: • Experience with Angular and hybrid frontend architectures. • Exposure to CI/CD pipelines and cloud-native development practices. • Familiarity with GenAI frameworks and libraries (e.g., LangChain, Transformers, OpenAI SDK). • Strong problem-solving and communication skills. • AWS certification or equivalent cloud experience is a plus. More ❯
milton, central scotland, united kingdom Hybrid / WFH Options
eTeam
a strong grasp of architectural principles, design patterns, and hands-on experience building scalable applications. Familiarity with Angular, Next.js, Node.js-based APIs, and GenAI frameworks (e.g., OpenAI, Hugging Face, LangChain) is a plus. This role demands a deep understanding of application compatibility, security, and API integration within modern cloud environments. Key Responsibilities: • Lead the design and development of robust, scalable … Bedrock). Preferred Skills: • Experience with Angular and hybrid frontend architectures. • Exposure to CI/CD pipelines and cloud-native development practices. • Familiarity with GenAI frameworks and libraries (e.g., LangChain, Transformers, OpenAI SDK). • Strong problem-solving and communication skills. • AWS certification or equivalent cloud experience is a plus. More ❯
paisley, central scotland, united kingdom Hybrid / WFH Options
eTeam
a strong grasp of architectural principles, design patterns, and hands-on experience building scalable applications. Familiarity with Angular, Next.js, Node.js-based APIs, and GenAI frameworks (e.g., OpenAI, Hugging Face, LangChain) is a plus. This role demands a deep understanding of application compatibility, security, and API integration within modern cloud environments. Key Responsibilities: • Lead the design and development of robust, scalable … Bedrock). Preferred Skills: • Experience with Angular and hybrid frontend architectures. • Exposure to CI/CD pipelines and cloud-native development practices. • Familiarity with GenAI frameworks and libraries (e.g., LangChain, Transformers, OpenAI SDK). • Strong problem-solving and communication skills. • AWS certification or equivalent cloud experience is a plus. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Transparent Technology
compliance sector. As a Solutions Engineer/AI Engineer/LLM Engineer , you'll work directly with clients, design POCs and production-ready solutions , and deliver impact using LLMs, LangChain, Python, and Azure . With 3+ years' experience, excellent communication skills, and a passion for Generative AI , you'll thrive in a hybrid role (2-3 days in London) where … a week. Apply by sending your CV today and Nathan Laidlaw from Transparent Tech will be in touch. Keywords: Solutions Engineer, AI Engineer, AI Consultant, Generative AI, Python, Azure, LangChain, LLMs, Hybrid London More ❯
Who are we? Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. More ❯
architecture and conduct technical reviews.Ensure seamless integration with enterprise systems and APIs.Act as a thought leader through publications, events, and industry presence.Track and experiment with emerging technologies (LLMs, AutoGPT, LangChain, CrewAI, RL).Prototype new tools and frameworks; drive innovation.Mentor junior consultants and deliver AI/agentic training content.Define and enforce ethical, regulatory-compliant AI standards.Implement governance for autonomous decision-making.What … based agents).Proven track record in building production-ready agentic systems, including multi-agent coordination and decision-making frameworks.Strong grasp of SDLC and software best practices.Familiarity with frameworks like LangChain, AutoGen, CrewAI.Systems-thinking approach to designing distributed, enterprise-integrated AI solutions.Proficiency in Python and cloud deployment using IaC.Experience deploying MS Copilot Agents for business solutions.(Desirable) Advanced degree in Software More ❯
Role: Python AIML developer Bill Rate: $86/hour C2C Location:Washington, DC Duration: 12+ months/long-term Interview Criteria: Telephonic + Skype Direct Client RequirementPython Programming:At least 5 years of hands-on experience with Python, particularly inframeworks More ❯
Python Programming: At least 5 years of hands-on experience with Python, particularly inframeworks like FastAPI, Django, Flask, and experience using AI frameworks.Access Control Expertise: Strong understanding of access control models such as Role-BasedAccess Control (RBAC) and Attribute-Based More ❯
vector stores (Chroma, FAISS, Milvus) with sensitive document ingestion pipelines - Host and optimise LLMs using vLLM/TGI on local GPU clusters without internet connectivity - Implement agent orchestration using LangChain/LangGraph in completely offline environments - Design secure document processing for classified materials with appropriate data sanitisation - Build monitoring and evaluation systems that operate within air-gapped infrastructure Essential Requirements … optimisation - Python expertise with offline dependency management and local package mirrors Technical Stack (All On-Premises) Models: Llama 3, Mistral, Qwen (locally hosted) Vector Stores: Chroma, FAISS, Milvus Orchestration: LangChain, LangGraph for agents Hosting: vLLM, TGI, Ollama on bare metal/private cloud Infrastructure: Air-gapped Kubernetes, local container registries Desirable Skills - Experience with defence/government IT security protocols More ❯
vector stores (Chroma, FAISS, Milvus) with sensitive document ingestion pipelines - Host and optimise LLMs using vLLM/TGI on local GPU clusters without internet connectivity - Implement agent orchestration using LangChain/LangGraph in completely offline environments - Design secure document processing for classified materials with appropriate data sanitisation - Build monitoring and evaluation systems that operate within air-gapped infrastructure Essential Requirements … optimisation - Python expertise with offline dependency management and local package mirrors Technical Stack (All On-Premises) Models: Llama 3, Mistral, Qwen (locally hosted) Vector Stores: Chroma, FAISS, Milvus Orchestration: LangChain, LangGraph for agents Hosting: vLLM, TGI, Ollama on bare metal/private cloud Infrastructure: Air-gapped Kubernetes, local container registries Desirable Skills - Experience with defence/government IT security protocols More ❯
vector stores (Chroma, FAISS, Milvus) with sensitive document ingestion pipelines - Host and optimise LLMs using vLLM/TGI on local GPU clusters without internet connectivity - Implement agent orchestration using LangChain/LangGraph in completely offline environments - Design secure document processing for classified materials with appropriate data sanitisation - Build monitoring and evaluation systems that operate within air-gapped infrastructure Essential Requirements … optimisation - Python expertise with offline dependency management and local package mirrors Technical Stack (All On-Premises) Models: Llama 3, Mistral, Qwen (locally hosted) Vector Stores: Chroma, FAISS, Milvus Orchestration: LangChain, LangGraph for agents Hosting: vLLM, TGI, Ollama on bare metal/private cloud Infrastructure: Air-gapped Kubernetes, local container registries Desirable Skills - Experience with defence/government IT security protocols More ❯