and high-quality production code, and reviews and debugs code written by others, with a focus on cloud-based systems using AWS and Python. Implements and optimizes RAG-based semanticsearch and LLM inference workflows using OpenAI and Claude models. Drives decisions that influence the product design, application functionality, and technical operations and processes. Serves as a function … capabilities. Practical cloud-native experience, specifically with AWS. Experience in Computer Science, Computer Engineering, Mathematics, or a related technical field. Preferred qualifications, capabilities, and skills Experience with RAG-based semanticsearch and LLM inference workflows using OpenAI and Claude models. Proven track record of proposing solutions independently and owning execution end-to-end in an individual contributor role. More ❯
Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale - unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone … to accelerate the results that matter. By taking advantage of all structured and unstructured data - securing and protecting private information more effectively - Elastic's complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What is The Role The Search Inference team is responsible for bringing performant, ergonomic, and cost effective … machine learning (ML) model inference to Search workflows. ML inference has become a crucial part of the modern search experience whether used for query understanding, semanticsearch, RAG, or any other GenAI use-case. Our goal is to simplify ML inference in Search workflows by focusing on large scale inference capabilities for embeddings and reranking More ❯
techniques (e.g. CatBoost, XGBoost, RoBERTa), and leveraging both structured and unstructured data sources. Experimenting with the integration and fine-tuning of models with Vector databases and embeddings to support semanticsearch, RAG (retrieval-augmented generation), and domain-specific applications. Working within a Data Mesh architecture, collaborating across domains to ensure scalable, interoperable data products; containerising solutions with Docker More ❯
for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. Integrate semanticsearch strategies and embeddings to improve LLM relevance and contextual understanding. Build demo-ready user interfaces with tools like Streamlit, Gradio, or React. Develop and maintain robust APIs … credibility with Google - experience working collaboratively with their teams, Google Premier Cloud Partner experience etc Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semanticsearch, vector databases, and memory management strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. Integrate semanticsearch strategies and embeddings to improve LLM relevance and contextual understanding. Build demo-ready user interfaces with tools like Streamlit, Gradio, or React. Develop and maintain robust APIs … credibility with Google - experience working collaboratively with their teams, Google Premier Cloud Partner experience etc Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semanticsearch, vector databases, and memory management strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI More ❯
for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. • Integrate semanticsearch strategies and embeddings to improve LLM relevance and contextual understanding. • Build demo-ready user interfaces with tools like Streamlit, Gradio, or React. • Develop and maintain robust APIs … credibility with Google - experience working collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semanticsearch, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. • Integrate semanticsearch strategies and embeddings to improve LLM relevance and contextual understanding. • Build demo-ready user interfaces with tools like Streamlit, Gradio, or React. • Develop and maintain robust APIs … credibility with Google - experience working collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semanticsearch, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. • Integrate semanticsearch strategies and embeddings to improve LLM relevance and contextual understanding. • Build demo-ready user interfaces with tools like Streamlit, Gradio, or React. • Develop and maintain robust APIs … credibility with Google - experience working collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semanticsearch, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI More ❯
with domain-specific knowledge. Prompt engineering and fine-tuning for tailoring model behavior to business-specific contexts. Use of embedding stores and vector databases (e.g., Pinecone, Redis, Azure AI Search) to support semanticsearch and recommendation systems. Building intelligent features like AI-powered chatbots , assistants , and question-answering systems using LLMs and conversational agents. Awareness of agentic … AI concepts — orchestrating multiple agents with specific tasks/goals in a collaborative, dynamic environment. Familiarity with tools and frameworks that enable LLM-based integrations such as LangChain , Semantic Kernel , or Azure OpenAI . Appreciation for ethical AI considerations including data privacy , security , and bias mitigation . Eagerness to explore emerging technologies and collaborate with AI/ML teams More ❯
with domain-specific knowledge. Prompt engineering and fine-tuning for tailoring model behavior to business-specific contexts. Use of embedding stores and vector databases (e.g., Pinecone, Redis, Azure AI Search) to support semanticsearch and recommendation systems. Building intelligent features like AI-powered chatbots , assistants , and question-answering systems using LLMs and conversational agents. Awareness of agentic … AI concepts — orchestrating multiple agents with specific tasks/goals in a collaborative, dynamic environment. Familiarity with tools and frameworks that enable LLM-based integrations such as LangChain , Semantic Kernel , or Azure OpenAI . Appreciation for ethical AI considerations including data privacy , security , and bias mitigation . Eagerness to explore emerging technologies and collaborate with AI/ML teams More ❯
data science, and behavioural modelling to drive meaningful insights and innovations. As AI agents become the norm in this industry, so does the need for efficient legal information retrieval, semanticsearch capabilities, and structured knowledge representation. Your work will bridge the gap between advanced search technologies and the complex information needs of the legal domain, making legal … engineering Legal knowledge base construction Specialised embedding methods for multimodal content in the legal domain Develop methods for retrieval and reasoning over legal knowledge bases and systems (e.g., hybrid search approaches combining symbolic and neural techniques, query understanding and rewriting for legal search) Perform fine-tuning and reinforcement learning to teach language models how to interact with new More ❯
AI engineering team. This role is at the cutting edge of applied LLM development—rapidly prototyping intelligent systems that leverage autonomous agents, reasoning workflows, RAG pipelines, and real-time semantic search. You’ll work across frameworks like LangChain, LangGraph, CrewAI, and LangFlow, designing smart, multi-step agents that can interact with APIs, memory modules, structured databases, and UIs. If … and innovate. As a Generative & Agentic AI Engineer, you will: Build fast, iterative prototypes using LLMs and agent frameworks Design and implement agent workflows, memory handling, RAG systems, and semanticsearch Integrate vector DBs (Pinecone, FAISS, Chroma), cloud AI services, and APIs Develop lightweight UIs (Streamlit, Gradio, React) and observability for demos Work hands-on with prompt engineering … tools Requirements: Strong Python development skills Hands-on experinece working with GCP and Google teams Experience with LangChain, LangGraph, CrewAI, Autogen, or similar frameworks Background in RAG, embeddings, vector search, and memory architecture Familiarity with cloud AI platforms and deploying models via APIs Comfortable building end-to-end prototypes (backend to simple UI) Max 2-month notice period ideally More ❯
AI engineering team. This role is at the cutting edge of applied LLM development—rapidly prototyping intelligent systems that leverage autonomous agents, reasoning workflows, RAG pipelines, and real-time semantic search. You’ll work across frameworks like LangChain, LangGraph, CrewAI, and LangFlow, designing smart, multi-step agents that can interact with APIs, memory modules, structured databases, and UIs. If … and innovate. As a Generative & Agentic AI Engineer, you will: Build fast, iterative prototypes using LLMs and agent frameworks Design and implement agent workflows, memory handling, RAG systems, and semanticsearch Integrate vector DBs (Pinecone, FAISS, Chroma), cloud AI services, and APIs Develop lightweight UIs (Streamlit, Gradio, React) and observability for demos Work hands-on with prompt engineering … tools Requirements: Strong Python development skills Hands-on experinece working with GCP and Google teams Experience with LangChain, LangGraph, CrewAI, Autogen, or similar frameworks Background in RAG, embeddings, vector search, and memory architecture Familiarity with cloud AI platforms and deploying models via APIs Comfortable building end-to-end prototypes (backend to simple UI) Max 2-month notice period ideally More ❯
City of London, London, United Kingdom Hybrid / WFH Options
trg.recruitment
CD, Docker, Kubernetes, and AWS Proven delivery of NLP/OCR projects with unstructured data Experience managing or mentoring engineers Bonus points for: Experience in regulated domains Knowledge of semanticsearch, graph/vector databases Java or C# background and understanding of SOLID principles 📩 Interested? Message me here or email mmatysik@trg-uk.com More ❯
CD, Docker, Kubernetes, and AWS Proven delivery of NLP/OCR projects with unstructured data Experience managing or mentoring engineers Bonus points for: Experience in regulated domains Knowledge of semanticsearch, graph/vector databases Java or C# background and understanding of SOLID principles 📩 Interested? Message me here or email mmatysik@trg-uk.com More ❯
to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is More ❯
to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is More ❯
to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is More ❯
to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. Who are we? Our mission is to scale intelligence to serve … humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to increasing More ❯
100x growth milestone. In this role, you will: Own the development and application of LLM-based features end-to-end Build and maintain evaluation frameworks for prompt iteration and semantic retrieval Optimise embeddings and vector stores for performance and scalability Interact with users to understand their problems and design solutions Stay up-to-date with the latest trends in … product Your background looks something like: Engineering experience at tech and product-driven companies Shipping LLM-based solutions to production Experience with inference and evaluation platforms Experience with RAG, semanticsearch, vector stores, embedding Building features end-to-end with TypeScript, React.js, and Node.js As a person, you Are first and foremost abuilder. Are excited towork in-personfrom More ❯
to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is More ❯
to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is More ❯
Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale - unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone … to accelerate the results that matter. By taking advantage of all structured and unstructured data - securing and protecting private information more effectively - Elastic's complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What is the Role Are you looking to make a real impact and play a meaningful role in … focused on excellence; taking the initiative to improve both themselves and the team through continuous learning and questioning the status quo. What Will You Be Doing Architect next-generation search solutions: Design and implement cutting-edge solutions utilizing Elastic's latest advancements in AI, semanticsearch, and vector search, including RAG, vector databases, and other emerging More ❯
to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semanticsearch, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is More ❯
AI to accelerate the creation of market-ready products. Our products span global patent databases, scientific journals, technical literature, and custom enterprise solutions. By combining cutting-edge AI, deep semanticsearch, advanced analytics, and workflow automation, our platform empowers R&D, innovation, and IP teams across industries like engineering, life sciences, technology, and finance. With over … sales, product, and delivery teams to drive successful client outcomes. As a Senior Professional Services Consultant, you will also be responsible for supporting the delivery of quality proposals for search and research requests. You will also be responsible for collaborating and reviewing analysis, recommendations and client-facing deliverables. This is a hybrid position in our London, UK office with More ❯