large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
Surrey, England, United Kingdom Hybrid / WFH Options
psd group
SQL/NoSQL databases. Experience with API development, web scraping/crawling, and data integration. Practical exposure to on premise AI model deployment and open-source AI frameworks (e.g., HuggingFace, LangChain). Ability to assess AI tools and recommend optimization for performance improvements. Strong consulting and communication skills — able to translate technical concepts into business value. Bachelor More ❯
guildford, south east england, united kingdom Hybrid / WFH Options
psd group
SQL/NoSQL databases. Experience with API development, web scraping/crawling, and data integration. Practical exposure to on premise AI model deployment and open-source AI frameworks (e.g., HuggingFace, LangChain). Ability to assess AI tools and recommend optimization for performance improvements. Strong consulting and communication skills — able to translate technical concepts into business value. Bachelor More ❯
practices. What We’re Looking For Strong academic background in Computer Science, ML, AI, or a related field. Deep hands-on experience with Python and ML frameworks (PyTorch, TensorFlow, HuggingFace). Proven background in MLOps, including model versioning, CI/CD for ML, and production monitoring. Experience building robust backend systems and APIs to serve ML models More ❯
learning. Advanced Python Development Proficiency: Experience with OOP, and data-validation libraries such as Pydantic. Deep familiarity with Python and its ecosystem for AI/ML, including libraries likePyTorch, HuggingFace Transformers, andscikit-learn. Experience withdata manipulationusing libraries such asPandasandNumPy, and familiarity withparallelizationorasynchronous programming. Proficiency in Test-Driven Development (TDD) and an understanding of Python testing libraries such More ❯
with LLMs and AI-powered search systems. Design and implement Retrieval-Augmented Generation (RAG) pipelines using open-source tools like LangChain, Langflow, and DataStax Vector Search. Fine-tune LLMs (HuggingFace, OpenAI, Cohere, or similar) to improve response quality and domain-specific performance. Optimize embedding models for better semantic search and document retrieval. Integrate AI-driven solutions with More ❯
Java, and SQL/NoSQL databases, along with API development and data integration techniques. Practical experience with deploying AI models on-premise and utilizing open-source frameworks such as HuggingFace and LangChain. Qualifications: Bachelors or Masters degree in Computer Science, Data Science, Machine Learning, Engineering, or a related field. Benefits: 25 days annual leave plus a flexible More ❯
City of London, London, United Kingdom Hybrid / WFH Options
MBN Solutions
Degree in Computer Science, AI, or related field (or equivalent practical experience). Cloud, AI/ML, or CX vendor certifications (AWS/Azure/GCP, Genesys Cloud, OpenAI, HuggingFace, etc.) are a plus. Flexible/remote working options Private healthcare and life insurance Pension plan Cycle-to-work scheme 28 days paid annual leave (including company More ❯
Degree in Computer Science, AI, or related field (or equivalent practical experience). Cloud, AI/ML, or CX vendor certifications (AWS/Azure/GCP, Genesys Cloud, OpenAI, HuggingFace, etc.) are a plus. Flexible/remote working options Private healthcare and life insurance Pension plan Cycle-to-work scheme 28 days paid annual leave (including company More ❯
london, south east england, united kingdom Hybrid / WFH Options
MBN Solutions
Degree in Computer Science, AI, or related field (or equivalent practical experience). Cloud, AI/ML, or CX vendor certifications (AWS/Azure/GCP, Genesys Cloud, OpenAI, HuggingFace, etc.) are a plus. Flexible/remote working options Private healthcare and life insurance Pension plan Cycle-to-work scheme 28 days paid annual leave (including company More ❯
slough, south east england, united kingdom Hybrid / WFH Options
MBN Solutions
Degree in Computer Science, AI, or related field (or equivalent practical experience). Cloud, AI/ML, or CX vendor certifications (AWS/Azure/GCP, Genesys Cloud, OpenAI, HuggingFace, etc.) are a plus. Flexible/remote working options Private healthcare and life insurance Pension plan Cycle-to-work scheme 28 days paid annual leave (including company More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
MBN Solutions
Degree in Computer Science, AI, or related field (or equivalent practical experience). Cloud, AI/ML, or CX vendor certifications (AWS/Azure/GCP, Genesys Cloud, OpenAI, HuggingFace, etc.) are a plus. Flexible/remote working options Private healthcare and life insurance Pension plan Cycle-to-work scheme 28 days paid annual leave (including company More ❯