or Personalisation . Strong understanding of ML fundamentals and deep learning frameworks (e.g., PyTorch, TensorFlow, JAX). Hands-on experience with GenAI , including working with LLMs (e.g., OpenAI, Anthropic, HuggingFace models). Deep knowledge of MLOps, experimentation, and model evaluation techniques. Experience scaling ML platforms, pipelines, and tooling in cloud environments (e.g., AWS, GCP, Azure). Preferred: PhD or MS More ❯
london (city of london), south east england, united kingdom
oryxsearch.io
or Personalisation . Strong understanding of ML fundamentals and deep learning frameworks (e.g., PyTorch, TensorFlow, JAX). Hands-on experience with GenAI , including working with LLMs (e.g., OpenAI, Anthropic, HuggingFace models). Deep knowledge of MLOps, experimentation, and model evaluation techniques. Experience scaling ML platforms, pipelines, and tooling in cloud environments (e.g., AWS, GCP, Azure). Preferred: PhD or MS More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯
large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems. More ❯