Arlington, Virginia, United States Hybrid / WFH Options
G2 Ops, Inc
using OpenAI API, RAG, or embedding models. Familiarity with prompt engineering for model fine-tuning or inference optimization. Understanding of vector databases (e.g., Qdrant, Pinecone) and semantic search techniques. Use of MLOps tools for CI/CD pipelines in AI (e.g., MLflow, Kubeflow, SageMaker). AI for Systems Engineering Experience More ❯
be nice if you have: Hands-on experience with OpenAI's GPT-4o, o1, and Claude models from Anthropic. Familiarity with vector databases (e.g., Pinecone, Weaviate, or similar). Experience building applications with Docker and Kubernetes. Proven expertise in building highly secure, fault-tolerant APIs. Experience building high-performance, distributed More ❯
San Francisco, California, United States Hybrid / WFH Options
esrhealthcare
LLMs) using frameworks such as OpenAI GPT or Anthropic Claude. Design and implement RAG pipelines for scalable, real-time applications leveraging vector databases like Pinecone, Weaviate, Opensearch. Develop prompt engineering strategies to optimize model outputs for specific use cases. Design and deploy scalable ML models that integrate with existing systems. … experience with LLMs (e.g., OpenAI GPT models, Anthropic Claude) and fine-tuning techniques. Strong understanding of RAG architectures and vector database integration (e.g., Opensearch, Pinecone, Weaviate). API Development: FastAPI, Flask, Django Containerization: Docker, AWS ECS, Kubernetes Cloud & Data Tools: Experience with cloud platforms such as AWS (SageMaker preferred), GCP More ❯
Develop and optimise RAG pipelines using LangChain, LlamaIndex, or Haystack. Build ingestion workflows (OCR, chunking, embedding, semantic search) and integrate with vector databases (FAISS, Pinecone, Qdrant). Ensure seamless integration of GenAI services into business workflows, prioritising security, scalability, and compliance. Collaborate with cross-functional teams (data scientists, architects, engineers More ❯
london, south east england, united kingdom Hybrid / WFH Options
twentyAI
Develop and optimise RAG pipelines using LangChain, LlamaIndex, or Haystack. Build ingestion workflows (OCR, chunking, embedding, semantic search) and integrate with vector databases (FAISS, Pinecone, Qdrant). Ensure seamless integration of GenAI services into business workflows, prioritising security, scalability, and compliance. Collaborate with cross-functional teams (data scientists, architects, engineers More ❯
and improving developer workflows. Excellent communication and collaboration skills in a remote-first environment. Experience contributing to open-source AI projects. Experience with LangChain, Pinecone, or similar AI frameworks/infrastructure. Past experience building AI features into developer platforms or tools. Benefits Our entire company is distributed, so we take More ❯
Docker MySQL, MongoDB, Firebase, Redis, Elasticsearch GCP, AWS, Azure, DigitalOcean The Future is AI We're building smarter with Generative AI -think vector embeddings, Pinecone, Weaviate, Langchain, Chainlink. If AI-powered development excites you, you'll feel right at home here. More ❯