nice if you have: Hands-on experience with OpenAI's GPT-4o, o1, and Claude models from Anthropic. Familiarity with vector databases (e.g., Pinecone, Weaviate, or similar). Experience building applications with Docker and Kubernetes. Proven expertise in building highly secure, fault-tolerant APIs. Experience building high-performance, distributed systems More ❯
San Francisco, California, United States Hybrid / WFH Options
esrhealthcare
using frameworks such as OpenAI GPT or Anthropic Claude. Design and implement RAG pipelines for scalable, real-time applications leveraging vector databases like Pinecone, Weaviate, Opensearch. Develop prompt engineering strategies to optimize model outputs for specific use cases. Design and deploy scalable ML models that integrate with existing systems. End … with LLMs (e.g., OpenAI GPT models, Anthropic Claude) and fine-tuning techniques. Strong understanding of RAG architectures and vector database integration (e.g., Opensearch, Pinecone, Weaviate). API Development: FastAPI, Flask, Django Containerization: Docker, AWS ECS, Kubernetes Cloud & Data Tools: Experience with cloud platforms such as AWS (SageMaker preferred), GCP Vertex More ❯
MySQL, MongoDB, Firebase, Redis, Elasticsearch GCP, AWS, Azure, DigitalOcean The Future is AI We're building smarter with Generative AI -think vector embeddings, Pinecone, Weaviate, Langchain, Chainlink. If AI-powered development excites you, you'll feel right at home here. More ❯
we use it to power our analytics) OCR engines (we use AWS Textract, GDocAI, and we have used tesseractOCR in the past) Prompt Engineering Weaviate (we use it for RAG in LLM powered tasks and for hybrid searches) Kubernetes (we run Weaviate and other specific services on Kubernetes) CircleCI DataDog More ❯