AI Prompt Engineer
- Hiring Organisation
- Staffworx Limited
- Location
- East London, London, United Kingdom
- Employment Type
- Contract
- Contract Rate
- market rates, outside IR35, remote first, UK but 1-2 days on site
KNative and managed inference endpoints. Optimize model performance with quantization, distillation, caching, batching and routing strategies. Youll Bring Strong Python skills, with experience usingTransformers,LangChain,LlamaIndex and the broader GenAI ecosystem. Deep understanding of LLM behavior, prompt optimization, embeddings, retrieval and data preparation workflows. Experience with vector DBs (FAISS, Pinecone … tuning open-source LLMs. Tech Stack LLMs: GPT-4/5, Claude 3.x, Gemini 2.x, Mistral Large, LLaMA 3, Cohere Command R+, DeepSeek Frameworks: LangChain, LlamaIndex, Haystack, AutoGen, CrewAI Tools: GitHub Copilot, Cursor, LangSmith, LangFuse, Weights & Biases, MLflow, Humanloop Cloud: Azure ML, AWS SageMaker, Google Vertex AI, Databricks, Modal Infra ...