AI Prompt Engineer
- Hiring Organisation
- Staffworx
- Location
- Greater London, England, United Kingdom
Docker, Kubernetes, KNative, and managed inference endpoints. Optimize system performance with: Caching, batching, routing, and fallback strategies Quantization and distillation for efficient inference Cost, latency, and reliability optimization Design resilient, observable GenAI systems suitable for production environments. EXPERIENCE Strong Python engineering skills with hands-on experience across ...