Machine Learning Engineer - Inference
- Hiring Organisation
- ConnexAI
- Location
- Manchester Area, United Kingdom
Optimise model performance across cloud platforms using frameworks like TensorRT and ONNX. Build and maintain robust API services using Python-based web frameworks (e.g., FastAPI). Manage containerisation and orchestration workflows using Docker and Kubernetes. Ensure system reliability through observability and monitoring tools like Prometheus, Grafana, and OpenTelemetry. Write concise ...