Staff Software Engineer
Staff Software Engineer – AI/ML (MLOps)
Engineering | UK Remote / London (hybrid working)
Overview
My client is a fast-growing, product-led technology company investing heavily in AI and machine learning. They are building advanced AI-driven products and are looking for a Staff Engineer to help scale their AI/ML and MLOps capability.
The Role
My client is seeking a Staff Software Engineer with strong experience in MLOps/DevOps and AI infrastructure.
This is a hands-on technical leadership role focused on designing, building, and scaling production-grade machine learning systems. You will work across the full lifecycle, from model development through to deployment and optimisation, while helping define best practices and technical direction.
Key Responsibilities
- Lead the design and delivery of AI/ML systems, including model training, serving, and deployment
- Own end-to-end ML lifecycle (experimentation → production)
- Build and optimise scalable data pipelines and infrastructure
- Collaborate with Engineering, Product, and DevOps teams
- Define and implement MLOps best practices and tooling
- Improve system performance, reliability, and scalability
- Mentor engineers and support technical development across the team
- Ensure high standards in data quality, security, and compliance
Skills & Experience
- Strong experience in AI/ML engineering and MLOps within production environments
- Background in technical leadership or senior engineering roles
- Proficiency in Python (or similar backend languages)
- Experience with ML infrastructure, pipelines, and model deployment
- Knowledge of LLMs and modern AI systems
- Experience working with cloud platforms (AWS, GCP, or Azure)
- Understanding of distributed systems and scalable architectures
- Experience with CI/CD, infrastructure as code, and microservices
- Strong collaboration and communication skills
- Experience working in Agile environments
- Degree in a relevant technical field
Tech Stack (indicative)
- Python, Java or similar
- Cloud platforms (AWS / GCP / Azure)
- Containerisation (Docker, Kubernetes)
- Infrastructure as Code (e.g. Terraform)
- Observability & ML tooling
- Modern LLM frameworks and tooling
- Relational and non-relational databases
Benefits
- Competitive salary
- 25 days’ annual leave + bank holidays
- Sick pay package
- Flexible / remote working
- Enhanced parental leave
- Pension scheme
- Career development opportunities
Additional Information
This role is not eligible for visa sponsorship.