Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
product teams to ship quickly and safely, applying SRE and DevSecOps practices throughout the software delivery lifecycle. What you'll be doing Design, build and operate the Azure-based Internal Developer Platform as a product, enabling self-service environment provisioning and repeatable golden paths. Develop and maintain Infrastructure as Code (Terraform and/or Bicep) modules … and progressive delivery. Introduce and run GitOps for Kubernetes (AKS preferred), patterns and multi-environment promotions. Own platform observability: metrics, logs and traces using AzureMonitor/LogAnalytics/ApplicationInsights, plus Datadog/Grafana where appropriate. Embed security by design: Azure Policy, Defender for Cloud, secrets management with Key … secrets, RBAC and workload identity. Experience with GitOps, and container build pipelines (e.g., ACR, OPA policies, image scanning). Working knowledge of observability tooling (AzureMonitor, LogAnalytics, ApplicationInsights, Datadog/Grafana) and alerting/response workflows. Understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well More ❯
Employment Type: Permanent, Part Time, Work From Home
of machine learning models in production. What you'll be doing as Lead ML Ops Engineer: Leading the design and implementation of robust ML Ops pipelines using Azure, Databricks, and Delta Lake Architecting and overseeing API services and caching layers (e.g., Azure Cache for Redis) Driving integration with cloud-based data storage … best practices for model deployment, monitoring, and lifecycle management Conducting performance tuning, load testing, and reliability engineering Managing CI/CD workflows and infrastructure as code via Azure DevOps and GitHub Mentoring junior engineers and fostering a culture of technical excellence and innovation What we're looking for from the Machine Learning Operations Lead: Proven experience … in ML Ops leadership, with deep expertise in Azure, Databricks, and cloud-native architectures Strong understanding of Postgres, Redis, Snowflake, and Delta Lake Architecture Hands-on experience with Docker, container orchestration, and scalable API design Excellent communication and stakeholder management skills Ability to drive strategic initiatives and influence technical direction Bonus: experience with AzureMore ❯
Employment Type: Permanent
Salary: £70000 - £90000/annum 25+bank, bonus + more
of machine learning models in production. What you'll be doing as Lead ML Ops Engineer: Leading the design and implementation of robust ML Ops pipelines using Azure, Databricks, and Delta Lake Architecting and overseeing API services and caching layers (e.g., Azure Cache for Redis) Driving integration with cloud-based data storage … best practices for model deployment, monitoring, and lifecycle management Conducting performance tuning, load testing, and reliability engineering Managing CI/CD workflows and infrastructure as code via Azure DevOps and GitHub Mentoring junior engineers and fostering a culture of technical excellence and innovation What we're looking for from the Machine Learning Operations Lead: Proven experience … in ML Ops leadership, with deep expertise in Azure, Databricks, and cloud-native architectures Strong understanding of Postgres, Redis, Snowflake, and Delta Lake Architecture Hands-on experience with Docker, container orchestration, and scalable API design Excellent communication and stakeholder management skills Ability to drive strategic initiatives and influence technical direction Bonus: experience with AzureMore ❯
orchestration, integration patterns (e.g., module federation, event-driven architecture), and distributed systems. Demonstrate deep expertise in React (including advanced patterns, SSR/SSG, Next.js), Node.js, TypeScript, and Azure cloud services. Define and enforce comprehensive testing strategies (unit, integration, E2E) using Jest, Testing Library, Playwright, and champion TDD/BDD practices. Architect and implement scalable, secure APIs … vulnerabilities. Architect logging, monitoring, and tracing strategies (OpenTelemetry, Prometheus, ApplicationInsights), and drive adoption of best practices for platform reliability. Architect and optimise CI/CD pipelines (Azure DevOps, GitHub Actions), automate quality gates, enable blue/green deployments, and drive continuous delivery. Set and enforce standards for Terraform/ARM templates, environment management, and disaster More ❯