City of London, London, United Kingdom Hybrid/Remote Options
Bloc Recruitment
system is secure, scalable, and sustainable. What You'll Do Define and drive the architectural vision across multiple engineering teams Guide system design for complex, cloud-native, containerised environments (Kubernetes, Terraform, Helm, Flux) Own the evolution of our data model and processing platforms Embed best practices in observability , security , and testing across the organisation Anticipate dependencies and unblock teams before … designing and delivering production systems at scale Expertise in modern backend architectures (Node.js, Python, Go) and fluency with frontend ecosystems (React, TypeScript) Deep experience with cloud-native infrastructure (Terraform, Kubernetes, Helm, Flux) Strong grasp of security, observability, and operational excellence Proven ability to influence and guide without formal authority Track record of mentoring and developing high-performing engineering teams Why More ❯
Job: Site Reliability Engineer (SRE) Location: Basildon (5 days on-site) What you'll do: Run and automate Kubernetes clusters on cloud platforms (AWS, Azure, or GCP). Use tools like Terraform to build and manage infrastructure. Monitor systems, find problems, and fix them (including an on-call rotation) click apply for full job details More ❯
Play a key role in shaping the future of digital services used by over 13 million users. Growing market-leader with excellent career development and training. Be part of a collaborative, forward-thinking digital development team. Were hiring a Software More ❯
Cegeka UK are seeking a DV cleared Application Developer with expertise in Python, PyTorch, Data Handling, Large Language Model (LLM), GitHub, Linux CLI, Docker and Kubernetes. Any additional knowledge of Artificial Intelligence (AI), Machine Learning (ML) and Natural Language Processing More ❯
My client an insurance SaaS provider start-up at an exciting time of growth is looking for a Platform Engineer who is accustomed to working in small-teams. To manage cloud infrastructure, CI/CD pipelines, and overall platform reliability. More ❯
Edinburgh, Scotland, United Kingdom Hybrid/Remote Options
CommuniTech Recruitment Group
Staff Engineer. AI Legal Tech Firm. Typescript/React/AWS. £80,000 +Amazing benefits. 3 Days a week Hybrid. Description As a Staff Engineer at my award winning client,, you’ll lead the delivery of large-scale features and More ❯
Software Engineer - £505/day inside IR35 - 6 month contract About the Role Are you a talented Software Engineer looking to take your career to new heights? As a Software Engineer, you will play a crucial role in developing, maintaining More ❯
many large brands and therefore it is a huge undertaking to drive best in practice technology, moving away from the Windows/IIS model towards .NET Core and then Kubernetes with images on Azure. The plan is for the stack to be run anywhere that Kubernetes can be deployed so they can run their choice of cloud solution. Responsibilities will More ❯
Kubernetes isn’t just a tool for you, it’s your domain. You build, scale, and secure clusters with precision, keeping complex systems running like clockwork. You’re well versed in Helm, GitOps, and different deployment strategies, choosing the right approach for each situation. All part of your toolkit, ready to be deployed when needed. In this role, you’ll … take ownership of optimising Kubernetes infrastructure at scale, boosting performance, security, and reliability. You’ll refine CI/CD pipelines, automate deployments, and troubleshoot issues to maintain peak uptime. Infrastructure as Code is second nature to you, and your Linux scripting skills will be put to work, especially with complex large scale cloud migrations. What’s in it for you … The chance to work on cutting edge Kubernetes projects, hands on with the latest tools, and the freedom to introduce new technologies. You’ll have access to fully funded training, conferences, and certifications. Maybe even becoming a certified Kubestronaut. Both a Leadership and hands-on nature will be required coming in at this level More ❯
Data Science Engineer - MLOPS, Machine Learning, AI, Artificial Intelligence, Azure, PyTorch, TensorFlow, LangChain, OpenAI, Docker, Kubernetes, GenAI, ETL We are actively working with a global law firm who are actively looking to bolster their IT team as they undergo a global-scale cloud transformation. At present they are looking to take on a new Data Science Engineer (MLOPS, Machine Learning … AI, Artificial Intelligence, Azure, PyTorch, TensorFlow, LangChain, OpenAI, Docker, Kubernetes, GenAI, ETL) to join their team on a permanent basis. this role we be responsible for the design, development and delivery of advanced analytics and AI solutions. This is a fantastic time to join a top-tier global law firm who have a long-stream of projects in the pipeline … alongside a diverse and collaborative team environment. To be considered for this Data Science Engineer (MLOPS, Machine Learning, AI, Artificial Intelligence, Azure, PyTorch, TensorFlow, LangChain, OpenAI, Docker, Kubernetes, GenAI, ETL) role, it's ideal you have: Ideal but not required law firm experience 2-4 years experience within AI/ML positions Knowledge of cloud platforms (Ideally Azure) AI/ More ❯
Minimum of 4 years' hands-on experience with containers, particularly in a Microsoft Azure environment. Demonstrated expertise in AZURE KUBERNETES SERVICES (AKS), Azure Container Instances (ACI), and Azure Container Apps (ACA). Strong ability to automate processes and create monitoring solutions using tools like GitLab CI, Kubecost, Azure DevOps, and other relevant technologies. Experience in designing and implementing secure and … scalable container solutions in Azure, leveraging tools such as Microsoft Entra ID, Azure CLI, and Azure Kubernetes Service. Familiarity with various Azure Cosmos DB APIs and PostgreSQL for database management and operation. Proficient in Helm charts and the Helm release process. Skilled in writing PowerShell and Bash scripts for automation More ❯
Minimum of 4 years' hands-on experience with containers, particularly in a Microsoft Azure environment. Demonstrated expertise in AZURE KUBERNETES SERVICES (AKS), Azure Container Instances (ACI), and Azure Container Apps (ACA). Strong ability to automate processes and create monitoring solutions using tools like GitLab CI, Kubecost, Azure DevOps, and other relevant technologies. Experience in designing and implementing secure and … scalable container solutions in Azure, leveraging tools such as Microsoft Entra ID, Azure CLI, and Azure Kubernetes Service. Familiarity with various Azure Cosmos DB APIs and PostgreSQL for database management and operation. Proficient in Helm charts and the Helm release process. Skilled in writing PowerShell and Bash scripts for automation More ❯
ROLE: As a Senior Platform Engineer , you'll support the data team by developing and maintaining secure, Linux-based data stacks. You'll play a key part in building Kubernetes environments, managing Spark clusters, and ensuring secure-by-design standards across all platform activity. This is a hands-on, technically varied role where you'll collaborate closely with security, data … and software teams to deliver high-assurance systems for national-level projects. SENIOR PLATFORM ENGINEER ESSENTIAL SKILLS: Excellent Linux engineering experience Strong Kubernetes and Docker knowledge Confident scripting in Python and Bash Experience with secure or air-gapped environments Familiarity with HPC or distributed data systems (e.g. Spark, Delta Lake) Knowledge of security, encryption, and compliance standards TO BE CONSIDERED … express consent for us to process and submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS: Senior Platform Engineer, Linux, Kubernetes, Docker, Python, Bash, Spark, Air-Gapped Systems, Secure-by-Design, NSD More ❯
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
East London, London, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
City of London, London, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
Leeds, West Yorkshire, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
Bolton, Greater Manchester, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
Leigh, Greater Manchester, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
Bury, Greater Manchester, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
Central London / West End, London, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯
Ashton-Under-Lyne, Greater Manchester, United Kingdom Hybrid/Remote Options
FRESH
of machine learning and cloud infrastructure, ensuring our models run efficiently, reliably, and cost-effectively at scale. Key Responsibilities Design, deploy, and manage ML infrastructure on AWS (SageMaker) and Kubernetes Build scalable pipelines for LLM and video generation models Optimise model inference, latency, and costs Implement monitoring, logging, and alerting for production systems Collaborate with ML teams to productionize new … and tooling for smoother ML operations Skills & Experience Strong Python skills and production coding experience Proven experience deploying/scaling LLMs or generative AI models Expertise in AWS SageMaker, Kubernetes, and CI/CD (Terraform/CloudFormation) Solid understanding of ML serving, optimisation, and cloud architecture Nice to Have: Video generation or computer vision experience, model compression, distributed training, and More ❯