and security initiatives across AWS. Mentor team members and help shape the future of the platform as the company scales. Tech Stack AWS (EKS, RDS, ElastiCache, Kinesis) Kubernetes & Docker Terraform, GitHub Actions, GitOps Ruby on Rails, Databricks Exposure to Datadog , Cloudflare , or scripting (Ruby, Python, Go) is a plus! About You Strong hands-on experience with Kubernetes (K8s) and containerisation. … Proficiency with Infrastructure as Code (Terraform). Solid understanding of AWS and CI/CD pipelines . Curious, collaborative, and passionate about building platforms that make a difference. This is a great opportunity to join a purpose-led tech company making a tangible social impact, while working with modern tools and practices in a truly supportive engineering culture. If this More ❯
high-performance tools and services to validate the reliability, performance, and correctness of ML data pipelines and AI infrastructure. Develop platform-level test solutions and automation frameworks using Python, Terraform, and modern cloud-native practices. Contribute to the platform’s CI/CD pipeline by integrating automated testing, resilience checks, and observability hooks at every stage. Lead initiatives that drive …/CD systems, GitHub Actions, Jenkins, or similar tools. Solid experience with AWS services (Lambda, S3, ECS/EKS, Step Functions, CloudWatch). Proficient in Infrastructure as Code using Terraform to manage and provision cloud infrastructure. Strong understanding of software engineering best practices: code quality, reliability, performance optimization, and observability. Preferred Qualifications Exposure to machine learning workflows, model lifecycle management More ❯
high-performance tools and services to validate the reliability, performance, and correctness of ML data pipelines and AI infrastructure. Develop platform-level test solutions and automation frameworks using Python, Terraform, and modern cloud-native practices. Contribute to the platform’s CI/CD pipeline by integrating automated testing, resilience checks, and observability hooks at every stage. Lead initiatives that drive …/CD systems, GitHub Actions, Jenkins, or similar tools. Solid experience with AWS services (Lambda, S3, ECS/EKS, Step Functions, CloudWatch). Proficient in Infrastructure as Code using Terraform to manage and provision cloud infrastructure. Strong understanding of software engineering best practices: code quality, reliability, performance optimization, and observability. Preferred Qualifications Exposure to machine learning workflows, model lifecycle management More ❯
high-performance tools and services to validate the reliability, performance, and correctness of ML data pipelines and AI infrastructure. Develop platform-level test solutions and automation frameworks using Python, Terraform, and modern cloud-native practices. Contribute to the platform’s CI/CD pipeline by integrating automated testing, resilience checks, and observability hooks at every stage. Lead initiatives that drive …/CD systems, GitHub Actions, Jenkins, or similar tools. Solid experience with AWS services (Lambda, S3, ECS/EKS, Step Functions, CloudWatch). Proficient in Infrastructure as Code using Terraform to manage and provision cloud infrastructure. Strong understanding of software engineering best practices: code quality, reliability, performance optimization, and observability. Preferred Qualifications Exposure to machine learning workflows, model lifecycle management More ❯
high-performance tools and services to validate the reliability, performance, and correctness of ML data pipelines and AI infrastructure. Develop platform-level test solutions and automation frameworks using Python, Terraform, and modern cloud-native practices. Contribute to the platform’s CI/CD pipeline by integrating automated testing, resilience checks, and observability hooks at every stage. Lead initiatives that drive …/CD systems, GitHub Actions, Jenkins, or similar tools. Solid experience with AWS services (Lambda, S3, ECS/EKS, Step Functions, CloudWatch). Proficient in Infrastructure as Code using Terraform to manage and provision cloud infrastructure. Strong understanding of software engineering best practices: code quality, reliability, performance optimization, and observability. Preferred Qualifications Exposure to machine learning workflows, model lifecycle management More ❯
Oxfordshire, England, United Kingdom Hybrid / WFH Options
Humand Talent
you love optimising platforms, scaling Kubernetes, and automating everything from deployment to compliance - this one’s for you. What You’ll Be Doing End-to-end AWS ownership using Terraform, Helm & CloudFormation Manage Kubernetes (EKS/OpenShift) and streamline CI/CD with GitHub Actions & Argo CD Lead security reviews and implement GuardDuty, IAM, and monitoring best practices Drive cost … Mentor teams on DevSecOps, automation, and cloud resilience What You’ll Bring 4+ years in DevOps/SRE with deep AWS experience Strong Kubernetes skills (EKS/OpenShift), IaC (Terraform), and CI/CD pipelines Scripting with Bash, Python, or Go Bonus: MLOps, Prometheus, AWS Karpenter, compliance experience Why Join A collaborative, supportive environment where engineers are trusted and empowered More ❯
london (city of london), south east england, united kingdom
Insight International (UK) Ltd
high-performance tools and services to validate the reliability, performance, and correctness of ML data pipelines and AI infrastructure. Develop platform-level test solutions and automation frameworks using Python, Terraform, and modern cloud-native practices. Contribute to the platform’s CI/CD pipeline by integrating automated testing, resilience checks, and observability hooks at every stage. Lead initiatives that drive …/CD systems, GitHub Actions, Jenkins, or similar tools. Solid experience with AWS services (Lambda, S3, ECS/EKS, Step Functions, CloudWatch). Proficient in Infrastructure as Code using Terraform to manage and provision cloud infrastructure. Strong understanding of software engineering best practices: code quality, reliability, performance optimization, and observability. Preferred Qualifications Exposure to machine learning workflows, model lifecycle management More ❯
Responsibilities Design, build, and manage a secure, scalable platform environment in AWS. Implement and maintain container orchestration using Kubernetes. Develop automation tooling, scripts, and infrastructure-as-code solutions (Python, Terraform, etc.). Collaborate with cross-functional teams to design solutions aligned with business and regulatory requirements. Apply strong networking knowledge to optimise performance, security, and reliability. Ensure compliance with financial … Infrastructure as Code (IaC) environments Strong awareness of security, compliance, and resilience in financial or regulated sectors Desirable: Experience introducing or working with AI/LLM solutions Familiarity with Terraform, Ansible, GitLab CI/CD, or similar tools Exposure to financial services or other highly regulated industries Experience with observability stacks (Prometheus, Grafana, ELK, etc. More ❯
Responsibilities Design, build, and manage a secure, scalable platform environment in AWS. Implement and maintain container orchestration using Kubernetes. Develop automation tooling, scripts, and infrastructure-as-code solutions (Python, Terraform, etc.). Collaborate with cross-functional teams to design solutions aligned with business and regulatory requirements. Apply strong networking knowledge to optimise performance, security, and reliability. Ensure compliance with financial … Infrastructure as Code (IaC) environments Strong awareness of security, compliance, and resilience in financial or regulated sectors Desirable: Experience introducing or working with AI/LLM solutions Familiarity with Terraform, Ansible, GitLab CI/CD, or similar tools Exposure to financial services or other highly regulated industries Experience with observability stacks (Prometheus, Grafana, ELK, etc. More ❯
you love optimising platforms, scaling Kubernetes, and automating everything from deployment to compliance - this one's for you. What You'll Be Doing End-to-end AWS ownership using Terraform, Helm & CloudFormation Manage Kubernetes (EKS/OpenShift) and streamline CI/CD with GitHub Actions & Argo CD Lead security reviews and implement GuardDuty, IAM, and monitoring best practices Drive cost … Mentor teams on DevSecOps, automation, and cloud resilience What You'll Bring 4+ years in DevOps/SRE with deep AWS experience Strong Kubernetes skills (EKS/OpenShift), IaC (Terraform), and CI/CD pipelines Scripting with Bash, Python, or Go Bonus: MLOps, Prometheus, AWS Karpenter, compliance experience Why Join A collaborative, supportive environment where engineers are trusted and empowered More ❯
Responsibilities Design, build, and manage a secure, scalable platform environment in AWS. Implement and maintain container orchestration using Kubernetes. Develop automation tooling, scripts, and infrastructure-as-code solutions (Python, Terraform, etc.). Collaborate with cross-functional teams to design solutions aligned with business and regulatory requirements. Apply strong networking knowledge to optimise performance, security, and reliability. Ensure compliance with financial … Infrastructure as Code (IaC) environments Strong awareness of security, compliance, and resilience in financial or regulated sectors Desirable: Experience introducing or working with AI/LLM solutions Familiarity with Terraform, Ansible, GitLab CI/CD, or similar tools Exposure to financial services or other highly regulated industries Experience with observability stacks (Prometheus, Grafana, ELK, etc. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Alexander Edward James Consulting Limited
to critical issues. Requirements Proven experience managing and scaling Azure cloud environments . Strong background in DevOps practices , automation, CI/CD, and setting up infrastructure-as-code (e.g., Terraform, ARM, Bicep) Hands-on expertise in disaster recovery planning and execution . Solid understanding of information security , including ISO 27001 frameworks and Networking. Familiarity with monitoring, logging, and alerting tools … Consulting and Services Employment Type Full-time Job Functions Information Technology Skills Networking Management Resource Management Cloud Infrastructure Business Continuity Planning Problem Solving Software as a Service (SaaS) Infrastructure TerraformMore ❯
london (city of london), south east england, united kingdom
develop
Responsibilities Design, build, and manage a secure, scalable platform environment in AWS. Implement and maintain container orchestration using Kubernetes. Develop automation tooling, scripts, and infrastructure-as-code solutions (Python, Terraform, etc.). Collaborate with cross-functional teams to design solutions aligned with business and regulatory requirements. Apply strong networking knowledge to optimise performance, security, and reliability. Ensure compliance with financial … Infrastructure as Code (IaC) environments Strong awareness of security, compliance, and resilience in financial or regulated sectors Desirable: Experience introducing or working with AI/LLM solutions Familiarity with Terraform, Ansible, GitLab CI/CD, or similar tools Exposure to financial services or other highly regulated industries Experience with observability stacks (Prometheus, Grafana, ELK, etc. More ❯
Responsibilities Design, build, and manage a secure, scalable platform environment in AWS. Implement and maintain container orchestration using Kubernetes. Develop automation tooling, scripts, and infrastructure-as-code solutions (Python, Terraform, etc.). Collaborate with cross-functional teams to design solutions aligned with business and regulatory requirements. Apply strong networking knowledge to optimise performance, security, and reliability. Ensure compliance with financial … Infrastructure as Code (IaC) environments Strong awareness of security, compliance, and resilience in financial or regulated sectors Desirable: Experience introducing or working with AI/LLM solutions Familiarity with Terraform, Ansible, GitLab CI/CD, or similar tools Exposure to financial services or other highly regulated industries Experience with observability stacks (Prometheus, Grafana, ELK, etc. More ❯
and optimizing multi-agent AI pipelines with 4+ agents for scale, reasoning, and automation. Implementing and maintaining multi-region AWS cloud infrastructure using Infrastructure as Code (IaC) tools like Terraform . Ensuring system reliability, security hardening (IAM, GuardDuty), and comprehensive observability for all AI services. Driving zero-touch deployment using CI/CD tools like GitHub Actions and Kubernetes (EKS … and proficiency in Bash/Go for automation. Deep familiarity with core AWS services (EC2, VPC, IAM, S3, ALB/ELB, ECR/ECS). Solid experience with IaC (Terraform) and containerisation (Docker) . Experience in CI/CD engineering using GitHub Actions or Argo CD . What Will Make You Stand Out Prior research or applied experience in edge … SEO Keywords for Search Senior AI Engineer, Full-Stack AI, Agentic AI, LangGraph, LangChain, Multi-Agent System, Large Language Models, LLMs, Machine Learning, MLOps, Data Scientist, Artificial Intelligence, AWS, Terraform, Kubernetes, EKS, Docker, Containerisation, Python, Go, Bash, CI/CD, GitHub Actions, Argo CD, Edge AI, Offline AI, MOD Clearance, Security Clearance, Sagemaker, Kubeflow, ZenML, AI Architect, Solution Architect. More ❯
and release cycles. Deploy, maintain, and optimize Docker-based microservices. Troubleshoot production issues, ensuring uptime and documenting processes on the internal wiki. Automate deployments, testing processes, and infrastructure provisioning (Terraform, Ansible, GitHub Actions). Implement monitoring and observability solutions for proactive issue detection. Provide occasional support for internal IT infrastructure (e.g., laptops, printers, office networking). Occasionally maintain and support …/pfSense firewall management VPN configuration (OpenVPN, WireGuard) Authentik (SSO/identity management) Docker containerization Python scripting for automation Git version control Desirable (Future-Facing Skills): Infrastructure as Code (Terraform, Pulumi, Ansible) Container orchestration (Kubernetes) Go development for microservice utilities Modern observability tools (Prometheus, Grafana, Datadog) CI/CD pipeline management (GitHub Actions, GitLab CI, Jenkins) Firewall-as-a-Service More ❯
and release cycles. Deploy, maintain, and optimize Docker-based microservices. Troubleshoot production issues, ensuring uptime and documenting processes on the internal wiki. Automate deployments, testing processes, and infrastructure provisioning (Terraform, Ansible, GitHub Actions). Implement monitoring and observability solutions for proactive issue detection. Provide occasional support for internal IT infrastructure (e.g., laptops, printers, office networking). Occasionally maintain and support …/pfSense firewall management VPN configuration (OpenVPN, WireGuard) Authentik (SSO/identity management) Docker containerization Python scripting for automation Git version control Desirable (Future-Facing Skills): Infrastructure as Code (Terraform, Pulumi, Ansible) Container orchestration (Kubernetes) Go development for microservice utilities Modern observability tools (Prometheus, Grafana, Datadog) CI/CD pipeline management (GitHub Actions, GitLab CI, Jenkins) Firewall-as-a-Service More ❯
and release cycles. Deploy, maintain, and optimize Docker-based microservices. Troubleshoot production issues, ensuring uptime and documenting processes on the internal wiki. Automate deployments, testing processes, and infrastructure provisioning (Terraform, Ansible, GitHub Actions). Implement monitoring and observability solutions for proactive issue detection. Provide occasional support for internal IT infrastructure (e.g., laptops, printers, office networking). Occasionally maintain and support …/pfSense firewall management VPN configuration (OpenVPN, WireGuard) Authentik (SSO/identity management) Docker containerization Python scripting for automation Git version control Desirable (Future-Facing Skills): Infrastructure as Code (Terraform, Pulumi, Ansible) Container orchestration (Kubernetes) Go development for microservice utilities Modern observability tools (Prometheus, Grafana, Datadog) CI/CD pipeline management (GitHub Actions, GitLab CI, Jenkins) Firewall-as-a-Service More ❯
large-scale AI services and agentic workflows across government systems. Key Responsibilities Design and provision secure, scalable cloud infrastructure (AWS, Azure, or GCP) using Python-based Infrastructure as Code (Terraform or Pulumi). Build and optimise CI/CD pipelines to automate the deployment of AI applications and models (MLOps/LLMOps). Containerise workloads using Docker and manage deployments … for AI/ML workloads using MLOps or LLMOps practices. Excellent scripting and automation skills in Python (e.g. Boto3, SDKs). Proven experience with Python-based IaC frameworks (Pulumi, Terraform, CDKs). Hands-on experience building CI/CD pipelines for AI deployments (Github Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). More ❯
large-scale AI services and agentic workflows across government systems. Key Responsibilities Design and provision secure, scalable cloud infrastructure (AWS, Azure, or GCP) using Python-based Infrastructure as Code (Terraform or Pulumi). Build and optimise CI/CD pipelines to automate the deployment of AI applications and models (MLOps/LLMOps). Containerise workloads using Docker and manage deployments … for AI/ML workloads using MLOps or LLMOps practices. Excellent scripting and automation skills in Python (e.g. Boto3, SDKs). Proven experience with Python-based IaC frameworks (Pulumi, Terraform, CDKs). Hands-on experience building CI/CD pipelines for AI deployments (Github Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). More ❯
large-scale AI services and agentic workflows across government systems. Key Responsibilities Design and provision secure, scalable cloud infrastructure (AWS, Azure, or GCP) using Python-based Infrastructure as Code (Terraform or Pulumi). Build and optimise CI/CD pipelines to automate the deployment of AI applications and models (MLOps/LLMOps). Containerise workloads using Docker and manage deployments … for AI/ML workloads using MLOps or LLMOps practices. Excellent scripting and automation skills in Python (e.g. Boto3, SDKs). Proven experience with Python-based IaC frameworks (Pulumi, Terraform, CDKs). Hands-on experience building CI/CD pipelines for AI deployments (Github Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). More ❯
large-scale AI services and agentic workflows across government systems. Key Responsibilities Design and provision secure, scalable cloud infrastructure (AWS, Azure, or GCP) using Python-based Infrastructure as Code (Terraform or Pulumi). Build and optimise CI/CD pipelines to automate the deployment of AI applications and models (MLOps/LLMOps). Containerise workloads using Docker and manage deployments … for AI/ML workloads using MLOps or LLMOps practices. Excellent scripting and automation skills in Python (e.g. Boto3, SDKs). Proven experience with Python-based IaC frameworks (Pulumi, Terraform, CDKs). Hands-on experience building CI/CD pipelines for AI deployments (Github Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). More ❯
large-scale AI services and agentic workflows across government systems. Key Responsibilities Design and provision secure, scalable cloud infrastructure (AWS, Azure, or GCP) using Python-based Infrastructure as Code (Terraform or Pulumi). Build and optimise CI/CD pipelines to automate the deployment of AI applications and models (MLOps/LLMOps). Containerise workloads using Docker and manage deployments … for AI/ML workloads using MLOps or LLMOps practices. Excellent scripting and automation skills in Python (e.g. Boto3, SDKs). Proven experience with Python-based IaC frameworks (Pulumi, Terraform, CDKs). Hands-on experience building CI/CD pipelines for AI deployments (Github Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). More ❯
london (city of london), south east england, united kingdom
Amber Labs
large-scale AI services and agentic workflows across government systems. Key Responsibilities Design and provision secure, scalable cloud infrastructure (AWS, Azure, or GCP) using Python-based Infrastructure as Code (Terraform or Pulumi). Build and optimise CI/CD pipelines to automate the deployment of AI applications and models (MLOps/LLMOps). Containerise workloads using Docker and manage deployments … for AI/ML workloads using MLOps or LLMOps practices. Excellent scripting and automation skills in Python (e.g. Boto3, SDKs). Proven experience with Python-based IaC frameworks (Pulumi, Terraform, CDKs). Hands-on experience building CI/CD pipelines for AI deployments (Github Actions, MLFlow, ZenML, or similar). Deep understanding of containerisation and orchestration tools (Docker, Kubernetes). More ❯
Oxford, Oxfordshire, South East, United Kingdom Hybrid / WFH Options
EFCI Group Ltd
networking, storage, security). Manage environment provisioning, maintenance, and synchronization (P2T/T2T) for the Oracle Fusion Cloud landscape. Automation & DevOps: Develop and maintain Infrastructure-as-Code (IaC) using Terraform or OCI Resource Manager. Implement CI/CD pipelines for Fusion extensions and integrations using tools like Jenkins, GitLab, or Azure DevOps. Fusion Environment Control: Manage and support the technical … operations and environment maintenance for Oracle Fusion Cloud (ERP/HCM/SCM) . Strong knowledge of DevOps, CI/CD, and Infrastructure as Code (IaC) using tools like Terraform, Ansible , etc. Proficiency in Linux/Unix environments , scripting ( Bash, Python ), and version control ( Git ). Demonstrable experience managing Fusion quarterly updates, patches, and related integrations. Strong troubleshooting, analytical, and More ❯