and container orchestration (ECS, EKS, or Kubernetes) Experience setting up CI/CD pipelines using GitHub Actions or similar tools Familiarity with monitoring and alerting tools (e.g. Prometheus, Grafana, CloudWatch, Sentry, DataDog) A security-first mindset when designing and managing infrastructure Nice to Haves Experience working in regulated or high-trust environments Knowledge of zero-downtime deployment patterns and More ❯
in production. Good to Have: Experience with Scala for data applications. Familiarity with serverless/event-driven architectures. Experience designing scalable, low-latency data services. Exposure to Datadog or CloudWatch monitoring tools. Nice to Have: Experience with LLM-powered applications or OpenAI APIs . Professional experience in a similar environment or high-scale system. Key Roles and Responsibilities: Own More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
Flyway, Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Navtech, Inc
Flyway, Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging More ❯
and practices. A solid understanding of networking protocols and concepts (TCP/IP, DNS, SSL/TLS, routing, etc.). Proficient with AWS services including EC2, ELB, VPC, IAM, CloudWatch, S3,VPC Lattice, Transit Gateway, VPN and more. Practical knowledge of DevOps tools: Git, Jenkins, Docker, Ansible, Terraform. Strong scripting skills (Bash, Python, or equivalent). Candidates must be More ❯
CD pipelines and lead Infrastructure as Code (Terraform, CloudFormation). Implement DevSecOps best practices to meet HIPAA, SOC 2, and ISO 27001 requirements. Monitor system performance and availability using CloudWatch, Prometheus, Grafana, and related tooling. Collaborate with engineering, security, and product teams to drive end to end reliability. Qualifications Experience 6+ years of DevOps/SRE experience in cloud More ❯
in automation, monitoring, and deployment processes. What we're looking for Experience with AWS services (ECS, S3, RDS, Lambda, CloudFront, etc.). Skilled in monitoring tools such as DataDog , CloudWatch , and Grafana. Familiarity with Docker , ECS , Kubernetes , or similar containerisation tech. Competence in scripting or coding with Bash, Python, or Node.js. Experience with Infrastructure as Code (Terraform, Pulumi, etc. More ❯
AWS services Strong knowledge of Infrastructure as Code (Terraform) Proficiency with containers and orchestration (Kubernetes, Docker, Helm) Experience with automation and configuration management Observability expertise using tools such as CloudWatch, ELK, Grafana Basic programming in at least one language (eg, Python, Go, Java, etc.) Experience designing and managing CI/CD pipelines What's on Offer Flexible working: Hybrid More ❯
include: An automation mind-set An exposure to test automation, test driven development (TDD) and agile delivery practices Understanding of monitoring and observability tools such as AppDynamics, ELK, AWS CloudWatch, AWS XRay etc Strong communication, verbally and in writing, able to tailor messages appropriately to the audience You may be assessed on the key critical skills relevant for success More ❯
above Proven experience in designing, developing, and optimizing backend systems for data processing at scale within AWS Hands-on experience with AWS services, such as; ALB, S3, ElastiCache, IAM, CloudWatch, DynamoDb, EKS Knowledge and experience on Kubernetes is advantageous Experience in building scalable, distributed REST/HTTP-based microservices on hybrid cloud environments Experience with high concurrency applications, multithreading More ❯
Croydon, London, United Kingdom Hybrid / WFH Options
Jane's Group
Gateway AWS VPN, Zscaler ZPA, AWS Identity Centre, Microsoft Entra AWS Backup, AWS Systems Manager, Security Hub, Guard Duty, Microsoft Defender Windows/Linux Operating Systems and Active Directory Cloudwatch, Cloudtrail, OpenSearch, Grafana, Kinesis Lambda, EKS, ECS and EC2 DocumentDB, DynamoDB, RDS, S3, OpenSearch EventBridge, Steps, Glue This position offers a hybrid working model , allowing for flexibility in your More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Jane's Group
Gateway AWS VPN, Zscaler ZPA, AWS Identity Centre, Microsoft Entra AWS Backup, AWS Systems Manager, Security Hub, Guard Duty, Microsoft Defender Windows/Linux Operating Systems and Active Directory Cloudwatch, Cloudtrail, OpenSearch, Grafana, Kinesis Lambda, EKS, ECS and EC2 DocumentDB, DynamoDB, RDS, S3, OpenSearch EventBridge, Steps, Glue This position offers a hybrid working model , allowing for flexibility in your More ❯
and maintaining tools that support data science and MLOps/LLMOps workflows. Collaborate with Data Scientists to deploy, serve, and monitor LLMs in real-time and batch environments using Amazon SageMaker, Bedrock Implement Infrastructure-as-Code with AWS CDK, CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild … Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration tests. Conduct regular code reviews, participate in pair programming, and advocate for clean code, modular design, and maintainable architecture. Collaborate with architects and stakeholders …/MLOps experience with a strong focus on building and delivering scalable infrastructure for ML and AI applications using Python and cloud native technologies Experience with cloud services, especially Amazon Web Services (AWS) - SageMaker, Bedrock, S3, EC2, Lambda, IAM, VPC, ECS/EKS. Proficiency in Infrastructure-as-Code using AWS CDK or CloudFormation. Experience implementing and scaling MLOps workflows More ❯
deployment pipelines using GitHub Actions and Kubernetes. Maintain robust monitoring, alerting, and CI/CD systems. Skills/Must have: Strong hands-on experience with AWS (eg EC2, EKS, CloudWatch, Lambda). Background in incident, change, and problem management; comfortable with on-call rotations. Expertise in Prometheus, Grafana, and Splunk; solid knowledge of PromQL. Proficient in Scripting/programming More ❯
responsibilities will include: Supporting and managing RedHat Linux systems (v6 & v7) Automating via Shell scripting in Linux/UNIX environments Working with AWS services such as EC2, S3, and CloudWatch Managing infrastructure using tools like Kubernetes and Terraform Administering storage (LVM, RAID) and working with Dell server hardware (BIOS, iDRACs, cabling) WHAT YOU NEED TO HAVE To be successful More ❯
and security principles. Familiarity with infrastructure as code (IaC) tools and configuration management frameworks (e.g. Terraform). Knowledge of monitoring and logging tools (e.g. Prometheus, Grafana, ELK Stack, AWS Cloudwatch) for infrastructure and application monitoring. Excellent problem-solving skills, attention to detail, and ability to work independently and collaboratively in a fast-paced environment. Effective communication skills, both written More ❯
deeply with engineers on system design and infrastructure. Familiarity with cybersecurity principles , identity/access management, and secure APIs. Experience with AWS services (e.g., EC2, RDS, S3, Lambda, IAM, CloudWatch). Proficiency in Python scripting for basic automation or data parsing tasks. Awareness of latency-critical systems and the ability to make trade-offs between speed, scalability, and reliability. More ❯
adopt new MLOps practices. What We're Looking For - 3+ years of experience in MLOps, DevOps, or ML infrastructure roles. - Deep familiarity with AWS services , especially SageMaker , S3, Lambda, CloudWatch, IAM, and optionally Glue or Athena. - Strong experience with MLflow , experiment tracking , and model versioning. - Proven experience setting up and managing a feature store , and driving best practices for More ❯
understanding of public cloud services adoption at scale. Expert-level understanding of AWS/GCP Observability across: Proficiency in working with cloud-native APIs from AWS (e.g. AWS Config, CloudWatch) and GCP (e.g. Cloud Asset Inventory, Cloud Monitoring) Experience with Python to automate API integrations and data workflows Experience working with structured data pipelines (e.g. JSON ingestion, normalization, tagging More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
itecopeople
environment. Key Skills & Experience Required: Proven background as an AWS DevOps Engineer in commercial environments Strong expertise in AWS core services including EC2, ECS, EKS, S3, IAM, Lambda, CloudFormation, CloudWatch, RDS, Route 53, and VPCs Solid experience with Kubernetes and containerisation using Docker and Amazon EKS Infrastructure as Code (IaC) using Terraform and/or CloudFormation CI/… CD tools such as GitHub Actions , GitLab CI , Jenkins , or CodePipeline Monitoring and logging with tools like CloudWatch , ELK Stack , Prometheus , Grafana , or similar Scripting in Python , Bash , or similar Good understanding of networking, security groups, load balancers, and general cloud security best practices Typical Tasks: As a key member of the team, you will: Design, build, and manage More ❯
Delivering Large-scale, Long-term IT Projects for the Public Sector. Key skills & experience: Good experience with AWS technologies (e.g., EC2, RDS, ELB, EBS, EFS, S3, VPC, Glacier, IAM, CloudWatch, KMS) to develop and maintain an AWS-based cloud solution, with an emphasis on best practice cloud security. Provisioning infrastructure using Terraform and VMs with tools such as Vagrant. … configuration management tools. Strong scripting skills (e.g., Shell, Python, PowerShell, Perl, JAVA) and automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g. Grafana, Alert Manager, Prometheus, Node exporter ). Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Desired More ❯
City, Cardiff, United Kingdom Hybrid / WFH Options
SRT Marine Systems PLC
is not a large scale IaC effort and so the sophistication of the approach much be proportionate to the complexity of the problem. Configure monitoring, logging, and alerting using CloudWatch, CloudTrail, and related services. Alternatively, use our existing monitoring and observability tooling. Work with stakeholders to understand non-prod use cases and tailor infrastructure accordingly. Ensure cost optimization and … architecture, procedures, and deployment steps. Required Skills & Experience: Proven experience as a Cloud Engineer, DevOps Engineer, or Infrastructure Engineer in AWS environments. Hands-on experience with EC2, VPC, IAM, CloudWatch, S3, EBS, Route 53s. Experience with Infrastructure as Code tools (Terraform preferred). Familiarity with Linux/Unix system administration. Strong understanding of cloud security and access control. Ability More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
SRT Marine Systems PLC
is not a large scale IaC effort and so the sophistication of the approach much be proportionate to the complexity of the problem. Configure monitoring, logging, and alerting using CloudWatch, CloudTrail, and related services. Alternatively, use our existing monitoring and observability tooling. Work with stakeholders to understand non-prod use cases and tailor infrastructure accordingly. Ensure cost optimization and … architecture, procedures, and deployment steps. Required Skills & Experience: Proven experience as a Cloud Engineer, DevOps Engineer, or Infrastructure Engineer in AWS environments. Hands-on experience with EC2, VPC, IAM, CloudWatch, S3, EBS, Route 53s. Experience with Infrastructure as Code tools (Terraform preferred). Familiarity with Linux/Unix system administration. Strong understanding of cloud security and access control. Ability More ❯