within automated pipelines to ensure code quality and reliability; Experience of work with Docker for containerisation and application packaging; Experience of implementing and managing monitoring solutions, with experience in Prometheus and Grafana for observability and alerting. Experience of implementing and managing robust security practices, including Encryption (TLS) and Secret Management in the Cloud; Experience of leveraging GitLab API for advanced More ❯
React Proficiency in Backend - Java v17+/Spring/Spring Boot Proven experience designing and implementing cloud-based architectures (AWS, Azure or GCP) Ability to implement monitoring solutions (e.g. Prometheus, Grafana) Experience implementing security best practices in DevOps Processes (e.g. Infrastructure as code, vulnerability management and compliance) Aptitude to work in Agile environments Strong communication skills Nice to have: Immediate More ❯
code tools (e.g., Terraform , Helm , GitLab CI ) Solid understanding of cloud platforms (preferably AWS or Azure) Strong scripting abilities (e.g., Bash, Python, or Go) Familiarity with monitoring tools (e.g., Prometheus, Grafana) and security controls Nice to Have: Experience with service mesh technologies (Istio, Linkerd) Knowledge of networking fundamentals and zero trust architectures Prior involvement in large-scale infrastructure migrations Contract More ❯
Birmingham, West Midlands, England, United Kingdom Hybrid / WFH Options
Bullion By Post
and optimise Linux-based infrastructure supporting a large-scale e-commerce platform Build and maintain deployment pipelines and infrastructure as code using Ansible Monitor performance and system health using Prometheus and Grafana Strengthen security, backups, and compliance Lead incident response, root cause analysis, and post-mortems Collaborate with development teams on CI/CD workflows and scalable architecture Document internal More ❯
troubleshooting and scripting languages such as Python, Go, or Bash. Experience with Kubernetes security, including workload isolation, RBAC, and network policies, containerisation, orchestration, and Kubernetes observability tools (e.g., Falco, Prometheus, Grafana). Experience with infrastructure-as-code and configuration management tools (e.g., Terraform, Helm, ArgoCD). Eligibility to obtain UK Developed Vetting (DV) security clearance; British Citizenship is required for More ❯
in Linux/Unix systems administration combined with strong SQL skills and proficiency in scripting languages such as Python or Java. Demonstrated experience with monitoring and observability tools including Prometheus, Grafana, Splunk, Geneos, OpenTelemetry or Corvil is highly desirable. Familiarity with cloud platforms as well as containerisation technologies like Kubernetes or Docker alongside CI/CD pipeline management is important More ❯
healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration - Skilled in the tooling and More ❯
Azure Kubernetes Service (AKS), Azure Synapse Analytics, or Azure Cognitive Services Azure certifications, such as Azure Solutions Architect Expert or DevOps Engineer Expert Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in implementing disaster recovery and high-availability solutions for critical systems Qualifications Bachelor's or Master's degree in Computer Science, Information Technology More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Netcompany UK Limited
Azure Kubernetes Service (AKS), Azure Synapse Analytics, or Azure Cognitive Services Azure certifications, such as Azure Solutions Architect Expert or DevOps Engineer Expert Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in implementing disaster recovery and high-availability solutions for critical systems Qualifications Bachelor's or Master's degree in Computer Science, Information Technology More ❯
building testable systems by design Nice-to-Haves: Exposure to regulated environments (e.g., BFSI, healthcare, public sector) Experience with performance, security, or chaos testing Familiarity with observability tooling (e.g., Prometheus, Grafana, OpenTelemetry) Knowledge of contract testing, mocking, or service virtualization Mindset & Cultural Fit A builder's mindset, focused on enabling early, frequent, and safe delivery through automated confidence A belief More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated experience More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Navtech, Inc
and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated experience More ❯
Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
Engineering role Hands-on expertise with AWS, Docker, Kubernetes, and Terraform Strong understanding of CI/CD tools and modern development workflows Experience implementing monitoring and observability tools (e.g., Prometheus, Grafana) Solid grasp of cloud security, IAM, and best practices for operational excellence Proactive problem solver with excellent troubleshooting skills Strong communication and collaboration skills, ideally within Agile teams Comfortable More ❯
pipelines and lead Infrastructure as Code (Terraform, CloudFormation). Implement DevSecOps best practices to meet HIPAA, SOC 2, and ISO 27001 requirements. Monitor system performance and availability using CloudWatch, Prometheus, Grafana, and related tooling. Collaborate with engineering, security, and product teams to drive end to end reliability. Qualifications Experience 6+ years of DevOps/SRE experience in cloud environments (AWS More ❯
Engineering role Hands-on expertise with AWS, Docker, Kubernetes, and Terraform Strong understanding of CI/CD tools and modern development workflows Experience implementing monitoring and observability tools (e.g., Prometheus, Grafana) Solid grasp of cloud security, IAM, and best practices for operational excellence Proactive problem solver with excellent troubleshooting skills Strong communication and collaboration skills, ideally within Agile teams Comfortable More ❯
Experience working with a CI/CD driven hybrid environment Knowledge of IAC principles and continuous development Pipeline configuration and deployment - Ansible Automation Platform Experience with Monitoring and observability - Prometheus/Grafana/Cloudwatch Experience of AWS Cloud based services Configuration and deployment of common AWS services, VPC, EC2, S3, NLB AWS automation - Terraform, CDK Shell scripting, specifically Bash and More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting and automation skills, with More ❯
storage infrastructure configuration and deployment. Develop Infrastructure-as-Code (IaC) solutions (e.g., using Terraform, Ansible) for scalable and repeatable storage provisioning. Integrate monitoring dashboards and alerting systems (e.g., Grafana, Prometheus, ELK) to ensure visibility into storage health and performance. Collaborate with infrastructure, platform, and cloud teams to align automation with operational goals. Ensure solutions meet enterprise standards for security , resilience More ❯
to low-code platforms (e.g., Retool) for rapid application development. Experience in DevOps practices, including infrastructure-as-code (IaC), monitoring, alerting, and incident management. Familiarity with observability tools (Grafana, Prometheus) and APM tools (New Relic, Datadog). Knowledge of microservices architecture, event-driven design, and scalability best practices. Experience implementing data compliance standards (GDPR, ISO 27001). Find.co is an More ❯
Development: Expertise in microservices architecture and RESTful API design. Programming Frameworks: Experience with ASP.NET Core and C#, or alternatively Java/Python. Monitoring & Alerting Tools: Experience with tools like Prometheus, Grafana, or ELK stack. Database Skills: Experience in database design and programming using MSSQL or Oracle, or similar. AI Exposure: Experience or interest in applying AI to business and technology More ❯
Code (IaC): Hands-on experience using Terraform for provisioning and managing cloud infrastructure. Proficient in version control, particularly with GitHub. Monitoring & Observability: Proficient with monitoring and alerting tools (e.g., Prometheus, Grafana, CloudWatch) to track pipeline and infrastructure health. Strong troubleshooting skills for resolving CI/CD pipeline issues and optimizing pipeline performance. Branching & Source Control Strategies: Advanced knowledge of branching More ❯
for improvement Take pride in building and operating scalable, reliable, secure systems Are comfortable with ambiguity and rapid change Preferred skills and experience: Familiar with monitoring tools such as Prometheus, Grafana, or similar 5+ years building core infrastructure Experience running inference clusters at scale Experience operating orchestration systems such as Kubernetes at scale Benefits & perks (UK full-time employees): Generous More ❯
troubleshooting and scripting languages such as Python, Go, or Bash. Experience with Kubernetes security, including workload isolation, RBAC, and network policies, containerisation, orchestration, and Kubernetes observability tools (e.g., Falco, Prometheus, Grafana). Experience with infrastructure-as-code and configuration management tools (e.g., Terraform, Helm, ArgoCD). United Kingdom Security Vetting Developed Vetting (DV) clearance. Preferred qualifications: Certifications in Security (e.g. More ❯
pipelines including working with repos, build automation tools, build orchestration and environment automation. e.g. Jenkins, GitHub, GitLab, CloudFormation, Others Experience in implementing tools for logging, monitoring and alerting. e.g. Prometheus, Splunk, CloudWatch, Nagios Experience in creating and automating virtual machines in public and private clouds An understanding or experience of high availability, business continuity and disaster recovery solutions in the More ❯