Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Become
platforms (AWS, Azure, or GCP) and containerisation (e.g., Docker, Kubernetes) Experience with Infrastructure as Code tools (e.g., Terraform, Ansible, CloudFormation) Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Datadog) Experience working in regulated environments such as banking, fintech, or insurance Prior experience working in or contributing to a Centre of Excellence team Strong scripting skills (e.g., Bash More ❯
Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
agile development methodologies such as Scrum or Kanban Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation Familiarity with monitoring and logging tools such as Prometheus, Grafana, or ELK Stack Experience with machine learning and artificial intelligence technologies Desirable Certifications Strong proficiency in at least one of the following AWS certifications: AWS Certified Solutions Architect - Associate AWS More ❯
Flux) Knowledge of IaC and configuration management tools (Terraform, OpenTofu, Crossplane, Pulumi, Ansible, CloudFormation) Strong problem-solving experience, focusing on automation Production experience with Monitoring and Observability tools (Prometheus, Grafana, Datadog, Thanos, New Relic, Open Telemetry) Understanding of Cloud Networking concepts (Mesh Networking, NAT, Load Balancers, SSL Certificates and TLS termination, API Gateways, proxies, etc) Strong written and verbal communication More ❯
languages, such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Hosting technologies such as IIS, nginx, Apache, App Service, LightSail Analytical and creative approach to problem solving We encourage you More ❯
pipelines to ensure code quality and reliability; Experience of work with Docker for containerisation and application packaging; Experience of implementing and managing monitoring solutions, with experience in Prometheus and Grafana for observability and alerting. Experience of implementing and managing robust security practices, including Encryption (TLS) and Secret Management in the Cloud; Experience of leveraging GitLab API for advanced automation, integration More ❯
and protocols - TCP/IP, DNS, HTTP Experience of deploying Continuous Integration solutions An awareness of security considerations in web application deployment Monitoring/Logging aka ELK, Prometheus/Grafana etc Strong AWS knowledge - EC2, EKS, RDS, Aurora, networking, cost management If you'd like to discuss this DevOps Engineer in more detail, please send your updated CV to and More ❯
Kubernetes , Docker/Containers , Ansible , Packer , Terraform , Linux variants and command line. Programming Skills: Proven ability in at least one language (e.g., JavaScript, Python, Java). Desirable: Experience with Grafana , Prometheus , Loki stack , Kubernetes certifications , web technologies , or AWS certifications. Benefits: Competitive salary, generous pension, private medical, flexible working, and professional development opportunities Rates depend on experience and client requirements More ❯
C++. Experience with CI/CD tools like Azure DevOps, GitHub Actions, GitLab, Jenkins, TeamCity. Scripting skills in PowerShell, Bash. Familiarity with observability and monitoring tools such as Prometheus, Grafana, Splunk. Experience with containerization tools like Docker, Kubernetes, OpenShift, EC2 containers. Analytical and creative problem-solving skills. We encourage you to apply, even if you don't meet all the More ❯
languages, such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Analytical and creative approach to problem solving We encourage you to apply , even if you don't meet all of More ❯
deployment processes. Excellent communication and leadership skills, with the ability to collaborate effectively with cross-functional teams and provide technical mentorship. Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and cloud cost optimisation strategies. Preferred Skills: AWS certification(s) (e.g., AWS Certified Solutions Architect, AWS Certified Security - Specialty). Experience with other cloud platforms (e.g., Azure, GCP More ❯
language (Python, Bash, etc.). Familiarity with containerization and orchestration tools (Kubernetes). Exposure to infrastructure as code (Terraform) concepts. Familiarity with monitoring, logging, and security tools (e.g., Prometheus, Grafana, Splunk, BQL). Experience supporting either Windows or Linux environments. Cyber Security: Basic understanding of cyber security principles and best practices. Interest in learning about and working with secrets management More ❯
Linux/Unix systems administration combined with strong SQL skills and proficiency in scripting languages such as Python or Java. Demonstrated experience with monitoring and observability tools including Prometheus, Grafana, Splunk, Geneos, OpenTelemetry or Corvil is highly desirable. Familiarity with cloud platforms as well as containerisation technologies like Kubernetes or Docker alongside CI/CD pipeline management is important for More ❯
Proficiency in Backend - Java v17+/Spring/Spring Boot Proven experience designing and implementing cloud-based architectures (AWS, Azure or GCP) Ability to implement monitoring solutions (e.g. Prometheus, Grafana) Experience implementing security best practices in DevOps Processes (e.g. Infrastructure as code, vulnerability management and compliance) Aptitude to work in Agile environments Strong communication skills Nice to have: Immediate availability More ❯
and lead Infrastructure as Code (Terraform, CloudFormation). Implement DevSecOps best practices to meet HIPAA, SOC 2, and ISO 27001 requirements. Monitor system performance and availability using CloudWatch, Prometheus, Grafana, and related tooling. Collaborate with engineering, security, and product teams to drive end to end reliability. Qualifications Experience 6+ years of DevOps/SRE experience in cloud environments (AWS, Azure More ❯
infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation More ❯
infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation More ❯
tools (e.g., Terraform , Helm , GitLab CI ) Solid understanding of cloud platforms (preferably AWS or Azure) Strong scripting abilities (e.g., Bash, Python, or Go) Familiarity with monitoring tools (e.g., Prometheus, Grafana) and security controls Nice to Have: Experience with service mesh technologies (Istio, Linkerd) Knowledge of networking fundamentals and zero trust architectures Prior involvement in large-scale infrastructure migrations Contract Details More ❯
for storage infrastructure configuration and deployment. Develop Infrastructure-as-Code (IaC) solutions (e.g., using Terraform, Ansible) for scalable and repeatable storage provisioning. Integrate monitoring dashboards and alerting systems (e.g., Grafana, Prometheus, ELK) to ensure visibility into storage health and performance. Collaborate with infrastructure, platform, and cloud teams to align automation with operational goals. Ensure solutions meet enterprise standards for security More ❯
business requirements Essential Requirements Specialist Knowledge: Demonstrable experience in observability engineering, infrastructure monitoring, or event management roles Experience with traditional and modern observability stacks such as SCOM, SolarWinds, Prometheus, Grafana and Elastic Stack (ELK) Hands-on experience with BMC Helix Operations Manager, TrueSight, or similar enterprise monitoring platforms Solid understanding of AIOps concepts, including event correlation, noise reduction, anomaly detection More ❯