Cloud architecture Excellent communication skills: must be able to mentor, coach and work effectively with others Experience in using and reacting to AWS's observability tools, particularly around product performance, security and costs Proven administrative experience of Azure Proven administrative experience of Atlassian Suite Strong Linux administration and networking skills More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Modix International
Actions). Strong troubleshooting skills for cloud infrastructure and application performance. Knowledge of cloud security, compliance , and identity management . Experience with monitoring and observability tools (New Relic, Splunk). A continuous improvement mindset and a desire to optimize systems for security, performance, and cost. AWS Certifications (e.g., AWS Certified More ❯
of container orchestration and scaling, with experience in managing and troubleshooting workloads. Experience of managing Kubernetes clusters, service mesh and hosted workloads. Proficient in observability and monitoring tools, including configuring alerts, creating dashboards, and conducting root cause analysis. Some of the tools we use are: Grafana, Prometheus, Elastic, Splunk. Configuring More ❯
experience working with: Cloud-native development such as AWS, Azure, GCP (we use Azure) and concepts like event-driven architecture or microservices. Monitoring and observability tools such as Azure Monitor, Prometheus, Grafana, or similar Infrastructure as Code (Terraform, Bicep, Pulumi, etc.) Performance tooling and scalable ML frameworks (e.g. NVIDIA ML More ❯
secure or regulated environments (e.g. Defence, Government, Critical National Infrastructure). Desirable: Familiarity with cloud platforms such as AWS, Azure, or OpenStack. Experience with observability tooling (e.g. Prometheus, Grafana, ELK stack). Exposure to infrastructure security principles and compliance frameworks. What’s in It for You: Salary from £80,000+ More ❯
equivalent and/or appropriate experience Experience: 8+ years of experience in virtualization, containerization, build, and deployment Extensive experience with SCM, CICD, instrumentation, and observability tools Proficiency with Git, GitHub, Azure DevOps Proficiency with major IaC technologies such as Terraform, Pulumi, or Bicep Proficiency with programming and scripting languages such More ❯
reigate, south east england, United Kingdom Hybrid / WFH Options
Client Server
such as Pulumi and Terraform You have experience of building and improving CI/CD pipelines for product teams You have experience with cloud observability (logging, tracing, metrics, monitoring and alerting) You have experience with Containerisation - Docker, Kubernetes You have strong scripting skills with PowerShell and/or C# .Net More ❯
Real Time data processing. Experience with automated testing frameworks and continuous delivery tools like Jenkins, GitLab CI, or CircleCI. Understanding of performance monitoring and observability tools such as CloudWatch , Prometheus , or Datadog . Interested? Please Apply! Golang Go AWS Kubernetes Terraform Bank Banking Finance Financial Services Crypto Blockchain Web3 Trading More ❯
appropriate technology choices. Own and manage our monitoring and diagnostics. You will work with the engineering teams to ensure we have best in class observability, alerting, SLOs and cost attribution for our software stack. Own and manage the CI/CD infrastructure for our products. Deliver against challenging technical problems More ❯
DevOps teams to implement CI/CD pipelines and infrastructure as code using tools like Terraform, CloudFormation, and Ansible. Implement and manage monitoring and observability tools such as Datadog. Ensure real-time logging, alerting, and troubleshooting capabilities. Collaboration & Stakeholder Management: Work closely with business units, developers, and IT teams to More ❯
DevOps teams to implement CI/CD pipelines and infrastructure as code using tools like Terraform, CloudFormation, and Ansible. Implement and manage monitoring and observability tools such as Datadog. Ensure real-time logging, alerting, and troubleshooting capabilities. Collaboration & Stakeholder Management: Work closely with business units, developers, and IT teams to More ❯
london, south east england, united kingdom Hybrid / WFH Options
Parser
DevOps teams to implement CI/CD pipelines and infrastructure as code using tools like Terraform, CloudFormation, and Ansible. Implement and manage monitoring and observability tools such as Datadog. Ensure real-time logging, alerting, and troubleshooting capabilities. Collaboration & Stakeholder Management: Work closely with business units, developers, and IT teams to More ❯
Kubernetes, Helm and Docker. Experience with storing, searching and filtering large scale data. Experience in operationally managing software components/service once live, including: observability best practises, logging best practises, error reporting, debugging and live incident management. Experience using tools such as Grafana, Prometheus, New Relic etc. Highly proficient in More ❯
solutions using Entra SSO Automate infrastructure deployment using Terraform and Azure DevOps Maintain high-availability web hosting services for marketing campaigns Lead monitoring and observability initiatives Optimize cloud resources for cost-effectiveness Provide technical leadership and mentoring to the team What You Will Bring To The Role You're not More ❯
and understand design documentation using C4, sequence diagrams and workflows. Excellent problem-solving skills and attention to detail. Solid understanding of logging, monitoring and observability to understand if software is functioning as required. Strong communication and teamwork skills. Preferred Skills: Experience with cloud platforms e.g., AWS, Azure, Google Cloud. Knowledge More ❯
Kubernetes, Helm and Docker. Experience with storing, searching and filtering large scale data. Experience in operationally managing software components/service once live, including: observability best practices, logging best practices, error reporting, debugging and live incident management. Experience using tools such as Grafana, Prometheus, New Relic etc. Highly proficient in More ❯
s degree in Computer Science, Engineering, related field, or equivalent experience - Experience in designing and implementing comprehensive monitoring solutions using Prometheus, Grafana, and other observability tools - Experience in managing and orchestrating containerized applications using Docker and Kubernetes - Experience in building and maintaining CI/CD pipelines using modern tools (Jenkins More ❯
phases. Experience implementing infrastructure-as-code, CI/CD, and efficient release management strategies. Experience collaborating with DevOps and SRE teams, utilising monitoring and observability tools to drive continuous improvement. Proven ability to assess and improve system performance through techniques such as load testing, caching strategies, and optimisation of database More ❯
as Shell, Bash, or Python A working knowledge of Linux, networking, routing & firewalls etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Proficient in Infrastructure as Code technologies (eg CloudFormation, CDK, Terraform) Source controlling your code with Version Control Systems, for example Git, Mercurial, Perforce Great More ❯
with containerization and orchestration (Docker, Kubernetes, AKS). Knowledge of Azure networking, security, and identity management . Hands-on experience in monitoring, logging, and observability tools in Azure. Strong understanding of Git branching strategies and version control . Experience in Windows/Linux server administration in cloud environments. Familiarity with More ❯
experience in monitoring, alerting, and logging. Familiarity with Infrastructure as Code (Terraform) and CI/CD pipelines (Jenkins, Azure DevOps, etc.). Experience with observability tools like Dynatrace, Stackdriver, Cloud Operations Suite, Cloud Monitoring, and Cloud Logging. Ability to mentor engineers, troubleshoot complex system issues, and improve automation to reduce More ❯
Bash, or Python Solid understanding of Linux systems, networking, routing, and firewall configurations A deep grasp of AWS operational best practices, particularly in monitoring, observability, and FinOps Expertise in Infrastructure as Code (IaC) tools such as CloudFormation, CDK, and Terraform Additionally, it would be advantageous to have experience with: AWS More ❯
Bash, or Python Solid understanding of Linux systems, networking, routing, and firewall configurations A deep grasp of AWS operational best practices, particularly in monitoring, observability, and FinOps Expertise in Infrastructure as Code (IaC) tools such as CloudFormation, CDK, and Terraform Additionally, it would be advantageous to have experience with: AWS More ❯
Docker, Terraform, Local Stack Data: JPA/Hibernate, PostgreSQL/MongoDB/MySQL, Kafka Architecture: Microservices Architecture (high availability), REST APIs Security: PKIs, KeyCloak, Observability & Monitoring Testing: JUnit (or similar i.e. Mockito), TDD, Integration Tests & End-To-End Testing Best Practices: SOLID principles Good communication and interpersonal skills working with More ❯
and delivery (CI/CD) pipeline enhancements to support cost-efficient deployments. Assist in the development and automation of processes to improve infrastructure scalability, observability, and performance. Participate in incident resolution, root cause analysis, and post-mortem reviews with a focus on improving resilience and cost-effectiveness. Identify and advocate More ❯