Puppet to implement infrastructure as Code Experience of using static code analysis tools, such as BlackDuck Able to use and manage other monitoring tools, such as Nagios, SolarWinds, Grafana, Prometheus etc. Experience of resolving complex issues using your debugging skills Strong communication skills, including the ability to explain technical concepts to non-technical colleagues Able to listen and take advice More ❯
and orchestration tools like Docker, Kubernetes, AKS, and Helm. Programming skills in Python, Java, PowerShell, or Go, with understanding of REST APIs. Experience with observability tools such as DataDog, Prometheus, Splunk, Elasticsearch, Grafana, Azure Monitor. Experience with CI/CD tools like Git, Terraform, Jenkins. Azure cloud expertise in mission-critical environments. Additional qualifications Azure cloud certification. Understanding of operating More ❯
CI/CD pipelines using Cloud Build, GitLab, Jenkins, or ArgoCD. Implement Istio, ingress controllers, network policies, and GCP IAM to secure microservice communications. Monitor and optimize systems using Prometheus, Grafana, and Cloud Operations Suite. Collaborate with platform, DevOps, and security teams in a multi-tenant Kubernetes ecosystem. Troubleshoot container performance, networking, and scaling issues. Provide architectural documentation and technical More ❯
as Python, Java Spring Boot, or .Net. Deep knowledge of software applications and technical processes, with emerging expertise in specific technical disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Proficiency with CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containerization and orchestration tools like Docker, Kubernetes, ECS. More ❯
a mentorship or managerial position. Strong knowledge of cloud platforms (AWS, GCP, Azure) and modern infrastructure technologies (Kubernetes, Docker, Terraform). Expertise in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk). Proficiency in at least one programming or scripting language (e.g., Python, Go, Bash). Deep understanding of networking, databases, and distributed systems. Strong communication , collaboration, and More ❯
MySQL, Redis, ElasticSearch, RabbitMQ, Consul, Docker, and Kubernetes. It is our mission to build highly resilient, dynamically scaling, self-healing systems by automating and monitoring everything using Terraform, Puppet, Prometheus, Grafana, Kibana, and Jenkins. Requirements: Strong understanding of operating systems, networking, and systems architecture; Strong experience working with Linux, as well as database, web, and file servers at scale in More ❯
e.g., Python, Bash). Familiarity with containerization and orchestration tools (e.g., Kubernetes). Experience with infrastructure as code (e.g., Terraform). Knowledge of monitoring, logging, and security tools (e.g., Prometheus, Grafana, Splunk). Support experience in Windows or Linux environments. Cyber Security: Basic understanding of cybersecurity principles and best practices. Interest in learning about secrets management solutions. Awareness of security More ❯
London, England, United Kingdom Hybrid / WFH Options
Causaly Inc
Terraform and infrastructure-as-code principles. • Experience managing Kubernetes clusters and containerized applications. • Familiarity with CI/CD pipelines and modern DevOps tooling. • Experience with observability tools (e.g. Datadog, Prometheus, Grafana). • Strong scripting skills (e.g. Bash, Python). • Ability to contribute to architectural planning and system design. • Excellent problem-solving and analytical skills, especially in complex or ambiguous environments. More ❯
Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Preferred Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. #J-18808-Ljbffr More ❯
Newcastle upon Tyne, Northumberland, United Kingdom
as Cloudformation, AWS CDK, Ansible to automate infrastructure provisioning, environment setup and software deployment Experience implementing system monitoring and alerting using tools such as Cloudwatch, Appdynamics, Kibana, Splunk or Prometheus Experience with one or more Public/Private cloud offerings and with Virtualisation Technologies Knowledge of RESTful APIs, how to consume them and how to invoke/engage with them More ❯
Strong expertise in Infrastructure as Code, particularly Terraform Proven experience with CI/CD pipelines and Azure DevOps Proficiency in containerization and Kubernetes orchestration Experience with monitoring tools like Prometheus and Grafana Advanced PowerShell scripting capabilities Microsoft certifications Strong knowledge of Windows Server infrastructure Experience supporting multi-language, European offices What Sets You Apart Passion for technology and continuous learning More ❯
by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services (AWS) or Microsoft More ❯
by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services (AWS) or Microsoft More ❯
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Ability to analyse and resolve complex infrastructure resource and application deployment issues. Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Knowledge of More ❯
troubleshooting experience in BGP/OSPF routing in multi VRF environment Experience in Python scripting and automation tools (Ansible, Terraform, Jinja2, Netbox, Git, GitLab CI/CD) Experience with Prometheus, Alert Manager, Grafana Knowledge of firewall policies, IPSEC VPNs, SSL, and security best practices Data Centre/Colo experience Experience working with Linux Excellent problem-solving skills and attention to More ❯
with data warehousing, ideally with Snowflake Experience with cloud platforms (AWS, GCP, or Azure) Knowledge of DevOps practices and CI/CD pipelines Experience with monitoring tools such as Prometheus and Grafana Experience with database migration and upgrade strategies What’s in it for you? We offer our employees more than just competitive compensation. Our team benefits include: Competitive pay More ❯
London, England, United Kingdom Hybrid / WFH Options
Ikerian
preferred), GitHub Actions, Jenkins, etc. Basic understanding of cloud networking concepts, including VPC, Subnets, and Load Balancing. Familiarity with monitoring and observability tools for cloud environments, such as Grafana, Prometheus, OpenSearch, and the ELK stack. Strong analytical and problem-solving skills, with a proactive approach to challenges. A genuine interest in staying updated with new AWS services and features, integrating More ❯
London, England, United Kingdom Hybrid / WFH Options
Hard Rock Digital
with data warehousing, ideally with Snowflake Experience with cloud platforms (AWS, GCP, or Azure) Knowledge of DevOps practices and CI/CD pipelines Experience with monitoring tools such as Prometheus and Grafana CockroachDB certification or advanced training Experience with database migration and upgrade strategies What’s in it for you? We offer our employees more than just competitive compensation. Our More ❯
London, England, United Kingdom Hybrid / WFH Options
Igbaffiliate
Implement and maintain CI/CD pipelines using Gitlab. Collaborate with development teams to improve system reliability and performance. Establish and maintain monitoring systems for all our systems using Prometheus, Graylog, Tempo, and Grafana. Manage and optimise database operations for MySQL, MariaDB, and PostgreSQL. Provide insights from industry best practices when necessary. Who we're looking for: Fluent spoken and More ❯
scripting language (Python, Bash, etc.). Familiarity with containerization and orchestration tools (Kubernetes). Exposure to infrastructure as code (Terraform) concepts. Familiarity with monitoring, logging, and security tools (e.g., Prometheus, Grafana, Splunk, BQL). Experience supporting either Windows or Linux environments. Cyber Security: Basic understanding of cyber security principles and best practices. Interest in learning about and working with secrets More ❯
London, England, United Kingdom Hybrid / WFH Options
Ten Lifestyle Group
. Experience with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code (Terraform). Familiarity and hands-on with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog). Experience in distributed systems and scaling. Knowledge and hands-on experience with multiple data stores (both SQL and NoSQL). Desired experience in building agentic workflows (e.g. More ❯
London, England, United Kingdom Hybrid / WFH Options
EDB
Kubernetes API objects and their relationships, proficiency in writing and managing YAML files for Kubernetes resources, use of Helm charts for managing Kubernetes applications Use of Monitoring tools like Prometheus, Grafana, and ELK stack, knowledge of logging solutions and troubleshooting techniques, experience with Persistent Volumes and Persistent Volume Claims and Knowledge of dynamic provisioning and storage classes. 2+ years of More ❯
Engineering Experience: Proven experience in building and scaling observability platforms in a cloud-native environment. Observability Expertise: Deep understanding of observability pillars (metrics, logs, traces) and related tools (e.g., Prometheus, Grafana, OpenTelemetry, Jaeger, Kibana Elastic Stack). AI/ML Proficiency: Hands-on experience integrating ML/AI models into observability systems to drive advanced insights, anomaly detection, and predictive More ❯
as Cloudformation, AWS CDK, Ansible to automate infrastructure provisioning, environment setup and software deployment Experience implementing system monitoring and alerting using tools such as Cloudwatch, Appdynamics, Kibana, Splunk or Prometheus Experience with one or more Public/Private cloud offerings and with Virtualisation Technologies Knowledge of RESTful APIs, how to consume them and how to invoke/engage with them More ❯
by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services (AWS) or Microsoft More ❯