london, south east england, United Kingdom Hybrid / WFH Options
Annapurna
and container technologies like Docker and Kubernetes. Deep understanding of networking, distributed systems, and databases. Expertise in monitoring and observability tools such as DataDog, Prometheus, Grafana, ELK stack, or Splunk. Excellent communication skills and a meticulous approach to problem-solving. Desirable Experience: Familiarity with Azure. Experience working in the autonomous More ❯
be confident scripting in Python, C# or similar scripting languages. You’ll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you’ll have worked in a fast-moving SaaS or product-led business before. Bonus points if you’ve helped shape DevOps roadmaps More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Noir
be confident scripting in Python, C# or similar scripting languages. You’ll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you’ll have worked in a fast-moving SaaS or product-led business before. Bonus points if you’ve helped shape DevOps roadmaps More ❯
Docker, and Kubernetes. It is our mission to build highly resilient, dynamically scaling, self-healing systems by automating and monitoring everything using Terraform, Puppet, Prometheus, Grafana, Kibana, and Jenkins. Requirements: Strong understanding of operating systems, networking, and systems architecture; Strong experience working with Linux, as well as database, web, and More ❯
of the following: Terraform, Ansible, AWS CDK, CloudFormation. Experience with CI/CD pipelines ex. Jenkins/Spinnaker. Experience with monitoring tools such as Prometheus, Grafana, Splunk and Datadog. Proven programming/scripting skills with some of the modern programming languages like Python. Solid software design, problem solving and debugging More ❯
Alerting: Set up monitoring and logging systems to proactively detect and address potential issues, ensuring optimal performance and reliability, in environments like on-prem Prometheus/Thanos, as well as Grafana Cloud and Loki. Database Management: Manage hundreds of on-prem PostgreSQL databases, including performance tuning, backups, disaster recovery strategies More ❯
Code, particularly Terraform Proven experience with CI/CD pipelines and Azure DevOps Proficiency in containerization and Kubernetes orchestration Experience with monitoring tools like Prometheus and Grafana Advanced PowerShell scripting capabilities Microsoft certifications Strong knowledge of Windows Server infrastructure Experience supporting multi-language, European offices What Sets You Apart Passion More ❯
Alerting: Set up monitoring and logging systems to proactively detect and address potential issues, ensuring optimal performance and reliability, in environments like on-prem Prometheus/Thanos, as well as Grafana Cloud and Loki. Database Management: Manage hundreds of on-prem PostgreSQL databases, including performance tuning, backups, disaster recovery strategies More ❯
infrastructure using Kubernetes, Terraform, and CI/CD tools such as Github Actions. Proven experience in using and setting up monitoring tools such as Prometheus, Grafana, New Relic, or similar, as well as log management systems (ELK stack, etc). Object-Oriented Programming skills. Good understanding of Agile/Scrum More ❯
/CD pipelines, infrastructure as code, and cloud automation (Azure preferred). Expertise in Docker, Kubernetes (desirable), Terraform (desirable), Git, and monitoring tools (ELK, Prometheus, or Application Insights). Proficiency in scripting, ideally with PowerShell and Python. Strong communication skills—able to influence, mentor, and challenge the status quo. Be More ❯
/CD pipelines, infrastructure as code, and cloud automation (Azure preferred). Expertise in Docker, Kubernetes (desirable), Terraform (desirable), Git, and monitoring tools (ELK, Prometheus, or Application Insights). Proficiency in scripting, ideally with PowerShell and Python. Strong communication skills—able to influence, mentor, and challenge the status quo. Be More ❯
City Of London, England, United Kingdom Hybrid / WFH Options
Harrington Starr
with Python. Solid understanding of containerisation concepts and how they support scalability, isolation, and portability in modern application deployment. Familiarity with monitoring stacks (Grafana, Prometheus, etc.) Working knowledge of CI/CD pipelines and Git-based workflows. Exposure to Terraform and Infrastructure as Code principles. Ready to Take the Next More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Harrington Starr
with Python. Solid understanding of containerisation concepts and how they support scalability, isolation, and portability in modern application deployment. Familiarity with monitoring stacks (Grafana, Prometheus, etc.) Working knowledge of CI/CD pipelines and Git-based workflows. Exposure to Terraform and Infrastructure as Code principles. Ready to Take the Next More ❯
and ADDS . Strong understanding of IP Networking and physical network setups. Desirable Skills and Experience: Exposure to tools such as CommVault , Nagios , Kibana , Prometheus , or Splunk . Ability to identify and communicate technical issues effectively, including applying root cause analysis. Strong communication and presentation skills, with the ability to More ❯
/service once live, including: observability best practises, logging best practises, error reporting, debugging and live incident management. Experience using tools such as Grafana, Prometheus, New Relic etc. Highly proficient in Python. Experience in data modelling and design patterns; in-depth knowledge of relational databases (PostgreSQL) and familiarity with data More ❯
containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross More ❯
containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross More ❯
Alerting: Set up monitoring and logging systems to proactively detect and address potential issues, ensuring optimal performance and reliability, in environments like on-prem Prometheus/Thanos, as well as Grafana Cloud and Loki. Database Management: Manage hundreds of on-prem PostgreSQL databases, including performance tuning, backups, disaster recovery strategies More ❯
/service once live, including: observability best practices, logging best practices, error reporting, debugging and live incident management. Experience using tools such as Grafana, Prometheus, New Relic etc. Highly proficient in Python. Experience in data modelling and design patterns; in-depth knowledge of relational databases (PostgreSQL) and familiarity with data More ❯