London, England, United Kingdom Hybrid / WFH Options
Deel
Familiarity with GitOps and CI/CD methodologies, utilizing tools like ArgoCD, GitHub Actions, or Jenkins. Knowledge and hands-on experience with alerting and monitoring tools such as DataDog, Prometheus, and Grafana. Results-driven mindset with a strong commitment to task completion. Ability to drive projects from inception to implementation. A self-motivated individual capable of proactive problem-solving and More ❯
Reading, England, United Kingdom Hybrid / WFH Options
BJSS
AZ CLI, Python, Bash, Ruby, or Groovy. Detailed knowledge of configuration management technologies like DSC, Puppet, Chef, SaltStack, or Ansible. Thorough knowledge of logging and monitoring tools such as Prometheus, Grafana, Azure Monitor, Azure Log Analytics, Azure App Insights, New Relic, or Datadog. Knowledge of cloud platform design patterns and best practices. Knowledge of common database and caching technologies, such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Stealth AI Startup
Advocacy for CI/CD, building pipelines in GitHub Actions, GitLab CI or CircleCI with automated tests and security gates. An observability and SRE mindset, using tools such as Prometheus, Grafana, Loki or ELK and OpenTelemetry. A security-first but pragmatic approach, covering secrets management, image provenance and zero-trust networking. Proficiency in at least one systems language (Go, Python More ❯
Advocacy for CI/CD, building pipelines in GitHub Actions, GitLab CI or CircleCI with automated tests and security gates. An observability and SRE mindset, using tools such as Prometheus, Grafana, Loki or ELK and OpenTelemetry. A security-first but pragmatic approach, covering secrets management, image provenance and zero-trust networking. Proficiency in at least one systems language (Go, Python More ❯
one or more technical disciplines Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Geneos, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.) Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.) Experience with troubleshooting More ❯
components interaction, engineering and testing (e.g. NMS applications, controllers, orchestrators, supervisory systems, etc.). Experience and understanding of kafka messaging bus. Experience in using monitoring tools like Nagios, Grafana, Prometheus and Kibana is desired. Deployment environment: Kubernetes, Docker, microservices. Experience on Talos Kubernetes is an advantage. Deployment experience in cloud-based environment AWS/Azure/GCP/OpenShift is More ❯
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Ability to analyse and resolve complex infrastructure resource and application deployment issues. Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Knowledge of More ❯
Jenkins etc.) Extensive experience with infrastructure-as-code tools and related technologies: Docker, Ansible, Terraform, Kubernetes, EKS, GKE Extensive experience with monitoring and alerting in cloud based environments: Grafana, Prometheus, Kibana, Elasticsearch, InfluxDB, CloudWatch, Stackdriver BONUS POINTS Demonstrated involvement in Open source projects, blockchain-related communities and online spaces Interest and involvement in the Ethereum space that goes beyond crypto More ❯
Skills Experience managing GPU servers, containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross-functional collaboration. Benefits High competitive More ❯
Skills Experience managing GPU servers, containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross-functional collaboration. Benefits High competitive More ❯
Skills Experience managing GPU servers, containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross-functional collaboration. High competitive salary. More ❯
models Additional skills that are a plus: Programming languages such as Scala, Rust, Go, Angular, React, Kotlin Database management with PostgreSQL Experience with ElasticSearch, observability tools like Grafana and Prometheus What this role can offer Opportunity to deepen understanding of AI and Data Science applications Mentorship and support from colleagues to apply your talents Career growth and development opportunities Knowledge More ❯
support, incident management, and ITSM processes (ServiceNow/Jira/Slack) Collaborate on DevOps initiatives: CI/CD (Jenkins/Terraform), containers (Docker/K8s), and monitoring (Grafana/Prometheus) Employee Lifecycle & Security Automate onboarding/offboarding workflows (account provisioning, access controls) Partner with HR/InfoSec to enforce IAM policies and compliance standards Develop IT operational playbooks and knowledge More ❯
support, incident management, and ITSM processes (ServiceNow/Jira/Slack) Collaborate on DevOps initiatives: CI/CD (Jenkins/Terraform), containers (Docker/K8s), and monitoring (Grafana/Prometheus) Employee Lifecycle & Security Automate onboarding/offboarding workflows (account provisioning, access controls) Partner with HR/InfoSec to enforce IAM policies and compliance standards Develop IT operational playbooks and knowledge More ❯
scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes). Ability More ❯
AWS (EC2, Lambda), Azure (AKS), or Google Cloud (BigQuery). DevOps and Automation: CI/CD: Jenkins, GitLab CI/CD, or CircleCI. IaC: Terraform or AWS CloudFormation. Monitoring: Prometheus, Grafana, or Datadog. Seniority level Seniority level Director Employment type Employment type Full-time Job function Job function Information Technology Industries Technology, Information and Media and Financial Services Referrals increase More ❯
scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes). Ability More ❯
scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise in setting up and maintaining monitoring systems (e.g., Prometheus, Grafana). Some other highly valued skills may include: Experience with cloud platforms (e.g., AWS, Azure, Google Cloud). Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes). Ability More ❯
Newcastle upon Tyne, England, United Kingdom Hybrid / WFH Options
Byggfakta UK Group
Professional (D) Strong stakeholder communication and vendor management experience (D) Demonstrable experience in leading cross-functional teams and fostering DevOps/FinOps culture (D) Familiarity with observability tools (e.g., Prometheus, Grafana, Datadog, ELK stack) and incident management processes. (D) MISSION & VISION Mission; By using our unique data, insights and software solutions, our customers in the construction industry will sell more More ❯
London, England, United Kingdom Hybrid / WFH Options
Humanoid
Skills Experience managing GPU servers, containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross-functional collaboration. High competitive salary. More ❯
tools Knowledge of cost optimisation strategies for cloud resources Experience with advanced Azure services such as Azure Kubernetes Service (AKS). Trade certifications Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in maintain disaster recovery and high-availability solutions for critical systems What We Offer At Netcompany, we believe in empowering our senior engineers More ❯