hybrid team environment (3 days a week onsite in London) Experience with Terraform, Kubernetes, or CI/CD pipelines Familiarity with observability tooling (e.g. Prometheus, Grafana, Datadog) Experience mentoring or leading other engineers More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Searchability NS&D
in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV More ❯
and managing secure, automated CI/CD pipelines (GitHub Actions, ArgoCD) ✅ Automating provisioning and scaling for Redis, Kafka, and PostgreSQL ✅ Implementing observability and monitoring (Prometheus, Grafana, Loki, etc.) ✅ Managing identity and access control frameworks (Keycloak or similar) ✅ Championing infrastructure security best practices : RBAC, secrets management, hardening ✅ Collaborating with backend and More ❯
slough, south east england, united kingdom Hybrid / WFH Options
WMtech
GenAI, LLMs, and multimodal systems Architecture: Microservices, RESTful APIs, async programming Infrastructure: Docker, Terraform, GitHub Actions, GCP (preferred) Datastores: MongoDB, Redis Monitoring/Tooling: Prometheus, Grafana, Sentry The role is remote with occasional travel Ready to lead and build with purpose? If you're excited by the idea of applying More ❯
at the DevOps Engineer level. Previous experience with incidents, change and problem management. Strong background in setup and operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL. Proficient in one or more languages of Python, Go, Bash, SQL. Familiar with GitHub, GitOps, container orchestration, and More ❯
Solid experience in Cloud DevOps (e.g. AWS, Azure, GCP), including services like EKS, ECS, Lambda, etc. Proficiency with observability platforms such as Grafana, Kibana, Prometheus, Datadog, Splunk , or similar. Strong knowledge of RegEx, Lucene, PromQL . Proven track record of leading technical teams and owning the end-to-end onboarding More ❯
SQL, MongoDB, or Postgres. Proficiency with Linux and Windows command lines (e.g. Bash, PowerShell). Experience with monitoring large systems using tools like Grafana, Prometheus, ELK, and Splunk. Knowledge of Agile methodologies and tools like Atlassian. Strong troubleshooting skills across various levels of the application stack. Familiarity with ITIL processes. More ❯
and containerized microservices Proven track record building low-latency data aggregation solutions and large-scale data platforms Hands-on with SRE observability tools (e.g., Prometheus, Grafana, ELK) and CI/CD pipelines (GitHub Actions, Azure DevOps) Strong communicator—comfortable influencing both technical teams and senior stakeholders Passionate about up-skilling More ❯
Grass Valley, Appear, MediaKind). In-depth knowledge of routing protocols (OSPF, BGP). Familiarity with network monitoring and management systems (e.g., Zabbix, Grafana, Prometheus, Netbox, OpenNMS). Experience with automation and scripting (e.g., Python, Ansible). Degree in Engineering, Telecommunications, Computer Science, or related field. Relevant industry certifications. This More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Prism Digital
version, and manage infrastructure as code across multiple environments. GitHub Actions & OIDC – build and maintain automated CI/CD pipelines with secure authentication. Datadog, Prometheus or similar – implement logging, metrics, and alerting for robust observability – the interim CTO is keen to hear your recommendation(s) on tooling and implementation strategy. More ❯
within high performance computing environments Excellent understanding of containerised architectures and engineering in Kubernetes Strong understanding of DevOps platforms and tooling (Ansible, Terraform, Helm, Prometheus, Grafana etc.) Strong Linux engineering and integration experience with significant on prem experience Financial markets or scientific computing experience preferred Degree educated or higher from More ❯
of DevOps tools Kubernetes, Jenkins, Gitlab, Terraform, and more. -Optimising automation and performance. -Champion containerisation and high performance base images. -Elevate monitoring systems Zabbix, Prometheus, Thanos ensuring 24/7 operational excellence. -Secure infrastructure access management, balancing innovation with ironclad security. They're offering a career defining role in a More ❯
experience in Go (Golang) , Java , Kotlin , JavaScript/Node.js , or Python Strong hands-on experience with Kubernetes (K8s) and OpenShift Experience with MongoDB , Kafka , Prometheus , OpenTelemetry , Grafana Familiarity with tools like Helm , Kustomize , Terraform , and Vault Proven experience with hybrid cloud environments (on-prem + public cloud) Ability to explain More ❯
to deliver cross-functional automation solutions involving Network Engineering and DevOps teams Demonstrate solid understanding of infrastructure monitoring and visualization tools, including Kibana, Splunk, Prometheus, and Grafana Maintain up-to-date knowledge of industry trends and new technologies to ensure automation practices remain advanced and relevant Possess strong foundational networking More ❯
similar technologies Demonstrated ability to lead cross-functional teams and operate within complex enterprise ecosystems Familiarity with monitoring, observability, and platform telemetry tools (e.g., Prometheus, Grafana, Azure Monitor) Exceptional communication and stakeholder engagement skills to partner with business, technical, and governance teams Experience managing platform SLAs, incident management, and continuous More ❯
3.10+), JavaScript and TypeScript for frontend Tools: RabbitMQ and ZeroMQ for messaging; PostgreSQL for data storage; Websockets for frontend communication Environment: Linux servers Observability: Prometheus, Grafana, Zabbix Benefits: Working alongside other extremely talented and driven engineers Extremely lucrative salary, bonus upto 30% and excellent benefits Greenfield Python/Golang work More ❯
3.10+), JavaScript and TypeScript for frontend Tools: RabbitMQ and ZeroMQ for messaging; PostgreSQL for data storage; Websockets for frontend communication Environment: Linux servers Observability: Prometheus, Grafana, Zabbix Benefits: Working alongside other extremely talented and driven engineers Extremely lucrative salary, bonus upto 30% and excellent benefits Greenfield Python/Golang work More ❯
big fans of Azure Pipelines! Some of our services are migrating away from TeamCity and Octopus Deploy Our observability stack is Splunk, Grafana and Prometheus You As a software engineer, you will be: Part of a cross-functional team working with Product Managers, Testers and DevOps engineers Writing well-tested More ❯
slough, south east england, united kingdom Hybrid / WFH Options
CipherTek Recruitment
data points, either via APIs or other appropriate methods, to ensure real-time decision-making capabilities. Integrate with Instrumentation Platforms : Integrate the platform with Prometheus and Geneos for continuous monitoring, diagnostics, and system health checks. Desired technical Skills: Java Expertise: Extensive experience with Core Java , focusing on low-level performance More ❯
Manage cloud infrastructure on Azure, ensuring high availability, scalability, and security Set up and maintain monitoring, logging, and alerting solutions using tools such as Prometheus and Grafana Collaborate with development teams to optimize AI application deployment and operational performance Ensure compliance with security standards and best practices in cloud and … containerisation and orchestration technologies such as Docker and Kubernetes In-depth knowledge of cloud platforms, especially Azure Experience with monitoring and logging tools like Prometheus and Grafana Familiarity with supporting AI/ML workloads is preferred Strong problem-solving skills and ability to work collaboratively in a fast-paced environment More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Wallet in Telegram
DevOps engineer, handling routine tasks and being on-call for production issues Resolve production and development issues, leveraging strong troubleshooting skills Adjust and rewrite Prometheus alert expressions to be non-flapping and algorithmic .Requirements Understanding of networking fundamentals Proficiency in Linux OS, including system metrics and filesystems Experience with PostgreSQL …/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Maxwell Bond
and as their Senior Developer, you’ll work with Java and Spring Boot. They’ve got their own internal cloud and tooling's, using Prometheus and Grafana. As their Senior Developer, you’ll be: A key advocate for best coding practice, standards, and innovation. TDD approach is key for this More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Maxwell Bond
Senior Developer, you’ll work with Java (17+), SpringBoot, Kotlin and NoSQL databases . They’ve got their own internal cloud and toolings using Prometheus and Grafana. As their Senior Developer, you’ll be: A key advocate for best coding practice, standards, and innovation. TDD approach is key for this More ❯
Collaborate with developers on performance tuning and troubleshoot cross-region network issues. Monitoring & Observability Set up and manage monitoring, alerting, and logging using OpenSearch, Prometheus, and Grafana. Develop dashboards for real-time network insights. Collaboration & Knowledge Sharing Partner with developers, traders, and data engineers to align infrastructure with business needs. … GitLab CI/CD pipelines and GitOps principles Knowledge of container orchestration platforms like Kubernetes (EKS) Experience with monitoring and observability tools including OpenSearch, Prometheus, and Grafana Understanding of security best practices and AWS CIS Benchmark standards Experience with low-latency network design and optimization Strong verbal communication and documentation More ❯