Hounslow, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
/technologies as possible: AWS Cloud and AWS Services Containerisation with Docker and/or Kubernetes Terraform Strong CI/CD (GitOps, ArgoCD, CircleCI etc) knowledge Monitoring experience with Prometheus and Grafana Linux and Network concepts Experience on Crypto and/or Trading platforms is also a requirement The role can offer remote working anywhere in the UK. #J More ❯
operating infrastructure on AWS and other providers Operating MongoDB (or other document database) clusters Operating Redis (or other key-value storage) clusters Administering Linux servers Maintaining distributed software Operating Prometheus and Grafana Operating logging collection and analysis systems Working hours within 16:00pm - 4:00am UTC Skills: Kubernetes & containers (advanced) AWS/EKS (advanced) Linux (advanced) Terraform and IaC in … general (proficient) Helm (proficient) Go and/or Python (familiar) MongoDB (or similar) Redis (or similar) Monitoring - prometheus, grafana, thanos (familiar) Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.) Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP) Proactive, energetic, innovative and change oriented Nice to have: GCP or Azure Bare metal infrastructure engineering API management More ❯
response and recovery workflows, and ensure systems meet internal SLOs/SLAs and reliability targets. Support the migration from our legacy ELK stack to a modern observability platform using Prometheus, Mimir, Grafana, Honeycomb, Loki, Quickwit , and OpenTelemetry . Contribute to knowledge sharing and the ongoing development of best practices in observability across the organisation. What you'll need: 4+ years … languages such as Python, Java, JavaScript , or Ruby . Familiarity with Kubernetes , AWS , and infrastructure-as-code tools such as Terraform Experience working with observability tools and platforms (e.g. Prometheus, Grafana, ELK, Honeycomb, Loki , or similar). A strong interest in developer experience and platform tooling, with the ability to empathise with engineering teams as internal customers. Excellent communication skills More ❯
London, England, United Kingdom Hybrid / WFH Options
WunderGraph, Inc
but is not limited to: Architecting, building, and operating the core cloud-native infrastructure for WunderGraph Cosmo, primarily using Go and Kubernetes. Owning and evolving our observability stack (OpenTelemetry, Prometheus, ClickHouse) and the infrastructure supporting our AI-driven features to ensure deep, actionable insights into our systems. Building and optimizing CI/CD pipelines to improve build times, automate quality … architecture, distributed systems, and the challenges of running high-performance API gateways. Familiarity with GraphQL Federation is a significant plus. Experience building or managing modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, ClickHouse). A self-starter attitude and a leader’s mindset: you are comfortable with ambiguity, can identify and solve ill-defined problems, and don’t need hand-holding. More ❯
Hounslow, London, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions
members and work independently across technical tasks What You'll Need Languages & Tools: Python, Ansible (C++, Go a plus), Git, Jira, Confluence Cloud & Infrastructure: Azure, Kubernetes, OpenShift Monitoring: Splunk, Prometheus, Grafana Databases: Oracle (OCA/OCP a plus) Environments: Linux/Unix Strong debugging, problem-solving, and collaboration skills Proven experience in DevOps and service reliability roles Interested? Apply now More ❯
Hounslow, Middlesex, England, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions Ltd
members and work independently across technical tasks What You'll Need Languages & Tools: Python, Ansible (C++, Go a plus), Git, Jira, Confluence Cloud & Infrastructure: Azure, Kubernetes, OpenShift Monitoring: Splunk, Prometheus, Grafana Databases: Oracle (OCA/OCP a plus) Environments: Linux/Unix Strong debugging, problem-solving, and collaboration skills Proven experience in DevOps and service reliability roles Interested? Apply now More ❯
Strong understanding of software design patterns , clean code practices, and Agile methodologies Nice to Have: Experience with GraphQL or gRPC Exposure to monitoring/logging tools (e.g., CloudWatch, ELK, Prometheus) Knowledge of security best practices in API and cloud development Familiarity with data streaming using Kafka or Kinesis #J-18808-Ljbffr More ❯
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Desired Skills and Experience: Hands-on DevOps delivery experience working on Digital or Technology More ❯
experience with AWS services Proven knowledge of Kubernetes and containerised application delivery Infrastructure as Code with Terraform CI/CD pipelines using GitLab or Drone Monitoring and logging – Grafana, Prometheus, or CloudWatch Experience in secure environments – knowledge of IAM, Vault, networking Active UK*C or Enhanced DV Clearance is a must (sole British nationals only) TO BE CONSIDERED: Please apply More ❯
experience with AWS services Proven knowledge of Kubernetes and containerised application delivery Infrastructure as Code with Terraform CI/CD pipelines using GitLab or Drone Monitoring and logging – Grafana, Prometheus, or CloudWatch Experience in secure environments – knowledge of IAM, Vault, networking Active UK*C or Enhanced DV Clearance is a must (sole British nationals only) TO BE CONSIDERED: Please apply More ❯
building internal tooling and services Hands-on experience with AWS, Kubernetes, Docker, and modern CI/CD pipelines Familiarity with infrastructure-as-code (e.g., Terraform) and observability tooling (e.g., Prometheus, Grafana) Comfortable working on distributed systems and improving developer workflows A product mindset and a collaborative approach to problem-solving Experience with Kafka, gRPC, or open-source contributions is a More ❯
Birmingham, West Midlands (County), United Kingdom
Syntax Consultancy Ltd
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g. Grafana, Alert Manager, Prometheus, Node exporter ). Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Desired Skills and Experience: Hands-on DevOps delivery experience working on Digital or Technology projects More ❯
Birmingham, West Midlands (County), United Kingdom
Syntax Consultancy Ltd
automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g. Grafana, Alert Manager, Prometheus, Node exporter ). Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Desired Skills and Experience: Hands-on DevOps delivery experience working on Digital or Technology projects More ❯
Have : Experience with Event-Driven Architecture using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this More ❯
Helm, Bash, Python). • Solid understanding of microservices, zero-trust security, mTLS, RBAC, and network policies. • Experience with CI/CD tools, logging (e.g., Fluentd, Loki), and monitoring (e.g., Prometheus, Grafana). About us Ascendion is a Global, leading provider of AI-first software engineering services, delivering transformative solutions across North America, APAC, and Europe. We are headquartered in New More ❯
Helm, Bash, Python). • Solid understanding of microservices, zero-trust security, mTLS, RBAC, and network policies. • Experience with CI/CD tools, logging (e.g., Fluentd, Loki), and monitoring (e.g., Prometheus, Grafana). About us Ascendion is a Global, leading provider of AI-first software engineering services, delivering transformative solutions across North America, APAC, and Europe. We are headquartered in New More ❯
with infrastructure automation and configuration management (Chef, Puppet, or Ansible) Experience with distributed storage and different storage protocols Knowledge of observability in distributed systems (e.g., Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana) Excellent written and verbal communication skills Eagerness to continuously learn new technologies and develop professionally Ability to work well in a fast-paced environment The minimum base salary for More ❯
Helm, Bash, Python). Solid understanding of microservices, zero-trust security, mTLS, RBAC, and network policies. Experience with CI/CD tools, logging (e.g., Fluentd, Loki), and monitoring (e.g., Prometheus, Grafana). About us Ascendion is a global, leading provider of AI-first software engineering services, delivering transformative solutions across North America, APAC, and Europe. We are headquartered in New More ❯
infrastructure across on-prem and AWS Administer and optimise Kubernetes clusters and containerised pipelines Implement and maintain Infrastructure as Code using Terraform Improve observability and resilience using tools like Prometheus Manage and monitor GitLab CI/CD pipelines for multi-platform builds (Linux, Windows, macOS) Collaborate with engineering teams to optimise developer workflows and apply DevOps best practices Set clear More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
Anson McCade
Desirable Experience Delivery of secure software in government, defence, or other regulated sectors Hands-on cloud-native development and deployment Knowledge of logging and monitoring tools such as DataDog, Prometheus, or StackDriver Experience working with product lifecycle tooling and engineering in complex domains If you’re looking to focus on real engineering work that drives meaningful outcomes and want to More ❯
e.g., Cloud, artificial intelligence, Android, etc.) Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker Familiarity More ❯
services. Experience with Terraform for infrastructure as code. Proficiency with Git and managing multiple repositories. Experience with scripting languages like Bash or Python. Knowledge of monitoring tools such as Prometheus, Grafana, or CloudWatch. Understanding of Git branching strategies and source control management. Preferred Skills Familiarity with AI/ML-driven build automation tools and predictive analytics for CI/CD. More ❯
London, England, United Kingdom Hybrid / WFH Options
Wayve Technologies Ltd
or Azure). Expert of CI/CD processes, containerization (Docker, Kubernetes), and a deep understanding of networking, distributed systems, and databases. Expert with monitoring and troubleshooting utilities (DataDog, Prometheus, Grafana, ELK stack, Splunk, Humio, etc.). Exceptional problem-solving skills and a detail-oriented mindset, coupled with outstanding communication abilities. Desirable Experience with Azure, a background in autonomous vehicles More ❯
equivalent preferred). Experience with cloud-based hosting platforms like AWS, Azure, or GCP and/or experience with hardware-based environments. Familiarity with monitoring systems using tools like Prometheus and writing health checks. Proficiency with one programming language, such as Java, Go, Python, JavaScript, or similar languages. #J-18808-Ljbffr More ❯
experience with either Python or Go Building CI/CD pipelines and automation of various parts of the stack Self-hosting and maintaining observability tools such as Grafana/Prometheus It would be great if you also have experience with one or more Edge/IoT infrastructure (Yocto, IoT devices provisioning, over-the-air updates..) Remote management of on-prem More ❯