Hemel Hempstead, Hertfordshire, United Kingdom Hybrid / WFH Options
Eckoh
and automation tooling (Gitlab experience preferable). Experience with "infrastructure as code" (Terraform, CloudFormation), containerisation (Docker), and orchestration (Kubernetes). Proficiency with observability and monitoring solutions (e.g., CloudWatch, Prometheus, Grafana, Splunk). Strong understanding of cloud-native development practices and agile ways of working. Confident conducting peer code reviews and providing constructive technical feedback. Desirables: Experience designing solutions in multi More ❯
Hemel Hempstead, Hertfordshire, South East, United Kingdom Hybrid / WFH Options
Eckoh PLC
and automation tooling (Gitlab experience preferable). Experience with 'infrastructure as code' (Terraform, CloudFormation), containerisation (Docker), and orchestration (Kubernetes). Proficiency with observability and monitoring solutions (e.g., CloudWatch, Prometheus, Grafana, Splunk). Strong understanding of cloud-native development practices and agile ways of working. Confident conducting peer code reviews and providing constructive technical feedback. Desirables: Experience designing solutions in multi More ❯
or similar. Manage cloud infrastructure (OCI, AWS, Azure, or GCP) using Infrastructure as Code tools like Terraform or Serverless Functions. Monitor system health and performance using tools like Prometheus, Grafana, or Datadog or NewRelic. Collaborate closely with development teams to automate builds, performance tests, and deployments. Ensure system security, compliance, and best practices are followed in deployment pipelines. Ensure network More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting and automation skills, with proficiency More ❯
up and managing monitoring, metrics, and alerting systems Experience operating production-grade services at scale Great to have: Experience with tools such as: Terraform, SaltStack, MongoDB, Elasticsearch, Kafka, Prometheus, Grafana or HashiCorp Vault Experience with securing applications, services, and data, including authentication, authorization, TLS, and encryption Exposure to Kubernetes (administering, deploying, or developing apps on K8s clusters) Understanding of compliance More ❯
and service flow mappings aligned to engineering needs. Help teams craft complex DQL queries to extract meaningful insights from telemetry data. Support observability design and migration efforts from Prometheus, Grafana, and CloudWatch to Dynatrace. Advise on RBAC models and data access strategies based on team structure and security requirements. Assist in monitoring strategy for Kubernetes-based workloads, especially in hybrid More ❯
skills, with the ability to present complex technical concepts clearly to diverse audiences. Some familiarity with containerization technologies (Docker, Podman, Kubernetes) and cloud (AWS, GCP) application deployment including monitoring (Grafana). Desire to work in a rapidly evolving startup environment where you will have a high impact. Strong academic background with at least a Bachelor's degree in a technical More ❯