working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumologic, NewRelic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/ More ❯
private-cloud infrastructure: Bare-metal, vSphere, KVM, Kubernetes. Experience with tools like Ansible, Terraform, Docker, Kafka, Nexus Experience with observability platforms: InfluxDB, Prometheus, ELK, Jaeger, Grafana, Nagios, Zabbix Familiarity with Big Data tools: Hadoop, HDFS, Spark, HBase Ability to write code in Go, Python, Bash, or Perl for automation. Work More ❯
Infrastructure as Code (IaC) : Proficiency with Infrastructure as Code (IaC) tools such as Terraform or CloudFormation. Distributed Tracing : Experience with distributed tracing tools like Jaeger or OpenTelemetry for debugging microservices. Security : Strong knowledge of securing microservices, Kubernetes clusters, and cloud-based applications. Additional Information We believe that coming together as More ❯
london, south east england, United Kingdom Hybrid / WFH Options
DeFinitive
development (GitHub Actions/GitOps preferred). Strong PostgreSQL management skills, including scaling and optimisation. Familiarity with observability tools (e.g. Prometheus, Grafana, Loki, Tempo, Jaeger). Strong problem-solving, communication, and teamwork abilities. Bonus points for: Experience with identity providers (Okta, Auth0, Cognito, Keycloak) and authentication protocols. Any knowledge of More ❯
Proficiency with using Puppet for configuration management, automation and system provisioning Hands-on experience in monitoring and observability platforms such as Grafana, Prometheus, Elasticsearch, Jaeger Experience with cloud architectures such as GCP or AWS Familiarity with SQL databases and broker systems such as Kafka You are a solution-oriented professional More ❯
role in managing and optimising microservice communications, ensuring seamless integration and performance across various platforms and technologies. Responsibilities Utilise monitoring tools such as Splunk, Jaeger, Kiali, xMatters, AppDynamics, and Grafana to ensure system performance and reliability. Manage file transfer servers for efficient sending and receiving of files. Automate the scheduling More ❯
retrieval-augmented generation). Experience with ML-specific CI/CD pipelines and model governance best practices. Familiarity with monitoring and observability tools like Jaeger, Prometheus, Grafana, or Datadog. Experience working in startups or fast-paced teams, balancing rapid iteration with production-grade reliability. We believe we offer career defining More ❯