Permanent Datadog Jobs in Essex

1 to 25 of 58 Permanent Datadog Jobs in Essex

Site Reliability Engineer

chelmsford, east anglia, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or More ❯
Posted:

Site Reliability Engineer

basildon, east anglia, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or More ❯
Posted:

Site Reliability Engineer

colchester, east anglia, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or More ❯
Posted:

Senior Site Reliability Engineer

basildon, east anglia, United Kingdom
Cipher7
Azure is a plus). Deep understanding of Container Orchestration technologies such as Kubernetes and Docker . Proficiency in monitoring and logging tools including: Datadog , Splunk , Dynatrace , AppDynamics , Prometheus , Grafana , ELK Stack , CloudWatch , Gremlin , ThousandEyes . Experience with Terraform , Jenkins , GitLab CI , PostgreSQL , Redis , and Kong API Gateway . Solid More ❯
Posted:

Senior Site Reliability Engineer

chelmsford, east anglia, United Kingdom
Cipher7
Azure is a plus). Deep understanding of Container Orchestration technologies such as Kubernetes and Docker . Proficiency in monitoring and logging tools including: Datadog , Splunk , Dynatrace , AppDynamics , Prometheus , Grafana , ELK Stack , CloudWatch , Gremlin , ThousandEyes . Experience with Terraform , Jenkins , GitLab CI , PostgreSQL , Redis , and Kong API Gateway . Solid More ❯
Posted:

Senior Site Reliability Engineer

colchester, east anglia, United Kingdom
Cipher7
Azure is a plus). Deep understanding of Container Orchestration technologies such as Kubernetes and Docker . Proficiency in monitoring and logging tools including: Datadog , Splunk , Dynatrace , AppDynamics , Prometheus , Grafana , ELK Stack , CloudWatch , Gremlin , ThousandEyes . Experience with Terraform , Jenkins , GitLab CI , PostgreSQL , Redis , and Kong API Gateway . Solid More ❯
Posted:

Head of SRE and Production Engineering

basildon, east anglia, United Kingdom
SS&C Technologies
pipeline scalability and governance while maintaining developer velocity. Observability & Troubleshooting Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog). Establish SLOs, SLIs, and error budgets with product and engineering teams. Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection. More ❯
Posted:

Head of SRE and Production Engineering

chelmsford, east anglia, United Kingdom
SS&C Technologies
pipeline scalability and governance while maintaining developer velocity. Observability & Troubleshooting Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog). Establish SLOs, SLIs, and error budgets with product and engineering teams. Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection. More ❯
Posted:

Head of SRE and Production Engineering

colchester, east anglia, United Kingdom
SS&C Technologies
pipeline scalability and governance while maintaining developer velocity. Observability & Troubleshooting Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog). Establish SLOs, SLIs, and error budgets with product and engineering teams. Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection. More ❯
Posted:

Software Engineer – Backend (On-site)

colchester, east anglia, United Kingdom
Zettafleet
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
Posted:

Software Engineer – Backend (On-site)

basildon, east anglia, United Kingdom
Zettafleet
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
Posted:

Software Engineer – Backend (On-site)

chelmsford, east anglia, United Kingdom
Zettafleet
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
Posted:

Site Reliability Engineer

chelmsford, east anglia, United Kingdom
DNSINFOLTD
with cloud platforms Primarily in AWS Cloud (e.g., AWS, GCP, Azure) and Container Orchestration (e.g., Kubernetes, Docker). Proficiency in Monitoring and Logging Tools: Datadog, Splunk, Dynatrace, AppDynamics, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Cloude Watch, Gremlin, Thousand Eyes. Terraform, Jenkins, GitLab CI, PostgreSQL, Redis, Kong API. Infrastructure skills More ❯
Posted:

Site Reliability Engineer

colchester, east anglia, United Kingdom
DNSINFOLTD
with cloud platforms Primarily in AWS Cloud (e.g., AWS, GCP, Azure) and Container Orchestration (e.g., Kubernetes, Docker). Proficiency in Monitoring and Logging Tools: Datadog, Splunk, Dynatrace, AppDynamics, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Cloude Watch, Gremlin, Thousand Eyes. Terraform, Jenkins, GitLab CI, PostgreSQL, Redis, Kong API. Infrastructure skills More ❯
Posted:

Site Reliability Engineer

basildon, east anglia, United Kingdom
DNSINFOLTD
with cloud platforms Primarily in AWS Cloud (e.g., AWS, GCP, Azure) and Container Orchestration (e.g., Kubernetes, Docker). Proficiency in Monitoring and Logging Tools: Datadog, Splunk, Dynatrace, AppDynamics, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Cloude Watch, Gremlin, Thousand Eyes. Terraform, Jenkins, GitLab CI, PostgreSQL, Redis, Kong API. Infrastructure skills More ❯
Posted:

Site Reliability Engineer

colchester, east anglia, United Kingdom
Ranger Technical Resources
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Containerization (Docker, Kubernetes, Fargate) Incident Response IaC (Terraform, CloudFormation, Helm, CDK) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and More ❯
Posted:

Site Reliability Engineer

basildon, east anglia, United Kingdom
Ranger Technical Resources
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Containerization (Docker, Kubernetes, Fargate) Incident Response IaC (Terraform, CloudFormation, Helm, CDK) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and More ❯
Posted:

Site Reliability Engineer

chelmsford, east anglia, United Kingdom
Ranger Technical Resources
monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Containerization (Docker, Kubernetes, Fargate) Incident Response IaC (Terraform, CloudFormation, Helm, CDK) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and More ❯
Posted:

Cloud Platform Lead

basildon, east anglia, United Kingdom
SoCode Recruitment
Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar AI Coding Tools : GitHub Copilot Cursor Claude Code 📩 Interested More ❯
Posted:

Cloud Platform Lead

chelmsford, east anglia, United Kingdom
SoCode Recruitment
Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar AI Coding Tools : GitHub Copilot Cursor Claude Code 📩 Interested More ❯
Posted:

Cloud Platform Lead

colchester, east anglia, United Kingdom
SoCode Recruitment
Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar AI Coding Tools : GitHub Copilot Cursor Claude Code 📩 Interested More ❯
Posted:

Site Reliability Engineer

colchester, east anglia, United Kingdom
Hybrid / WFH Options
Halian
SOPs. Develop scripts and tools to automate infrastructure provisioning and application deployment. Implement best practices for observability and monitoring using tools like New Relic, DataDog, or Splunk. Influence design decisions to ensure scalable, secure architecture and high availability. Key Requirements: 5+ years in Site Reliability Engineering and/or DevOps More ❯
Posted:

Site Reliability Engineer

basildon, east anglia, United Kingdom
Hybrid / WFH Options
Halian
SOPs. Develop scripts and tools to automate infrastructure provisioning and application deployment. Implement best practices for observability and monitoring using tools like New Relic, DataDog, or Splunk. Influence design decisions to ensure scalable, secure architecture and high availability. Key Requirements: 5+ years in Site Reliability Engineering and/or DevOps More ❯
Posted:

Site Reliability Engineer

chelmsford, east anglia, United Kingdom
Hybrid / WFH Options
Halian
SOPs. Develop scripts and tools to automate infrastructure provisioning and application deployment. Implement best practices for observability and monitoring using tools like New Relic, DataDog, or Splunk. Influence design decisions to ensure scalable, secure architecture and high availability. Key Requirements: 5+ years in Site Reliability Engineering and/or DevOps More ❯
Posted:

DevOps Engineer

chelmsford, east anglia, United Kingdom
Hybrid / WFH Options
Prism Digital
define, version, and manage infrastructure as code across multiple environments. GitHub Actions & OIDC – build and maintain automated CI/CD pipelines with secure authentication. Datadog, Prometheus or similar – implement logging, metrics, and alerting for robust observability – the interim CTO is keen to hear your recommendation(s) on tooling and implementation More ❯
Posted: