Permanent 'Datadog' Job Vacancies

1 to 25 of 315 Permanent Datadog Jobs

Data DevOps Engineer

St Albans, England, United Kingdom
Addition+
Kubernetes; experienced in scalable, portable BI and data environments. Environment Management: Managed Dev/QA/UAT freshness, data synchronisation, and Jira-integrated release workflows. Observability & Monitoring: Implemented CloudWatch, Datadog, Prometheus, and Grafana for logging, metrics, and alerting. Troubleshooting & Problem Solving: Strong analytical and cross-functional collaboration skills; effective under pressure. Project Delivery: Managed multiple concurrent BI and data releases More ❯
Posted:

Senior DevOps Engineer Belfast, Northern Ireland, United Kingdom

Belfast, United Kingdom
TRG Screen
pipelines (GitHub Actions, GitLab CI, Azure DevOps, Jenkins) Experience withconfiguration managementtools such asChef/Puppet Strong proficiency in scripting/programming (Python, Go, or similar) Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana) Knowledge of microservices architecture and service mesh technologies Understanding of security best practices and compliance frameworks Comfortable with asynchronous collaboration tools (Slack, Teams) Agile mindset More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer - Site Reliability Engineer (SRE)

Texas, United States
Lovelace Ai
Azure) and related services (e.g., EC2, S3, Lambda, Kubernetes). Experience with containerization and orchestration technologies like Docker and Kubernetes. Proficiency with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Dynatrace, ELK Stack). Strong understanding of networking fundamentals (DNS, HTTP, TCP/IP), load balancing, and CDNs. Experience with CI/CD tools (e.g., Jenkins, GitLab CI, CircleCI) and More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

DevOps Engineer

London Area, United Kingdom
Hybrid/Remote Options
Signify Technology
. Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef, or Puppet). Experience with serverless architectures More ❯
Posted:

DevOps Engineer

City of London, London, United Kingdom
Hybrid/Remote Options
Signify Technology
. Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef, or Puppet). Experience with serverless architectures More ❯
Posted:

Principal, DevOps Engineer

San Mateo, California, United States
Ptc
Operations: Manage and optimize cloud environments (AWS, Azure, GCP), ensuring high availability and cost efficiency. Monitoring & Observability: Implement and maintain monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK, Datadog). Security & Compliance: Enforce security best practices and ensure compliance with industry standards (e.g., SOC 2). Mentorship: Provide technical leadership and mentorship to DevOps engineers and other team members. More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer

Atlanta, Georgia, United States
Robotics technology LLC
Kubernetes, Docker Knowledge of networking fundamentals (TCP/IP, DNS, load balancing Proficiency in Linux/Unix administration, scripting (Python, Bash, or similar Experience with monitoring tools (Prometheus, Grafana, DataDog Familiarity with containerization (Docker, Kubernetes) and cloud services. Experience with CI/CD systems (Jenkins, GitHub Actions, GitLab CI Strong analytical and problem-solving skills. Knowledge of security practices (IAM More ❯
Employment Type: Any
Salary: USD Annual
Posted:

Lead Engineer

City of London, London, United Kingdom
Hybrid/Remote Options
Sanderson
Lambda, DynamoDB). Drive automation across CI/CD pipelines using tools like GitHub Actions , Terraform , and Argo CD for seamless and secure deployments. Enhance observability using Prometheus , Grafana , Datadog , and CloudWatch , enabling proactive incident prevention. Own incident management and post-mortem practices — guiding the team through challenges calmly and driving meaningful improvement. Collaborate with global engineering and product teams More ❯
Posted:

Lead Engineer

London Area, United Kingdom
Hybrid/Remote Options
Sanderson
Lambda, DynamoDB). Drive automation across CI/CD pipelines using tools like GitHub Actions , Terraform , and Argo CD for seamless and secure deployments. Enhance observability using Prometheus , Grafana , Datadog , and CloudWatch , enabling proactive incident prevention. Own incident management and post-mortem practices — guiding the team through challenges calmly and driving meaningful improvement. Collaborate with global engineering and product teams More ❯
Posted:

Senior Software Engineer (Data & ML Platform)

Galway, Ireland
Datavant Corporation
testing, and incident management. Hands on experience with Databricks , MLflow , or similar ML/ETL platforms is a plus. Bonus: Experience with container orchestration (Kubernetes) and observability tools like Datadog, Prometheus, or Grafana. Passion for building tools and platforms that empower teams and improve developer velocity. Excitement, passion and curiosity about our mission of connecting the world's health data More ❯
Employment Type: Permanent
Salary: EUR 125,000 - 150,000 Annual
Posted:

Senior DevOps Engineer

United States
Par Technology
/TLS, VPNs, and Cloud Networking. • Experience implementing CI/CD pipelines (Jenkins, GitHub Actions, ArgoCD, GitOps). • Expertise in observability/monitoring tools (Prometheus, Grafana, ELK, New Relic, DataDog). • Security-focused mindset with IAM, secrets management (HashiCorp Vault, AWS Secrets Manager), compliance frameworks. • Leadership experience mentoring engineers and driving DevOps best practices. • Strong problem-solving skills with a More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Site Reliability Engineer

United States
Hybrid/Remote Options
Euna Solutions
with strong debugging and code optimization skills. Hands-on experience with IaC tools - especially Terraform. Extensive CI/CD pipeline design & management experience. Familiarity with observability platforms (Prometheus, Coralogix, Datadog, etc.). Strong understanding of cloud platforms (AWS, Azure) and containerization (Docker, Kubernetes). Ability to troubleshoot complex issues across the full stack - from code to infrastructure. Excellent communication and More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Foster City, California, United States
Visa
understanding of Linux/Unix systems, networking protocols, certificate management, secret management, system design, cloud platforms (AWS, Azure, GCP), and containerization (Kubernetes, Docker • Proficiency with monitoring tools (Prometheus, Grafana, Datadog, etc.), logging systems (ELK stack, Splunk), and tracing tools (Jaeger, Zipkin). • Proficiency in infrastructure-as-code tools such as Terraform and Ansible. • Hands-on experience with CI/CD More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

DevOps Engineer

London Area, United Kingdom
Hybrid/Remote Options
Advanced Resource Managers
or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation skills More ❯
Posted:

DevOps Engineer

City of London, London, United Kingdom
Hybrid/Remote Options
Advanced Resource Managers
or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation skills More ❯
Posted:

Senior Software Engineer - Observo AI

United States
Sentinelone
analytics and anomaly detection systems using advanced machine learning techniques and large language models Architect cloud-native microservices and APIs that integrate seamlessly with major observability platforms (Splunk, Elastic, Datadog, New Relic) Implement robust monitoring, alerting, and observability solutions for distributed systems operating at enterprise scale Collaborate with Product and DevOps teams to translate customer requirements into technical solutions Optimize More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Staff Software Engineer - Observo AI

United States
Sentinelone
analytics and anomaly detection systems using advanced machine learning techniques and large language models Design cloud-native microservices and APIs that integrate seamlessly with major observability platforms (Splunk, Elastic, Datadog, New Relic) Establish robust monitoring, alerting, and observability solutions for distributed systems operating at enterprise scale Lead cross-functional technical initiatives, collaborating with Product, Data Science, and DevOps teams to More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Development Enablement Engineer

Edinburgh, Midlothian, United Kingdom
Hybrid/Remote Options
Aberdeen
tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. Experience with CI/CD More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer

United States
Hybrid/Remote Options
Canals Ai
GitHub Actions, CircleCI, etc.) and infrastructure as code (Terraform, CloudFormation). Proficiency in Docker and container orchestration (ECS or Kubernetes). Familiarity with monitoring, alerting, and logging (Prometheus, Grafana, Datadog, etc.). Experience securing systems and managing secrets, permissions, and network policies. Strong communication skills and comfort working remotely in a fast-moving, product-focused environment. Prior experience in high More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior DevOps Engineer

Palo Alto, California, United States
Clockwork.io
Terraform, Pulumi, or similar tools. Collaborate with engineering teams to improve deployment workflows, observability, and performance monitoring. Set up and manage logging, alerting, and monitoring frameworks (Prometheus, Grafana, ELK, Datadog, etc.). Champion security best practices, including secrets management and vulnerability assessments. Drive automation across environments to reduce manual effort and increase reliability. What We're Looking For 4+ years More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

DevOps Engineer

Plano, Texas, United States
PROLIM Corporation
YAML, JSON Build Tools: Maven, Gradle, NPM, Bazel, Go Databases: RDS, SQL, MySQL, Postgres, RedShift, MongoDB, DynamoDB Security Scans: SAST, Secrets, Container, DAST, Xray, Prisma Cloud Logging and Monitoring: DataDog, Splunk, App Dynamics, ELK, Grafana About PROLIM Corporation PROLIM is a leading provider of end-to-end IT, PLM and Engineering Services and Solutions for Global 1000 companies. They understand More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Staff DevOps Engineer

Denver, Colorado, United States
Hybrid/Remote Options
Cleerly
in AWS security, encryption, and backup practices, including compliance with frameworks such as SOC 2, HIPAA, and HITRUST. Manage monitoring and log analysis using tools like CloudWatch, CloudTrail, GuardDuty, Datadog, and Sentry. Collaborate with application teams to gather requirements and deliver secure, scalable migration paths using AWS services like CloudFront, ECS, EC2, EKS, ElastiCache, Aurora, DynamoDB, SQS, SNS, Step Functions More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Staff Site Reliability Engineer

San Francisco, California, United States
Altana
Site Reliability Engineering (SRE) principles, including SLOs, error budgets, toil reduction, and blameless culture. Expertise in designing, implementing, and managing observability platforms for cloud-native environments (e.g., Prometheus, Grafana, Datadog, ELK stack, OpenTelemetry, Jaeger). Proficiency in at least one programming/scripting language (e.g., Python, Go) for automation and tool development. Extensive hands-on experience with cloud platforms (AWS More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Head of Infrastructure

London Area, United Kingdom
Hybrid/Remote Options
Harnham
platforms (GCP, AWS, or Azure), containerization, CI/CD, and infrastructure-as-code Docker; Kubernetes (EKS, GKE, AKS); Jenkins, GitLab CI, or GitHub Actions; Terraform or CloudFormation; Prometheus, Grafana, Datadog, or New Relic; Slurm, Torque, LSF; MPI; Hadoop or Spark;Director of In Experience with high-performance computing, distributed systems, and observability tools Strong communication and executive presence, with the More ❯
Posted:

Head of Infrastructure

City of London, London, United Kingdom
Hybrid/Remote Options
Harnham
platforms (GCP, AWS, or Azure), containerization, CI/CD, and infrastructure-as-code Docker; Kubernetes (EKS, GKE, AKS); Jenkins, GitLab CI, or GitHub Actions; Terraform or CloudFormation; Prometheus, Grafana, Datadog, or New Relic; Slurm, Torque, LSF; MPI; Hadoop or Spark;Director of In Experience with high-performance computing, distributed systems, and observability tools Strong communication and executive presence, with the More ❯
Posted:
Datadog
10th Percentile
£40,650
25th Percentile
£62,500
Median
£75,000
75th Percentile
£93,750
90th Percentile
£106,500