Datadog Job Vacancies

1 to 25 of 1,040 Datadog Jobs

Senior Cloud Engineer with Security Clearance

Chantilly, Virginia, United States
Arion Systems, inc
etc.) for automation and management tasks. • Strong understanding of cloud networking, security (e.g., VPC, IAM, VPN, etc.), and monitoring tools. • Experience with logging and monitoring tools (e.g., CloudWatch, Prometheus, Datadog). • Knowledge of cloud-native technologies (e.g., Lambda, S3, EC2, GKE, App Services, etc.). • An existing DoD 8570 Baseline IAT Certification; higher levels preferred. • TS or TS/SCI … Microsoft Certified: Azure Solutions Architect Expert, Certified Kubernetes Administrator (CKA) • Experience with serverless computing (AWS Lambda, Azure Functions, etc.). • Familiarity with logging and monitoring platforms like CloudWatch, Prometheus, Datadog, or Splunk. • Experience with CI/CD tools like Jenkins, GitHub Actions, or GitLab CI. • An adjudicated Counterintelligence polygraph. Soft Skills: • Self-driven • Strong communication and interpersonal skills. • Ability to More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Cloud Engineer with Security Clearance

Chantilly, Virginia, United States
Arion Systems, inc
An existing DoD 8570 Baseline IAT Certification; higher levels preferred. • Experience with serverless computing (AWS Lambda, Azure Functions, etc.). • Familiarity with logging and monitoring platforms like CloudWatch, Prometheus, Datadog, or Splunk. • Experience with CI/CD tools like Jenkins, GitHub Actions, or GitLab CI. • An adjudicated Counterintelligence Polygraph. Soft Skills: • Self-driven • Strong communication and interpersonal skills. • Ability to More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Machine Learning Ops Engineer

London, England, United Kingdom
DailyPay
integration and deployment of ML models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration and Mentorship: Collaborate effectively with data scientists, engineers, and other stakeholders. Provide guidance and support to junior team members Performance Optimization: Continuously optimize … and implement efficient CI/CD pipelines Containerization and Orchestration: Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes) Monitoring and Logging: Experience with monitoring and logging tools like DataDog, Prometheus, or Grafana Data Engineering Skills: Knowledge of event streaming platforms (e.g., Apache Kafka) and SQL database management Strong Communication and Collaboration: Excellent communication skills and the ability to work More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer More ❯
Posted:

DevOps/Site Reliability Engineer, Junior/Mid/Senior (m/f/ )

United Kingdom
Hybrid / WFH Options
Crane Venture Partners
Who we are We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/Site Reliability Engineer with experience managing complex infrastructure and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Mid-Senior DevOps / Site Reliability Engineer (m/f/*)

London, England, United Kingdom
Hybrid / WFH Options
Quaisr Limited
DevOps/Site Reliability Engineer, Junior/Mid/Senior (m/f/*) We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a More ❯
Posted:

Senior Machine Learning Ops Engineer

Belfast, United Kingdom
DailyPay
integration and deployment of ML models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration and Mentorship: Collaborate effectively with data scientists, engineers, and other stakeholders. Provide guidance and support to junior team members Performance Optimization: Continuously optimize … and implement efficient CI/CD pipelines Containerization and Orchestration: Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes) Monitoring and Logging: Experience with monitoring and logging tools like DataDog, Prometheus, or Grafana Data Engineering Skills: Knowledge of event streaming platforms (e.g., Apache Kafka) and SQL database management Strong Communication and Collaboration: Excellent communication skills and the ability to work More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer - Monitoring & Observability

London, England, United Kingdom
Lumenalta
AWS CDK, or CloudFormation to automate cloud resource provisioning, enabling consistent and repeatable infrastructure deployments. Monitoring & Observability: Implement monitoring, logging, and alerting solutions using tools like Prometheus, Grafana, Loki, Datadog, or CloudWatch to ensure system health and performance. Security & Compliance: Implement security best practices for cloud infrastructure, including IAM policies, security groups, and VPC configurations, to ensure compliance and data More ❯
Posted:

Site Reliability Engineer

Southampton, Hampshire, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer More ❯
Employment Type: Permanent
Posted:

Site Reliability Engineer

Portsmouth, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer More ❯
Posted:

Site Reliability Engineer

Hampshire, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
ZipRecruiter
Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer More ❯
Posted:

Technical Account Manager - DevOps Specialist

London, England, United Kingdom
ITR Partners
the equivalent with Azure and GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations More ❯
Posted:

Site Reliability Engineer (SRE) - Weekend Coverage

London, England, United Kingdom
Hybrid / WFH Options
Elwood Technologies
closely with engineering teams to design and deploy scalable, fault-tolerant infrastructure solutions on AWS or GCP . Improve observability by utilizing monitoring, logging, and alerting systems (e.g., CloudWatch , Datadog ). Lead post-incident reviews , contribute to the continuous improvement of system reliability and follow up on strategic fixes. Develop and update runbooks, incident response playbooks, and documentation. Work closely … love it if you have experience of some or all of the following: Experience with client-impact triage , working cross-functionally with account managers or product teams. Proficiency with Datadog or similar observability platforms. Knowledge of serverless architectures (e.g., AWS Lambda, GCP Cloud Functions). Familiarity with RDBMS and NoSQL databases , such as RDS, CloudSQL, DynamoDB. Prior experience in fintech More ❯
Posted:

Junior DevOps Engineer

City of London, London, United Kingdom
Sparta Global
with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience in a More ❯
Posted:

Junior DevOps Engineer

London Area, United Kingdom
Sparta Global
with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience in a More ❯
Posted:

Junior DevOps Engineer

london, south east england, united kingdom
Sparta Global
with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience in a More ❯
Posted:

Junior DevOps Engineer

london (city of london), south east england, united kingdom
Sparta Global
with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience in a More ❯
Posted:

Junior DevOps Engineer

Slough, England, United Kingdom
JR United Kingdom
with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience in a More ❯
Posted:

DevOps Engineer

United Kingdom
CareerUS Solutions
Implement and manage containerization and orchestration platforms (Docker, Kubernetes, ECS, etc.). Monitor system performance, availability, and security using monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack, Splunk, Datadog). Troubleshoot and resolve infrastructure and application issues in development, test, and production environments. Collaborate with development teams to ensure smooth code deployments and environment consistency. Maintain version control systems More ❯
Posted:

DevOps Engineer

London, England, United Kingdom
CareerUS Solutions
Implement and manage containerization and orchestration platforms (Docker, Kubernetes, ECS, etc.). Monitor system performance, availability, and security using monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack, Splunk, Datadog). Troubleshoot and resolve infrastructure and application issues in development, test, and production environments. Collaborate with development teams to ensure smooth code deployments and environment consistency. Maintain version control systems More ❯
Posted:

Junior DevOps Engineer

London, England, United Kingdom
Sparta Global
with cloud platforms such as AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately 18 months of experience in a More ❯
Posted:

DevOps Engineer

London, England, United Kingdom
Welfordsystems
compliance. Collaborate with development and operations teams to improve system performance and scalability. Maintain and improve logging, monitoring, and alerting systems using tools like Prometheus, Grafana, ELK Stack, or Datadog Support and optimize infrastructure for both Linux and Windows-based environments. Participate in incident management, problem resolution, and root cause analysis. Ensure documentation of infrastructure, processes, and best practices is More ❯
Posted:

Director, DevOps

London, England, United Kingdom
Choreograph
Actions, CircleCI ) and orchestration technologies (e.g., Kubernetes, Docker). Proficiency in scripting and programming languages (e.g., Python, Bash, Go). Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog). Solid understanding of security best practices, compliance standards, and DevSecOps . Proven ability to manage and deliver complex projects on time and within budget. Strong interpersonal, communication, and problem More ❯
Posted:

DevOps Engineer/Analyst

Colchester, Essex, United Kingdom
Enigen UK
Manage cloud infrastructure (OCI, AWS, Azure, or GCP) using Infrastructure as Code tools like Terraform or Serverless Functions. Monitor system health and performance using tools like Prometheus, Grafana, or Datadog or NewRelic. Collaborate closely with development teams to automate builds, performance tests, and deployments. Ensure system security, compliance, and best practices are followed in deployment pipelines. Ensure network security with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Datadog
10th Percentile
£48,125
25th Percentile
£64,877
Median
£75,000
75th Percentile
£87,500
90th Percentile
£97,500