Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
experience of Grafana Observability Suite (Loki, Mimir, Tempo). Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty, Rundeck. Familiarity with configuration management tools like Ansible, Puppet, or Chef. Certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or More ❯
GCP. Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational More ❯
Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of More ❯
Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of More ❯
Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of More ❯
Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of More ❯
orchestration (Kubernetes). Understanding of CI/CD pipelines. Familiarity with scripting languages like Python, Bash, or Go. Experience with monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack. Strong problem-solving, communication skills, and ability to work independently or in teams. Additional notes We value diverse backgrounds and More ❯
orchestration (Kubernetes). Understanding of CI/CD pipelines. Familiarity with scripting languages like Python, Bash, or Go. Experience with monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack. Strong problem-solving, communication skills, and ability to work independently or in teams. Additional notes We value diverse backgrounds and More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Smart DCC
Develop automated test suites for data pipelines, ensuring data quality and transformation integrity. Monitoring & Performance Optimization: Monitor data pipelines with tools like Prometheus and Datadog to ensure optimal performance and health. Proactively implement anomaly detection and optimize system performance and resource allocation. Collaborate with cross-functional teams to align DataOps More ❯
and Active Directory. Experience with disaster recovery and redundancy strategies in both cloud and on-premises environments. Proficiency with leading monitoring tools, such as Datadog, Splunk , Prometheus, Grafana, ELK Stack, and New Relic. Programming expertise, especially in systems programming languages (e.g., Java, Kotlin, Scala) and databases (e.g., SQL Server, PostgreSQL More ❯
and Active Directory. Experience with disaster recovery and redundancy strategies in both cloud and on-premises environments. Proficiency with leading monitoring tools, such as Datadog, Splunk , Prometheus, Grafana, ELK Stack, and New Relic. Programming expertise, especially in systems programming languages (e.g., Java, Kotlin, Scala) and databases (e.g., SQL Server, PostgreSQL More ❯
and Active Directory. Experience with disaster recovery and redundancy strategies in both cloud and on-premises environments. Proficiency with leading monitoring tools, such as Datadog, Splunk , Prometheus, Grafana, ELK Stack, and New Relic. Programming expertise, especially in systems programming languages (e.g., Java, Kotlin, Scala) and databases (e.g., SQL Server, PostgreSQL More ❯
Dublin, City of Dublin, Republic of Ireland Hybrid / WFH Options
The Recruitment Company
plus) Deep knowledge of Kubernetes, containers, and cloud-native architectures Proficient in scripting and automation (Python, Shell, Go) Comfortable with tools like Terraform, Jenkins, DataDog, Prometheus, Splunk Solid background in networking, Linux systems, and infrastructure as code If you’re passionate about cloud reliability, automation, and solving complex problems at More ❯
Azure) and infrastructure-as-code (Terraform). Familiarity and hands-on with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog). Experience in distributed systems and scaling. Knowledge and hands-on experience with multiple data stores (both SQL and NoSQL). Desired experience in building More ❯
infrastructure-as-code (Terraform, etc.). Familiarity and hands-on experience with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog, etc.). Experience in distributed systems and scaling. Knowledge and hands-on experience with multiple data stores (both SQL and NoSQL). Desired experience in More ❯
Love to automate manual work and try new modern technology/approaches Tech stack: AWS, Kubernetes, MongoDB, PostgreSQL, RabbitMQ, Redis, Ansible, Terraform, Grafana, Prometheus, Datadog, Sentry, Loki, Jenkins. What we Offer We expect excellence from our people — both on the road and in the office. In return, we offer flexible More ❯
AWS, GCP, Azure). Strong knowledge of CI/CD, containerization (Docker, Kubernetes), networking, distributed systems, and databases. Experience with monitoring and troubleshooting tools (DataDog, Prometheus, Grafana, ELK, Splunk, Humio). Excellent problem-solving, attention to detail, and communication skills. Desirable Experience with Azure, autonomous vehicles, or ML/AI More ❯
an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
and infrastructure-as-code (Terraform etc). Familiarity and hands-on with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog etc) Experience in distributed systems and scaling Knowledge and hands-on experience with multiple datastores (both SQL and NoSQL) Desired experience in building agentic workflows More ❯
the pain points these companies face as they operate in or migrate to a cloud environment at scale as well as delivering the appropriate Datadog solution. At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We … multiple stakeholders and product suites Cross-sell and navigate throughout complex accounts Create, own, and grow your own accounts, demonstrating the value of the Datadog platform Develop a deep comprehension of customer's business Work cross-functionally with marketing and solutions engineering to drive coordinated efforts that support the outbound … up to 4 hours, traveling to and from client sites Able to travel via auto, train or air up to 70% of the time Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That's okay. If you More ❯
Herndon, Virginia, United States Hybrid / WFH Options
Marathon TS Inc
and Code Pipeline • Experience with Configuration as Code, including AWS SSM, Ansible, PowerShell, or Bash • Monitoring log and System performance using tools like Grafana, Datadog, and Prometheus. • Experience with multiple CI/CD and Agile Development tools, including GitLab, Atlassian, or Jenkins • Experience working within an Agile and version-controlled More ❯
of CI/CD processes, containerization (Docker, Kubernetes), and a deep understanding of networking, distributed systems, and databases. Expert with monitoring and troubleshooting utilities (DataDog, Prometheus, Grafana, ELK stack, Splunk, Humio, etc.). Exceptional problem-solving skills and a detail-oriented mindset, coupled with outstanding communication abilities. Desirable Experience with More ❯
pipelines, and be confident scripting in Python, C# or similar scripting languages. You’ll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you’ll have worked in a fast-moving SaaS or product-led business before. Bonus points if you’ve helped shape More ❯
london, south east england, united kingdom Hybrid / WFH Options
Noir
pipelines, and be confident scripting in Python, C# or similar scripting languages. You’ll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you’ll have worked in a fast-moving SaaS or product-led business before. Bonus points if you’ve helped shape More ❯