years of professional experience, some of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and willingness to get hands on More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
We're assembling a team of elite founding software engineers for a startup, building the future of e-commerce in MENA, bringing together community, shopping and entertainment. Location: Remote We are looking for engineers who are passionate about creating scalable More ❯
What You'll Do Build and maintain cloud infrastructure, helping transition fully to Google Cloud Platform (GCP) Use Terraform for Infrastructure as Code (IaC) Manage and monitor systems using Datadog, observability is key (no separate SRE team) Work with Kubernetes in production environments Collaborate closely with engineering and project management teams to deliver secure, scalable platforms Participate in a shared … field Strong cloud expertise, with AWS and/or GCP Proven skills in Terraform and infrastructure automation Experience with Kubernetes in production Familiarity with monitoring/observability tools - ideally Datadog Comfortable working cross-functionally and contributing to team-wide initiatives What You'll Get 4 day week Pension match Performance-based bonuses Private medical, dental and optical insurance Enhanced Parental More ❯
Burton-on-Trent, Staffordshire, England, United Kingdom Hybrid / WFH Options
Crimson
secure, scalable AWS and Azure infrastructure Build and maintain CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Set up monitoring, alerting, and logging with tools like Datadog, Logic Monitor, and Solarwinds. Strong grasp of DevOps principles; hands-on CI/CD experience. Microsoft Certified: DevOps Engineer Expert (AZ-400). Design and deploy containers on AKS/ More ❯
Bradford, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Anson Mccade
microservices. Experience with at least one cloud platform - Azure preferred, but AWS or GCP also welcome. Bonus Skills: Knowledge of database systems, test automation frameworks, and monitoring tools (e.g., DataDog, Prometheus). Passion for continuous learning and using open-source technologies. Why Join? No sales targets. No consulting fluff. Just great engineering. Paid overtime (up to 200 hours/year More ❯
infrastructure and deployment workflows - PowerShell is preferred Collaborate with engineering teams to support platform reliability and enable delivery Maintain visibility and awareness through monitoring and logging tools such as Datadog, Azure Monitor, App Insights etc. Support incident resolution and participate in an on-call rota to help maintain service uptime Qualifications The Requirements: Essential Experience: Proven experience in a Platform More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
Experience of using Git or similar to track changes Experience of both the full .NET Framework and .NET Core Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production A solid understanding of security principles and secure coding including OWASP Top 10 Experience in VOIP, (SIP and RTP advantageous) Experience of TDD More ❯
CI/CD tools such as Git, GHA and test automation frameworks such as BDD Experienced in API testing using Postman Experience with observability and monitoring tools such as Datadog Hands-on experience in test artefact creation, Test management and Defect Management testing tools Specific training or certifications in Quality Assurance is preferred Experience validating AI/ML data pipelines More ❯
funding from top-tier investors, including NEA, Atlassian Ventures, WiL, PSP Growth, and existing investors such as Accel, Nvidia, Kleiner Perkins, GV and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook. The role As a Cloud FinOps Manager , you will play a pivotal role at the intersection of Finance, Engineering, and DevOps. Operating as a senior individual More ❯
Fort George G Meade, Maryland, United States Hybrid / WFH Options
August Schell
Confluent, Kubernetes operators. • Experience creating data partitioning strategies and monitoring topics for performance. • Experience deploying and upgrading Kafka clusters in high availability containerized environments. • Experience utilizing observability platforms (Elastic, Datadog, etc) to configure monitoring for data pipelines to ensure high availability and throughput, low latency, and alerting • Knowledge of stream processing pipelines and analytics. • Experience with Apache NiFi, multi-cluster More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
stage environments preferred. Nice to Have: Experience scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous More ❯
self-healing services that allow you to sleep through the night. Support the production systems on prem (legacy) and GCP which includes build, deployment (CI/CD), and monitoring (Datadog, PagerDuty). Deliver high quality code by following coding standards, performing code reviews and ensuring test coverage at all levels. Collaborate with engineers and other teams to elevate technology, follow More ❯
fault-tolerant APIs. Experience building high-performance, distributed systems at scale. A strong understanding of modern dev practices like 12 Factor, CI/CD, and observability tools such as Datadog or Prometheus. Exposure to GraphQL APIs and WebSockets for real-time interactions. As part of our commitment to information security, all employees are expected to adhere to company security policies More ❯
and aligned. ️ The skills Proven experience as a DevOps Engineer, Site Reliability Engineer, Platform Engineer or similar role. Ideally in an entreprise-grade Experience with APM stacks such as Datadog, New Relic, Prometheus or similar. Experience with handling telemetry, tracing and logging data, at scale, in multiple different environments. Familiarity with low-level telemetry daemons and aggregators such as StatsD. More ❯
Kubernetes is a plus Knowledge of Redis and log queries is a plus Experience in automations/AI would be an advantage Experience administering multiple monitoring systems such as Datadog, NewRelic, Kubernetes, Grafana and Elastic Cloud Experience with Cloud Computing, AWS, Microservices Architecture, Unix and Linux Systems Life @ Empowered to think big. Try new opportunities while working with a talented More ❯
Birmingham, Staffordshire, United Kingdom Hybrid / WFH Options
CET Structures Limited
responsive, user-friendly interfaces and working with component libraries like Vuetify. Experience in writing unit and integration tests Experience working with the Azure stack is essential Experience working with DataDog or other observability platforms is desirable Interest in learning new technologies is desirable Additional Skills & Qualities Agile experience: Familiarity with Scrum, Kanban, or similar methodologies. A team player with strong More ❯
to have, but can be learned on the job: Experience with web and/or app Scraping TypeScript (just the ability to understand the logic, not necessarily write code) DataDog (just the ability to write queries) LaunchDarkly (just the ability to change feature flag rules manually or programmatically) Postman for testing API calls Most importantly , though, you will embody the More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Job Overview: The Engineering IT group provides the high-performance compute environment that fuels product and solutions development for Arm's engineering community. Whether its high-performance compute (HPC) on Arm's on-prem infrastructure and/or in the More ❯
Frontend. Tech & Data Science stack: Kubernetes & Docker on Google Cloud Python 3: Pandas, RabbitMQ, Celery, Flask, SciPy, NumPy, Dash, Plotly, Matplotlib Javascript, React, Redux PostgreSQL, Redis Prometheus, Alert Manager, DataDog If you joined the company in a Data Science role you would be working on sophisticated pricing algorithms which would enable companies in the entertainment industry to significantly increase profit More ❯
are ideally looking for someone with: Strong experience supporting technical products in a customer facing capacity Deep understanding of cloud native technologies and modern observability stacks such as Grafana, DataDog, Splunk or similar A hands on mindset and the ability to work comfortably across Kubernetes, microservices, and comparable environments Beyond technical skills, they value clear communicators who are curious, adaptable More ❯
architectures and different data storage systems Nice to haves: Experience with ecommerce and marketplace systems in a B2C environment Proficiency with Infrastructure as Code tools like Terraform Experience with Datadog for monitoring and observability Track record of implementing company-wide technical initiatives #J-18808-Ljbffr More ❯
re a multi-cloud team - experience with AWS and CDK is a big plus. Experience implementing/maintaining cloud compliance standards (SOC2, ISO-27001) Familiarity with observability tools like Datadog Background in implementing security best practices in cloud infrastructure Why Prolific is a great place to work We've built a unique platform that connects researchers and companies with a More ❯