London, South East, England, United Kingdom Hybrid / WFH Options
Morela
and SRE teams to embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM More ❯
london, south east england, united kingdom Hybrid / WFH Options
Sanderson Government & Defence
API development and integration (Go preferred but not essential) Familiarity with JavaScript front-end frameworks , HTML5, and CSS3 Understanding of asynchronous programming models Knowledge of monitoring tools (ELK, Nagios, DataDog, Splunk, New Relic) Experience with Atlassian Toolset (Jira, Confluence, etc.) Understanding of databases and query languages Awareness of identity and access management technologies (ADFS, OIDC, OAuth2, Keycloak, Red Hat SSO More ❯
South East London, London, United Kingdom Hybrid / WFH Options
Stepstone UK
Principles: TDD, Agile, Pair Programming - CI/CD: Git, Docker, Bamboo - Cloud: AWS, Lambda, ECS - Databases/Storage: Postgres , Dynamo, DocumentDb, OpenSearch/Elastic, Redis - Monitoring: Cloudwatch, Kibana, Grafana, DataDog Qualifications Experience of working with the following tech; C# .NET (8+), Terraform, Typescript, AWS, CI/CD Dedication to high quality, testable and maintainable code and super passionate about all More ❯
woburn, massachusetts, united states Hybrid / WFH Options
Knox Systems
no-code platform operations*. *Key ResponsibilitiesIncident Management & System Troubleshooting* * Perform advanced troubleshooting for infrastructure, OS, and application issues. * Analyze system logs, metrics, and telemetry from monitoring platforms (Grafana, Datadog, Wiz, CloudWatch). * Coordinate with Platform/DevOps Engineers on root cause analysis and long-term remediation. * Ensure timely resolution of escalated incidents in accordance with SLAs. *Cloud Administration & Maintenance More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Michael Page Technology
The role of a Platform Support Engineer involves providing excellent technical support and maintenance for platform solutions within the technology and telecoms industry. You will ensure the smooth operation of systems, troubleshoot issues, and deliver high-quality service to internal More ❯
projects on Google Cloud using Terraform . Develop and maintain backend services in Golang and Python . Set up CI/CD pipelines , observability, and automation with GitLab/DataDog . Partner with data engineers to deliver high-performance ingestion and transformation pipelines. 🧠 What You’ll Bring: Strong background in backend development (Golang/Python). Solid experience across Google More ❯
birmingham, midlands, united kingdom Hybrid / WFH Options
Isio
financial services, utilities. Experience working with internal software development teams Experience automating repetitive or complex manual activities to improve efficiency Experience implementing system monitoring tools, e.g. SolarWinds, New Relic, Datadog etc Microsoft Certifications in relevant fields, e.g. Microsoft MCSE/A Networking Certifications, e.g. Cisco CCNA/P or equivalent Experience of acquisitions and integration/standardisation of IT systems More ❯
City of London, London, England, United Kingdom Hybrid / WFH Options
Eligo Recruitment
and capacity planning for mission-critical systems Develop secure backup, recovery, and disaster recovery procedures Explore multi-tenant and sharded architectures to support growth Implement monitoring strategies using Grafana, Datadog, and CI/CD integrations Champion database best practices, mentor teams, and standardize tooling and automation What You’ll Bring Extensive experience managing cloud-hosted PostgreSQL at scale Proficiency in More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
and run GitOps for Kubernetes (AKS preferred), patterns and multi-environment promotions. Own platform observability: metrics, logs and traces using Azure Monitor/Log Analytics/Application Insights, plus Datadog/Grafana where appropriate. Embed security by design: Azure Policy, Defender for Cloud, secrets management with Key Vault, SBOM and image scanning, policy-as-code and least privilege IAM. Drive … RBAC and workload identity. Experience with GitOps, and container build pipelines (e.g., ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected Framework. Familiarity with DevSecOps practices: threat modelling, dependency and More ❯
Employment Type: Permanent, Part Time, Work From Home
systems/infrastructure engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Understanding Recruitment
systems/infrastructure engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting More ❯
london, south east england, united kingdom Hybrid / WFH Options
Understanding Recruitment
systems/infrastructure engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Understanding Recruitment
systems/infrastructure engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Understanding Recruitment
systems/infrastructure engineering role Strong scripting skills in Python , Bash , or Ruby Familiarity with configuration management tools (Ansible, Puppet, or Chef) Interest or exposure to observability tools like Datadog , Prometheus , or Grafana A passion for learning and improving in high-performance environments This is a rare chance to learn from elite engineers and contribute directly to a platform supporting More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Eligo Recruitment
Bring Strong experience with GCP , Terraform , and Infrastructure-as-Code Deep knowledge of cloud networking, security automation, and compliance standards Proficiency in CI/CD pipelines , monitoring tools (Grafana, Datadog), and scripting A collaborative mindset with excellent communication and mentoring skills Why Join? Shape a next-gen AI infrastructure with autonomy and purpose Hybrid working with regular meetups in our More ❯
woburn, massachusetts, united states Hybrid / WFH Options
Knox Systems
and cloud monitoring environments, while collaborating effectively with L2 and Security teams. *Key ResponsibilitiesMonitoring & Incident Response* * Monitor infrastructure, applications, and network health using tools such as Grafana, Wiz, CloudWatch, Datadog, and CrowdStrike Falcon. * Detect, triage, and escalate alerts based on severity and business impact. * Document incident timelines, actions, and resolutions in ticketing systems (ServiceNow, Jira Service Management). * Follow established More ❯
CMS (we use Sanity) Exposure to backend or full-stack development (Node.js, Express, etc.) Developing white-label applications Building internationalised applications Familiarity with frontend observability best practices (we use Datadog) Experience in agile software development methodologies Why Travelex? To remain the world’s leading foreign exchange specialist, we are focused on making our customers’ lives simpler, more engaging and hassle More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Travelex
CMS (we use Sanity) Exposure to backend or full-stack development (Node.js, Express, etc.) Developing white-label applications Building internationalised applications Familiarity with frontend observability best practices (we use Datadog) Experience in agile software development methodologies Why Travelex? To remain the world’s leading foreign exchange specialist, we are focused on making our customers’ lives simpler, more engaging and hassle More ❯
london, south east england, united kingdom Hybrid / WFH Options
Travelex
CMS (we use Sanity) Exposure to backend or full-stack development (Node.js, Express, etc.) Developing white-label applications Building internationalised applications Familiarity with frontend observability best practices (we use Datadog) Experience in agile software development methodologies Why Travelex? To remain the world’s leading foreign exchange specialist, we are focused on making our customers’ lives simpler, more engaging and hassle More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Travelex
CMS (we use Sanity) Exposure to backend or full-stack development (Node.js, Express, etc.) Developing white-label applications Building internationalised applications Familiarity with frontend observability best practices (we use Datadog) Experience in agile software development methodologies Why Travelex? To remain the world’s leading foreign exchange specialist, we are focused on making our customers’ lives simpler, more engaging and hassle More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Travelex
CMS (we use Sanity) Exposure to backend or full-stack development (Node.js, Express, etc.) Developing white-label applications Building internationalised applications Familiarity with frontend observability best practices (we use Datadog) Experience in agile software development methodologies Why Travelex? To remain the world’s leading foreign exchange specialist, we are focused on making our customers’ lives simpler, more engaging and hassle More ❯
Coventry, West Midlands, United Kingdom Hybrid / WFH Options
Stackstudio Digital Ltd
and Spectrum Scale . Experience with Cisco UCS & HP Blade infrastructure . Appreciation for backup software ( Commvault ), storage technologies (IBM, Block Storage, File Storage), and monitoring technologies ( SolarWinds, AppDynamics, Datadog ). Familiarity with cloud technologies (AWS/Azure) and associated features. Understanding of Linux command line technologies and AWS server administration . Experience working in an Agile delivery environment . More ❯
Crawley, Sussex, United Kingdom Hybrid / WFH Options
Rentokil Initial Group
with Google Cloud Platform (GCP) and related services. Familiarity with Java and database technologies. Experience with IAM tools (Auth0 preferred) and authentication/authorization protocols. Knowledge of monitoring tools (Datadog preferred) and performance optimization techniques. UML documentation Data Modelling/Design Architecture frameworks (e.g Togaf) Benefits Competitive salary and bonus scheme Hybrid working Rentokil Initial Reward Scheme 23 days holiday More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Synthesia
raised our Series D. This brings our total funding to over $330M from top-tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook. What you'll do at Synthesia: As a Research Engineer you will join a team of 40+ Researchers and Engineers within the R&D Department working More ❯
raised our Series D. This brings our total funding to over $330M from top-tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook. What you'll do at Synthesia: As a Research Engineer you will join a team of 40+ Researchers and Engineers within the R&D Department working More ❯