practices. Tech Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka More ❯
london, south east england, united kingdom Hybrid / WFH Options
Experis
practices. Tech Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Experis
practices. Tech Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Experis
practices. Tech Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka More ❯
team leverages a modern and scalable technology stack: Backend: Python (FastAPI), Node.js Frontend: React, TypeScript Database: PostgreSQL Infrastructure: AWS, Docker, Terraform CI/CD: GitHub Actions, Pulumi Monitoring & Observability: DataDog, Sentry Data & Analytics: dbt, Metabase Internal Tools: Retool Collaboration: Linear, Slack, Notion Candidates are not expected to have experience with every tool listed, but should be enthusiastic about learning and More ❯
Experience of using Git or similar to track changes • Experience of both the full .NET Framework and .NET Core • Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production • A solid understanding of security principles and secure coding including OWASP Top 10 Nice to haves: o Experience in VOIP, (SIP and RTP More ❯
SRE) Location - London (onsite full-time, 5 days a week) Salary - Perm up to 80K gross Minimum requirement: 12+ years of profile PFB updated JD Core Competencies/Responsibilities Datadog, Splunk, Dynatrace, Grafana, Prometheus, Thousand Eyes, Gremlin, etc. Efficiency in creating dashboards for Infra/APM/E2E workflows. Monitoring, logging, alerting and error budgets (SLA metrics: 99.9, 99.99, 99.999 More ❯
Eccles, Manchester, United Kingdom Hybrid / WFH Options
Rebel Recruitment Limited
developers in the team, to automate their deployments. You ll have experience with DevOps tools, including Terraform for IaC, Docker for containerisation, Kubernetes (k8s) for orchestration, monitoring tools like DataDog, etc. understanding what it takes to make sure their systems, application, etc are secure, scalable, resilient, and with plenty of redundancy baked in, you ll lean on your networking, traditional More ❯
Eccles, City and Borough of Salford, Greater Manchester, United Kingdom Hybrid / WFH Options
Rebel Recruitment Limited
developers in the team, to automate their deployments. You’ll have experience with DevOps tools, including Terraform for IaC, Docker for containerisation, Kubernetes (k8s) for orchestration, monitoring tools like DataDog, etc. understanding what it takes to make sure their systems, application, etc are secure, scalable, resilient, and with plenty of redundancy baked in, you’ll lean on your networking, traditional More ❯
and SRE teams to embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
and SRE teams to embed observability into the full delivery lifecycle Skills & Experience: Strong background in observability, monitoring, and event management Hands-on experience with platforms such as Dynatrace, Datadog, AppDynamics, Splunk, Prometheus, Grafana, New Relic, or Elastic Experience building integrations and automation using APIs, Python, Node.js, Go, or scripting Familiarity with AIOps platforms (BigPanda, Moogsoft, etc.) Knowledge of ITSM More ❯
london, south east england, united kingdom Hybrid / WFH Options
Prolific
Services (AWS) Programming Languages: Python, JavaScript, TypeScript Frameworks: Django Rest Framework, Serverless architectures, container-based services Databases: MongoDB, DynamoDB, Postgres DevOps & Monitoring Tools: CircleCI, GitHub Actions, Kubernetes, Celery, EventBridge, DataDog Join us at Prolific and play a critical role in shaping the human data infrastructure that is powering the next generation of AI innovation. Apply now and let's build More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Michael Page Technology
The role of a Platform Support Engineer involves providing excellent technical support and maintenance for platform solutions within the technology and telecoms industry. You will ensure the smooth operation of systems, troubleshoot issues, and deliver high-quality service to internal More ❯
Burton-on-Trent, Staffordshire, England, United Kingdom
Crimson
and manage secure, scalable AWS infrastructure. Build and maintain CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Set up monitoring, alerting, and logging with tools like Datadog, Logic Monitor, and Solarwinds. Strong grasp of DevOps principles; hands-on CI/CD experience. Microsoft Certified: DevOps Engineer Expert (AZ-400). Design and deploy containers on AKS/ More ❯
Burton-On-Trent, Staffordshire, Burton upon Trent, United Kingdom Hybrid / WFH Options
Crimson
and manage secure, scalable AWS infrastructure. Build and maintain CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Set up monitoring, alerting, and logging with tools like Datadog, Logic Monitor, and Solarwinds. Strong grasp of DevOps principles; hands-on CI/CD experience. Microsoft Certified: DevOps Engineer Expert (AZ-400). Design and deploy containers on AKS/ More ❯
Jersey City, New Jersey, United States Hybrid / WFH Options
ArborTekSystem
and Ontological processes). Technical Skills: Five or more years of experience with Python, SQL, and data visualization/exploration tools Full stack observability lead with Splunk (preferred)/Datadog, Infra monitoring, App onboarding and APM experience Proficiency in observability tools: They are familiar with tools for logging, metrics, and tracing, such as ELK Stack, Splunk and distributed tracing systems. More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Interquest
d also have the opportunity to mentor other team members an collaborate with product managers. Skills: TypeScript (Node, React) AWS (Lambda, Fargate, S3, Dynamo, Event Bridge etc.) Observability tools (Datadog, Dynatrace, Honeycomb, CloudWatch etc.) The money is good too – up to £70k plus benefits including hybrid working (2 days per week in Manchester) and a 2pm finish every Friday. If More ❯
Derby, Derbyshire, East Midlands, United Kingdom Hybrid / WFH Options
Experis
secure infrastructure using Terraform and other IaC tools. Own the CI/CD pipeline strategy using Azure DevOps, GitHub Actions, or similar. Set up monitoring, alerting, and logging frameworks (Datadog, LogicMonitor, SolarWinds). Collaborate closely with Cloud and FinOps teams to align infrastructure, cost optimisation, and delivery. Lead incident response, root cause analysis, and post-mortem processes. Mentor engineers and More ❯
Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of managers. Demonstrated ability to build More ❯
AutoSys Experience using cloud infrastructures such as AWS or Azure Experience working in a secure, multi-data center environment Experience with other monitoring tools such as Dynatrace, Paessler, or Datadog Splunk Enterprise Certified Architect Understands how to resolve conflicting priorities and objectives with grace and professionalism Knows to look ahead and think of solutions that benefit the environment in the More ❯
Accreditation Council for Graduate Medical Education
technical, ambiguous domains. Strong knowledge of REST APIs, distributed system design, and performance optimization. Experience with both SQL and NoSQL data stores, caching layers, and observability tooling (e.g., Prometheus, Datadog). Nice To Have Experience deploying or integrating LLMs or NLP models in production systems. Comfortable balancing short-term execution with long-term architectural thinking. Passion for building highly-available More ❯
via Kafka (Confluent Cloud), infrastructure automation with Pulumi (Typescript), our infrastructure is hosted at AWS (most used: ECS, S3, DynamoDB, Aurora, OpenSearch), Github Actions for builds and workflow automation, DataDog for monitoring and alerts. About The Role The role of platform engineer in domain is to support the teams using the platform and to evolve it with growing business and More ❯
Create and maintain IaC solutions with tools like Terraform. Partner with development teams to enable scalable microservices, primarily using Python. Implement and oversee observability tools such as New Relic, DataDog, Splunk, and AWS CloudWatch. Configure and troubleshoot networking components including VPNs, load balancers (NLB/ALB), HTTPS, TLS, and CDNs. Ensure system reliability and performance within Unix/Linux environments. More ❯
mindset geared towards enabling internal engineering teams Platform Engineer nice to have Exposure to AWS and use of the Cloud Development Kit (CDK) Previous experience maintaining observability stacks (e.g. Datadog ) Background in applying security-first approaches to cloud architecture Aimtech Recruitment is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. More ❯