City of London, London, United Kingdom Hybrid / WFH Options
Experis
Kubernetes/EKS). · Skilled in scripting languages such as Python, Bash, PowerShell, or TypeScript. · Solid understanding of Linux, Windows, and storage systems. · Experience with monitoring and observability tools (Datadog, CloudWatch, Azure Monitor, Coralogix). · Familiarity with DevOps and Site Reliability Engineering (SRE) principles. · Knowledge of networking and cloud security best practices. · Experience with Databricks is a plus. · Excellent problem More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Experis
Kubernetes/EKS). · Skilled in scripting languages such as Python, Bash, PowerShell, or TypeScript. · Solid understanding of Linux, Windows, and storage systems. · Experience with monitoring and observability tools (Datadog, CloudWatch, Azure Monitor, Coralogix). · Familiarity with DevOps and Site Reliability Engineering (SRE) principles. · Knowledge of networking and cloud security best practices. · Experience with Databricks is a plus. · Excellent problem More ❯
deploying AI/ML workloads, particularly LLMs and vector-based apps. Comfortable with containerisation, Git workflows, and scripting (Bash, Python, etc.). Exposure to observability tools like Prometheus, Grafana, Datadog, or ELK. A proactive, detail-oriented approach with strong documentation and communication skills. Comfortable working with JIRA for tickets and sprint rituals. 🌟 Why Join Us? Join a team pushing the More ❯
cross-functional teams. Desirable Skills: Experience with AWS certifications (AWS Certified Solutions Architect, Developer, or DevOps Engineer). Experience with Monitoring and Logging solutions like CloudWatch , New Relic , or Datadog . More ❯
london (city of london), south east england, united kingdom
HCLTech
cross-functional teams. Desirable Skills: Experience with AWS certifications (AWS Certified Solutions Architect, Developer, or DevOps Engineer). Experience with Monitoring and Logging solutions like CloudWatch , New Relic , or Datadog . More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Experis
practices. Tech Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Experis
practices. Tech Stack Highlights AWS (ECS, S3, DynamoDB, Aurora, OpenSearch) Pulumi (TypeScript) for infrastructure as code Kafka (Confluent Cloud) for event-driven architecture GitHub Actions for CI/CD DataDog for observability Containerised microservices architecture What We’re Looking For Strong programming background (Java or TypeScript preferred) Experience designing scalable, resilient cloud infrastructure Familiarity with event-driven systems and Kafka More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Sanderson Government & Defence
API development and integration (Go preferred but not essential) Familiarity with JavaScript front-end frameworks , HTML5, and CSS3 Understanding of asynchronous programming models Knowledge of monitoring tools (ELK, Nagios, DataDog, Splunk, New Relic) Experience with Atlassian Toolset (Jira, Confluence, etc.) Understanding of databases and query languages Awareness of identity and access management technologies (ADFS, OIDC, OAuth2, Keycloak, Red Hat SSO More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Sanderson Government & Defence
API development and integration (Go preferred but not essential) Familiarity with JavaScript front-end frameworks , HTML5, and CSS3 Understanding of asynchronous programming models Knowledge of monitoring tools (ELK, Nagios, DataDog, Splunk, New Relic) Experience with Atlassian Toolset (Jira, Confluence, etc.) Understanding of databases and query languages Awareness of identity and access management technologies (ADFS, OIDC, OAuth2, Keycloak, Red Hat SSO More ❯
tuning. Lead technical triage and root cause analysis for infrastructure-related issues Develop and deploy applications using Docker and AWS FARGATE Use CloudWatch, CloudTrail, and third-party tools like Datadog for performance and cost efficiency Configure AWS networking (VPCs, TGWs), enforce governance via AWS Config and tagging policies Maintain architecture diagrams, SOPs, and collaborate across engineering and product teams Should More ❯
driven development and continuous integration/delivery Drive innovation by integrating AI and automation into platform solutions Maintain high standards of performance, scalability, security, and observability (using tools like DataDog) What We’re Looking For Significant experience in software engineering, ideally in a senior capacity Expertise in JavaScript/TypeScript , Node.js , React , GraphQL , and AWS (EKS, Lambda, RDS) Strong background More ❯
london (city of london), south east england, united kingdom
Wilson Brown
driven development and continuous integration/delivery Drive innovation by integrating AI and automation into platform solutions Maintain high standards of performance, scalability, security, and observability (using tools like DataDog) What We’re Looking For Significant experience in software engineering, ideally in a senior capacity Expertise in JavaScript/TypeScript , Node.js , React , GraphQL , and AWS (EKS, Lambda, RDS) Strong background More ❯
analysis Knowledge of AgentOps and OpenTelemetry Understanding of Network Security Concepts, Network Telemetry and Analytics Understanding of Cloud computing and Virtualization Exposure to APM/Observability tools (Dynatrace, AppDynamics, Datadog, Splunk etc) Exposure to onshore-offshore model working with professionals spread across the globe Participation in AI events, workshops and summits Personal Besides the professional qualifications, we respect and place More ❯
london (city of london), south east england, united kingdom
Infosys
analysis Knowledge of AgentOps and OpenTelemetry Understanding of Network Security Concepts, Network Telemetry and Analytics Understanding of Cloud computing and Virtualization Exposure to APM/Observability tools (Dynatrace, AppDynamics, Datadog, Splunk etc) Exposure to onshore-offshore model working with professionals spread across the globe Participation in AI events, workshops and summits Personal Besides the professional qualifications, we respect and place More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Metomic
service development Experience with distributed messaging systems like Kafka Knowledge of AWS, Terraform, and cloud infrastructure operations Familiarity with Temporal for orchestrating complex workflows Experience with observability tools like Datadog for monitoring systems at scale Experience operating data-intensive applications in production environments BENEFITS Remote-Friendly: Work remotely, or join us in our office in Hoxton, London Equity Options: Share More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Reward Gateway
or similar Wide-reaching SRE skills and a deep understanding of SRE practices A strong understanding of SQL, PHP, Kubernetes, CI/CD Observability product experience (eg, New Relic, Datadog) Managing infrastructures using SLI/SLO & Error Budgets Ability to work both independently and as part of a team Ability to work under pressure and be highly reliable Adaptability and More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Reward Gateway
or similar Wide-reaching SRE skills and a deep understanding of SRE practices A strong understanding of SQL, PHP, Kubernetes, CI/CD Observability product experience (eg, New Relic, Datadog) Managing infrastructures using SLI/SLO & Error Budgets Ability to work both independently and as part of a team Ability to work under pressure and be highly reliable Adaptability and More ❯
building and running cloud platforms and leading teams that sit at the intersection of infrastructure and product. Great Expertise in AWS best practices, infrastructure-as-code (Terraform), and monitoring (Datadog) Strong Experience in AWS utilizing Lambda, ECS, SQS, API Gateway etc. Any Programming Language experience such as Python, Golang, Typescript, Nodejs etc. If this sounds like an interesting opportunity to More ❯
london (city of london), south east england, united kingdom
Harvey Nash
building and running cloud platforms and leading teams that sit at the intersection of infrastructure and product. Great Expertise in AWS best practices, infrastructure-as-code (Terraform), and monitoring (Datadog) Strong Experience in AWS utilizing Lambda, ECS, SQS, API Gateway etc. Any Programming Language experience such as Python, Golang, Typescript, Nodejs etc. If this sounds like an interesting opportunity to More ❯
ClaimCenter and other systems, including PAS, document management systems, and external data providers. Platform Monitoring : Determine requirements for specific alerts, set up alerts for various events and thresholds, utilise Datadog logs and dashboards for error analysis, and track DXC downtime while communicating updates to users. Platform Updates : Conduct a 3-way merge of updated code, validate new versions, and implement More ❯
sharding technologies in regional deployments. Familiarity with SQL, PHP, and CI/CD processes related to database deployments. Experience with observability tools specific to database performance (e.g., New Relic, Datadog). Knowledge of managing database infrastructures using SLI/SLO & Error Budgets. Ability to work both independently and collaboratively within a team. Capacity to perform under pressure and maintain high More ❯
london (city of london), south east england, united kingdom
Reward Gateway
sharding technologies in regional deployments. Familiarity with SQL, PHP, and CI/CD processes related to database deployments. Experience with observability tools specific to database performance (e.g., New Relic, Datadog). Knowledge of managing database infrastructures using SLI/SLO & Error Budgets. Ability to work both independently and collaboratively within a team. Capacity to perform under pressure and maintain high More ❯
a plus. Hands-on debugging skills: reading logs, calling API endpoints, basic troubleshooting on Linux/CLI and with HTTP tools. Experience with dashboards/metrics (e.g., Grafana/Datadog/Looker) and clear, concise documentation. Strong communicator who thrives in fast feedback loops and enjoys sports. Based in London and available to work on-site in Marylebone. Nice to More ❯