or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit www.computerfutures.com Computer Futures More ❯
or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit (url removed) Computer More ❯
Cambridge, Cambridgeshire, England, United Kingdom
Computer Futures
security principles , threat detection, or incident response. Strong problem-solving skills and willingness to learn. Nice to Have Exposure to containerization (Docker, Kubernetes). Knowledge of monitoring tools (Grafana, Datadog). Experience with SIEM/SOC tools or security automation. Cloud certifications or security training (AWS, GCP, Azure, or similar). To find out more about Computer Futures please visit More ❯
london, south east england, united kingdom Hybrid/Remote Options
Mott MacDonald
region deployment. Strong proficiency and current experience in React, Typescript, Python and database systems (SQL + NoSQL). Experience with performance monitoring and logging tools, including CloudWatch, Sentry, or DataDog, to ensure application stability, performance optimisation, and effective issue resolution Experience managing or mentoring engineering teams, including cross-functional collaboration. Understanding of secure architecture, API design, and performance optimisation. Experience More ❯
scalable, secure infrastructure in AWS and Azure Build and maintain CI/CD pipelines using tools such as Azure DevOps Implement and manage monitoring, alerting and logging systems (e.g. Datadog, Logic Monitor, SolarWinds) Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform Ensure compliance with security policies; manage IAM, PIM and RBAC access controls Respond to incidents More ❯
Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
scalable, secure infrastructure in AWS and Azure Build and maintain CI/CD pipelines using tools such as Azure DevOps Implement and manage monitoring, alerting and logging systems (e.g. Datadog, Logic Monitor, SolarWinds) Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform Ensure compliance with security policies; manage IAM, PIM and RBAC access controls Respond to incidents More ❯
/CD pipelines (e.g., Jenkins, TeamCity, Concourse). Familiarity with web/application servers such as NGINX, Apache, or JBoss. Exposure to monitoring and logging tools (ELK, Nagios, Splunk, DataDog, New Relic, etc.). Understanding of security and identity management (OAuth2, SSO, ADFS, Keycloak, etc.). Experience with version control systems (Git, Bitbucket, Subversion). Working knowledge of database technologies More ❯
design (REST, GraphQL) Experience with containerization (Docker, Kubernetes) and cloud-native development patterns DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience More ❯
deployments (Kubernetes, Docker). Hands-on experience with data and model pipelines (feature stores, registries, distributed training, inference scaling). Knowledge of observability and monitoring stacks (Prometheus, Grafana, ELK, Datadog) for ML system performance. Experience collaborating with cross-functional teams in regulated industries (finance, insurance, health) with compliance and governance needs. Exceptional communication and leadership skills, with the ability to More ❯
Strong expertise in implementing Site Reliability Engineering (SRE) principles. Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills). Proficiency in automation & scripting using Python & Ansible (primary skills). Strong experience with cloud platforms AWS & Azure (primary skills). Solid understanding of containerization and orchestration tools like Docker and Kubernetes . Proficiency in cloud native distributed systems & microservices More ❯
Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A mindset that More ❯
analysing metrics and logs using KQL (Kusto Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance - Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network security with NSGs and More ❯
KPIs and strategic goals Excellent communication and presentation skills. Ability to travel occasionally for customer meetings and events. Preferred Skills Experience with Dynatrace and similar platforms (e.g., New Relic, Datadog, AppDynamics). Certifications in cloud technologies or DevOps practices. Familiarity with CI/CD pipelines, Kubernetes, and infrastructure-as-code tools (Terraform, Ansible). What we offer DXC provide a More ❯
london, south east england, united kingdom Hybrid/Remote Options
Fresha
Cloudfront and MSK extensively Have an understanding of SLIs, SLOs & SLAs Knowledge of platform and ops concepts such as networking and Linux administration Experience with monitoring tools: we use Datadog, Grafana, ELK, Sentry and OpsGenie. £90,000 - £120,000 a year Inclusive workforce At Fresha, we are creating a culture where individuals of all backgrounds feel comfortable. We want all More ❯
about building resilient systems that empower product teams. Essential skill set: Experience leading cloud-native teams in AWS and serverless Lambda Strong grasp of infrastructure-as-code (Terraform), observability (Datadog), and serverless architecture Background in software engineering (JavaScript/TypeScript preferred) Proven leadership in growing and developing engineering teams Product mindset with stakeholder engagement and roadmap ownership FinTech or scale More ❯
or teach them new things Love to automate manual work and try new modern technology/approaches Tech stack: AWS, Kubernetes, MongoDB, PostgreSQL, RabbitMQ, Redis, Ansible, Terraform, Grafana, Prometheus, Datadog, Sentry, Loki, Jenkins. What we Offer We expect excellence from our people — both on the road and in the office. In return, we offer flexible working hours, stock options, and More ❯
london, south east england, united kingdom Hybrid/Remote Options
Entain
with AWS Cost Explorer, Budgets, Cost and Usage Reports, and AWS Cost Anomaly Detection. Proven cost optimisation across developer tools, such as GitHub Actions and Azure Pipelines. Experience with DataDog or similar SaaS based observability platforms to inform decision making. Strong background in Kubernetes, containerisation, and infrastructure automation. Extensive experience with Infrastructure as Code tools such as Terraform/OpenTofu. More ❯
Golang Developer - AWS - Financial Services Excellent opportunity opens for a talented Golang Developer with strong experience in AWS and Docker/Kubernetes to join a highly regarded Financial Services entity's London office. This role offers the possibility to play More ❯
Senior Site Reliability Engineer (SRE) Location - London (full onsite- 5 days every week) Perm up to 80K gross Minimum 12+ year profile are required PFB updated JD Core Competency, • Datadog, Splunk, Dynatrace, Grafana, Prometheus, Thousand Eyes, Gremlin etc. • Efficiency in creating Dashboard for Infra/APM/E2E workflows. • Monitoring, logging, Alerting and Error budget , 99.99, % ) for software, Operations & Business. More ❯
best practices. Version control experience (e.g., Git). Strong troubleshooting and root cause analysis skills. Desirable Skills Experience with Kubernetes and/or other cloud platforms. Familiarity with Nagios, Datadog, or similar monitoring tools. Exposure to CI/CD systems such as TeamCity, AWS CodeBuild, AWS CodePipeline, or ArgoCD. Personal Attributes Proactive, curious, and process-driven. Enjoys collaboration and mentoring. More ❯
Solid technical grounding in Microsoft stack, AWS and the following technologies: C#, .NET Framework, .NET Core, Entity Framework, LINQ, MVC5, MVC Core, Angular framework, RestAPI CI/CD pipeline Datadog Microsoft SQL Server, NoSQL Terraform Experience implementing development tooling with AWS, JIRA, GIT Significant track record of delivering quality software using agile methodologies. Manage technical projects from inception to in More ❯
london, south east england, united kingdom Hybrid/Remote Options
Cogna
identity management (Entra ID), and network configurations. Support container orchestration, and workload deployment using Kubernetes and AKS. Improve observability by implementing logging, monitoring, and alerting systems (e.g. Azure Monitor, Datadog, etc.). Partner with internal teams to improve resilience, automate toil, and reduce lead time to deployment. Drive root cause analysis and reliability improvements from incidents. What we're looking More ❯
Beckwith, Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
. AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs, Firewalls, WAFs, route tables). More ❯
England, North Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
. AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs, Firewalls, WAFs, route tables). More ❯
leeds, west yorkshire, yorkshire and the humber, united kingdom
Entain
AWS Programming languages: JavaScript, Typescript, Python Frameworks: React Native Databases: NoSQL (DynamoDB), SQL AWS services: Lambda, S3, API Gateway, Step Functions, SQS, Athena DevOps and monitoring tools such as Datadog, New Relic, Grafana Desirable: Experience in mobile application development. Experience in sports betting, gaming, or related high-scale transactional domains. Previous experience leading organisational change or scaling teams. Additional Information More ❯