pipelines (GitHub Actions, GitLab CI, Azure DevOps, Jenkins) Experience withconfiguration managementtools such asChef/Puppet Strong proficiency in scripting/programming (Python, Go, or similar) Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana) Knowledge of microservices architecture and service mesh technologies Understanding of security best practices and compliance frameworks Comfortable with asynchronous collaboration tools (Slack, Teams) Agile mindset More ❯
or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit www.computerfutures.com Computer Futures More ❯
or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit (url removed) Computer More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. Experience with CI/CD More ❯
Cambridge, Cambridgeshire, England, United Kingdom
Computer Futures
security principles , threat detection, or incident response. Strong problem-solving skills and willingness to learn. Nice to Have Exposure to containerization (Docker, Kubernetes). Knowledge of monitoring tools (Grafana, Datadog). Experience with SIEM/SOC tools or security automation. Cloud certifications or security training (AWS, GCP, Azure, or similar). To find out more about Computer Futures please visit More ❯
london, south east england, united kingdom Hybrid/Remote Options
Mott MacDonald
region deployment. Strong proficiency and current experience in React, Typescript, Python and database systems (SQL + NoSQL). Experience with performance monitoring and logging tools, including CloudWatch, Sentry, or DataDog, to ensure application stability, performance optimisation, and effective issue resolution Experience managing or mentoring engineering teams, including cross-functional collaboration. Understanding of secure architecture, API design, and performance optimisation. Experience More ❯
scalable, secure infrastructure in AWS and Azure Build and maintain CI/CD pipelines using tools such as Azure DevOps Implement and manage monitoring, alerting and logging systems (e.g. Datadog, Logic Monitor, SolarWinds) Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform Ensure compliance with security policies; manage IAM, PIM and RBAC access controls Respond to incidents More ❯
Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
scalable, secure infrastructure in AWS and Azure Build and maintain CI/CD pipelines using tools such as Azure DevOps Implement and manage monitoring, alerting and logging systems (e.g. Datadog, Logic Monitor, SolarWinds) Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform Ensure compliance with security policies; manage IAM, PIM and RBAC access controls Respond to incidents More ❯
/CD pipelines (e.g., Jenkins, TeamCity, Concourse). Familiarity with web/application servers such as NGINX, Apache, or JBoss. Exposure to monitoring and logging tools (ELK, Nagios, Splunk, DataDog, New Relic, etc.). Understanding of security and identity management (OAuth2, SSO, ADFS, Keycloak, etc.). Experience with version control systems (Git, Bitbucket, Subversion). Working knowledge of database technologies More ❯
design (REST, GraphQL) Experience with containerization (Docker, Kubernetes) and cloud-native development patterns DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience More ❯
KPIs (observability, alerting, SLAs) Hands on experience with CI/CD, containerization and orchestration tools (Docker, Kubernetes ) Knowledge of monitoring, logging, alerting and observability tools (Prometheus, Grafana, ELK Stack, Datadog ) Familiarity with infrastructure as code tools like Terraform or CloudFormation Proficiency in scripting languages (Python, Go, Bash ) and knowledge of software development best practices Strong understanding of networking, security, and More ❯
deployments (Kubernetes, Docker). Hands-on experience with data and model pipelines (feature stores, registries, distributed training, inference scaling). Knowledge of observability and monitoring stacks (Prometheus, Grafana, ELK, Datadog) for ML system performance. Experience collaborating with cross-functional teams in regulated industries (finance, insurance, health) with compliance and governance needs. Exceptional communication and leadership skills, with the ability to More ❯
complex issues to senior stakeholders and technical teams. Implementation of highly available and reliable systems, using multi-AZ and multiregional approaches Expertise with monitoring and observability tools (e.g. SolarWinds, Datadog, Azure/AWS native tools) Expertise with SLI/SLO management tools such as (ServiceNow) Expertise with Incident ticketing and change management systems such as (ServiceNow, Ivanti) Expertise with automated More ❯
years in SRE, Observability, or DevOps functions.Proven track record implementing observability solutions in cloud-native environments (AWS, Azure, or GCP). Hands-on proficiency with observability tools such as Datadog, Grafana, Prometheus, OpenTelemetry. Strong knowledge of distributed systems, microservices, and container orchestration (Kubernetes, Docker). Experience with automation and Infrastructure as Code (Terraform, Ansible) and CI/CD pipelines. Familiarity More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Sanderson Recruitment
years in SRE, Observability, or DevOps functions.Proven track record implementing observability solutions in cloud-native environments (AWS, Azure, or GCP). Hands-on proficiency with observability tools such as Datadog, Grafana, Prometheus, OpenTelemetry. Strong knowledge of distributed systems, microservices, and container orchestration (Kubernetes, Docker). Experience with automation and Infrastructure as Code (Terraform, Ansible) and CI/CD pipelines. Familiarity More ❯
and integration (preferably using Go but not essential). Knowledge of OpenShift Containerisation, RHEL 6,7,8, Docker and Kubernetes. Experience with monitoring systems e.g., ELK, Nagios, New Relic, DataDog, Splunk etc. Working knowledge of digital delivery processes and methodologies. Working knowledge of Atlassian Toolset. Knowledge of Javascript frontend frameworks. Understanding of front-end technologies, such as HTML5, and CSS3. More ❯
Strong expertise in implementing Site Reliability Engineering (SRE) principles. Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills). Proficiency in automation & scripting using Python & Ansible (primary skills). Strong experience with cloud platforms AWS & Azure (primary skills). Solid understanding of containerization and orchestration tools like Docker and Kubernetes . Proficiency in cloud native distributed systems & microservices More ❯
analysing metrics and logs using KQL (Kusto Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance - Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network security with NSGs and More ❯
frontend architecture (e.g., Module Federation or Single-SPA). Experience with cloud-native DevOps tooling: Docker, Kubernetes, AWS/GCP deployments. Proficiency in analytics and observability tools like Sentry, Datadog, or LogRocket. Soft Skills Strategic thinker with strong problem-solving and decision-making skills. Ability to work in fast-paced, agile environments with cross-functional teams. Clear communication and documentation More ❯
KPIs and strategic goals Excellent communication and presentation skills. Ability to travel occasionally for customer meetings and events. Preferred Skills Experience with Dynatrace and similar platforms (e.g., New Relic, Datadog, AppDynamics). Certifications in cloud technologies or DevOps practices. Familiarity with CI/CD pipelines, Kubernetes, and infrastructure-as-code tools (Terraform, Ansible). What we offer DXC provide a More ❯
london, south east england, united kingdom Hybrid/Remote Options
Fresha
Cloudfront and MSK extensively Have an understanding of SLIs, SLOs & SLAs Knowledge of platform and ops concepts such as networking and Linux administration Experience with monitoring tools: we use Datadog, Grafana, ELK, Sentry and OpsGenie. £90,000 - £120,000 a year Inclusive workforce At Fresha, we are creating a culture where individuals of all backgrounds feel comfortable. We want all More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid/Remote Options
London Stock Exchange Group
as React or Angularo CI/CD processes using Gitlabo Terraform IaC Desirable: Some experience in one or more ofo Python, Javao Atlassian's tooling stack including JIRA & Confluenceo DataDog, BigPanda, Service Nowo Test Driven Development Demonstrable experience of building applications in Public Cloud - ideally Microsoft Azure & AWS Ability to design and explain solutions to complex problems Motivation, self-starting More ❯
sufficient to design, author, and maintain CI/CD codebases Experience designing and implementing observability components like alerts, dashboards, monitors, log-parsing pipelines, and automated remediation flows (experience with DataDog is a plus) Detail-oriented approach to writing crystal-clear technical documentation WHAT YOU'LL GET IN RETURN: Experience coordinating business-critical technical initiatives at a massive scale Hands on More ❯
with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes This position is open to Lead level Engineers, able to offer £80-90K, and operates a remote first model (with only More ❯
Code. Proven track record of delivering measurable cost savings. Ability to translate technical costs into business-friendly metrics. Collaborative mindset with strong influencing skills. Nice to Have Experience with DataDog or similar observability platforms. Familiarity with developer tools like GitHub Actions and Azure Pipelines . What’s in It for You Competitive salary and bonus scheme. Hybrid working model. Private More ❯