pipelines (GitHub Actions, GitLab CI, Azure DevOps, Jenkins) Experience withconfiguration managementtools such asChef/Puppet Strong proficiency in scripting/programming (Python, Go, or similar) Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana) Knowledge of microservices architecture and service mesh technologies Understanding of security best practices and compliance frameworks Comfortable with asynchronous collaboration tools (Slack, Teams) Agile mindset More ❯
testing, and incident management. Hands on experience with Databricks , MLflow , or similar ML/ETL platforms is a plus. Bonus: Experience with container orchestration (Kubernetes) and observability tools like Datadog, Prometheus, or Grafana. Passion for building tools and platforms that empower teams and improve developer velocity. Excitement, passion and curiosity about our mission of connecting the world's health data More ❯
or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit www.computerfutures.com Computer Futures More ❯
or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit (url removed) Computer More ❯
Kubernetes, Docker Knowledge of networking fundamentals (TCP/IP, DNS, load balancing Proficiency in Linux/Unix administration, scripting (Python, Bash, or similar Experience with monitoring tools (Prometheus, Grafana, DataDog Familiarity with containerization (Docker, Kubernetes) and cloud services. Experience with CI/CD systems (Jenkins, GitHub Actions, GitLab CI Strong analytical and problem-solving skills. Knowledge of security practices (IAM More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. Experience with CI/CD More ❯
Cambridge, Cambridgeshire, England, United Kingdom
Computer Futures
security principles , threat detection, or incident response. Strong problem-solving skills and willingness to learn. Nice to Have Exposure to containerization (Docker, Kubernetes). Knowledge of monitoring tools (Grafana, Datadog). Experience with SIEM/SOC tools or security automation. Cloud certifications or security training (AWS, GCP, Azure, or similar). To find out more about Computer Futures please visit More ❯
/CD pipelines (e.g., Jenkins, TeamCity, Concourse). Familiarity with web/application servers such as NGINX, Apache, or JBoss. Exposure to monitoring and logging tools (ELK, Nagios, Splunk, DataDog, New Relic, etc.). Understanding of security and identity management (OAuth2, SSO, ADFS, Keycloak, etc.). Experience with version control systems (Git, Bitbucket, Subversion). Working knowledge of database technologies More ❯
design (REST, GraphQL) Experience with containerization (Docker, Kubernetes) and cloud-native development patterns DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience More ❯
KPIs (observability, alerting, SLAs) Hands on experience with CI/CD, containerization and orchestration tools (Docker, Kubernetes ) Knowledge of monitoring, logging, alerting and observability tools (Prometheus, Grafana, ELK Stack, Datadog ) Familiarity with infrastructure as code tools like Terraform or CloudFormation Proficiency in scripting languages (Python, Go, Bash ) and knowledge of software development best practices Strong understanding of networking, security, and More ❯
complex issues to senior stakeholders and technical teams. Implementation of highly available and reliable systems, using multi-AZ and multiregional approaches Expertise with monitoring and observability tools (e.g. SolarWinds, Datadog, Azure/AWS native tools) Expertise with SLI/SLO management tools such as (ServiceNow) Expertise with Incident ticketing and change management systems such as (ServiceNow, Ivanti) Expertise with automated More ❯
platforms) that supports the different platform services. Develop comprehensive monitoring solutions to provide full visibility into different platform components using tools and services such as Kubernetes, Prometheus, Grafana, ELK, Datadog, New Relic and other similar tools. Identify and troubleshoot any bottlenecks, availability and performance issues at multiple layers of deployment, from hardware, operating environment, software, network, and application. Evaluate performance More ❯
in the knowledge of programming languages, relational databases, and NoSQL databases Experience building infrastructure as code using AWS CloudFormation or similar scripting techniques Familiarity with monitoring tool suites like DataDog, SumoLogic, NewRelic, and Nagios Strong practical Linux based systems administration skills and scripting experience in a Cloud based environment Experience with Agile Scrum practice a plus Desired Skills More ❯
and integration (preferably using Go but not essential). Knowledge of OpenShift Containerisation, RHEL 6,7,8, Docker and Kubernetes. Experience with monitoring systems e.g., ELK, Nagios, New Relic, DataDog, Splunk etc. Working knowledge of digital delivery processes and methodologies. Working knowledge of Atlassian Toolset. Knowledge of Javascript frontend frameworks. Understanding of front-end technologies, such as HTML5, and CSS3. More ❯
Strong expertise in implementing Site Reliability Engineering (SRE) principles. Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills). Proficiency in automation & scripting using Python & Ansible (primary skills). Strong experience with cloud platforms AWS & Azure (primary skills). Solid understanding of containerization and orchestration tools like Docker and Kubernetes . Proficiency in cloud native distributed systems & microservices More ❯
about building resilient systems that empower product teams. Essential skill set: Experience leading cloud-native teams in AWS and serverless Lambda Strong grasp of infrastructure-as-code (Terraform), observability (Datadog), and serverless architecture Background in software engineering (JavaScript/TypeScript preferred) Proven leadership in growing and developing engineering teams Product mindset with stakeholder engagement and roadmap ownership FinTech or scale More ❯
analysing metrics and logs using KQL (Kusto Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance - Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network security with NSGs and More ❯
analysing metrics and logs using KQL (Kusto Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance- Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network security with NSGs and More ❯
frontend architecture (e.g., Module Federation or Single-SPA). Experience with cloud-native DevOps tooling: Docker, Kubernetes, AWS/GCP deployments. Proficiency in analytics and observability tools like Sentry, Datadog, or LogRocket. Soft Skills Strategic thinker with strong problem-solving and decision-making skills. Ability to work in fast-paced, agile environments with cross-functional teams. Clear communication and documentation More ❯
KPIs and strategic goals Excellent communication and presentation skills. Ability to travel occasionally for customer meetings and events. Preferred Skills Experience with Dynatrace and similar platforms (e.g., New Relic, Datadog, AppDynamics). Certifications in cloud technologies or DevOps practices. Familiarity with CI/CD pipelines, Kubernetes, and infrastructure-as-code tools (Terraform, Ansible). What we offer DXC provide a More ❯
london, south east england, united kingdom Hybrid/Remote Options
Fresha
Cloudfront and MSK extensively Have an understanding of SLIs, SLOs & SLAs Knowledge of platform and ops concepts such as networking and Linux administration Experience with monitoring tools: we use Datadog, Grafana, ELK, Sentry and OpsGenie. £90,000 - £120,000 a year Inclusive workforce At Fresha, we are creating a culture where individuals of all backgrounds feel comfortable. We want all More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid/Remote Options
London Stock Exchange Group
as React or Angularo CI/CD processes using Gitlabo Terraform IaC Desirable: Some experience in one or more ofo Python, Javao Atlassian's tooling stack including JIRA & Confluenceo DataDog, BigPanda, Service Nowo Test Driven Development Demonstrable experience of building applications in Public Cloud - ideally Microsoft Azure & AWS Ability to design and explain solutions to complex problems Motivation, self-starting More ❯
Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated CV quoting More ❯
london, south east england, united kingdom Hybrid/Remote Options
Entain
with AWS Cost Explorer, Budgets, Cost and Usage Reports, and AWS Cost Anomaly Detection. Proven cost optimisation across developer tools, such as GitHub Actions and Azure Pipelines. Experience with DataDog or similar SaaS based observability platforms to inform decision making. Strong background in Kubernetes, containerisation, and infrastructure automation. Extensive experience with Infrastructure as Code tools such as Terraform/OpenTofu. More ❯
of code quality - writing, maintaining, and automating your own tests to ensure reliability and resilience. Observability, Reliability & Performance Implement and maintain monitoring, logging, and tracing using tools such as Datadog, AppDynamics, or New Relic. Drive operational excellence by identifying and resolving system bottlenecks and improving fault tolerance. Ensure high availability, resilience, and performance of production systems. Automation, DevOps & AI Tools … Familiarity with Infrastructure as Code tools such as AWS CDK, Terraform, or CloudFormation. Deep understanding of distributed systems, event driven architecture, and microservices. Experience with observability platforms such as Datadog, AppDynamics, or New Relic. Proven experience using AI coding tools (e.g., GitHub Copilot, Cursor, Claude Code) and understanding their role in accelerating delivery responsibly. Demonstrated ownership and accountability in delivering More ❯