london, south east england, united kingdom Hybrid/Remote Options
Cogna
identity management (Entra ID), and network configurations. Support container orchestration, and workload deployment using Kubernetes and AKS. Improve observability by implementing logging, monitoring, and alerting systems (e.g. Azure Monitor, Datadog, etc.). Partner with internal teams to improve resilience, automate toil, and reduce lead time to deployment. Drive root cause analysis and reliability improvements from incidents. What we're looking More ❯
This role demands deep expertise in Java or Golang programming, alongside a comprehensive understanding of observability tools and practices. A proven ability to implement and leverage solutions such as Datadog, ElasticSearch, Prometheus, and Grafana is crucial for success. The ideal candidate will excel at instrumenting applications, gathering and interpreting telemetry data, and translating these insights into tangible system improvements. Experience More ❯
This role demands deep expertise in Java or Golang programming, alongside a comprehensive understanding of observability tools and practices. A proven ability to implement and leverage solutions such as Datadog, ElasticSearch, Prometheus, and Grafana is crucial for success. The ideal candidate will excel at instrumenting applications, gathering and interpreting telemetry data, and translating these insights into tangible system improvements. Experience More ❯
This role demands deep expertise in Java or Golang programming, alongside a comprehensive understanding of observability tools and practices. A proven ability to implement and leverage solutions such as Datadog, ElasticSearch, Prometheus, and Grafana is crucial for success. The ideal candidate will excel at instrumenting applications, gathering and interpreting telemetry data, and translating these insights into tangible system improvements. Experience More ❯
This role demands deep expertise in Java or Golang programming, alongside a comprehensive understanding of observability tools and practices. A proven ability to implement and leverage solutions such as Datadog, ElasticSearch, Prometheus, and Grafana is crucial for success. The ideal candidate will excel at instrumenting applications, gathering and interpreting telemetry data, and translating these insights into tangible system improvements. Experience More ❯
This role demands deep expertise in Java or Golang programming, alongside a comprehensive understanding of observability tools and practices. A proven ability to implement and leverage solutions such as Datadog, ElasticSearch, Prometheus, and Grafana is crucial for success. The ideal candidate will excel at instrumenting applications, gathering and interpreting telemetry data, and translating these insights into tangible system improvements. Experience More ❯
Beckwith, Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
. AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs, Firewalls, WAFs, route tables). More ❯
Harrogate, North Yorkshire, Yorkshire, United Kingdom Hybrid/Remote Options
Morson Edge
. AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs, Firewalls, WAFs, route tables). More ❯
England, Beckwith, North Yorkshire, United Kingdom Hybrid/Remote Options
The Bridge IT Recruitment
. AZ-104 and AZ-305 desirable. Skilled in scripting and automation (PowerShell required; Bicep/Terraform desirable). Experience with Azure monitoring and observability (Azure Monitor, Log Analytics, Datadog). Familiar with backup, disaster recovery, and business continuity tooling (Azure Backup, RSV, ASR). Strong working knowledge of networking concepts (VNets, VPNs, ExpressRoute, NSGs, Firewalls, WAFs, route tables). More ❯
leeds, west yorkshire, yorkshire and the humber, united kingdom
Entain
AWS Programming languages: JavaScript, Typescript, Python Frameworks: React Native Databases: NoSQL (DynamoDB), SQL AWS services: Lambda, S3, API Gateway, Step Functions, SQS, Athena DevOps and monitoring tools such as Datadog, New Relic, Grafana Desirable: Experience in mobile application development. Experience in sports betting, gaming, or related high-scale transactional domains. Previous experience leading organisational change or scaling teams. Additional Information More ❯
by the boy scout and embrace the DevOps mindset: you build it, you ship it, you run it! Excerpt from our tech-stack, sorted by priority: Python, AWS, Docker, DatadogMore ❯
Support: Work hand-in-hand with cross-functional teams and help develop the junior SRE through mentoring and knowledge sharing. Monitor & Troubleshoot: Strengthen monitoring systems (moving from Nagios to Datadog) and take ownership of incident management. What You'll Bring Solid experience in SRE or DevOps roles within cloud environments (AWS preferred). Confidence with infrastructure-as-code and configuration More ❯
by the boy scout and embrace the DevOps mindset: you build it, you ship it, you run it! Excerpt from our tech stack, sorted by priority: Python, AWS, Docker, Datadog Are you a good fit for us? We look forward to your application (including salary requirements and start date) - We are also open to applicants with severe disabilities. Shaping an More ❯
Edinburgh, Midlothian, Scotland, United Kingdom Hybrid/Remote Options
Asset Resourcing Limited
Bonus Points If You Have BSc or MSc in Computer Science or related field Mobile development experience for Android and iOS Experience with monitoring tools like Application Insights or DataDog Previous experience with security compliance or vendor security assessments Why Join? 25 days holiday, your birthday off and the Scottish bank holidays Share options - they want everyone to be part More ❯
Bethesda, Maryland, United States Hybrid/Remote Options
ALTA IT Services
maintain multiple cloud systems for the client. RESPONSIBILITIES: • Primarily responsible for the client's cloud infrastructure architecture and associated observability/instrumentation of various services using tools such as DataDog, Dynatrace, or similar APMs. • Develop a cloud services delivery and operational model, keeping track of cloud activities, developing and moving applications to the cloud, and specifying computing demands. • Provide advice More ❯
database performance and query efficiency. Experience with data security best practices and automation using scripting languages like Python or Bash Familiarity with database monitoring tools such as AWS CloudWatch, DataDog, or Prometheus Proven experience in managing cloud-based and NoSQL databases in enterprise environments Excellent verbal and written communication skills, with the ability to collaborate effectively with cross-functional teams More ❯
AWS European Sovereign Cloud Development Center GmbH
Able to lead the creation, revision, and/or improvement of standard operational procedures (SOPs) and driving operational best practices BEVORZUGTE QUALIFIKATIONEN Experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar). Experience actively mentoring junior engineers and working cross-organizationally and leading strategic team efforts requiring work from multiple team members Experience supporting service in AWS More ❯
Security Expertise in Palo Alto Firewalls including policy configuration, threat prevention Network segmentation, zero-trust frameworks, and IAM integration Cloud native Web Application Firewalls Tools and Monitoring Monitoring solutions: Datadog, Stackdriver, PA Panorama, or equivalent Has strong practical experience with DevOps tools and methods, like CI/CD, Git, IaC (Terraform) Working and collaborating with Agile Teams (Squad) Good understanding More ❯
london, south east england, united kingdom Hybrid/Remote Options
Black Pen Recruitment
of automation tools such as Terraform and Ansible, alongside programming in Python. Their environments are entirely based on Ubuntu Linux. Experience with server monitoring software (e.g. Prometheus, Grafana, Zabbix, Datadog) and a solid understanding of security principles and best practices (including hardening, access control, auditing, and incident response) is highly valued. This is a remote-first role, and they are … Configuration management with Ansible Cloud platforms (AWS, Azure) Containerization (LXC, LXD, Docker, Kubernetes) CI/CD tooling (TeamCity, Jenkins, GitHub Actions) Server monitoring and alerting systems (Prometheus, Grafana, Zabbix, Datadog) Strong Python programming skills Solid Linux administration and general networking knowledge Understanding of infrastructure security best practices, including secure configuration, identity and access management, and compliance controls Experience with monitoring More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Hargreaves Lansdown
and run GitOps for Kubernetes (AKS preferred), patterns and multi-environment promotions. Own platform observability: metrics, logs and traces using Azure Monitor/Log Analytics/Application Insights, plus Datadog/Grafana where appropriate. Embed security by design: Azure Policy, Defender for Cloud, secrets management with Key Vault, SBOM and image scanning, policy-as-code and least privilege IAM. Drive … RBAC and workload identity. Experience with GitOps, and container build pipelines (e.g., ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected Framework. Familiarity with DevSecOps practices: threat modelling, dependency and More ❯
Employment Type: Permanent, Part Time, Work From Home
Edinburgh, Midlothian, Scotland, United Kingdom Hybrid/Remote Options
Asset Resourcing
Bonus Points If You Have ? BSc or MSc in Computer Science or related field ? Mobile development for Android and iOS ? Experience with monitoring tools such as Application Insights or DataDog ? Experience with security compliance or vendor security assessments Why Join? ? 25 days holiday, birthday off, and Scottish bank holidays ? Share options ? Monthly social budget ? Autonomy and real responsibility ? Career progression More ❯
Plotly, Tableau, Looker, Grafana, Power BI) Able to write concise reports with actionable insights - weekly summaries, defect overviews, quality scorecards, etc. Familiar with log analysis tools (e.g., Sumologic, Splunk, Datadog, Kibana, ElasticSearch) Comfortable discussing and designing instrumentation/logging with engineers Familiarity with QA concepts, release validation, and production monitoring Strong communication skills; can adapt output to technical and non More ❯
user interfaces in ReactJS, Typescript, MobX or closely related technologies to meet your team's objectives. Deploying, monitoring and maintaining a set of critical services in our infrastructure using Datadog and FireHydrant Signals. Building user experiences that adhere to a high level of accessibility standards. Applying analytical problem solving, cross functional communication, and systems thinking to ensure smooth execution of More ❯