or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit www.computerfutures.com Computer Futures More ❯
or incident response. Knowledge of networking fundamentals and APIs. Excellent problem-solving and communication skills. Nice to Have Experience with containerization (Docker, Kubernetes). Exposure to monitoring tools (Grafana, Datadog). Cloud certifications or security accreditations. Understanding of Agile methodologies. Interest in automation, security testing, or threat detection. To find out more about Computer Futures please visit (url removed) Computer More ❯
Kubernetes, Docker Knowledge of networking fundamentals (TCP/IP, DNS, load balancing Proficiency in Linux/Unix administration, scripting (Python, Bash, or similar Experience with monitoring tools (Prometheus, Grafana, DataDog Familiarity with containerization (Docker, Kubernetes) and cloud services. Experience with CI/CD systems (Jenkins, GitHub Actions, GitLab CI Strong analytical and problem-solving skills. Knowledge of security practices (IAM More ❯
testing, and incident management. Hands on experience with Databricks , MLflow , or similar ML/ETL platforms is a plus. Bonus: Experience with container orchestration (Kubernetes) and observability tools like Datadog, Prometheus, or Grafana. Passion for building tools and platforms that empower teams and improve developer velocity. Excitement, passion and curiosity about our mission of connecting the world's health data More ❯
secrets management, encryption) • Support compliance initiatives (ISO 27001, NIST, GDPR, MCERTS, etc.) • Manage network configuration, firewalls, and secure endpoints Monitoring & Reliability • Set up observability and monitoring tools (Prometheus, Grafana, Datadog, or CloudWatch) • Ensure high availability, scalability, and cost efficiency of cloud services • Define SLIs, SLOs, and SLAs for platform components • Troubleshoot production issues and coordinate incident response Collaboration • Work with More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. Experience with CI/CD More ❯
YAML, JSON Build Tools: Maven, Gradle, NPM, Bazel, Go Databases: RDS, SQL, MySQL, Postgres, RedShift, MongoDB, DynamoDB Security Scans: SAST, Secrets, Container, DAST, Xray, Prisma Cloud Logging and Monitoring: DataDog, Splunk, App Dynamics, ELK, Grafana About PROLIM Corporation PROLIM is a leading provider of end-to-end IT, PLM and Engineering Services and Solutions for Global 1000 companies. They understand More ❯
Cambridge, Cambridgeshire, England, United Kingdom
Computer Futures
security principles , threat detection, or incident response. Strong problem-solving skills and willingness to learn. Nice to Have Exposure to containerization (Docker, Kubernetes). Knowledge of monitoring tools (Grafana, Datadog). Experience with SIEM/SOC tools or security automation. Cloud certifications or security training (AWS, GCP, Azure, or similar). To find out more about Computer Futures please visit More ❯
london, south east england, united kingdom Hybrid/Remote Options
Mott MacDonald
region deployment. Strong proficiency and current experience in React, Typescript, Python and database systems (SQL + NoSQL). Experience with performance monitoring and logging tools, including CloudWatch, Sentry, or DataDog, to ensure application stability, performance optimisation, and effective issue resolution Experience managing or mentoring engineering teams, including cross-functional collaboration. Understanding of secure architecture, API design, and performance optimisation. Experience More ❯
Birmingham, Leeds, Liverpool, London (Canary Wharf), United Kingdom Hybrid/Remote Options
UK Health Security Agency
in programming/scripting languages such as Python, PowerShell or Bash Understanding of Linux/Unix & Windows systems, networking, and distributed systems Experience with observability tools (e.g., Prometheus, Grafana, Datadog) and alerting systems Understanding of infrastructure automation (e.g., Terraform, Ansible, PowerShell, Helm) Excellent communication and collaboration skills Experience with security best practices Possesses problem solving skills and the ability to More ❯
scalable, secure infrastructure in AWS and Azure Build and maintain CI/CD pipelines using tools such as Azure DevOps Implement and manage monitoring, alerting and logging systems (e.g. Datadog, Logic Monitor, SolarWinds) Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform Ensure compliance with security policies; manage IAM, PIM and RBAC access controls Respond to incidents More ❯
Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
scalable, secure infrastructure in AWS and Azure Build and maintain CI/CD pipelines using tools such as Azure DevOps Implement and manage monitoring, alerting and logging systems (e.g. Datadog, Logic Monitor, SolarWinds) Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform Ensure compliance with security policies; manage IAM, PIM and RBAC access controls Respond to incidents More ❯
/CD pipelines (e.g., Jenkins, TeamCity, Concourse). Familiarity with web/application servers such as NGINX, Apache, or JBoss. Exposure to monitoring and logging tools (ELK, Nagios, Splunk, DataDog, New Relic, etc.). Understanding of security and identity management (OAuth2, SSO, ADFS, Keycloak, etc.). Experience with version control systems (Git, Bitbucket, Subversion). Working knowledge of database technologies More ❯
design (REST, GraphQL) Experience with containerization (Docker, Kubernetes) and cloud-native development patterns DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience More ❯
platforms) that supports the different platform services. Develop comprehensive monitoring solutions to provide full visibility into different platform components using tools and services such as Kubernetes, Prometheus, Grafana, ELK, Datadog, New Relic and other similar tools. Identify and troubleshoot any bottlenecks, availability and performance issues at multiple layers of deployment, from hardware, operating environment, software, network, and application. Evaluate performance More ❯
in the knowledge of programming languages, relational databases, and NoSQL databases Experience building infrastructure as code using AWS CloudFormation or similar scripting techniques Familiarity with monitoring tool suites like DataDog, SumoLogic, NewRelic, and Nagios Strong practical Linux based systems administration skills and scripting experience in a Cloud based environment Experience with Agile Scrum practice a plus Desired Skills More ❯
Strong expertise in implementing Site Reliability Engineering (SRE) principles. Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills). Proficiency in automation & scripting using Python & Ansible (primary skills). Strong experience with cloud platforms AWS & Azure (primary skills). Solid understanding of containerization and orchestration tools like Docker and Kubernetes . Proficiency in cloud native distributed systems & microservices More ❯
analysing metrics and logs using KQL (Kusto Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance - Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network security with NSGs and More ❯
york, new york, united states Hybrid/Remote Options
Menusifu, Inc
and Postgres DB * Proficient in CI/CD workflows with TeamCity and Bitbucket Pipelines. * Skilled in Linux, Docker, and scripting languages (Bash, Python, Node.js). * Monitoring experience with CloudWatch, Datadog, and Uptime Kuma. * Infrastructure-as-Code knowledge using Terraform or CloudFormation. * Experience managing TLS certificates, DNS, and secure network routing. * Strong documentation and collaboration skills across distributed teams. xfsbrmf * Ability More ❯
new york city, new york, united states Hybrid/Remote Options
Menusifu, Inc
and Postgres DB * Proficient in CI/CD workflows with TeamCity and Bitbucket Pipelines. * Skilled in Linux, Docker, and scripting languages (Bash, Python, Node.js). * Monitoring experience with CloudWatch, Datadog, and Uptime Kuma. * Infrastructure-as-Code knowledge using Terraform or CloudFormation. * Experience managing TLS certificates, DNS, and secure network routing. * Strong documentation and collaboration skills across distributed teams. xfsbrmf * Ability More ❯
KPIs and strategic goals Excellent communication and presentation skills. Ability to travel occasionally for customer meetings and events. Preferred Skills Experience with Dynatrace and similar platforms (e.g., New Relic, Datadog, AppDynamics). Certifications in cloud technologies or DevOps practices. Familiarity with CI/CD pipelines, Kubernetes, and infrastructure-as-code tools (Terraform, Ansible). What we offer DXC provide a More ❯
london, south east england, united kingdom Hybrid/Remote Options
Fresha
Cloudfront and MSK extensively Have an understanding of SLIs, SLOs & SLAs Knowledge of platform and ops concepts such as networking and Linux administration Experience with monitoring tools: we use Datadog, Grafana, ELK, Sentry and OpsGenie. £90,000 - £120,000 a year Inclusive workforce At Fresha, we are creating a culture where individuals of all backgrounds feel comfortable. We want all More ❯
or teach them new things Love to automate manual work and try new modern technology/approaches Tech stack: AWS, Kubernetes, MongoDB, PostgreSQL, RabbitMQ, Redis, Ansible, Terraform, Grafana, Prometheus, Datadog, Sentry, Loki, Jenkins. What we Offer We expect excellence from our people — both on the road and in the office. In return, we offer flexible working hours, stock options, and More ❯
JavaScript, React/Angular/Vue, Node.js, REST APIs, microservices, and responsive design. Experience with performance analysis tools such as Lighthouse, WebPageTest, Chrome DevTools, GTmetrix, New Relic, AppDynamics, Dynatrace, Datadog, and Splunk. Expertise in performance testing tools including JMeter, LoadRunner, K6, or similar. Deep knowledge of web performance metrics such as Core Web Vitals, TTFB, FCP, LCP, CLS, TTI, TBT More ❯