London, South East, England, United Kingdom Hybrid / WFH Options
Become
Azure, or GCP) and containerisation (e.g., Docker, Kubernetes) Experience with Infrastructure as Code tools (e.g., Terraform, Ansible, CloudFormation) Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Datadog) Experience working in regulated environments such as banking, fintech, or insurance Prior experience working in or contributing to a Centre of Excellence team Strong scripting skills (e.g., Bash, Python) and More ❯
Proficiency in scripting and automation using Python, Bash, or Go. Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible). Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, ELK, etc.). Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls). Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices. Experience with high-performance and More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Navtech, Inc
Liquibase) and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated More ❯
tools and container orchestration (Docker, ECS, or Kubernetes) Solid understanding of system/network security, IAM, VPC, and secure cloud configurations Familiarity with monitoring and logging tools (e.g., CloudWatch, Datadog, Prometheus, Sentry) Experience with Postgres, Redis, and scalable backend systems Bonus: Exposure to fintech or regulated environments, GDPR/data compliance, or SOC2 setup A little about us Our founders More ❯
Proficiency with cloud providers like Azure, AWS, or GCP. Experience with Infrastructure as Code tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring and logging tools like DataDog, Prometheus, Grafana, or similar. A track record of maintaining highly available and performant production environments. Ability to develop effective mitigation strategies and operational playbooks. Useful/Bonus Skills: Experience with More ❯
Washington, Washington DC, United States Hybrid / WFH Options
Epsilon Inc
Kubernetes), and cloud-native development practices Advanced knowledge of configuration management tools (Ansible, Puppet, Chef), version control systems (Git), and infrastructure automation frameworks Experience with monitoring and logging tools (DataDog, Splunk, ELK Stack), application performance monitoring solutions, and security scanning tools for vulnerability management Proficiency in programming and scripting languages including Java, Python, PowerShell, Bash, and experience with API development More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
/CD tools such as GitlabCI, CircleCI, Github Actions, and GitOps using ArgoCD, FluxCD Troubleshooting and debugging applications using Observability tooling across microservices and serverless applications such as Splunk, DataDog Managing ephemeral secrets and credentials using Hashicorp Vault Managing least privileged access to cloud resources using TPAM solutions such as Hashicorp Boundary Bonus Points for experience with: Production experience architecting More ❯
Rust, with cloud platform experience (AWS, GCP, Azure). Strong knowledge of CI/CD, containerization (Docker, Kubernetes), networking, distributed systems, and databases. Experience with monitoring and troubleshooting tools (DataDog, Prometheus, Grafana, ELK, Splunk, Humio). Excellent problem-solving, attention to detail, and communication skills. Desirable Experience with Azure, autonomous vehicles, or ML/AI projects. This is a full More ❯
skills — and a passion for building better together Nice to Have (We’ll Support Learning Too) Frontend development experience (especially with Angular) Experience with Kubernetes, Docker, GitHub Actions, or Datadog Familiarity with BDD (Gherkin, SpecFlow), observability tooling, and secure development practices Experience working in highly regulated or enterprise-scale environments What’s In It for You Be at the forefront More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Uniting Ambition
skills — and a passion for building better together Nice to Have (We’ll Support Learning Too) Frontend development experience (especially with Angular) Experience with Kubernetes, Docker, GitHub Actions, or Datadog Familiarity with BDD (Gherkin, SpecFlow), observability tooling, and secure development practices Experience working in highly regulated or enterprise-scale environments What’s In It for You Be at the forefront More ❯
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
Watford, Hertfordshire, United Kingdom Hybrid / WFH Options
Wickes
You'll have a deep understanding of modern cloud ecosystems, with extensive hands-on experience in Amazon Web Services (AWS). Familiarity with modern observability concepts and tools, including Datadog, and proven experience with the "platform as a product" model and driving adoption of internal tools. Strong familiarity with CI/CD principles and pipelines (e.g., Jenkins, GitLab CI, CircleCI More ❯
PowerShell with other scripting languages like Python or Bash a bonus Awareness of configuration tools like Flux and Terraform Experience monitoring large distributed systems using technologies such as ELK, Datadog, Prometheus and tooling provided by cloud platform vendors Awareness and interest in technology trends to adopt new cutting-edge tools Building, managing, and securing C# ASP.Net web applications Excellent communication More ❯
and optimize CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Automate everything with Terraform, Bicep, and scripting (PowerShell, Bash, Python). Drive observability with tools like Datadog, LogicMonitor, CloudWatch, and Grafana. Champion cloud security, IAM, RBAC, and compliance best practices. Collaborate across teams, mentor peers, and contribute to a culture of continuous improvement. What You Bring: Proven More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Principality Building Society
on-premise infrastructure models. Working knowledge of secure SDLC practices and non-functional testing requirements (e.g. resilience, availability, performance, security). Experience with monitoring, logging, and observability tooling (e.g. Datadog, App Insights). Knowledge of Agile principles and DevOps practices. Experience working in platform or enablement teams and using flow metrics to improve delivery. What You'll Bring: A strong More ❯
roads to help teams get their apps up and running quickly in a consistent manner Event-Driven: We share data through an event-driven system powered by MSK Observability: Datadog is used for comprehensive logging and monitoring Databases: We use a combination of MongoDB and AWS Relational Databases Automation and CICD: Deployments are highly automated using Jenkins pipelines and Github More ❯
TypeScript for Frontend. Our backend services are written in TypeScript and Kotlin. Frameworks and Libraries: We use React/Redux and WebAssembly. Monitoring and Logging: We are currently using Datadog for monitoring and logging. Metrics are collected across our agents, taken from the logs using metric filters, and updated directly from lambda function or the application. Infrastructure-as-Code: Most More ❯
of the React Framework, relative patterns and best practices. Good understanding of UI/UX best practices and considerations. Understanding of front-end observability with tools like Sentry, LogRocket, Datadog, or New Relic. Experience with CI/CD pipelines, like Github Actions, ArgoCD. Awareness of common front-end security risks (e.g., XSS, CSRF). Passion for writing clean, modular, scalable More ❯
Reigate, Surrey, South East, United Kingdom Hybrid / WFH Options
Client Server
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
years of professional experience, some of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and willingness to get hands on More ❯