London, England, United Kingdom Hybrid / WFH Options
Ten Lifestyle Group
with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code (Terraform). Familiarity and hands-on with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog). Experience in distributed systems and scaling. Knowledge and hands-on experience with multiple data stores (both SQL and NoSQL). Desired experience in building agentic workflows (e.g., autonomous systems More ❯
London, England, United Kingdom Hybrid / WFH Options
Elliptic
/CD tools such as GitlabCI, CircleCI, Github Actions, and GitOps using ArgoCD, FluxCD Troubleshooting and debugging applications using Observability tooling across microservices and serverless applications such as Splunk, DataDog Managing ephemeral secrets and credentials using Hashicorp Vault Managing least privileged access to cloud resources using TPAM solutions such as Hashicorp Boundary Bonus Points for experience with: Production experience architecting More ❯
Newcastle upon Tyne, England, United Kingdom Hybrid / WFH Options
Byggfakta UK Group
Strong stakeholder communication and vendor management experience (D) Demonstrable experience in leading cross-functional teams and fostering DevOps/FinOps culture (D) Familiarity with observability tools (e.g., Prometheus, Grafana, Datadog, ELK stack) and incident management processes. (D) MISSION & VISION Mission; By using our unique data, insights and software solutions, our customers in the construction industry will sell more, improve efficiency More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Group
. Hands-on experience with IaC tools like Terraform, CloudFormation, or Ansible. Strong knowledge of cloud-native technologies (Kubernetes, Docker, Serverless, etc.). Experience with monitoring tools (e.g., Prometheus, Datadog, New Relic). Familiarity with networking concepts, security protocols, and identity management. General Experience building SaaS applications. Strong problem-solving and analytical skills. Excellent communication and leadership abilities. Ability to More ❯
London, England, United Kingdom Hybrid / WFH Options
NinjaOne
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
KnowBe4, Inc
communication skills. Some of the technologies we use: Programming Languages - Python, Ruby, Rust Infrastructure as Code - Terraform, AWS CDK Source Code Management and CI/CD - GitLab, Snyk Observability - DataDog, Airbrake Cloud-native infrastructure in AWS - ECS, Lambda, Step Functions, SNS/SQS, Transit Gateway, Aurora, DynamoDB, CloudFront, S3, AppSync, API Gateway, and many more. Responsibilities: Work with other Site More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
KnowBe4, Inc
communication skills. Some of the technologies we use: Programming Languages - Python, Ruby, Rust Infrastructure as Code - Terraform, AWS CDK Source Code Management and CI/CD - GitLab, Snyk Observability - DataDog, Airbrake Cloud-native infrastructure in AWS - ECS, Lambda, Step Functions, SNS/SQS, Transit Gateway, Aurora, DynamoDB, CloudFront, S3, AppSync, API Gateway, and many more. Responsibilities: Work with other Site More ❯
London, England, United Kingdom Hybrid / WFH Options
Canada Life
Azure certifications are a plus Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of security and scalability across all More ❯
London, England, United Kingdom Hybrid / WFH Options
Take-Two Interactive
Strong proficiency in AWS, Azure, or GCP, with hands-on experience with Terraform. Experience with configuration management tools like Ansible or Puppet, and observability tools like Prometheus, Grafana, and Datadog, etc. Design, develop, automate testing, and deploy custom tools using languages like Python or C#. Great to Have: Experience with database administration and performance tuning. Experience in optimizing cloud costs More ❯
London, England, United Kingdom Hybrid / WFH Options
London Stock Exchange Group
activities for new tools and technologies. Key Tools and Technologies DevOps Tools: GitLab, Jenkins, Ansible, Terraform, Cloud Technologies: AWS Containerization and Orchestration: Docker, Kubernetes, EKS, AKS Monitoring and Logging: DataDog Scripting Languages: Python, Bash, Shell Personal Attributes Proactive and dedicated with a passion for continuous improvement. Diligent with a focus on delivering high-quality work. Ability to work under pressure More ❯
Seattle, Washington, United States Hybrid / WFH Options
Georgia IT Inc
Event Hub, NATS or any messaging broker. JAVA Based microservices, responsible for deployment, scripting language is pytho n. Should have an understanding around terraform. Emphasis on Logs and Monitoring (datadog and splunk) Summary of Experience Requires 7+ years experience in the IT industry Requires 5+ years of software and DevOps development engineering Experience in working with cloud environment Azure preferred. … or any messaging broker. Experience with Cassandra, PostgresSQL, Mongo, Elastic Search, Cosmos DB Experience on Azure DevOps, Jenkins/Python/Terraform/Ansible Experience with Databricks Experience with DataDog, Splunk or other logging and APM tools. Experience in working with Linux environment. In-depth understanding of Computer Science fundamentals in object-oriented design, data structures, algorithms, and problem solving More ❯
Salt Lake City, Utah, United States Hybrid / WFH Options
CyberArk
management tools in both Windows and Linux - Terraform, Ansible, CloudFormation Ensuring cloud-based architecture meets availability and recoverability requirements Architecture and implementation of cloud-based monitoring, alerting and reporting - Datadog, CloudWatch, ELK, Grafana Develop tools to enable teams for greater output and reliability. Qualifications B.S. in Computer Science or equivalent experience Minimum 2 years of experience managing AWS infrastructure Minimum … years of experience with systems engineering and software development Solid understanding/experience of containerization services such as Docker Working knowledge of tools and technologies such as Terraform, Datadog, Logstash, Elasticsearch, Ansible Solid understanding/experience of web services, databases and relating infrastructure/architectures Solid understanding of backup/restore best practices Strong level of expertise programming Python/ More ❯
Salt Lake City, Utah, United States Hybrid / WFH Options
CyberArk
in both Windows and Linux - CloudFormation, Helm, Terraform, Salt, Ansible Ensuring cloud-based architectures meet availability and recoverability requirements Architecture and implementation of cloud-based monitoring, alerting and reporting - Datadog, Logz.io, CloudWatch, Catchpoint, ELK, Support and guidance on tooling that helps to enable teams for greater output and reliability. Deep understanding of the latest tech solutions, trends, and ability to … development A deep understanding of Site Reliability, infrastructure and Cloud Platform Expert understanding/experience of containerization services such as Docker/Kubernetes Expert in observability tooling such as Datadog, NewRelic, Logstash, Elasticsearch Solid understanding/experience of web services, databases and relating infrastructure/architectures Solid understanding of backup/restore best practices Strong level of expertise programming writing More ❯
Chicago, Illinois, United States Hybrid / WFH Options
Peapod Digital Labs
/SLIs and operational health indicators for business-critical services. Automate infrastructure provisioning and application deployment workflows using Terraform, GitHub Actions, and ArgoCD. Drive observability and telemetry adoption using Datadog, including dashboards, alerts, custom metrics, and distributed tracing. Act as incident commander during critical production issues; conduct blameless postmortems and guide root cause remediation. Lead cross-team efforts in reducing … ArgoCD for orchestration and GitOps deployment workflows. Proficiency in Python, Java, Bash, or Go for automation, scripting, and infrastructure tooling. Proven ability to implement observability platforms and practices using Datadog (metrics, logs, traces, dashboards, alerts). Advanced experience working with CI/CD pipelines using GitHub and GitHub Actions. Deep understanding of networking, Linux (especially Ubuntu), distributed systems, and container More ❯
Virginia Beach, Virginia, United States Hybrid / WFH Options
CrowdStrike Holdings, Inc
Virtual Desktop Infrastructure (VDI) solutions to ensure seamless user experience, performance, and security. Observability, Incident Response & Performance Optimization Deploy and manage monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, Datadog, Splunk, ELK). Implement automated self-healing mechanisms and proactive monitoring solutions. Lead incident response, postmortems, and root cause analysis (RCA) to prevent future system disruptions. Ensure 24/… AWS GovCloud (Azure/GCP is a plus). Strong expertise in Infrastructure as Code (Terraform, CloudFormation). Experience with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK, Datadog, Splunk). Expertise in IAM and PAM solutions such as Okta, CyberArk, or AWS IAM. Strong scripting and automation skills (Python, Bash, PowerShell). Experience with CI/CD pipelines More ❯
London, England, United Kingdom Hybrid / WFH Options
So Energy
across the stack: Frontend conversations: Vue.js, modern component-driven design, API design for seamless integration. Infrastructure: GCP stack, Terraform, Kubernetes, Docker, CI/CD pipelines (GitHub Actions, SonarCloud), observability (Datadog, Grafana). Data: BigQuery, SQL/NoSQL, event-driven architecture, data pipelines. Bring holistic thinking to system design, including scalability, latency, operational excellence, and future-proofing. This role will be … and event-driven architectures. Experience with cloud-native development (GCP preferred; AWS experience relevant). Infrastructure-as-code expertise: Terraform, Kubernetes. Database mastery: PostgreSQL, BigQuery, NoSQL. Observability and monitoring: Datadog, Grafana, logging pipelines. Security best practices: OAuth, SSO, data protection, and secure coding principles. Familiarity with frontend frameworks (React, Vue) and mobile technologies (Ionic, Swift, Android) a plus. Hands-on More ❯
VPC, etc. IaC: AWS CDK, Terraform, or CloudFormation CI/CD pipelines + scripting (Python, Bash, PowerShell) Containerized applications (Docker + ECS) Observability tooling like New Relic, CloudWatch, Prometheus, Datadog Who we’re looking for: Proven SRE or platform engineering experience in a high-availability environment Passion for reliability, automation, and system performance Strong problem-solving mindset and solid communication More ❯
London, England, United Kingdom Hybrid / WFH Options
Your Next Hire
VPC, etc. IaC: AWS CDK, Terraform, or CloudFormation CI/CD pipelines + scripting (Python, Bash, PowerShell) Containerized applications (Docker + ECS) Observability tooling like New Relic, CloudWatch, Prometheus, Datadog Who we’re looking for: Proven SRE or platform engineering experience in a high-availability environment Passion for reliability, automation, and system performance Strong problem-solving mindset and solid communication More ❯
London, England, United Kingdom Hybrid / WFH Options
Causaly Inc
with Terraform and infrastructure-as-code principles. • Experience managing Kubernetes clusters and containerized applications. • Familiarity with CI/CD pipelines and modern DevOps tooling. • Experience with observability tools (e.g. Datadog, Prometheus, Grafana). • Strong scripting skills (e.g. Bash, Python). • Ability to contribute to architectural planning and system design. • Excellent problem-solving and analytical skills, especially in complex or ambiguous More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
native technologies: Experience in deploying to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity: A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
native technologies: Experience in deploying to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity: A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down More ❯
London, England, United Kingdom Hybrid / WFH Options
Zettafleet
native technologies: Experience in deploying to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity: A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down More ❯
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
East Tawakoni, Texas, United States Hybrid / WFH Options
Fisher Investments
client portals, CMS-driven corporate sites, or digital channels for financial institutions Strong expertise in Cloud platforms (Azure preferred), CI/CD pipelines (Jenkins, GitLab CI, etc.), Monitoring & alerting (Datadog, Prometheus, ELK, Grafana), Infrastructure as Code (Terraform, CloudFormation), and Scripting (Python, Bash, or similar) Experience with Salesforce Experience Cloud, Kaltura, SiteCore or similar systems used for customer-facing web platforms More ❯