/CD tools such as GitlabCI, CircleCI, Github Actions, and GitOps using ArgoCD, FluxCD Troubleshooting and debugging applications using Observability tooling across microservices and serverless applications such as Splunk, DataDog Managing ephemeral secrets and credentials using Hashicorp Vault Managing least privileged access to cloud resources using TPAM solutions such as Hashicorp Boundary Bonus Points for experience with: Production experience architecting More ❯
CircleCI also welcome Proficiency in testing frameworks like JUnit and RestAssured A passion for monitoring, observability , and maintaining resilient systems Desirable Skills: Experience with monitoring and alerting tools like Datadog, Prometheus, Grafana, or PagerDuty Exposure to Python scripting Familiarity with deployment platforms such as Kubernetes and tools like Helm Why Join H&B Tech? Be part of a fast-moving More ❯
containerization (Docker, Kubernetes), and CI/CD practices. Familiarity with Guidewire Cloud architecture models, deployment automation, and support practices. Experience integrating cloud infrastructure with DevOps, Monitoring (e.g., CloudWatch, Prometheus, Datadog), and Logging tools (ELK, Splunk). Solid understanding of cloud security, compliance (including regulatory needs in insurance), and networking. Knowledge of data migration, analytics integration, and insurance data models is More ❯
Our stack AWS as our cloud compute platform Kubernetes (EKS) for container runtime and orchestration RDS (PostgreSQL, MySQL), Kafka, Redis Terraform for infrastructure as code Lambda and Step Functions Datadog for Observability Github actions for CICD Frontend is React Backend services are developed in NodeJS (TypeScript) As we are an international team, please submit your application and CV in English. More ❯
Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
scalable, secure infrastructure in AWS and Azure Build and maintain CI/CD pipelines using tools such as Azure DevOps Implement and manage monitoring, alerting and logging systems (e.g. Datadog, Logic Monitor, SolarWinds) Automate infrastructure provisioning using Infrastructure as Code (IaC) tools such as Terraform Ensure compliance with security policies; manage IAM, PIM and RBAC access controls Respond to incidents More ❯
software applications and optimizing fleet utilization - Strong understanding of network fundamentals (DNS, DHCP, TCP/IP, routing, load balancing, load shedding) and experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar) - Experience scripting operating system tasks in Bash, Python, etc. and with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar) - Experience operating services More ❯
skills — and a passion for building better together Nice to Have (We’ll Support Learning Too) Frontend development experience (especially with Angular) Experience with Kubernetes, Docker, GitHub Actions, or Datadog Familiarity with BDD (Gherkin, SpecFlow), observability tooling, and secure development practices Experience working in highly regulated or enterprise-scale environments What’s In It for You Be at the forefront More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Uniting Ambition
skills — and a passion for building better together Nice to Have (We’ll Support Learning Too) Frontend development experience (especially with Angular) Experience with Kubernetes, Docker, GitHub Actions, or Datadog Familiarity with BDD (Gherkin, SpecFlow), observability tooling, and secure development practices Experience working in highly regulated or enterprise-scale environments What’s In It for You Be at the forefront More ❯
in Computer Science, Management Information Systems, or related fields is desirable but not essential. Nice to have but not essential: Service monitoring and graphing tools (Prometheus + Grafana, Nagios, Datadog) Elastic Stack Repository solutions (JFrog Artifactory, JFrog Bintray) OpenVPN SQL Databases (MongoDB, PostgreSQL, MySQL) Our Values: We work together We believe in people We won't accept the "way it More ❯
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development. IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive More ❯
are JVM based with the majority running on Java 21. We're in the process of moving our backend services to Spring Boot. We've invested heavily in our DataDog integration to bring world class observability and monitoring to our systems. We've recently moved to Gitlab and are currently building out our next generation of automated deployment pipelines. We More ❯
expand documentation for system behavior, runbooks, and escalation flows. Tech Stack & Tooling Languages: Python (primary), Bash, T-SQL OS/Infrastructure: Linux, Windows, Docker, AWS Cloud services Monitoring & Alerting: DataDog, Grafana, custom tooling Automation/CI/CD: Git, TeamCity, Ansible, Terraform (optional) Databases: MS SQL Server, Snowflake General Any other duties commensurate with the post holder's position and More ❯
engineering (SRE), or a similar role. Proficiency in cloud platforms (AWS, Azure, GCP) and associated reliability tools. Hands-on experience with monitoring and logging tools such as Prometheus, Grafana, Datadog, Splunk, or ELK stack. Proficiency in scripting languages like Python, Bash, or Go for automation. Familiarity with containerization and orchestration tools (Docker, Kubernetes). Strong understanding of distributed systems, fault More ❯
frontend architecture (e.g., Module Federation or Single-SPA). Experience with cloud-native DevOps tooling: Docker, Kubernetes, AWS/GCP deployments. Proficiency in analytics and observability tools like Sentry, Datadog, or LogRocket. Soft Skills Strategic thinker with strong problem-solving and decision-making skills. Ability to work in fast-paced, agile environments with cross-functional teams. Clear communication and documentation More ❯
factor principles and fit into our microservices architecture Cloud-related tools, services, and distributed system observability to support these applications, such as Docker, Kubernetes, ElasticSearch, log management systems, and Datadog APM, to name but a few API specifications, conforming to the OpenAPI (Swagger) standard, provide a clean boundary both externally between our customers and our product, and internally between our More ❯
or similar GitHub Actions, CircleCI) Understands the importance of monitoring and proactive in resolving critical issues. Fluent in testing frameworks Junit , RestAssured Desirable: Exposure with monitoring and alerting platforms. Datadog , PagerDuty, Graphana, Prometheus Exposure in Python Scripting Exposure in deployment platforms like Kubernetes and tools like Helm. Ready to shape the future of health and wellness through tech? Apply now More ❯
PowerShell with other scripting languages like Python or Bash a bonus Awareness of configuration tools like Flux and Terraform Experience monitoring large distributed systems using technologies such as ELK, Datadog, Prometheus and tooling provided by cloud platform vendors Awareness and interest in technology trends to adopt new cutting-edge tools Building, managing, and securing C# ASP.Net web applications Excellent communication More ❯
Desirable Technical Skills Operating Systems: Ubuntu (18-22) Middleware: Apache Tomca Databases: Microsoft SQL Server (T-SQL) Scripting: Bash Cloud Platforms: Amazon Web Services (AWS) Containers: Docker Monitoring & Logging: Datadog General Skills & Attributes Strong problem-solving abilities with a strategic mindset Self-starter who works independently with minimal guidance Effective communicator able to simplify complex information for diverse audiences Proven More ❯
building robust and efficient backend solutions. Strong hands-on experience with Terraform for infrastructure as code, enabling scalable and reliable systems. Experience with monitoring and observability tools, such as Datadog or Prometheus. Familiarity with event-driven systems, particularly Kafka and/or RabbitMQ. Deep understanding of messaging and queuing systems, including design patterns for reliability, retries, and scaling. Strong understanding More ❯
building robust and efficient backend solutions. Strong hands-on experience with Terraform for infrastructure as code, enabling scalable and reliable systems. Experience with monitoring and observability tools, such as Datadog or Prometheus. Familiarity with event-driven systems, particularly Kafka and/or RabbitMQ. Deep understanding of messaging and queuing systems, including design patterns for reliability, retries, and scaling. Strong understanding More ❯
and optimize CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins. Automate everything with Terraform, Bicep, and scripting (PowerShell, Bash, Python). Drive observability with tools like Datadog, LogicMonitor, CloudWatch, and Grafana. Champion cloud security, IAM, RBAC, and compliance best practices. Collaborate across teams, mentor peers, and contribute to a culture of continuous improvement. What You Bring: Proven More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Principality Building Society
on-premise infrastructure models. Working knowledge of secure SDLC practices and non-functional testing requirements (e.g. resilience, availability, performance, security). Experience with monitoring, logging, and observability tooling (e.g. Datadog, App Insights). Knowledge of Agile principles and DevOps practices. Experience working in platform or enablement teams and using flow metrics to improve delivery. What You'll Bring: A strong More ❯
of resource allocation, network and/or internals. Experience working with cloud solutions (GCP or AWS). Deep understanding and demonstrable experience with modern monitoring tools such as Prometheus, Datadog, Grafana, Telegraf Experience with infrastructure as code tools. Experience with complex Terraform deployments is a plus. Solid background with configuration management tools. Experience with Saltstack is a plus. Experience with More ❯