. Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
London, Bloomsbury, United Kingdom Hybrid / WFH Options
IntaPeople
or AWS CodePipeline Support and train technical staff in upskilling necessary for ongoing operations Monitor and ensure system reliability, availability, and performance using tools likeCloudWatch, Prometheus, Icinga2, Grafana, and Datadog Automate deployment, scaling, and management of containerized applications using Docker and Kubernetes Desirable skills Travis CI Monitoring – Grafana, Icinga Prometheus Rabbit MQ/AMQP Working knowledge of security best practices More ❯
South West, England, United Kingdom Hybrid / WFH Options
Interquest
design, automation, deployments. Containers (Docker, Kubernetes); GitOps tools (ArgoCD, Flux) a plus. Infrastructure drift detection and IAC best practices. Monitoring & Observability, tools such as Prometheus, Grafana, ELK/EFK, Datadog, CloudWatch. Building dashboards, alerting systems, SLA/SLO/SLI monitoring, alert tuning. Proven track record leading and mentoring engineering teams. Experience working in multi-disciplinary environments with multiple stakeholders. More ❯
Experience with Infrastructure as Code tools (Terraform, CloudFormation, etc.) Proficiency in scripting languages (Python, Bash, Go) Strong understanding of version control systems (Git) Experience with monitoring tools (Prometheus, Grafana, DataDog, or similar) Our cash compensation range for this role is $180,000 - $230,000. Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from More ❯
as GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness More ❯
Stoke-On-Trent, Staffordshire, West Midlands, United Kingdom
Evolution Funding Limited
AWS CDK, Serverless Framework, CloudFormation). Knowledge of microservices and event-driven architectures. Exposure to container technologies (Docker, ECS, EKS, Kubernetes). Experience with monitoring and observability tools (CloudWatch, Datadog, OpenTelemetry). More ❯
including custom applications, integrations, AI & flows. Develop applications and integrations across platforms such as ITSM, ITOM, PA, CSM, SPM, CSDM, CMDB, Employee Centre, Integration Hub, and observability tools (e.g., Datadog, Splunk, AWS CloudWatch, Prometheus, etc.). Ensure seamless interoperability between service operations tooling and cloudnative environments. Technical Leadership & Collaboration: Serve as a technical lead, providing guidance & best practices across service More ❯
bradford, yorkshire and the humber, united kingdom
Mastek
including custom applications, integrations, AI & flows. Develop applications and integrations across platforms such as ITSM, ITOM, PA, CSM, SPM, CSDM, CMDB, Employee Centre, Integration Hub, and observability tools (e.g., Datadog, Splunk, AWS CloudWatch, Prometheus, etc.). Ensure seamless interoperability between service operations tooling and cloudnative environments. Technical Leadership & Collaboration: Serve as a technical lead, providing guidance & best practices across service More ❯
configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development. ABOUT BUSINESS UNIT IBM Consulting is IBM's consulting and global professional services business, with market leading capabilities in business and technology More ❯
codebase, currently in Java (11+), and ideally Spring Boot. You will be working with SQL and large SQL databases, Docker, Kubernetes, OpenAPI specifications, and distributed system observability tooling (e.g., Datadog APM). Infrastructure automation is primarily owned by the infrastructure team, but you will be a consumer of their work; familiarity with AWS, Terraform and Docker is beneficial. Testing approaches More ❯
configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development. IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive More ❯
experience in various libraries Experience with AWS Lambda functions and serverless architectures Knowledge of REST APIs, JSON/XML, and web services integration Familiarity With Cribl, Grafana, Logic Monitor, Datadog, Newrelic or comparable monitoring & APM solutions is a plus. Exposure to SIEM and Service Management toolsets like ServiceNow would be advantageous. Nice to have UNIX/RHEL/Ubuntu with More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Noir
and maintaining CI/CD pipelines, and be confident scripting in Python, C# or similar scripting languages. You'll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you'll have worked in a fast-moving SaaS or product-led business before. Bonus points if you've helped shape DevOps roadmaps, mentored others, or More ❯
are JVM based with the majority running on Java 21. We're in the process of moving our backend services to Spring Boot. We've invested heavily in our DataDog integration to bring world class observability and monitoring to our systems. We've recently moved to Gitlab and are currently building out our next generation of automated deployment pipelines. We More ❯
Mesh (ie. Istio) and GitOps (ie. ArgoCD), with a focus on streamlined deployments and managing complex service-oriented architectures. Experienced in leveraging observability tools, such as Honeycomb (OpenTelemetry) and DataDog, to support data-driven decisions across the wider engineering team. Comprehensive understanding of networking in cloud environments, including VPN solutions, efficient network configuration, load balancing, and troubleshooting. Extensive experience designing More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
London Stock Exchange Group
as React or Angularo CI/CD processes using Gitlabo Terraform IaC Desirable: Some experience in one or more ofo Python, Javao Atlassian's tooling stack including JIRA & Confluenceo DataDog, BigPanda, Service Nowo Test Driven Development Demonstrable experience of building applications in Public Cloud - ideally Microsoft Azure & AWS Ability to design and explain solutions to complex problems Motivation, self-starting More ❯
building robust and efficient backend solutions. Strong hands-on experience with Terraform for infrastructure as code, enabling scalable and reliable systems. Experience with monitoring and observability tools, such as Datadog or Prometheus. Familiarity with event-driven systems, particularly Kafka and/or RabbitMQ. Deep understanding of messaging and queuing systems, including design patterns for reliability, retries, and scaling. Strong understanding More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Fruition Group
developers and SREs to solve complex problems What we're looking for: Strong experience with AWS (EC2, ECS, Lambda, RDS etc.) Good knowledge of observability tools (Grafana, Prometheus, OpenTelemetry, Datadog, or similar) Background in software engineering (JavaScript/TypeScript & Node.js, although any language is fine) Experience with Infrastructure as Code (Terraform, CloudFormation, or similar) CI/CD pipelines and automation More ❯
Experience of using Git or similar to track changes Experience of both the full .NET Framework and .NET Core Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production A solid understanding of security principles and secure coding including OWASP Top 10 Nice to haves: o Experience in VOIP, (SIP and RTP More ❯
Experience of using Git or similar to track changes Experience of both the full .NET Framework and .NET Core Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production A solid understanding of security principles and secure coding including OWASP Top 10 Nice to haves: o Experience in VOIP, (SIP and RTP More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
role, ideally as a Site Reliability Engineer Ability to work cross-functionally, take ownership of tasks, and prioritize effectively Excellent communication and collaboration skills Experience with monitoring solutions (e.g., Datadog, Azure Application Insights, Log Analytics) Programming/scripting skills for automation (favoring PowerShell, but also comfortable with Bash, C#, Ruby, or Python) Experience with web-based applications Desirable qualifications Familiarity More ❯
re looking for someone with deep expertise in: oInfrastructure as Code: Terraform, CloudFormation o Security best practices: IAM, KMS, encryption in transit/at rest, DevSecOps o Monitoring & observability: Datadog, Prometheus, Grafana, ELK, or similar What You Bring o 6+ years in DevOps or platform engineering, with experience in a technical lead role. o Proven experience designing and operating cloud More ❯
tuning. Lead technical triage and root cause analysis for infrastructure-related issues Develop and deploy applications using Docker and AWS FARGATE Use CloudWatch, CloudTrail, and third-party tools like Datadog for performance and cost efficiency Configure AWS networking (VPCs, TGWs), enforce governance via AWS Config and tagging policies Maintain architecture diagrams, SOPs, and collaborate across engineering and product teams Should More ❯