London, Bloomsbury, United Kingdom Hybrid / WFH Options
IntaPeople
or AWS CodePipeline Support and train technical staff in upskilling necessary for ongoing operations Monitor and ensure system reliability, availability, and performance using tools likeCloudWatch, Prometheus, Icinga2, Grafana, and Datadog Automate deployment, scaling, and management of containerized applications using Docker and Kubernetes Desirable skills Travis CI Monitoring – Grafana, Icinga Prometheus Rabbit MQ/AMQP Working knowledge of security best practices More ❯
Experience with Infrastructure as Code tools (Terraform, CloudFormation, etc.) Proficiency in scripting languages (Python, Bash, Go) Strong understanding of version control systems (Git) Experience with monitoring tools (Prometheus, Grafana, DataDog, or similar) Our cash compensation range for this role is $180,000 - $230,000. Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from More ❯
as GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness More ❯
Stoke-On-Trent, Staffordshire, West Midlands, United Kingdom
Evolution Funding Limited
AWS CDK, Serverless Framework, CloudFormation). Knowledge of microservices and event-driven architectures. Exposure to container technologies (Docker, ECS, EKS, Kubernetes). Experience with monitoring and observability tools (CloudWatch, Datadog, OpenTelemetry). More ❯
including custom applications, integrations, AI & flows. Develop applications and integrations across platforms such as ITSM, ITOM, PA, CSM, SPM, CSDM, CMDB, Employee Centre, Integration Hub, and observability tools (e.g., Datadog, Splunk, AWS CloudWatch, Prometheus, etc.). Ensure seamless interoperability between service operations tooling and cloudnative environments. Technical Leadership & Collaboration: Serve as a technical lead, providing guidance & best practices across service More ❯
bradford, yorkshire and the humber, united kingdom
Mastek
including custom applications, integrations, AI & flows. Develop applications and integrations across platforms such as ITSM, ITOM, PA, CSM, SPM, CSDM, CMDB, Employee Centre, Integration Hub, and observability tools (e.g., Datadog, Splunk, AWS CloudWatch, Prometheus, etc.). Ensure seamless interoperability between service operations tooling and cloudnative environments. Technical Leadership & Collaboration: Serve as a technical lead, providing guidance & best practices across service More ❯
orchestration (ECS, EKS, or Kubernetes) Experience setting up CI/CD pipelines using GitHub Actions or similar tools Familiarity with monitoring and alerting tools (e.g. Prometheus, Grafana, CloudWatch, Sentry, DataDog) A security-first mindset when designing and managing infrastructure Nice to Haves Experience working in regulated or high-trust environments Knowledge of zero-downtime deployment patterns and rollback strategies Exposure More ❯
configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development. ABOUT BUSINESS UNIT IBM Consulting is IBM's consulting and global professional services business, with market leading capabilities in business and technology More ❯
codebase, currently in Java (11+), and ideally Spring Boot. You will be working with SQL and large SQL databases, Docker, Kubernetes, OpenAPI specifications, and distributed system observability tooling (e.g., Datadog APM). Infrastructure automation is primarily owned by the infrastructure team, but you will be a consumer of their work; familiarity with AWS, Terraform and Docker is beneficial. Testing approaches More ❯
configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development. IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive More ❯
experience in various libraries Experience with AWS Lambda functions and serverless architectures Knowledge of REST APIs, JSON/XML, and web services integration Familiarity With Cribl, Grafana, Logic Monitor, Datadog, Newrelic or comparable monitoring & APM solutions is a plus. Exposure to SIEM and Service Management toolsets like ServiceNow would be advantageous. Nice to have UNIX/RHEL/Ubuntu with More ❯
improvement in automation, monitoring, and deployment processes. What we're looking for Experience with AWS services (ECS, S3, RDS, Lambda, CloudFront, etc.). Skilled in monitoring tools such as DataDog , CloudWatch , and Grafana. Familiarity with Docker , ECS , Kubernetes , or similar containerisation tech. Competence in scripting or coding with Bash, Python, or Node.js. Experience with Infrastructure as Code (Terraform, Pulumi, etc. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Noir
and maintaining CI/CD pipelines, and be confident scripting in Python, C# or similar scripting languages. You'll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you'll have worked in a fast-moving SaaS or product-led business before. Bonus points if you've helped shape DevOps roadmaps, mentored others, or More ❯
are JVM based with the majority running on Java 21. We're in the process of moving our backend services to Spring Boot. We've invested heavily in our DataDog integration to bring world class observability and monitoring to our systems. We've recently moved to Gitlab and are currently building out our next generation of automated deployment pipelines. We More ❯
Lisburn, County Antrim, United Kingdom Hybrid / WFH Options
Camlin
e.g., Docker, Kubernetes, Terraform, Ansible, Helm, etc). Familiarity with continuous integration and deployment tools (e.g., GitLab CI, Argo Workflow, ArgoCD). Experience with monitoring/logging solutions (e.g., DataDog, ELK, Prometheus). Good understanding of concepts related to computer architecture, data structures and programming practices. Solid understanding of networking, databases, and security principles. Our Values We work together We More ❯
KPIs and strategic goals Excellent communication and presentation skills. Ability to travel occasionally for customer meetings and events. Preferred Skills Experience with Dynatrace and similar platforms (e.g., New Relic, Datadog, AppDynamics). Certifications in cloud technologies or DevOps practices. Familiarity with CI/CD pipelines, Kubernetes, and infrastructure-as-code tools (Terraform, Ansible). What we offer DXC provide a More ❯
Mesh (ie. Istio) and GitOps (ie. ArgoCD), with a focus on streamlined deployments and managing complex service-oriented architectures. Experienced in leveraging observability tools, such as Honeycomb (OpenTelemetry) and DataDog, to support data-driven decisions across the wider engineering team. Comprehensive understanding of networking in cloud environments, including VPN solutions, efficient network configuration, load balancing, and troubleshooting. Extensive experience designing More ❯
building robust and efficient backend solutions. Strong hands-on experience with Terraform for infrastructure as code, enabling scalable and reliable systems. Experience with monitoring and observability tools, such as Datadog or Prometheus. Familiarity with event-driven systems, particularly Kafka and/or RabbitMQ. Deep understanding of messaging and queuing systems, including design patterns for reliability, retries, and scaling. Strong understanding More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Fruition Group
developers and SREs to solve complex problems What we're looking for: Strong experience with AWS (EC2, ECS, Lambda, RDS etc.) Good knowledge of observability tools (Grafana, Prometheus, OpenTelemetry, Datadog, or similar) Background in software engineering (JavaScript/TypeScript & Node.js, although any language is fine) Experience with Infrastructure as Code (Terraform, CloudFormation, or similar) CI/CD pipelines and automation More ❯
Experience of using Git or similar to track changes Experience of both the full .NET Framework and .NET Core Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production A solid understanding of security principles and secure coding including OWASP Top 10 Nice to haves: o Experience in VOIP, (SIP and RTP More ❯
Experience of using Git or similar to track changes Experience of both the full .NET Framework and .NET Core Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production A solid understanding of security principles and secure coding including OWASP Top 10 Nice to haves: o Experience in VOIP, (SIP and RTP More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
role, ideally as a Site Reliability Engineer Ability to work cross-functionally, take ownership of tasks, and prioritize effectively Excellent communication and collaboration skills Experience with monitoring solutions (e.g., Datadog, Azure Application Insights, Log Analytics) Programming/scripting skills for automation (favoring PowerShell, but also comfortable with Bash, C#, Ruby, or Python) Experience with web-based applications Desirable qualifications Familiarity More ❯
re looking for someone with deep expertise in: oInfrastructure as Code: Terraform, CloudFormation o Security best practices: IAM, KMS, encryption in transit/at rest, DevSecOps o Monitoring & observability: Datadog, Prometheus, Grafana, ELK, or similar What You Bring o 6+ years in DevOps or platform engineering, with experience in a technical lead role. o Proven experience designing and operating cloud More ❯
tuning. Lead technical triage and root cause analysis for infrastructure-related issues Develop and deploy applications using Docker and AWS FARGATE Use CloudWatch, CloudTrail, and third-party tools like Datadog for performance and cost efficiency Configure AWS networking (VPCs, TGWs), enforce governance via AWS Config and tagging policies Maintain architecture diagrams, SOPs, and collaborate across engineering and product teams Should More ❯