London, Bloomsbury, United Kingdom Hybrid / WFH Options
IntaPeople
or AWS CodePipeline Support and train technical staff in upskilling necessary for ongoing operations Monitor and ensure system reliability, availability, and performance using tools likeCloudWatch, Prometheus, Icinga2, Grafana, and Datadog Automate deployment, scaling, and management of containerized applications using Docker and Kubernetes Desirable skills Travis CI Monitoring – Grafana, Icinga Prometheus Rabbit MQ/AMQP Working knowledge of security best practices More ❯
concepts. Some experience with Linux systems and basic scripting (Bash, Python, or similar). Interest in CI/CD tools and processes. Eagerness to learn observability tools (Prometheus, Grafana, Datadog, etc.). Problem-solving mindset and willingness to troubleshoot with guidance. Strong written and verbal communication skills, with the ability to clearly articulate technical concepts to both technical and non More ❯
backend and IoT systems, ensuring a seamless, fully automated workflow that reliably delivers high-quality code to production with speed and confidence. Elevating Monitoring: Enhance our monitoring capabilities with Datadog, gaining deeper insights and proactively ensuring system health. Scaling for Impact: Contribute to scaling our systems to efficiently manage and operate thousands of parking lots. Championing Engineering Excellence: Play a More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Noir
and maintaining CI/CD pipelines, and be confident scripting in Python, C# or similar scripting languages. You'll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you'll have worked in a fast-moving SaaS or product-led business before. Bonus points if you've helped shape DevOps roadmaps, mentored others, or More ❯
as needed. Experience with relational and non-relational databases. Experience delivering high levels of observability and proficiency in improving early warning systems, for example: has worked with Grafana/DataDog/Prometheus. Collaborating with internal/external teams/engineers and fostering an inclusive environment, where all points of view are welcomed and encouraged. Own and lead multiple domains of More ❯
Lisburn, County Antrim, United Kingdom Hybrid / WFH Options
Camlin
e.g., Docker, Kubernetes, Terraform, Ansible, Helm, etc). Familiarity with continuous integration and deployment tools (e.g., GitLab CI, Argo Workflow, ArgoCD). Experience with monitoring/logging solutions (e.g., DataDog, ELK, Prometheus). Good understanding of concepts related to computer architecture, data structures and programming practices. Solid understanding of networking, databases, and security principles. Our Values We work together We More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
Fruition Group
developers and SREs to solve complex problems What we're looking for: Strong experience with AWS (EC2, ECS, Lambda, RDS etc.) Good knowledge of observability tools (Grafana, Prometheus, OpenTelemetry, Datadog, or similar) Background in software engineering (JavaScript/TypeScript & Node.js, although any language is fine) Experience with Infrastructure as Code (Terraform, CloudFormation, or similar) CI/CD pipelines and automation More ❯
. Utilize tools such as Wiz , GCP Security Command Center , and IAM to enforce governance. Monitoring & Reliability Implement robust monitoring and alerting using Cloud Monitoring , Cloud Logging , Prometheus , or Datadog . Troubleshoot incidents to ensure optimal system performance and uptime. What You Bring Must-Have Skills Proven experience with GCP services (Compute Engine, Cloud Run, BigQuery, etc.). Proficient in More ❯
Saint Paul, Minnesota, United States Hybrid / WFH Options
Genesis10
Architect, and/or Google Cloud Professional Architect Experience solutioning hybrid cloud or multi-cloud environments. AWS and GCP preferred. Experience with cloud monitoring and logging tools such as Datadog, Splunk, or Dynatrace and integrating these tools into IT Service Management platforms Proficiency with containerization tools like Docker and Kubernetes Familiarity with cloud architectures in Pharmacy Benefit Management (PBM) or More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
role, ideally as a Site Reliability Engineer Ability to work cross-functionally, take ownership of tasks, and prioritize effectively Excellent communication and collaboration skills Experience with monitoring solutions (e.g., Datadog, Azure Application Insights, Log Analytics) Programming/scripting skills for automation (favoring PowerShell, but also comfortable with Bash, C#, Ruby, or Python) Experience with web-based applications Desirable qualifications Familiarity More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
/NSG policies and ensure compliance with standards (e.g. GDPR, ISO 27001, SOC 2) across cloud environments. • Monitoring, Optimization & Cost Governance Continuously monitor workloads using tools like CloudWatch, Prometheus, Datadog; drive performance tuning and cost optimisation (rightsizing, reserved instances, auto scaling). • Disaster Recovery & Business Continuity Planning Develop and test backup/DR strategies, restore drills, and self healing infrastructure … CI/CD, GitHub Actions or Azure DevOps. • Security & Compliance: Implement and monitor IAM, encryption, audit logging, network isolation, and compliance frameworks. • Monitoring & Optimization Tools: Familiarity with CloudWatch, Grafana, Datadog, Prometheus, ELK or similar The position requires team members to work from client-site to ensure the reliability and availability of critical systems. Together, as owners, let’s turn meaningful More ❯
North West London, London, United Kingdom Hybrid / WFH Options
ByteHire
of infrastructure setup and management Exposure to designing or building distributed systems, preferably in a cloud environment Company Tech Stack PHP, Laravel, ReactJS, TypeScript, Inertia, WordPress MySQL, Redis, ElasticSearch, DataDog, AWS, Terraform, Docker Benefits Hybrid working 1-2 days per week in the London office. Collaborate directly with the founding team and take ownership of product features. Be part of More ❯
Bristol, Somerset, United Kingdom Hybrid / WFH Options
Just Eat
by your peers. What will you bring to the table? Interest and understanding in Cloud engineering. Familiarity with Infrastructure Automation concepts. Understanding of monitoring and logging tools such as Datadog, Prometheus, Graphite, Vector and Grafana. Interest in Platform Infrastructure (e.g. DNS, DR, SSL, Load balancing). Hands-on familiarity with Linux services and/or Windows services. Interest and understanding More ❯
as-code: Terraform, Pulumi Data Management and Orchestration: Airflow, dbt Databases and Data Warehouses: SQL Server, PostgreSQL, MongoDB, Qdrant, Pinecone GenAI: OpenAI APIs, HuggingFace, LangChain, Talk-to-data Monitoring: Datadog About You We are looking for someone who can wear two hats - the data architect and the strategic business consultant - so you'll need to show both advanced technical acumen More ❯
Lincolnshire, England, United Kingdom Hybrid / WFH Options
Akkodis
be helping to design and manage ETL/ELT pipelines, making sure data flows smoothly and reliably across the business. You'll also get hands-on with tools like Datadog or CloudWatch to monitor performance and keep things secure and efficient. If you enjoy writing clean Python code, working with SQL, and collaborating with analysts and engineers, this could be More ❯
East Midlands, United Kingdom Hybrid / WFH Options
Akkodis
be helping to design and manage ETL/ELT pipelines, making sure data flows smoothly and reliably across the business. You'll also get hands-on with tools like Datadog or CloudWatch to monitor performance and keep things secure and efficient. If you enjoy writing clean Python code, working with SQL, and collaborating with analysts and engineers, this could be More ❯
yourself on consistent high levels of test coverage, strong technical documentation and effective monitoring Preferably exposure to technologies such as Kafka, PostgreSQL, Redis We use Kotlin, PostgreSQL, Kafka, Redis, Datadog, Amplitude, Grafana, BigQuery, ApacheSpark and more A passion for crypto and the transformations it enables COMPENSATION & PERKS Full-time salary based on experience and meaningful equity in an industry-leading More ❯
invites you to shape the future of parking systems, where you'll be at the forefront of developing a cutting-edge backend, optimizing CI pipelines, and enhancing monitoring with Datadog, all while ensuring the seamless operation of thousands of parking lots. If you're a passionate engineer with a hunger for continuous improvement and proficiency in a variety of programming … Experience with Terraform or similar tools for infrastructure automation. High-Throughput Systems: Strong experience in real production projects handling large-scale data flows. Monitoring & Observability: Proficiency in tools like Datadog, Prometheus, and Grafana. Security & Networking: Solid understanding of networking principles, security best practices, and cloud security. Agile & Fast-Paced Environments: Experience in agile teams, working in dynamic, fast-growing tech More ❯
Glasgow, City of Glasgow, United Kingdom Hybrid / WFH Options
Cathcart Technology
issues and improve the platform What You'll Need Strong SQL or Transact SQL skills Good technical awareness across CPU, memory, and infrastructure basics Experience with monitoring tools (e.g. DataDog or similar) Great communication skills, comfortable dealing with clients and internal teams SaaS background would be ideal, but not essential Kubernetes knowledge is a bonus, but not a requirement What More ❯
Glasgow, Lanarkshire, Scotland, United Kingdom Hybrid / WFH Options
Cathcart Technology
issues and improve the platform What You'll Need Strong SQL or Transact SQL skills Good technical awareness across CPU, memory, and infrastructure basics Experience with monitoring tools (e.g. DataDog or similar) Great communication skills, comfortable dealing with clients and internal teams SaaS background would be ideal, but not essential Kubernetes knowledge is a bonus, but not a requirement What More ❯
North London, London, United Kingdom Hybrid / WFH Options
VERTECH GROUP (UK) LTD
API security), cloud security (AWS), and CI/CD pipeline hardening Familiar with SAST/DAST tools, vulnerability scanners, penetration testing frameworks, and monitoring platforms (e.g. Splunk, ELK, Datadog) Understanding of GDPR and data privacy best practices Tremendous opportunity offering plenty of scope for career progression in a friendly, innovative environment where you'll be able to bring ideas and More ❯
API security), cloud security (AWS), and CI/CD pipeline hardening Familiar with SAST/DAST tools, vulnerability scanners, penetration testing frameworks, and monitoring platforms (e.g. Splunk, ELK, Datadog) Understanding of GDPR and data privacy best practices Tremendous opportunity offering plenty of scope for career progression in a friendly, innovative environment where you'll be able to bring ideas and More ❯
raised our Series D. This brings our total funding to over $330M from top-tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook. What you'll do at Synthesia: You will work end-to-end, contributing to our client application written in React and our monolithic backend written in More ❯