able to build new DevOps pipelines AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR Kubernetes Helm Kops Ingress/Egress Terraform Deployment of AWS Resources Pipelines OCI Observability ELK Dynatrace Prometheus Others Vault RedHat Skills working in a secure environment and ability to adhere to security principles Experience in support organisation DevOps Engineer - HLC DV UK wide (Manchester More ❯
recovery procedures to ensure system availability and data integrity. o Monitor and troubleshoot system resources in the AWS environment, ensuring modern Site Reliability Engineering best practices and client approved observability tools, such as OpenTelemetry, Dynatrace, Elastic, etc. • Collaboration and Security: o Work closely with development, operations, and security teams to ensure cloud solutions align with organizational goals and security requirements. More ❯
small team Data Engineers. Define and deliver the data engineering roadmap aligned with business priorities. Own and evolve the data platform architecture from ingestion and transformation through to governance, observability, and real time processing Drive adoption of best practices (CI/CD, testing, infra as code) and introduce new technologies where appropriate. Collaborate with stakeholders across Engineering, Product, and Analytics More ❯
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hlx Technology
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
london, south east england, united kingdom Hybrid / WFH Options
Hlx Technology
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Hlx Technology
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Hlx Technology
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
West London, London, United Kingdom Hybrid / WFH Options
Young's Employment Services Ltd
Fabric, leveraging expertise in Azure Data Factory, Databricks, and other Azure services. Advocate for engineering best practices and ensure long-term sustainability of systems. Integrate principles of data quality, observability, and governance throughout all processes. Participate in recruiting, mentoring, and developing a high-performing data organization. Demonstrate pragmatic leadership by aligning multiple product workstreams to achieve a unified, robust, and More ❯
a commitment to continuous improvement Bonus points if you have: Azure certifications (e.g., Developing or Architecting Microsoft Azure Solutions) Experience with GraphQL (e.g., HotChocolate), Kafka, Docker, Azure DevOps, or observability tools Knowledge of identity management, housing systems, or Gitflow What you'll get in return The chance to build a development function from scratch and leave a lasting legacy A More ❯
architectures Knowledge with infrastructure as code IaC (Terraform, CloudFormation, AWS CDK) Understanding of CI/CD pipelines and DevOps practices Experience in serverless application development Proficient with monitoring and observability tools Why Join - Growth & Opportunity: Be part of a thriving company with a culture built on innovation and collaboration. - Flexibility: Enjoy the freedom of remote-first work with regular in More ❯
models Backend Development & APIs: Develop high-performance APIs using FastAPI for agent interaction and monitoring Implement real-time streaming capabilities for agent responses and status updates Build monitoring and observability systems for agent performance tracking Create robust authentication and authorization systems for agent access Data & Knowledge Management: Design and implement RAG (Retrieval-Augmented Generation) systems Optimize vector embeddings and similarity More ❯
and maintain containerized applications using Docker, and develop CI/CD pipelines to automate testing, deployment, and delivery processes for scalable and reliable software releases. Implement and maintain robust observability practices, including logging, monitoring, and alerting systems, to ensure real-time visibility into application performance, system health, and efficient troubleshooting. What we offer: The opportunity to be part of something More ❯
related field, including 6+ years proven experience as a technology architect and 3+ years managing technology vendors Subject-matter expertise in:- Delivery infrastructure build out (e.g. CI, deployment orchestration, observability, and A/B test infrastructure)- Modern security practices- Modern API platform design- Modern data architectures (e.g. event-driven architectures, stream processing, and integrating real-time analytics into customer applications More ❯
Code principles Design an agile release engineering strategy that delivers value incrementally and continuously Support a highly-available live production system, respond to alerts, diagnose problems using logs and observability tooling, triage and resolve incidents What we offer We make sure our team is well looked after with generous salaries and a great benefits package which includes: Enhanced pension with More ❯
scalability and reduce manual intervention. Operational Security, SRE & Assurance: Ensure security platforms are resilient, continuously monitored, and designed for 24x7 support and incident response readiness. Embed security telemetry and observability to enable proactive threat detection and automated response. Apply SRE principles to improve reliability, performance, and maintainability of security services. Define service level objectives (SLOs) and key performance indicators (KPIs More ❯
orchestration and infrastructure-as-code. * Solid understanding of cloud networking and architecture (AWS, Azure, or GCP). * Experience with CI/CD systems and automated deployment workflows. * Familiarity with observability and performance monitoring tools. * Experience with data pipelines and workflow orchestration. * Excellent communication and documentation skills. * Alignment with SRE principles and a passion for automation and reliability. * Security-first approach … cloud infrastructure to support scalable and secure application deployments. * Develop and maintain CI/CD pipelines to streamline development and release processes. * Monitor and optimize system performance using modern observability tools. * Support and enhance data processing workflows using event-driven orchestration. * Troubleshoot production issues and implement solutions to ensure system stability. * Document infrastructure and promote best practices across teams. * Embed … Workflows, Prometheus, Grafana, Sentry, Python, Java, Next.js, Infrastructure as Code, Monitoring, Logging, Security, SRE, Remote DevOps, UK Tech Jobs, STEM, ISO 27001, SOC2, HIPAA, GDPR, Git, Cloud Security, Automation, Observability, Event-driven Architecture More ❯
or strong interest in learning) cloud-native tooling: AWS (especially CloudWatch) Artifact Management (e.g., Artifactory, CodeArtifact) Infrastructure as Code with Terraform Monitor test metrics, troubleshoot failures, and improve system observability and debuggability. More ❯
Milton Keynes, Buckinghamshire, South East, United Kingdom
Interact Consulting Limited
or strong interest in learning) cloud-native tooling: AWS (especially CloudWatch) Artifact Management (e.g., Artifactory, CodeArtifact) Infrastructure as Code with Terraform Monitor test metrics, troubleshoot failures, and improve system observability and debuggability. More ❯
a focus on security, data protection, and performance optimization. Experience managing transport and change governance, incident triage, and root cause analysis. Skilled in monitoring tools like SAP Cloud ALM, observability platforms, and incident management platforms such as Jira or Azure DevOps. Adept at documentation using Confluence and following agile methodologies like Scrum and Kanban. Exceptional stakeholder management and communication skills More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Tria
within enterprise systems. Strong understanding of cloud platforms (Azure preferred). Knowledge of Infrastructure-as-Code (IaC), APIs, and automation tools. Familiarity with CI/CD pipelines, monitoring, and observability tools. Knowledge of ITSM, Agile, DevOps, and service-level objectives (SLOs) and indicators (SLIs). Excellent problem-solving skills and ability to work in complex, multi-supplier environments. Desirable: Bachelor More ❯
Python Experience with IaC principles and automation tools such as Ansible, Puppet and SaltStack General HPC technical knowledge regarding compute, network, memory, and storage components Experience with monitoring and observability tools such as Grafana Clearance: TS/SCI clearance with polygraph is required. Total Compensation Package We offer a comprehensive compensation package designed to support your well-being and professional More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
StepChange Debt Charity
and governance controls Automation & Orchestration (Essential): Building Infrastructure as Code (IaC) using Terraform. Designing CI/CD pipelines for repeatable, automated deployments Driving operational excellence with monitoring, logging, and observability tools such as CloudWatch and AWS Config. Monitoring (Desirable) - Grafana Strong troubleshooting skills and diagnostic abilities for BAU escalations An aptitude for Security and a keen eye for detail. Ideally More ❯
APIs Experience of writing performance critical code Experience of using Git or similar to track changes Experience of both the full .NET Framework and .NET Core Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production A solid understanding of security principles and secure coding including OWASP Top 10 Nice to haves More ❯
APIs Experience of writing performance critical code Experience of using Git or similar to track changes Experience of both the full .NET Framework and .NET Core Experience of using observability systems such as Elastic APM or DataDog to track and diagnose issues in production A solid understanding of security principles and secure coding including OWASP Top 10 Nice to haves More ❯