automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise in AWS alongside knowledge of multi-cloud environments, identity management, observability, and cost optimisation. Key Responsibilities Design and implement secure, scalable AWS cloud architectures Drive Infrastructure as Code (IaC) adoption using Terraform and CloudFormation Build, optimise, and automate CI/CD … GitHub Actions, and related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise … in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and related platforms Proven background in cloud security, IAM, and governance Solid understanding of monitoring and observability stacks Ability to influence architecture decisions and align solutions to best practices Desired Certifications AWS Certified Solutions Architect – Associate/Professional AWS Certified Security – Specialty HashiCorp Certified: Terraform Associate Kubernetes Certified More ❯
automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise in AWS alongside knowledge of multi-cloud environments, identity management, observability, and cost optimisation. Key Responsibilities Design and implement secure, scalable AWS cloud architectures Drive Infrastructure as Code (IaC) adoption using Terraform and CloudFormation Build, optimise, and automate CI/CD … GitHub Actions, and related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise … in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and related platforms Proven background in cloud security, IAM, and governance Solid understanding of monitoring and observability stacks Ability to influence architecture decisions and align solutions to best practices Desired Certifications AWS Certified Solutions Architect – Associate/Professional AWS Certified Security – Specialty HashiCorp Certified: Terraform Associate Kubernetes Certified More ❯
london (city of london), south east england, united kingdom
Damia Group
automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise in AWS alongside knowledge of multi-cloud environments, identity management, observability, and cost optimisation. Key Responsibilities Design and implement secure, scalable AWS cloud architectures Drive Infrastructure as Code (IaC) adoption using Terraform and CloudFormation Build, optimise, and automate CI/CD … GitHub Actions, and related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise … in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and related platforms Proven background in cloud security, IAM, and governance Solid understanding of monitoring and observability stacks Ability to influence architecture decisions and align solutions to best practices Desired Certifications AWS Certified Solutions Architect – Associate/Professional AWS Certified Security – Specialty HashiCorp Certified: Terraform Associate Kubernetes Certified More ❯
/year + Benefits Your Role: Design and implement Kubernetes (EKS) platforms on AWS Maintain and optimize CI/CD pipelines using GitHub Actions and Harness.io Implement and manage observability with Datadog across platforms Support and enhance AWS cloud infrastructure Review, audit, optimize, and document deployment processes Adhere to change management processes aligned with ISO27001 and PCI-DSS Enable self … costs across infrastructure Your Experience & Qualifications: 5+ years’ experience in a senior Platform/DevOps role Strong Kubernetes experience, particularly with AWS ECS Expertise in AWS EKS and Datadog observability Solid AWS knowledge across compute, database, and security services Proficiency in Infrastructure as Code using Terraform A solid understanding of container security and best practices Strong scripting skills (Python, Bash More ❯
to deployment and maintenance. • Ensure on-time delivery by identifying and mitigating risks early. • Champion CI/CD practices and ensure smooth, automated deployment pipelines. 5. Reliability, Security, and Observability • Own the uptime, latency, and performance SLAs of financial APIs and services. • Proactively monitor risk vectors and enforce observability via metrics, logging, and alerting. • Work with DevSecOps to embed security More ❯
Edinburgh & Lothians, Scotland, United Kingdom Hybrid / WFH Options
Neogen Recruitment Solutions Ltd
will own full-stack features end-to-end: design and implement backend services in Go, build responsive React frontends, integrate third-party systems, and help maintain CI/CD, observability and security. Youll work closely with product, design and customer success to deliver production-grade software. Key responsibilities - Design, implement and maintain backend services and APIs in Go (Golang). … with CI/CD tooling (GitHub Actions, GitLab CI, CircleCI, etc.). - Automated testing experience (unit/integration/e2e). - Good engineering practices: code reviews, TDD/BDD, observability (metrics/logs/tracing). - Strong communication skills and ability to work asynchronously in a remote team. Nice-to-have - Experience with procurement, finance or ERP integrations (SAP, Oracle More ❯
Monitor, troubleshoot, and optimize data pipelines to ensure performance and cost efficiency. Implement data governance, access controls, and security measures in line with best practices and regulatory standards. Develop observability and anomaly detection tools to support Tier 1 systems. Work with engineers and business teams to gather requirements and translate them into technical solutions. Maintain documentation, follow coding standards, and … to work across technical and non-technical teams. Additional Strengths Experience with orchestration tools like Apache Airflow. Knowledge of real-time data processing and event-driven architectures. Familiarity with observability tools and anomaly detection for production systems. Exposure to data visualization platforms such as Tableau or Looker. Relevant cloud or data engineering certifications. What we offer: A collaborative and transparent … ELT workflows with Apache Airflow (or similar) and integrating them into containerised CI/CD pipelines (Docker, GitHub Actions, Jenkins, etc.)? Select Which option best describes your experience building observability and automated anomaly detection tooling for data pipelines? Select What best describes your current location and working rights status? Select By submitting your application, you confirm that you have read More ❯
lead the global Cloud & Automation team. As a key member of the Engineering leadership team, you will drive the modernization of our SaaS platform, delivering intelligent automation, AI-powered observability, and highly scalable infrastructure across AWS and GCP. You will lead the transformation of legacy monolithic systems into modern, service-based architectures while building a high-performing, diverse engineering team. … architectures. Hands-on with Terraform, AWS CloudFormation to manage and version infrastructure. Ability to design modular, reusable infrastructure components across environments. Deep understanding of cloud-native practices, security, and observability tooling. Experience with relational databases: SQL Server, Oracle, MySQL. Strong time management, communication, and cross-functional leadership skills. Master's degree in computer science, Engineering, or equivalent experience. Culture and More ❯
and automation. Collaborate closely with Product Managers and GenAI/Data Science leaders to translate business vision into technical delivery. Scalability & Reliability Implement frameworks for CI/CD, testing, observability, and MLOps integration. Ensure systems meet performance, uptime, and data security standards required by global CPG enterprises. Plan for multi-market, multi-category deployments. Long-term Productization Create a … intensive applications (streaming, APIs, ETL, data warehouses, event-driven systems). Experience in enterprise SaaS, AI/ML integration, or analytics platforms. Deep knowledge of DevOps, CI/CD, observability, and security best practices. Degree in Computer Science, Engineering, or related field (MS preferred). Benefits Lead engineering for a high-growth AI-first company disrupting the global CPG industry. More ❯
or integrate with existing runtimes (e.g., containerd, CRI-O). Implement cluster networking, load balancing, and service discovery mechanisms. Ensure robust authentication, RBAC, auditing, and security policy enforcement. Integrate observability tooling for metrics, logging, and tracing. Work with cross-functional teams to ensure the platform supports internal use cases at scale. Write clean, maintainable, well-tested code and documentation. Stay More ❯
a collaborative, low-ego environment to maintain and improve a market-leading platform that drives revenue and builds stronger audience relationships. As an Application Support Engineer, you will: Ensure observability tools are configured to give full visibility of system health. Respond to incidents and urgent issues in a timely and effective manner. Investigate and triage operational issues in collaboration with More ❯
Databricks, M365, SharePoint, Salesforce, ServiceNow). Guide engineering teams through reference implementations and scalable architecture models. Partner with platform teams on enabling LLMOps/AgentOps-covering prompt management, evaluation, observability, and governance. Collaborate with business stakeholders to shape AI use cases-automated summarisation, triage, classification, and decision support. Ensure secure, policy-compliant access to enterprise data in AI pipelines. What More ❯
hosting Ability to design data pipelines and feedback loops for improving AI-driven features Awareness of emerging AI areas such as multimodal, edge AI, or AI in DevOps/observability Why Join? A leadership role with real ownership and autonomy A chance to shape the future of the engineering function while remaining close to the code Competitive salary (£90k-£120k More ❯
Amsterdam, North Holland, Netherlands Hybrid / WFH Options
Highfield Professional Solutions Ltd
solutions (InfiniBand HDR/NDR, RoCEv2) at rack, POD, and DC scale. Data & Storage: Partner with storage teams to optimise training data access, checkpointing, and high-performance throughput. Reliability & Observability: Translate signals from monitoring and telemetry systems into design improvements and reliability gains. Cross-Functional Collaboration: Work closely with reliability, networking, storage, and data centre engineering teams to deliver designs More ❯
role you'll need: Practical experience and execution of data governance & data management practices and tools such as data cataloguing, data discovery, lineage mapping, data dictionaries, data quality/observability and data security, data compliance The ability to align data initiatives with business goals and drive data maturity across the organisation. To be able to Identify opportunities where data can More ❯
work with others. Required Skills and Experience Deep domain knowledge of Services offerings and technical solutions in Application Management Services covering but not limited to Application Modernization & Migration, Application Observability, Application Maintenance & Support, application Engineering including Development, Testing & Release Management Demonstrated experience translating distinctive technical knowledge into actionable customer insights and solutions Prior consultative selling experience Externally recognized as an More ❯
Channel Security Solutions Engineer Apply () Location:London, United Kingdom Alternate LocationLuxembourg, Belgium, Area of InterestEngineer - Pre Sales and Product Management Job TypeProfessional Technology InterestSecurity and Observability Job Id What You'll Do This role requires a growth mindset with an extensive skill set - the combination of creative vision, cross-functional collaboration, sales experience, strong execution, Service Provider, and global service More ❯
We're Hiring: Mid-Level Site Reliability Engineer (SRE) This role would be Fully Remote, Permanent position Are you passionate about automation, observability, and scaling systems to support millions of users? Join ourclients SRE teamwithin thePlatform Engineeringorganization and help us build resilient, secure, and high-performing infrastructure. What Youll Do: Diagnose and resolve complex infrastructure and application issues Participate in … 24x7 on-call rotation, SCRUM, and deployment planning Perform Root Cause Analysis and guide application teams Improve system availability using industry-leading observability tools Influence architecture and design decisions with a security-first mindset Build tools and automation to streamline delivery and operations Tech Stack Highlights: Java, Kotlin, C++, Postgres AWS (EC2, ECS, Fargate, Route53, ALB/NLB) Observability: New More ❯
Senior AWS Platform/DevOps Engineer to design, deploy, and operate the backbone of our new streaming platform. You will own our AWS environment end-to-end: infrastructure, automation, observability, and security. As our first DevOps hire, you'll work directly with the Head of Engineering (ex-Amazon, ex-Prime Video) and a small, senior development team. This is a … AWS CodePipeline). Security & Compliance Configure IAM roles, policies, and least-privilege access. Implement and monitor CloudTrail, GuardDuty, Security Hub, WAF. Enforce tagging, cost controls, and guardrails across environments. Observability & Reliability Set up CloudWatch metrics, dashboards, and alarms. Implement distributed tracing (AWS X-Ray) and log analytics (OpenSearch or equivalent). Create synthetic canaries to validate key user flows (login … acceptable with willingness to switch) Skilled in CI/CD pipelines (GitHub Actions, CodePipeline, or Jenkins) Strong grasp of IAM, VPC networking, and AWS security best practices Experience with observability: CloudWatch, X-Ray, logging pipelines, and alerting Proficiency in scripting languages (Python, Bash) Strong communication and documentation skills; able to work independently in a small, fast-moving team Nice to More ❯
cloud infrastructure while championing automation through Infrastructure as Code solutions such as Terraform. Your day-to-day activities will involve collaborating with SRE and engineering teams to enhance system observability, proactively managing operational risks, maintaining high standards of security compliance, and ensuring robust disaster recovery capabilities. You will be responsible for documenting risks, tracking remediation actions, implementing process improvements, and … to cloud-hosted services to ensure seamless business operations.* Maintain the reliability and security of cloud environments by implementing robust monitoring tools and adhering to industry best practices.* Enhance observability and telemetry within cloud-hosted environments using SRE methodologies to deliver on Service Level Agreements (SLAs), Objectives (SLOs), and Indicators (SLIs).* Document and regularly review operational risks within the More ❯
and postmortem processes, driving root cause analysis and long-term fixes. Automation & Tooling Champion automation to reduce toil and improve system reliability. Oversee the development and maintenance of internal observability, tools and platforms. Collaborate with engineering and DevOps teams to embed reliability into the software development lifecycle. Collaboration & Strategy Partner with product, engineering, DevOps and Customer Support teams to align … on priorities and roadmaps. Contribute to the strategic direction of infrastructure and reliability initiatives. Advocate for best practices in observability, CI/CD, and infrastructure as code. What You Will Bring: Proven experience managing or leading SRE, DevOps, or infrastructure teams. Strong background in systems engineering, cloud platforms (AWS, Azure), and container orchestration (Kubernetes). Proficiency in monitoring, alerting, and More ❯
CI/CD pipelineswithGitLab CI or Jenkinsto enable fast, secure, and reliable software delivery. o Champion Kubernetes-based platformsusingAmazon EKSandIstio Service Meshto build scalable, service-oriented architectures. o Drive observability and reliability engineeringthrough proactive monitoring, alerting, and incident response strategies. o Mentor and guide DevOps engineers, fostering a culture of continuous improvement, automation, and operational excellence. o Collaborate cross-functionallywith … We're looking for someone with deep expertise in: oInfrastructure as Code: Terraform, CloudFormation o Security best practices: IAM, KMS, encryption in transit/at rest, DevSecOps o Monitoring & observability: Datadog, Prometheus, Grafana, ELK, or similar What You Bring o 6+ years in DevOps or platform engineering, with experience in a technical lead role. o Proven experience designing and operating More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
VML Enterprise Solutions
mindset to your work Communicate effectively with excellent written and verbal skills. Familiarity with Diagrams-as-Code for documenting infrastructure architecture. Designing solutions observing cross-cutting concerns such as observability and system security Taking ownership of deployments in a true devops model MINIMUM QUALIFICATIONS/SKILLS Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent work … skills. Familiarity with Diagrams-as-Code for documenting infrastructure architecture is a plus. Understanding of modern authentication protocols such as OAuth2 and OIDC. Consideration of cross-cutting concerns like observability and security in infrastructure design. Contributions to Open Source projects are a plus. As an equal opportunity employer we welcome applications that reflect the diversity of our wider community. Please More ❯