Snowflake (via ODI) . DR and CRM data migration to Snowflake (to avoid legacy reporting) . File loading and file creation frameworks . Storage space and cost optimization . Observability with Datadog dashboards . Minimize Azure SQL DB dependency by moving data to Snoflake. . User access management and auditing . Maintenance and monitoring of database health - daily reports . More ❯
Snowflake (via ODI) . DR and CRM data migration to Snowflake (to avoid legacy reporting) . File loading and file creation frameworks . Storage space and cost optimization . Observability with Datadog dashboards . Minimize Azure SQL DB dependency by moving data to Snoflake. . User access management and auditing . Maintenance and monitoring of database health - daily reports . More ❯
Belfast Metropolitan Area, United Kingdom Hybrid/Remote Options
Realtime Recruitment
is a hands-on, highly cross-functional role that bridges engineering, product, and customer success. You'll work on cutting edge API integrations, helping customers embed security , automation, and observability into their environments at scale. What you'll do: Design and develop integrations with enterprise systems - MDMS, SIEMs, SOAR. Build and maintain APIs, SDKs, and automation frameworks. Collaborate with enterprise More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Stax - Deeptech Talent
supports embedded dashboards, cross-source joins, and both human and AI-native querying (e.g. Text-to-SQL). Leverage in-app instrumentation (session replays, logs, engagement events) to enable observability, anomaly detection, and customer experience analytics. Build AI-Enabled Interactions Lead the development of agentic capabilities—such as natural language querying, AI-generated dashboards, and real-time recommendations—integrating structured More ❯
supports embedded dashboards, cross-source joins, and both human and AI-native querying (e.g. Text-to-SQL). Leverage in-app instrumentation (session replays, logs, engagement events) to enable observability, anomaly detection, and customer experience analytics. Build AI-Enabled Interactions Lead the development of agentic capabilities—such as natural language querying, AI-generated dashboards, and real-time recommendations—integrating structured More ❯
the team. Solve challenging problems: Proactively identify root causes, implement durable solutions, and share learnings that help others solve problems more effectively. Be data-driven: Use product metrics and observability tools to guide decisions during discovery and development, and deliver work that demonstrably improves product or business metrics. Deliver at pace: Improve team velocity by unblocking others, streamlining workflows, and More ❯
across the organization. What you’ll be doing: Building and maintaining a Kubernetes-hosted AI platform (AKS) Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse Implementing observability with Prometheus, Grafana, and Loki Managing infrastructure through Terraform, ArgoCD, and GitHub Actions Supporting internal AI applications including RAG, document processing, and internal AI assistants What you’ll need … years in Platform or DevOps Engineering (Azure preferred) Strong experience with Kubernetes, Docker, and Terraform Programming or scripting skills in Python or Go Familiarity with GitOps, Helm, and observability tools A learning mindset and interest in LLM operations More ❯
AKS), CI/CD automation with GitHub and Terraform, and Linux administration to create robust deployment pipelines and cloud infrastructure that supports multiple product lines. You'll focus on observability solutions, including LogStash integration , while collaborating with cross-functional teams to streamline software delivery, enhance system reliability, and empower developers with automated, secure infrastructure solutions. Key Responsibilities: Build and optimise … Actions and Azure DevOps with automated testing and deployment strategies Develop cloud infrastructure using Terraform/Bicep for AKS, container registries, application services, and networking Implement monitoring, alerting, and observability frameworks with SLOs and diagnostic tools Enforce security through access controls, secrets management, vulnerability scanning, and policy-as-code Manage environment consistency and optimise cloud costs through performance monitoring and … on Kubernetes and container orchestration knowledge Proficient in Infrastructure as Code with Terraform or Bicep Extensive CI/CD pipeline experience with GitHub Actions or Azure Pipelines Background in observability and telemetry solutions Understanding of cloud security, identity management, and compliance frameworks Networking fundamentals and familiarity with Azure databases and messaging services Scripting skills (Bash/PowerShell/Python) and More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Computappoint
AKS), CI/CD automation with GitHub and Terraform, and Linux administration to create robust deployment pipelines and cloud infrastructure that supports multiple product lines. You'll focus on observability solutions, including LogStash integration , while collaborating with cross-functional teams to streamline software delivery, enhance system reliability, and empower developers with automated, secure infrastructure solutions. Key Responsibilities: Build and optimise … Actions and Azure DevOps with automated testing and deployment strategies Develop cloud infrastructure using Terraform/Bicep for AKS, container registries, application services, and networking Implement monitoring, alerting, and observability frameworks with SLOs and diagnostic tools Enforce security through access controls, secrets management, vulnerability scanning, and policy-as-code Manage environment consistency and optimise cloud costs through performance monitoring and … on Kubernetes and container orchestration knowledge Proficient in Infrastructure as Code with Terraform or Bicep Extensive CI/CD pipeline experience with GitHub Actions or Azure Pipelines Background in observability and telemetry solutions Understanding of cloud security, identity management, and compliance frameworks Networking fundamentals and familiarity with Azure databases and messaging services Scripting skills (Bash/PowerShell/Python) and More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Computappoint
AKS), CI/CD automation with GitHub and Terraform, and Linux administration to create robust deployment pipelines and cloud infrastructure that supports multiple product lines. You'll focus on observability solutions, including LogStash integration , while collaborating with cross-functional teams to streamline software delivery, enhance system reliability, and empower developers with automated, secure infrastructure solutions. Key Responsibilities: Build and optimise … Actions and Azure DevOps with automated testing and deployment strategies Develop cloud infrastructure using Terraform/Bicep for AKS, container registries, application services, and networking Implement monitoring, alerting, and observability frameworks with SLOs and diagnostic tools Enforce security through access controls, secrets management, vulnerability scanning, and policy-as-code Manage environment consistency and optimise cloud costs through performance monitoring and … on Kubernetes and container orchestration knowledge Proficient in Infrastructure as Code with Terraform or Bicep Extensive CI/CD pipeline experience with GitHub Actions or Azure Pipelines Background in observability and telemetry solutions Understanding of cloud security, identity management, and compliance frameworks Networking fundamentals and familiarity with Azure databases and messaging services Scripting skills (Bash/PowerShell/Python) and More ❯
Building backend services and APIs that give secure, governed access to LLM capabilities Developing Python-based GenAI components including prompt orchestration and evaluation tooling Integrating LLMs with enterprise systems, observability layers, and security frameworks Designing and maintaining CI/CD pipelines using Azure DevOps Working closely with platform leads, architects, and SRE teams to ensure stable, scalable operations Supporting benchmarking … on exposure to GenAI technologies and Large Language Models Practical understanding of LLM evaluation, prompt handling, and operational complexities A DevOps-first approach with experience in CI/CD, observability, and automation (Azure DevOps preferred) Confidence working in regulated enterprise environments with tight security controls Experience integrating AI or ML services into real-world applications Knowledge of authentication, secret management … Exposure to RAG patterns and vector search technologies What Success Looks Like: Secure, reusable GenAI components running smoothly in production Faster engineering delivery through automation and DevOps maturity High observability and strong evaluation built in from day one Platform teams and application teams genuinely enjoying what you build Clean documentation and an onboarding flow that makes adoption effortless If you More ❯
Building backend services and APIs that give secure, governed access to LLM capabilities Developing Python-based GenAI components including prompt orchestration and evaluation tooling Integrating LLMs with enterprise systems, observability layers, and security frameworks Designing and maintaining CI/CD pipelines using Azure DevOps Working closely with platform leads, architects, and SRE teams to ensure stable, scalable operations Supporting benchmarking … on exposure to GenAI technologies and Large Language Models Practical understanding of LLM evaluation, prompt handling, and operational complexities A DevOps-first approach with experience in CI/CD, observability, and automation (Azure DevOps preferred) Confidence working in regulated enterprise environments with tight security controls Experience integrating AI or ML services into real-world applications Knowledge of authentication, secret management … Exposure to RAG patterns and vector search technologies What Success Looks Like: Secure, reusable GenAI components running smoothly in production Faster engineering delivery through automation and DevOps maturity High observability and strong evaluation built in from day one Platform teams and application teams genuinely enjoying what you build Clean documentation and an onboarding flow that makes adoption effortless If you More ❯
Building backend services and APIs that give secure, governed access to LLM capabilities Developing Python-based GenAI components, including prompt orchestration and evaluation tooling Integrating LLMs with enterprise systems, observability layers, and security frameworks Designing and maintaining CI/CD pipelines using Azure DevOps Working closely with platform leads, architects, and SRE teams to ensure stable, scalable operations Supporting benchmarking … on exposure to GenAI technologies and Large Language Models Practical understanding of LLM evaluation, prompt handling, and operational complexities A DevOps-first approach with experience in CI/CD, observability, and automation (Azure DevOps preferred) Confidence working in regulated enterprise environments with tight security controls Experience integrating AI or ML services into real-world applications Knowledge of authentication, secret management … Exposure to RAG patterns and vector search technologies What Success Looks Like: Secure, reusable GenAI components running smoothly in production Faster engineering delivery through automation and DevOps maturity High observability and strong evaluation built in from day one Platform teams and application teams genuinely enjoying what you build Clean documentation and an onboarding flow that makes adoption effortless Role Details More ❯
City of London, London, United Kingdom Hybrid/Remote Options
EMBS Technology
Building backend services and APIs that give secure, governed access to LLM capabilities Developing Python-based GenAI components, including prompt orchestration and evaluation tooling Integrating LLMs with enterprise systems, observability layers, and security frameworks Designing and maintaining CI/CD pipelines using Azure DevOps Working closely with platform leads, architects, and SRE teams to ensure stable, scalable operations Supporting benchmarking … on exposure to GenAI technologies and Large Language Models Practical understanding of LLM evaluation, prompt handling, and operational complexities A DevOps-first approach with experience in CI/CD, observability, and automation (Azure DevOps preferred) Confidence working in regulated enterprise environments with tight security controls Experience integrating AI or ML services into real-world applications Knowledge of authentication, secret management … Exposure to RAG patterns and vector search technologies What Success Looks Like: Secure, reusable GenAI components running smoothly in production Faster engineering delivery through automation and DevOps maturity High observability and strong evaluation built in from day one Platform teams and application teams genuinely enjoying what you build Clean documentation and an onboarding flow that makes adoption effortless Role Details More ❯
Proven DevOps experience in Azure environments. Strong knowledge of Azure architecture (App Services, Functions, Key Vault, etc.). Familiarity with Azure Monitor, Application Insights, Log Analytics, and KQL for observability and diagnostics. Experience with IaC, monitoring tools (App Insights, Log Analytics), and automation. Familiarity with .NET Core, React, or modern software delivery pipelines. Passion for reliability, continuous improvement, and secure … Terraform for Azure resource management. Practical knowledge of containerisation and orchestration (Docker, Azure Container Apps, or Kubernetes). Familiarity with Azure Monitor, Application Insights, Log Analytics, and KQL for observability and diagnostics. Strong focus on automation, continuous improvement, and service reliability. Understanding of DevSecOps principles and secure cloud deployment practices. If you have the relevant skills for this role and More ❯
using Terraform and integrate them into CI/CD pipelines. Drive continuous improvement in platform reliability, scalability, and cost optimisation. Collaborate with SRE, Security, and Engineering teams to strengthen observability, monitoring, and alerting using tools like Prometheus, Grafana, and CloudWatch. What You’ll Bring Proven experience designing and automating AWS infrastructure . Strong experience building IaC pipelines with Terraform and …/CD tools Deep understanding of Kubernetes operations on AWS , including scaling, deployment automation, and monitoring. Solid background in Linux administration , networking, and cloud security. Hands-on experience with observability stacks (Prometheus, Grafana, Loki). Knowledge of database reliability . Strong scripting skills. A collaborative approach with a passion for improving systems through automation and consistency. The role: Pay More ❯
Platform Engineer, you'll be creating, iterating, and solving genuine engineering challenges. You'll collaborate closely with product and engineering teams across the stack to improve developer experience, strengthen observability, and troubleshoot complex distributed systems. A focus on clean, maintainable code, cloud infrastructure, and strong security practices is essential. Senior Platform Engineer - Key Skills & Experience: Solid background in software engineering …/CD pipelines and modern deployment practices Familiarity with infrastructure-as-code tools such as Terraform Strong understanding of security best practices in application and infrastructure design Exposure to observability tools (e.g. Prometheus, Grafana, structured logging) Confident debugging and resolving issues in complex distributed systems Product-oriented mindset with a collaborative approach to improving developer experience Bonus: experience with Kafka More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Fruition Group
Platform Engineer, you’ll be creating, iterating, and solving genuine engineering challenges. You’ll collaborate closely with product and engineering teams across the stack to improve developer experience, strengthen observability, and troubleshoot complex distributed systems. A focus on clean, maintainable code, cloud infrastructure, and strong security practices is essential. Senior Platform Engineer – Key Skills & Experience: Solid background in software engineering …/CD pipelines and modern deployment practices Familiarity with infrastructure-as-code tools such as Terraform Strong understanding of security best practices in application and infrastructure design Exposure to observability tools (e.g. Prometheus, Grafana, structured logging) Confident debugging and resolving issues in complex distributed systems Product-oriented mindset with a collaborative approach to improving developer experience Bonus: experience with Kafka More ❯
Knutsford, Cheshire, England, United Kingdom Hybrid/Remote Options
Tenth Revolution Group
and driven Security Engineer to join our small, focused team building a telemetry pipeline MVP. You'll play a key role in designing and securing our containerized environments, ensuring observability tools and infrastructure are built with security at their core. This role blends deep technical expertise with a hands-on, collaborative approach ideal for someone who thrives in fast-moving … documentation and response playbooks What You Bring Hands-on experience with Kubernetes, OpenShift, and secure production systems Strong GitLab and CI/CD security expertise Familiarity with telemetry and observability stacks Solid grasp of networking, firewalls, and core security principles Knowledge of container security tools (Aqua, Twistlock, Trivy) Understanding of frameworks like NIST or ISO 27001 Excellent analytical and communication More ❯
using modern JavaScript frameworks or C# and Blazor Lead technical decision-making within your squad, balancing innovation with pragmatic delivery Drive best practices in code quality, testing, security, and observability Quality & Continuous Improvement Write clean, maintainable, well-tested code with comprehensive documentation Configure and maintain CI/CD pipelines with automated testing and deployment Skills Full stack software engineering experience … C# .NET for back-end services RESTful API design and implementation HTML5, CSS3, and responsive design principles Cloud platform experience with Azure, AWS, or GCP Production systems mindset including observability, testing, security, and reliability Agile delivery experience in fast-paced, iterative environments Strong collaboration and communication skills working effectively with technical and product stakeholders Architectural thinking - ability to design scalable More ❯
Belfast, City of Belfast, County Antrim, United Kingdom
InterCity Partners Ltd
using modern JavaScript frameworks or C# and Blazor Lead technical decision-making within your squad, balancing innovation with pragmatic delivery Drive best practices in code quality, testing, security, and observability Quality & Continuous Improvement Write clean, maintainable, well-tested code with comprehensive documentation Configure and maintain CI/CD pipelines with automated testing and deployment Skills Full stack software engineering experience … C# .NET for back-end services RESTful API design and implementation HTML5, CSS3, and responsive design principles Cloud platform experience with Azure, AWS, or GCP Production systems mindset including observability, testing, security, and reliability Agile delivery experience in fast-paced, iterative environments Strong collaboration and communication skills working effectively with technical and product stakeholders Architectural thinking - ability to design scalable More ❯
Nottingham, England, United Kingdom Hybrid/Remote Options
KDR Talent Solutions
importantly, drive the blameless post-mortem process to find the root cause and engineer a permanent fix. Partner with development teams to consult on new features, ensuring reliability and observability are designed in from day one. What You'll Need: Deep experience in the Microsoft Azure ecosystem (especially PaaS, App Services). Strong commercial experience with Infrastructure as Code (especially … proven background in an SRE, DevOps, or Software Engineering (with an operations focus) role. Solid scripting/programming skills for automation (e.g., PowerShell, Python, Bash). Expertise with modern observability tools (e.g., Datadog, Application Insights, Log Analytics, Grafana). A collaborative mindset with a strong sense of ownership and a passion for engineering reliability. What's In It For You More ❯
and data workloads, governing platform standards, documentation and best practices across multiple engineering teams. Your role will also encompass leading the definition and enforcement of infrastructure as code (IaC), observability and automation frameworks and integration of streaming and messaging solutions such as Kafka, SQS or SNS for data ingestion and event-driven processing. You'll collaborate with development, DevOps and … experience of designing and managing containerised environments You have experience with CI/CD pipeline design using GitHub Actions and a good working knowledge of Infrastructure-as-Code (Terraform), observability and monitoring tools You have a good knowledge of security architecture, identity management and network segmentation in cloud environments You have a good understanding of data flows, APIs and asynchronous More ❯
secure, compliant, and high-availability environments for our AI workloads. You’ll lead a small cross-functional team, manage deployment pipelines, automate processes, and enforce defence-grade reliability and observability standards. Key Responsibilities Lead and manage a small team (developers + DevOps engineers) delivering secure, scalable, and high-availability AI systems. Own AWS-based infrastructure : Lambda, EC2, S3, CloudFormation/… Python , JSON , and serverless architectures . Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4–8 years in software development and/or DevOps , including 2+ More ❯
years of commercial experience working primarily in an AWS Cloud environment using approaches/tooling like ours (see technical skills), delivering scalable, performant, reliable solutions. Strong data reliability/observability, data governance and information security credentials. What technical skills are required? ETL/ELT & Data Transformation: Amazon Redshift (query tuning, distribution/sort keys, workload management) Data modelling (normalisation, dimensional … data processing, automation, testing) SQL (advanced query writing and optimization) Data Engineering Best Practices: CI/CD for data pipelines (Git, GitHub Actions, etc.) Data quality checks, monitoring, and observability Infrastructure as Code (Terraform etc.) Other Tools & Ecosystem: Experience with logging/monitoring Exposure to data governance, cataloguing, and lineage tools Ability to work with a range of structured, semi More ❯