Job Title: DevOps Specialist & Data Engineer Location: Remote Type: Full-time Experience Level: Senior Industry: Generative AI/Artificial Intelligence/Machine Learning Reports To: Head of Engineering/CTO About Us Ready to join a cutting edge AI company? We’re on a mission to become the OpenAI of the spicy content industry, building a full-spectrum ecosystem … images, with a roadmap that includes agentic superModels, API integrations, and video capabilities. Role Overview We are looking for a Senior DevOps Specialist with a strong python and data engineering background to support our R&D and tech teams by designing, building, and maintaining robust infrastructure and data pipelines across AWS and GCP. You will be instrumental in ensuring … proactively resolve infrastructure issues. Manage CI/CD pipelines for rapid, reliable deployment of services and models. Ensure high availability, disaster recovery, and robust security practices across environments. Data Engineering Build and maintain data processing pipelines for model training, experimentation, and analytics. Work closely with machine learning engineers and researchers to understand data requirements and workflows. Design and implement More ❯
Job Title: DevOps Specialist & Data Engineer Location: Remote Type: Full-time Experience Level: Senior Industry: Generative AI/Artificial Intelligence/Machine Learning Reports To: Head of Engineering/CTO About Us Ready to join a cutting edge AI company? We’re on a mission to become the OpenAI of the spicy content industry, building a full-spectrum ecosystem … images, with a roadmap that includes agentic superModels, API integrations, and video capabilities. Role Overview We are looking for a Senior DevOps Specialist with a strong python and data engineering background to support our R&D and tech teams by designing, building, and maintaining robust infrastructure and data pipelines across AWS and GCP. You will be instrumental in ensuring … proactively resolve infrastructure issues. Manage CI/CD pipelines for rapid, reliable deployment of services and models. Ensure high availability, disaster recovery, and robust security practices across environments. Data Engineering Build and maintain data processing pipelines for model training, experimentation, and analytics. Work closely with machine learning engineers and researchers to understand data requirements and workflows. Design and implement More ❯
Job Title: DevOps Specialist & Data Engineer Location: Remote Type: Full-time Experience Level: Senior Industry: Generative AI/Artificial Intelligence/Machine Learning Reports To: Head of Engineering/CTO About Us Ready to join a cutting edge AI company? We’re on a mission to become the OpenAI of the spicy content industry, building a full-spectrum ecosystem … images, with a roadmap that includes agentic superModels, API integrations, and video capabilities. Role Overview We are looking for a Senior DevOps Specialist with a strong python and data engineering background to support our R&D and tech teams by designing, building, and maintaining robust infrastructure and data pipelines across AWS and GCP. You will be instrumental in ensuring … proactively resolve infrastructure issues. Manage CI/CD pipelines for rapid, reliable deployment of services and models. Ensure high availability, disaster recovery, and robust security practices across environments. Data Engineering Build and maintain data processing pipelines for model training, experimentation, and analytics. Work closely with machine learning engineers and researchers to understand data requirements and workflows. Design and implement More ❯
Job Title: DevOps Specialist & Data Engineer Location: Remote Type: Full-time Experience Level: Senior Industry: Generative AI/Artificial Intelligence/Machine Learning Reports To: Head of Engineering/CTO About Us Ready to join a cutting edge AI company? We’re on a mission to become the OpenAI of the spicy content industry, building a full-spectrum ecosystem … images, with a roadmap that includes agentic superModels, API integrations, and video capabilities. Role Overview We are looking for a Senior DevOps Specialist with a strong python and data engineering background to support our R&D and tech teams by designing, building, and maintaining robust infrastructure and data pipelines across AWS and GCP. You will be instrumental in ensuring … proactively resolve infrastructure issues. Manage CI/CD pipelines for rapid, reliable deployment of services and models. Ensure high availability, disaster recovery, and robust security practices across environments. Data Engineering Build and maintain data processing pipelines for model training, experimentation, and analytics. Work closely with machine learning engineers and researchers to understand data requirements and workflows. Design and implement More ❯
Lead SiteReliability Engineer Central London (Hybrid) Up to £95k + Car Allowance & Bonus TRIA are working with a leading hospitality client for a Lead SRE, where they are investing heavily in the performance, stability, and reliability of its digital platforms. This is a hands-on leadership role - you won’t just guide others, you’ll be … uptime The stack includes Kubernetes , Terraform , AWS , Python , and modern CI/CD tools, and it's evolving. If you're confident in a crisis, understand what a good SRE practice looks like, and want to leave systems in a better place than you found them, please apply to be considered and learn more! What you’ll bring : Experience in … high-traffic digital or eCommerce platforms 5+ years in SRE/DevOps roles; strong background in incident response Observability, automation, and infrastructure as code expertise Leadership skills - mentoring others or leading from the front More ❯
is redefining how digital assets are secured and managed. As part of their expansion, they’re looking to bring on a Senior Production Engineer to lead the charge on reliability, resilience, and operational excellence within a complex, high-uptime platform environment. What You’ll Get A superb opportunity to join an institutionally backed, cutting edge Crypto Fintech at the … salary and annual discretionary bonus. Pension contributions, in addition to Health Insurance, Life Assurance. 25 Annual Leave. What You’ll Be Doing This is a hands-on and strategic engineering role where you’ll be responsible for ensuring production stability across a highly dynamic microservices architecture hosted in Azure . You’ll have end-to-end ownership over reliability … and monitoring across distributed systems. Collaborating with cross-functional teams to align platform strategy and reliability goals. What You’ll Bring: 5+ years in software engineering or SRE/production infrastructure roles. Strong experience with Java (Spring) and cloud platforms (ideally Azure ). Proven track record in building and maintaining mission-critical systems. Deep understanding of Kubernetes, observability More ❯
South East London, England, United Kingdom Hybrid / WFH Options
SiSU Health UK
Help build and maintain the infrastructure powering a leading digital health platform. SiSU Health is seeking an experienced DevOps Engineer to ensure the reliability, scalability, and security of our AWS infrastructure and IoT-connected health stations. This is a critical role supporting the operational excellence and future readiness of our infrastructure as we scale across the UK and internationally. … The DevOps Engineer maintains and optimises our cloud platform and health station infrastructure and ensures platform reliability, availability, scalability, recoverability, security and compliance to relevant standards. ️ In this role, you'll: Own and optimise our AWS stack and CI/CD pipelines Maintain our Health Station platform, networks and infrastructure Improve observability, resilience, compliance, security and uptime Collaborate across … and ensure platform stability Be part of the team designing our cloud platform Work at the intersection of healthcare, infrastructure, and innovation Must haves 4+ years in DevOps/SRE Strong AWS experience and familiarity with Well Architected frameworks Familiarity with Information Security management, ISO 27001 and digital healthcare technology, compliance, standards and risk management Familiarity with Azure IOT and More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Client Server
WFH: You can work from home most of the time, meeting up with colleagues in the London office once a week. About you: You have experience in similar DevOps, SRE or Infrastructure engineering positions You have expertise with Kubernetes and Helm, having built mission critical systems in production You have strong IaC, Terraform experience You have strong CI/ More ❯
SiteReliability Engineer (SRE) – Market leading company - Milton Keynes (Tech stack: .Net, C#, ASP.Net Core, SQL Server, PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes) Help shape the tech future of UK market leader! Backed by a major financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at scale. … We're hiring a Lead SiteReliability Engineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You’ll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and resilience. 🛠️ Tech stack you’ll work with: .NET, C#, ASP.NET Core, SQL Server … PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes We want to hear from you if: ✅ As a SiteReliability Engineer (SRE) you've delivered scalable systems using .NET, C#, and ASP.NET Core , with real-world experience managing production workloads ✅ You’ve automated operations using PowerShell, Azure CLI, and Bash to reduce toil and boost efficiency More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Unitary
SRE (Unitary AI) Description The company We are a rapidly growing startup developing solutions that blend human expertise and AI agents to handle manual customer and marketplace operations tasks. Our unique approach combines the strengths of human expertise (high accuracy and nuanced decision-making) with the advantages of AI automation (speed and cost efficiency). This cutting-edge technology helps … the beginning of our journey - and we are very excited about our plans for growth over the coming year and beyond! The role We are now looking for a SiteReliability Engineer to ensure our systems run smoothly and reliably at scale. Your expertise in monitoring, observability, and system automation will help maintain the high availability and performance … such as Terraform for scalable system deployment Are familiar with MLOps practices and tools, and monitoring machine learning systems in production This role will report to the VP of Engineering and can be based anywhere within a 3-hour time zone of the UK. Benefits About us The team Unitary is a remote-first team of c. 20 people More ❯
the real estate space. Backed by a major financial institution and with a brand-new, tech-committed CEO at the helm, this is a rare opportunity to lead platform reliability across a business that touches millions. This is not just a hands-on role, it’s a leadership opportunity at the centre of a £multi-million transformation programme. You … ll shape and grow a SiteReliability function from the ground up, beginning with owning the Azure-based App Platform and evolving it into a modern, scalable engineering hub for over 400 IT professionals and 100 software engineers. Our client defines this role as ‘sitereliabilityengineering’ but are understanding and open to you … you do not have to have had a previous leadership/management position. You will however have to have the gravitas, hunger and ability to lead and grow an SRE team. What You’ll Do: Own the operational reliability of a large-scale Azure cloud platform. Drive automation-first culture using Terraform, Azure CLI, PowerShell and more. Lead incident More ❯
Lead Cloud Engineer As a Lead Cloud Engineer, you’ll have SME level knowledge across AWS and Platform Engineering disciplines. You’ll build AWS Cloud Solutions to ensure the organisation can take full advantage of Cloud based technologies. You’ll contribute to standards, guardrails and best practices, and implement improvements to processes and tooling to ensure engineering excellence. … You’ll have a strong understanding of operational requirements, and ensure Scalability, Resiliency, Observability, Security, Cost and Maintainability are at the forefront of all engineering activities. This specific project will involve Real Time Payments value stream, Form 3 gateway set-up and setting up the infrastructure for connectivity. What you’ll bring SME Level knowledge in AWS and Platform … and secure code delivery (ie SCA, SAST, DAST Networks/Security/Middleware & Apps Scripting/Coding (Bash, Python) End to End Observability solutions (logging, monitoring, alerting) Knowledge of SRE principles and practices More ❯
and tooling (e.g., Terraform) as data sources for observability. Solid programming ability in Golang (preferred) or Python for automation and integration. Strong collaboration skills to work with cross-functional engineering teams. Experience working in Linux-based environments. Bonus/Nice … to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
and optimizing cloud-modernization solutions that drive our business forward Extensive experience in modernizing Java based monolith applications to Microservices based architecture on Azure Extensive experience in DevSecOps and SRE Primary Responsibilities: Lead the design and implementation of microservices architecture on Azure. Architect and deploy scalable, reliable, and secure solutions using AKS. Design and manage PostgreSQL databases in a cloud … Conduct assessments of existing applications and recommend modernization strategies. Develop and maintain architectural documentation and guidelines. Ensure compliance with security and governance policies. Provide technical leadership and mentorship to engineering teams. Stay updated with the latest industry trends and technologies. Implement Infrastructure as Code (IaC) using tools like Terraform or ARM templates. Oversee DevOps practices and CI/CD More ❯
engineering? Join a small, high-performing engineering team on a project at a top-tier quant fund, helping to build out observability capabilities. This is not an SRE or sysadmin role – it would suit a software engineer who cares about clean, testable code and good software practices, but prefers working in the infra/tooling space. What you … monitoring pipelines in Grafana and Prometheus Developing infrastructure-as-code tooling (Terraform, Ansible) Designing well-structured, testable software that improves system visibility What they’re looking for: Strong software engineering skills (Go or Python preferred) Experience working in … or alongside platform engineering teams Familiarity with modern observability tools (Grafana, Prometheus, etc.) Comfort working across both code and infrastructure – but this is not a pure ops/SRE role If you've worked in finance that would be great but not mandatory The role offers an initial 6-month initial contract, Inside IR35, up to £750/day More ❯
Document configurations, workflows, and best practices for team and organizational use. Requirements: Must-Have: Bachelor’s degree in computer science, Engineering, or a related field. Experience in DevOps, SRE, or platform/infrastructure roles. Hands-on experience with Kong Mesh (or other service mesh solutions like Istio, Linkerd, Kuma). Solid understanding of Kubernetes, Envoy proxy, and container orchestration … . Familiarity with CI/CD pipelines, Git, and Infrastructure as Code tools like Terraform or Helm. Proficient in scripting languages (e.g., Bash, Python, or Go). Join our engineering team to build and scale service mesh architecture using Kong Mesh , Envoy , and Kubernetes . We're looking for someone with deep experience in service discovery , zero-trust networking More ❯
Vacancy for Snr SiteReliability Engineer (SRE) at Preservica Abingdon/Remote, UK About You You have a proven track record in DevOps and software development, with a passion for creating reliable solutions to deploy software at scale and speed. You are eager to challenge the status quo, learn, and adopt new technologies. Excellent communication skills across all … Our team is small but growing, so self-motivation, organization, and the ability to multitask and prioritize are crucial. The Role Serve as a primary visionary for DevOps/SiteReliabilityEngineering across the entire technology organization. Eliminate process bottlenecks to enable frictionless, reliable, and high-velocity feature development through automation of Build, Test, Deploy, and Operate More ❯
Description The Role We are looking for a Lead Engineer to join our growing engineering organisation developing a wide range of market-leading InsurTech solutions. You will be working in flexible agile squads delivering value on multiple greenfield workstreams in the delivery family to deliver core foundational functionality that will be used by multiple SaaS product offerings across the … key role in designing and creating new features and enhancing existing code whilst ensuring the multiple micro services that team is responsible for continue to meet high levels of reliability, maintainability, usability, and performance. The Responsibilities •Work with the Lead System Architect to define the architecture the individual scrum teams will implement •Contribute to the design of SaaS software … Experience with software development ecosystem (IDE's, version control, test automation/CI, etc.). •Strong appreciation of building flexible cross-functional full-stack squads with shift-left DevOps, SRE and QA culture. Other highly desirable, but not essential skills are: •Appreciation of current and emerging technologies along with their potential benefits and limitations •General knowledge of the Insurance Industry More ❯
across on-premise environments, from large data centers to edge devices—is seeking a Senior Infrastructure Engineer with deep expertise in Ceph . This individual will enhance the scale, reliability, and performance of ruggedized Kubernetes offerings operating under complex and novel constraints.Kubernetes offerings operati Ideal candidates are passionate about infrastructure at scale, adept in Ceph, and eager to contribute … Insight : Understanding of network architectures and experience with CNIs or cloud networking solutions. Data Center Experience : Hands-on experience managing on-premise hardware or serving as a sysadmin/SiteReliability Engineer in production environments. Minimum Requirements 4+ years of software development focused on infrastructure and operational excellence 2+ years of system design experience, particularly in scaling and … reliability 1+ year managing production-grade Ceph clusters Bachelor’s degree in Computer Science or equivalent experience More ❯
the team both as a hands-on developer, coach and mentor. Improving the engineering practices in the squad including code quality, testing and CI/CD Working with SRE engineers to build distributed systems with a focus on high availability, low latency and scalability (you build it, you run it) Senior member of esure’s engineering community, engaging More ❯
for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Experience working in fast-paced, early stage environment. Someone who is currently hands on (not someone More ❯
and operational excellence, supporting a global client base of financial institutions. You’ll define and implement a modern operations roadmap—driving automation, CI/CD, managed services, and platform reliability—while enabling high-performing engineering and delivery teams. Key Responsibilities Lead DevOps and infrastructure strategy across cloud/on-prem environments Oversee CI/CD, automation, and platform … reliability Align operations with business goals, client delivery … and engineering standards Support ISO27001/SOC2 compliance and secure operational models Drive continuous improvement through KPIs and operational metrics Build and lead a multidisciplinary operations team (DevOps, SRE, Infra) Working predominantly with AWS Requirements Proven experience in Ops/Platform/DevOps leadership within tech or software Deep knowledge of DevOps tools, infrastructure-as-code, and cloud architecture More ❯
with SQL and Python Data Visualisation skills with PowerBI, other Automation and Metrics knowledge handy. Proficiency with tools like Jira, Confluence, Excel, and SharePoint Familiarity with Agile, DevOps, and SiteReliabilityEngineering Excellent communication and stakeholder management skills More ❯
Description The Role We are looking for a Lead Cloud developer to join our growing engineering organisation developing a wide range of market-leading InsurTech solutions. You will be working in flexible agile squads delivering value on multiple greenfield workstreams in the delivery family to deliver core foundational functionality that will be used by multiple SaaS product offerings across … key role in designing and creating new features and enhancing existing code whilst ensuring the multiple micro services that team is responsible for continue to meet high levels of reliability, maintainability, usability, and performance. Although experience in Angular is not mandatory, a willingness to upskill and play a full stack role in the team is also required. The Responsibilities … Experience with software development ecosystem (IDE's, version control, test automation/CI, etc.). •Strong appreciation of building flexible cross-functional full-stack squads with shift-left DevOps, SRE and QA culture. Other highly desirable, but not essential skills are: •Strong appreciation of DevOps principles, with the ability to create automated processes to continuously deliver SaaS products on a More ❯
working with cloud technologies, specifically the Azure stack. Strong appreciation of DevOps principles, with the ability to create automated processes for continuous delivery of SaaS products. Experience with software engineering best practices: object-oriented design … patterns, SOLID principles. Experience with software development ecosystems (IDEs, version control, test automation/CI, etc.). Strong appreciation of building flexible, cross-functional squads with shift-left DevOps, SRE, and QA culture. Other highly desirable skills (not essential): Appreciation of current and emerging technologies and their benefits/limitations. General knowledge of the Insurance Industry. An appreciation of UX More ❯