DistributedSystems Engineer - Data Platform - Logs and Audit Logs Hybrid About Us At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare … their time at Cloudflare. Come join us! Locations Available London (UK), Lisbon (Portugal), Austin (US), Denver (US), Atlanta (US) About Role We are actively seeking experienced and highly motivated DistributedSystems Engineers to join Cloudflare's dynamic DATA Organisation. This is a pivotal opportunity to contribute to the future of data at Cloudflare, working on systems that … global operations and customer insights. Our organisation is responsible for the entire data lifecycle, encompassing everything from initial ingestion and sophisticated processing to robust storage and efficient retrieval. These systems are the backbone that power critical logs and analytics, providing our customers with real-time, actionable visibility into the health, performance, and security of their online properties. Our overarching More ❯
to help design and evolve internal compute frameworks that underpin their trading and research platforms. This is a high-impact role, working at the intersection of software engineering and distributedsystems, with the opportunity to build scalable tools and frameworks used across the business. What youll do: Design, develop, and maintain performant, reliable frameworks and services in Python … Build productivity tools and platforms that streamline workflows across investment and technology teams Contribute to code reviews and best practices, improving overall development quality Work across distributedsystems, containers, and automation pipelines to deliver scalable solutions What were looking for: 6+ years professional software development experience Strong proficiency in high-performance Python (deep ecosystem knowledge and best practices … Experience with at least one JVM language (Java, Kotlin, Scala) is an advantage Background in distributedsystems and large-scale compute frameworks Familiarity with Docker, Kubernetes, Linux environments, and CI/CD automation This role offers the chance to work on complex technical challenges at scale, with direct impact on high-performance computing platforms. If youre a skilled More ❯
to help design and evolve internal compute frameworks that underpin their trading and research platforms. This is a high-impact role, working at the intersection of software engineering and distributedsystems, with the opportunity to build scalable tools and frameworks used across the business. What you ll do: Design, develop, and maintain performant, reliable frameworks and services in … Python Build productivity tools and platforms that streamline workflows across investment and technology teams Contribute to code reviews and best practices, improving overall development quality Work across distributedsystems, containers, and automation pipelines to deliver scalable solutions What we re looking for: 6+ years professional software development experience Strong proficiency in high-performance Python (deep ecosystem knowledge and … best practices) Experience with at least one JVM language (Java, Kotlin, Scala) is an advantage Background in distributedsystems and large-scale compute frameworks Familiarity with Docker, Kubernetes, Linux environments, and CI/CD automation This role offers the chance to work on complex technical challenges at scale, with direct impact on high-performance computing platforms. If you More ❯
to help design and evolve internal compute frameworks that underpin their trading and research platforms. This is a high-impact role, working at the intersection of software engineering and distributedsystems, with the opportunity to build scalable tools and frameworks used across the business. What you’ll do: Design, develop, and maintain performant, reliable frameworks and services in … Python Build productivity tools and platforms that streamline workflows across investment and technology teams Contribute to code reviews and best practices, improving overall development quality Work across distributedsystems, containers, and automation pipelines to deliver scalable solutions What we’re looking for: 6+ years’ professional software development experience Strong proficiency in high-performance Python (deep ecosystem knowledge and … best practices) Experience with at least one JVM language (Java, Kotlin, Scala) is an advantage Background in distributedsystems and large-scale compute frameworks Familiarity with Docker, Kubernetes, Linux environments, and CI/CD automation This role offers the chance to work on complex technical challenges at scale, with direct impact on high-performance computing platforms. If you More ❯
to help design and evolve internal compute frameworks that underpin their trading and research platforms. This is a high-impact role, working at the intersection of software engineering and distributedsystems, with the opportunity to build scalable tools and frameworks used across the business. What you’ll do: Design, develop, and maintain performant, reliable frameworks and services in … Python Build productivity tools and platforms that streamline workflows across investment and technology teams Contribute to code reviews and best practices, improving overall development quality Work across distributedsystems, containers, and automation pipelines to deliver scalable solutions What we’re looking for: 6+ years’ professional software development experience Strong proficiency in high-performance Python (deep ecosystem knowledge and … best practices) Experience with at least one JVM language (Java, Kotlin, Scala) is an advantage Background in distributedsystems and large-scale compute frameworks Familiarity with Docker, Kubernetes, Linux environments, and CI/CD automation This role offers the chance to work on complex technical challenges at scale, with direct impact on high-performance computing platforms. If you More ❯
Employment Type: Permanent
Salary: £170000 - £200000/annum plus Bonus & Package
Guildford, Surrey, England, United Kingdom Hybrid/Remote Options
Jonothan Bosworth
Senior Python/C++ DistributedSystems Engineer Location: Hybrid Salary: £60,000 – £70,000 Type: Permanent About the Role: Our employer-partner is looking for a Senior Python/C++ Engineer with strong experience in distributedsystems, VoIP, audio/video processing, and cloud-native architectures. This role suits someone who enjoys building performance-critical tools … working across the full stack, and contributing to scalable real-time communication products. Key Responsibilities: Develop backend services and distributed components using Python and C++ . Work with media processing technologies (FFMPEG, audio mixing, streaming pipelines). Engineer VoIP, DECT, and SIP-based communication software, including real-time call handling. Build and maintain REST APIs (Flask, FastAPI, Django) and … control-plane systems. Desirable: FFMPEG, PyQt, NumPy, SQLAlchemy experience. Understanding of secure communications (SSL/TLS, JWT). Passion for scalable, highly available architecture (real-time or mission-critical systems). Why Apply? Join a highly technical engineering group solving complex real-time communication challenges. Opportunity to work on both cloud and on-premise distributed systems. A role More ❯
St. Albans, Hertfordshire, England, United Kingdom
Method Resourcing
business for nearly 30-years, and after a very successful period, they have acquired several competitors. After a period of consolidation, they are now looking to unify all their systems into a single source of truth. They are also looking to scale massively over the next 5-years as they enter new markets, and as a result, are looking … several million events per day, before scaling up to 100+ millions events per day. You'll work at the top of the engineering track, designing and delivering high-performance, distributedsystems while guiding others through implementation and problem-solving. This is a deep technical role, ideal for someone who thrives on code, architecture, and tangible impact. What you … ll do Design and build scalable, distributedsystems that support critical environments. Lead technical decision-making and resolve engineering challenges across domains. Own the delivery of complex features, ensuring performance, resilience, and maintainability. Collaborate with Engineering Leads, Architects, and Product to translate roadmap goals into reality. Contribute to a 5-year architectural refresh, evolving systems to event More ❯
tools like Apache Spark, Databricks, Apache Pulsar, Apache Airflow, Temporal, and Apache Flink, sharing knowledge and suggesting improvements. Documentation: Contribute to clear and concise documentation for software, processes, and systems to ensure team alignment and knowledge sharing. Your Qualifications Experience: Professional experience in Python development or related software engineering roles. Python Proficiency: Strong knowledge of Python, including experience with … web frameworks like Django, Flask, or FastAPI. Database Management: Solid experience with relational databases like PostgreSQL or MySQL and familiarity with NoSQL databases like Redis. DistributedSystems: A basic understanding of distributedsystems and microservices architecture, as well as cloud-based infrastructure (AWS, GCP, Azure), is a plus. Version Control: Experience with Git and familiarity with … like AWS, GCP, or Azure. DevOps Tools: Familiarity with containerization (Docker) and infrastructure automation tools like Terraform or Ansible. Real-time Data Streaming: Experience with Apache Pulsar or similar systems for real-time messaging and stream processing is a plus. Data Engineering: Experience with Apache Spark, Databricks, or similar big data platforms for processing large datasets, building data pipelines More ❯
software that empowers businesses to reach new heights. The Opportunity As a DevOps Engineer at Bright, you'll be enabling our engineering teams to build, deploy, and operate production systems at scale. You'll support the infrastructure and deployment pipelines for our API platform and product development teams, ensuring they can ship fast, safely, and reliably to tens of … of customers across the UK and Ireland. Working across our Azure-based Kubernetes infrastructure, you'll build and maintain the CI/CD pipelines, infrastructure as code, and observability systems that enable small, autonomous squads to deploy with confidence. You'll be instrumental in creating the foundations that allow our teams to move at startup pace while maintaining production … scanning, and quality gates Create deployment strategies (blue/green, canary, rolling updates) Support teams with deployment tooling and best practices Observability & Reliability Implement comprehensive monitoring, logging, and alerting systems Build dashboards and metrics for system health and performance Design and implement incident response procedures Conduct post-mortems and drive continuous improvement Optimize system performance and resource utilization Security More ❯
oxford district, south east england, united kingdom
Recursion
full model lifecycle from development through deployment Design and implement scalable LLM serving infrastructure to support high-throughput inference and real-time model serving capabilities Architect infrastructure for agentic systems, including multi-agent coordination, workflow orchestration, and autonomous decision-making pipelines Partner with ML teams actively developing models to onboard them to platform capabilities Continuously optimize the platform to … support our emerging machine learning research capabilities and evolving use case needs Build and maintain orchestration frameworks for complex ML workflows, including agent-based systems and multi-modal model pipelines Inspire your team and stakeholders alike to find the best outcome by facilitating constructive dialogue, reconciling perspectives into a unified view Create tools and experiences that support the development … organization by offering low barrier-to-entry solutions for building intelligent, self-managing machine learning pipelines from development through deployment. In a rapidly evolving landscape of MLOps tooling, agentic systems, and large language models, we aim to foster a culture of continuous learning and growth to stay informed of emerging technologies, and prudently incorporate them into the platform as More ❯
backend services and microservices using Java. Collaborate broadly: Work with cross-functional teams to deliver scalable telecom solutions. Optimise performance: Integrate databases, APIs, and ensure efficiency across systems. Harden systems: Implement redundancy, security, and performance tuning for telecom operations. Problem-solve: Troubleshoot challenges in distributedsystems and live client environments. Contribute to Agile: Participate in code reviews … ensure best practices across the lifecycle. What we’re looking for 4+ years’ hands-on backend Java development experience. Strong expertise in RESTful APIs and microservice architectures. Background in distributedsystems, OOP, and networking fundamentals. Cloud environment experience (AWS, GCP, etc.), plus Docker/Kubernetes and CI/CD. Proven track record optimising apps for performance, memory, and … scalability. SQL/NoSQL database experience, including deployment and integration. Knowledge of messaging systems (Kafka, RabbitMQ, Pub/Sub). Excellent communication and analytical skills. Nice to have: Telecom-specific protocols (SMPP, SIP), OSS/BSS integrations, or network APIs. Event-driven systems, CQRS, or high-redundancy architectures. Security scanning, testing, Git, and Agile/Scrum experience. Interest More ❯
role will be hands-on as well as leading the team to shape the future backend systems. As a Staff Software Engineer, you'll lead backend architecture, scaling secure distributedsystems for a growing customer base. Set engineering standards, mentor engineers, and collaborate across teams to deliver scalable features. Tackle challenges in performance, fault tolerance, and data-heavy … workloads while influencing product strategy.Requirements: Strong foundations in algorithms, data structures, and distributedsystems Experience building and operating large-scale backend systems Expertise in system and API design, scalability, and performance tuning Proficiency in a modern backend language (Java preferred) Knowledge of cloud-native architectures, containers, and CI/CD Proven leadership in technical strategy and mentoring More ❯
role will be hands-on as well as leading the team to shape the future backend systems. As a Staff Software Engineer, you'll lead backend architecture, scaling secure distributedsystems for a growing customer base. Set engineering standards, mentor engineers, and collaborate across teams to deliver scalable features. Tackle challenges in performance, fault tolerance, and data-heavy … workloads while influencing product strategy. Requirements: Strong foundations in algorithms, data structures, and distributedsystems Experience building and operating large-scale backend systems Expertise in system and API design, scalability, and performance tuning Proficiency in a modern backend language (Java preferred) Knowledge of cloud-native architectures, containers, and CI/CD Proven leadership in technical strategy and More ❯
Systems Software Engineer (Python) Are you a top-tier Software Engineer with a passion for high-performance systems and a knack for solving complex infrastructure challenges? We are seeking an elite developer to join a rapidly growing, highly respected HFT company that operates at the cutting edge of its industry. This isn't a typical infrastructure support role. … who thrive on tackling difficult OS-level problems. The role You will be a core member of an infrastructure engineering team, focusing on developing, maintaining, and scaling the critical systems that power operations. Your primary tool will be Python, used to build robust, high-availability solutions that support their technology ecosystem. What We're Looking For: We are specifically … targeting engineers who possess a world-class blend of software development and systems knowledge: Python is essential, including building production-grade, highly reliable systems. A comprehensive understanding of the Linux operating system, including its internals, performance tuning, and low-level functionalities. Strong knowledge of networking concepts (TCP/IP, routing, load balancing, etc.) and experience troubleshooting and optimising network More ❯
East London, London, United Kingdom Hybrid/Remote Options
Client Server
quality, high-impact releases. You'll be working with a modern, cloud native tech stack using Java, Spring Boot, AWS, Kafka and CI/CD to build highly scalable, distributedsystems with 24/7 availability. Location/WFH: There's a hybrid model with two days a week work from home, when you are in the office … range of facilities including roof terrace, restaurant and break out areas. About you: You have strong technical and people leadership skills You have advanced experience of building cloud-native, distributedsystems using Java and Spring Boot You have a strong knowledge of AWS including Amazon EKS You have experience with Kafka and event driven architectures You're collaborative More ❯
Worcester, Worcestershire, West Midlands, United Kingdom
University of Worcester
located around 40 minutes south of Birmingham, the UKs second city. The Department of Computing undertakes research across a range of themes, including Machine Learning, Data Science, Intelligent Interactive Systems, Computing in Education, and Cyber Security. It is also home to the Digital Innovation and Intelligent Systems Research Group. Depending on qualifications and experience, this role may be … expected to have contemporary subject knowledge and/or experience of professional practice in several of the following areas: - Computer Science - Mathematics for Computing - Data Structures and Algorithms - Operating Systems - DistributedSystems - Object Oriented Programming - Robotics - Internet of Things - Machine Learning - Artificial Intelligence - Natural Language Processing We would especially welcome applicants who have experience with physical computing More ❯
research leader, where you ll architect and optimise the platforms that deliver large-scale language models to production. You ll be working on some of the hardest challenges in distributed AI systems: building ultra-reliable, ultra-scalable environments for inference and deployment. What you ll be doing Designing cloud-native architectures to run large language models on serverless … frameworks (e.g. Kubernetes, Knative, or custom-built FaaS). Developing approaches to minimise cold-start latency through advanced container snapshotting, weight pre-loading, and graph partitioning . Building distributed inference pipelines with tensor parallelism, model sharding, and efficient memory scheduling to serve LLMs at scale. Experimenting with quantisation, pruning, and KV-cache management to squeeze maximum throughput from GPU … accelerator clusters. Working closely with applied researchers to turn state-of-the-art methods into robust, production-grade systems. What you ll bring Deep understanding of large-scale ML systems engineering , with direct experience in deploying or optimising LLMs. Hands-on expertise in C Rust/Go for systems programming, plus Python for model integration. Strong knowledge of More ❯
research leader, where you’ll architect and optimise the platforms that deliver large-scale language models to production. You’ll be working on some of the hardest challenges in distributed AI systems: building ultra-reliable, ultra-scalable environments for inference and deployment. What you’ll be doing Designing cloud-native architectures to run large language models on serverless … frameworks (e.g. Kubernetes, Knative, or custom-built FaaS). Developing approaches to minimise cold-start latency through advanced container snapshotting, weight pre-loading, and graph partitioning . Building distributed inference pipelines with tensor parallelism, model sharding, and efficient memory scheduling to serve LLMs at scale. Experimenting with quantisation, pruning, and KV-cache management to squeeze maximum throughput from GPU … accelerator clusters. Working closely with applied researchers to turn state-of-the-art methods into robust, production-grade systems. What you’ll bring Deep understanding of large-scale ML systems engineering , with direct experience in deploying or optimising LLMs. Hands-on expertise in C Rust/Go for systems programming, plus Python for model integration. Strong knowledge of More ❯
swindon, wiltshire, south west england, united kingdom Hybrid/Remote Options
Humana
solutions. We foster a culture of innovation, agility, and ownership—empowering engineers to solve complex problems, drive strategic initiatives, and shape the future of how we monitor and operate distributed systems. If you're passionate about building tools engineers love, solving unscoped challenges, and leading with technical excellence—this is the team for you. About the Role We're … for a Lead Software Engineer with deep expertise in logging and observability engineering. You should be fluent in the principles of open telemetry, log ingestion, and event correlation across distributed systems. While familiarity with platforms like Splunk or Dynatrace is a plus, we value platform-agnostic thinking and the ability to design resilient, scalable logging solutions using the best … you will drive the design, implementation, and evolution of our observability and logging platforms. You'll lead enterprise-scale initiatives, mentor engineers, and collaborate across disciplines to ensure our systems are reliable, scalable, and performant. Applying deep technical expertise to solve real-world challenges. You'll work closely with SRE, Security, Networking, Platform Engineering, and Data Science teams to More ❯
Liverpool, Merseyside, England, United Kingdom Hybrid/Remote Options
Acorn Insurance
a hybrid working basis Salary: £65,000 - £75,000 depending on experience We're looking for a passionate Senior Backend Developer who thrives on solving complex problems and building systems that scale. You'll be at the forefront of creating next-generation backend services that genuinely make a difference for our customers and business. In this role, you'll … that values openness and collaboration. Bonus: Clean Architecture Understanding Clean Architecture and Mediator Pattern is a huge plus! If you have it, you'll help us build maintainable, testable systems that follow industry best practices. Our Tech Stack: Backend: C#, .NET 8.0, ASP.NET Core Database: SQL Server, Entity Framework Infrastructure: Docker, Azure Tools: Unit and Integration Testing, Git, Agile … building robust, scalable systems. Understanding Clean Architecture and Mediator Pattern is desirable but not mandatory. Bonus Points For: Understanding Clean Architecture, Mediator Pattern Solid grasp of asynchronous communication in distributedsystems Experience with high-throughput, data-intensive systems Contributed towards systems utilising Event-Driven Architecture Why Join Us? Modern tech stack and a strong culture of More ❯
Requirements You are a builder and innovator who is passionate about creating sophisticated AI systems that bring synthetic populations to life. You are someone who can translate complex behavioral requirements into robust technical architectures, building the systems that allow thousands of AI agents to interact, remember, and behave consistently , You have hands-on experience applying large language models … to solve complex, open-ended problems. You excel at designing scalable agent architectures, implementing advanced LLM techniques, and building production-ready systems that can handle the computational demands of large-scale agent simulations , MS in Computer Science, Machine Learning, AI, or related field, or equivalent practical experience , At least 2 years of industry experience in a commercial, non-research … environment working with AI/ML systems , Proven track record of building and deploying LLM-powered applications in production , Deep experience applying large language models to complex, real-world problems , Strong proficiency in Python and ML frameworks (PyTorch, JAX, TensorFlow, Hugging Face Transformers) , Hands-on experience with advanced LLM techniques: RAG systems, chain-of-thought prompting, agentic tool More ❯
City of London, London, United Kingdom Hybrid/Remote Options
83zero Limited
looking for a DevOps Lead to join their journey. This is a hands-on leadership role where you'll own the technical direction of a serverless, event-driven, globally distributed platform built on AWS. You'll lead a distributed engineering team, shape high-performance APIs, and build systems that scale hard and run clean. What's On … you'll: Architect and optimise a high-scale, cloud-native PaaS. Champion CI/CD, automation, observability, and reliability. Design and maintain secure, performant public APIs. Build real-time, distributedsystems on AWS (Lambda, DynamoDB, Kinesis, ECS/EKS). Mentor a cross-functional team and elevate engineering standards. Prototype and ship features that drive commercial impact - including … console or dashboard backend. Deep expertise in serverless/event-driven architecture on AWS. Mastery of CI/CD, Agile delivery, and containerisation (Docker, Kubernetes). Strong leadership of distributed engineering teams. Desirable: Experience in social media, advertising, eCommerce, or news. Proficiency in Node.js, Go, Python, Java, and modern front-end frameworks. AWS certifications, microservices/service mesh, and More ❯
Edinburgh, Midlothian, Scotland, United Kingdom Hybrid/Remote Options
Cathcart Technology
Software Engineering Manager required to lead a team of Java engineers in Edinburgh, shaping the development of next-generation, large-scale systems that solve complex technical challenges in a collaborative environment. The Opportunity This is a chance to lead a team of talented engineers building sophisticated, large-scale backend systems that deliver real-time, mission-critical functionality to … challenges in a regulated, high-availability environment, driving innovation while ensuring reliability, quality, and customer trust. You'll lead an agile team who will be working on Java-based distributedsystems and cloud platforms, whilst guiding delivery across the full software development lifecycle. You'll empower your team to take ownership, innovate, and deliver high-impact features with … of software innovation. They have built a deeply technical, forward-thinking engineering culture where collaboration, experimentation, and operational excellence are core capabilities. Teams work with large-scale datasets, complex systems, and distributed architectures to deliver scalable, resilient, and high-quality software, supported by cutting-edge infrastructure and cloud technologies. Why this role? ** Lead a talented team of engineers More ❯
Aker Systems was founded in 2017 by a team of experienced technology professionals who recognised an opportunity to provide highly secure enterprise data platforms to large organisations. We build and operate ground-breaking, ultra-secure, high performance, cloud-based data infrastructure for the enterprise. Our proprietary technology solutions drive performance and reduce costs while helping our clients to improve … the management and sharing of data across their organisations. In 2024, Aker Systems won the Breakthrough Culture Awards highlighting growth companies putting culture first. In 2020 Aker Systems was recognised as a One to Watch on the Sunday Times Tech Track. The Company was also recognised at the Thames Valley Tech Awards 2020; winning the Thames Valley Tech … role. If you dont hold SC clearance, we will support you to apply assuming you have lived and worked in the UK for a minimum of 5 years. Aker Systems deliver secure cloud hosted solutions into the public and private sectors. We are rapidly expanding and are looking to hire a Principal Data Architect to support the delivery of More ❯