maintaining large-scale datasets and complex data pipelines in cloud environments. Proven capability in data architecture design, including the development and optimisation of end-to-end data pipelines for performance, reliability, and scalability. Expert-level knowledge of Databricks, including hands-on implementation, cluster management, performancetuning, and (ideally) relevant Databricks certifications. Hands-on experience with SQL and More ❯
Cheltenham, Gloucestershire, England, United Kingdom
Oscar Technology
the role, an active UK*C DV Clearance is required for eligibility. Your day-to-day will consist of: Gathering and analysing statistics from operating systems and applications regarding performancetuning and error searching. Troubleshooting and providing solutions for technical issues across the stack. Be a part of system design consultation, platform management and capacity planning. Using well More ❯
Cheltenham, Gloucestershire, UK Hybrid/Remote Options
TechNET IT Recruitment Ltd
large-scale production environments (5,000+ hosts) built on technologies such as Kafka, Redis, Kubernetes and MySQL. This is a hands-on, systems-level position focused on reliability, scalability, performance and troubleshooting. You'll work alongside experienced engineers, operating with a high degree of autonomy to keep critical systems healthy, resilient, and observable. Key Responsibilities Manage, configure and maintain … systems including Kafka, Redis, Kubernetes and MySQL Support day-to-day operations in data centre/large-scale infrastructure environments (5,000+ hosts) Contribute to system reliability, scalability and performance improvements across the platform Participate in an on-call rotation (one week every 4–5 weeks) to ensure 24x7 availability of critical systems Collaborate with internal teams to improve … Skills 5+ years' experience in Linux system administration, SRE, Infrastructure or Platform Engineering roles Proven experience operating large-scale infrastructure (thousands of hosts/distributed systems) Strong troubleshooting and performancetuning skills at the infrastructure and OS level Solid understanding of MySQL operations, including replication concepts Hands-on experience with Kafka and/or other distributed messaging systems More ❯