Senior Infrastructure Engineer
Senior Infrastructure Engineer
If you’re passionate about infrastructure at scale, have deep expertise in Ceph, and enjoy contributing to open-source communities, this is an opportunity to make a significant impact.
We’re looking for a Senior Infrastructure Engineer to design, automate, and operate large-scale storage systems using Ceph in modern Kubernetes environments. You’ll work across diverse hardware deployments and play a hands-on role in shaping tools and infrastructure used across the broader Ceph and CNCF ecosystems.
Key Responsibilities
- Manage Ceph at Scale: Design, deploy, and maintain Ceph clusters across heterogeneous on-prem hardware, ensuring high availability, resilience, and performance.
- Automate Infrastructure: Create and maintain automation strategies and tooling to orchestrate hundreds of Ceph deployments using best-in-class platforms and infrastructure-as-code practices.
- Contribute Upstream: Drive adoption of cutting-edge tooling and contribute upstream to Ceph, Rook, and CNCF projects as needed.
- Engage in Open Source: Collaborate with open-source communities, including Ceph developers and CNCF maintainers, to exchange knowledge and shape the future of infrastructure tooling.
- Architect for the Future: Help build scalable, stable, and secure infrastructure that powers mission-critical systems across a wide range of workloads.
What You Bring
- Deep Ceph & Rook Knowledge: Hands-on experience managing and operating Ceph clusters, ideally via Rook in Kubernetes-based environments.
- Automation & Tooling: Strong proficiency with infrastructure automation (e.g., Terraform, Kubernetes Operators), and scripting or coding in Go, Java, or similar languages.
- Systems Programming: Proficient in low-level systems programming in Go, Rust, C/C++, or equivalent.
- Hardware & OS Proficiency: Comfortable navigating hardware diagnostics, system tuning, and Linux internals in production environments.
- Networking Fundamentals: Solid understanding of networking, with bonus points for experience in CNIs or cloud/hybrid networking.
- On-Premise Experience: Familiarity with running infrastructure in physical data centers, including sysadmin or SRE responsibilities for bare metal.
Requirements
- 4+ years of experience in software development, with a focus on infrastructure or SRE roles.
- 2+ years of system design or architectural experience building reliable, scalable infrastructure.
- 1+ year of production responsibility for Ceph clusters, including operations and maintenance.
- Bachelor’s degree in Computer Science or equivalent practical experience.
- Company
- Andiamo
- Location
- City of London, Greater London, UK
- Posted
- Company
- Andiamo
- Location
- City of London, Greater London, UK
- Posted