System Specialist (Storage)
Job Summary
Squarepoint is looking for a Platform Storage Specialist to join our growing global team. The candidate will work alongside our team to design, build, and maintain enterprise-grade storage services that are continuously scaling to support the growth of the company. This role requires in-depth knowledge of distributed storage systems, storage architectures, and storage protocols, including performance optimization, capacity planning, and data lifecycle management. The candidate will ideally have experience with on-premises storage infrastructure, network-attached storage, parallel file systems, object storage, and cloud-based storage platforms. Your day-to-day could consist of storage infrastructure design and deployment, performance tuning and troubleshooting, monitoring and alerting improvements, capacity management, automation and tool development, and much more.
Main Duties & Responsibilities
- Design, deploy, and manage distributed storage systems efficiently at scale through standardization, automation, testing, and in-depth monitoring
- Architect and maintain high-performance storage solutions including NFS, GPFS, WEKA, and object storage platforms
- Perform storage capacity planning, forecasting, and lifecycle management to ensure optimal utilization and cost efficiency
- Monitor storage performance metrics, identify bottlenecks, and implement tuning and optimization strategies
- Develop and maintain automation tooling for storage provisioning, configuration, and health monitoring
- Collaborate with compute, networking, and application teams to ensure storage solutions meet performance and reliability requirements
- Implement and improve storage-related observability, alerting, and incident response processes
- Evaluate and integrate new storage technologies, including cloud-based storage services (AWS S3, EBS, EFS, GCP Cloud Storage, Filestore, etc.)
- Ensure data integrity, availability, and disaster recovery readiness across storage platforms
- Document storage architecture, runbooks, and operational procedures
Required Qualifications
- 5+ years of experience in a technology infrastructure role with a strong focus on storage administration and Linux administration
- Deep expertise with distributed storage systems and storage protocols (NFS, GPFS, WEKA, Lustre, Ceph, or similar)
- Experience with storage performance analysis, benchmarking, and optimization
- Experience working with configuration management driven administration using tools such as Chef or Ansible
- Proficient in Python, or other high-level language (Go, Rust, etc.) for automation and tooling development
- Experience with modern software development practices: version control, agile development, CI/CD
- Experience with observability in distributed systems (e.g., Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana)
- Experience working with cloud storage services across various cloud providers (AWS and GCP)
- Bachelor's Degree in an Engineering discipline or related area
- Excellent written and verbal communication skills
Preferred Qualifications
- Experience with object storage systems (Cloudian, MinIO, S3-compatible platforms)
- Familiarity with storage hardware (disk arrays, JBODs, NVMe/SSD technologies, HBAs, and storage networking)
- Experience with data protection strategies including backup, replication, and disaster recovery
- Understanding of storage tiering and data lifecycle policies
- Knowledge of high-performance computing (HPC) storage workloads and patterns
- Eagerness to continuously learn new storage technologies and develop with us
- Ability to work well in a fast-paced, adaptable, and pressured environment