Junior Data Infrastructure Engineer
At Cision, we believe in empowering every individual to make an impact. Here, your voice is heard, your ideas are valued, and your unique perspective fuels our collective success. As part of our global team, you'll thrive in an environment that champions curiosity, collaboration, and innovation, all while making meaningful contributions to the brands we accelerate. Join us in shaping the future of communication and building authentic connections that matter. Whether you're solving complex problems or driving bold innovations, your growth is our success, and together, we’ll create the conversations of tomorrow. Empower your impact at Cision. Be seen, be understood, be you. As part of the Data Infrastructure team you will be supporting mission critical big data platforms, to ensure they are fully performant, reliable, available and secure. We call it data infrastructure engineering, also known as DataOps or Database Administration or SRE. The role is a mixture of development of tooling and operational support for our platforms, two aspects that go hand-in-hand. It requires attention to detail and curiousity about how the systems work under the hood, and gives you a wide base of skills from low level system tuning to general coding. We Manage Four Main Storage Platforms, Namely
- Apache Solr (~2.2 PB)
- Apache HBase (~450 TB)
- PostgreSQL (~15 TB)
- Kafka (~60 TB)
- Coding mapreduce jobs to validate data between two multi-TB+ hbase clusters (to run in hours rather than months)
- Building out a python code stack to transparently migrate production clients between clusters (without losing any data or requiring any downtime)
- Building and testing new HBase builds and deploying them live (without putting everyone in Brandwatch out of a job)
- Creating a platform for rolling out our data management systems using Apache Airflow (to stop people re-writing custom job management systems)
- Rolling out a DR solution for hadoop on AWS (while keeping the system performant and not bankrupting the company)
- An interest in how computer infrastructure actually works, and a passion for learning
- Interest, and ideally production experience, running storage systems, eg. as part of a selfhosted service, a home lab or as part of academic studies
- Experience with Linux systems administration, including experience of trouble shooting
- Fluency with one or more scripting languages, ideally Bash or Python
- Experience helping your peers
- Pride in the quality of your work
- Experience with any of the following will be beneficial:
- Development experience, in Python, Java, Rust, C/C++ or Golang.
- Upstream open-source contributions
- Academic research in scalability, distributed systems or storage infrastructure
- Kubernetes experience, ideally running or writing Operators Experience with Docker and with CI/CD pipelines