Senior Site Reliability Engineer, Platform Engineering
See yourself at Dataminr As a Senior Site Reliability Engineer, you will ensure the high-quality delivery of our software by building and maintaining tools used by software engineers and data scientists to deploy, and monitor their code. In this role, you will be a champion for best practices and a quality mentor to the rest of the engineering organization. This is a remote role based in the UK or Ireland. AI Innovation at Dataminr Working at Dataminr you’ll have the opportunity to tackle the most exciting trends in AI on a daily basis to power a revolutionary product that uncovers critical events around the world as they unfold. : our AI technology, ReGenAI, is a new form of generative AI that automatically regenerates real-time Live Event Briefs as events unfold. Learn more here . we recently launched our Agentic AI capability, what we’re calling our Intel Agents, that autonomously generates critical context for our clients on real-time events, threats, and risks allowing them to see the clearest, most accurate view of what’s happening on the ground. Learn more here Multimodal AI: our platform detects events from many different types of data (images, video, sensor data, audio, and text in over 150 languages). Learn more here . The opportunity
- Work on our self service internal developer platform used by engineering teams to deploy containers, serverless functions and cloud resources
- Maintain and improve our observability stack
- Drive improvements in security, reliability, cost efficiency and performance
- Troubleshoot large-scale distributed systems
- Work closely with product engineering teams to enable efficient project delivery
- Support our production environment as part of an on call rota, help with triage and resolution when issues arise
- Experience managing Kubernetes clusters at scale (CKA a bonus)
- Maintaining and hardening AWS infrastructure using Terraform
- Development skills in Python or Go
- Linux systems administration and TCP/IP networking
- Experience maintaining observability tooling e.g. LGTM stack, OpenSearch