Site Reliability Engineer
Staff Site Reliability Engineer – Data & Trading Platforms
London, Barcelona or Madrid | Full-time | Hybrid
The mission
Join a fast‐growing B2B data platform used by commodity traders to replace spreadsheets and fragmented tools with a single real‐time view of the market. The product ingests live data streams, powers complex analytics and visualisations, and is becoming the operating system for trading teams globally.
The role
You'll be the reliability specialist embedded in a product engineering team, owning how distributed systems are designed, run and scaled rather than sitting in a separate ops silo. Expect real‐time data, high‐throughput pipelines and a strong "you build it, you run it" culture, with plenty of scope to shape infrastructure direction and mentor others.
What you'll do
- Lead the design and evolution of reliable, scalable and observable infrastructure for real‐time and analytical data systems.
- Embed SRE practices (SLIs/SLOs, error budgets, operational readiness) across product teams.
- Optimise performance, availability and cost across cloud environments while maintaining strong security and resilience.
- Build and enhance observability stacks for metrics, logging, tracing and alerting, and guide incident response.
- Mentor engineers in distributed systems, reliability engineering and infrastructure design, acting as a technical reference point.
What you bring
- 7+ years in software engineering, SRE, DevOps or infrastructure roles, with a solid software engineering foundation.
- Experience operating cloud‐hosted systems at scale, plus strong skills in automation, CI/CD and cloud engineering.
- Strong understanding of distributed systems, performance and failure modes, and experience with observability tooling.
- Comfortable with several languages (e.g. Python, Java, Kotlin, TypeScript) and technologies like Kafka, Flink, Redis and clustered Postgres.
- Familiarity with networking concepts (routing, load balancing, service‐to‐service communication); bonus points for leading teams and managing Kubernetes‐based infrastructure.
Why you might care
- Work on a mission‐critical platform where reliability directly affects how traders see and act on live markets.
- High‐ownership, low‐bureaucracy environment, with the backing of fresh Series B funding and the chance to influence both architecture and culture as the company scales.