Strategic Relationship Manager
Site Reliability Engineer (SRE) - Market Risk Platform
London (5 days onsite) | Contract | Banking/Finance/Trading
£450 day
Overview
We are hiring experienced Site Reliability Engineers (SREs) to support a Market Risk platform within a leading financial services environment.
This is an engineering-led transformation role, focused on automation, reliability, and AI-driven operational improvement rather than BAU support.
Success is measured by:
- Reduced operational toil
- Faster recovery (MTTR reduction)
- Safer, faster change delivery
- Increased automation and self-service
- Improved platform reliability
Key Responsibilities
Automation Engineering (Core)
- Build production-grade Python automation for operational workflows
- Automate environment checks, dependency validation, reruns, restarts, and drift remediation
- Deliver self-service tools with proper audit, rollback, and safety controls (idempotency, dry-run, approvals)
Process Re-engineering (Core)
- Redesign incident, change, release, and recovery processes
- Convert runbooks into automated workflows
- Remove manual handoffs and operational friction
- Define KPIs: toil, MTTR, alert volume, change failure rate
Agentic AI (Core)
- Build agentic workflows for diagnostics, remediation, and orchestration
- Implement guardrails, human-in-the-loop controls, and evaluation frameworks
- Productionise AI automation with monitoring and feedback loops
Observability
- Improve monitoring, logging, and system visibility to enable automation at scale
Required Skills
- 8+ years SRE/production engineering experience
- Strong Python (automation/tooling focus)
- Experience with distributed systems in production environments
- Strong Linux troubleshooting (app/system/network layers)
- Hybrid infrastructure exposure (on-prem + cloud)
- Kubernetes experience (ops/monitoring/reruns)
- Strong background in automation and process optimisation
- Athena ecosystems
Agentic AI (Essential)
- Proven experience with agentic AI or intelligent automation systems
- Tool integration, guardrails, evaluation, and measurable production impact (toil/MTTR reduction)
Desirable
- Banking/Finance/Market Risk experience
- Familiarity with Athena ecosystem or similar (SecDB, Quartz)
- Exposure to trading, risk, or regulatory platforms
About the Role
A high-impact SRE role in a Market Risk trading environment, focused on eliminating operational toil through automation, AI, and reliability engineering at scale.