Senior Site Reliability Engineer
Senior Site Reliability Engineer (SRE) - Data & AI Platforms
Location: Glasgow, Scotland (Hybrid - 3 days onsite, non-negotiable)
Duration: 6 months with extensions likely
Day rate: PAYE circa £430
We are seeking an experienced Senior Site Reliability Engineer to support a strategic, data- and AI-driven platform delivering operational intelligence at scale. This role sits at the intersection of software engineering, platform operations, and data infrastructure, with a strong focus on reliability, automation, and continuous improvement.
Key Responsibilities:
- Own reliability, availability, and performance across DEV, QA, and PROD environments
- Apply SRE principles (SLIs/SLOs) to improve resilience and reduce operational toil
- Build automation tooling (primarily Python) to streamline workflows
- Develop and maintain CI/CD pipelines for data and platform services
- Lead incident response, root cause analysis, and preventative improvements
- Implement monitoring, alerting, and observability across systems
- Support cloud data platforms (including Snowflake) for scalability and cost efficiency
- Enable reliable deployment of AI-driven services and data pipelines
- Collaborate with engineering, data, and product teams in Agile environments
Required Skills & Experience:
- Extensive experience in SRE, platform engineering, or similar roles
- Strong Python Scripting and automation experience
- Proven experience managing production systems at scale
- CI/CD and modern software delivery practices
- Cloud data platforms and relational databases knowledge
- Strong troubleshooting and communication skills
Desirable:
- Experience with AI/LLM platforms, BI tools, and semantic data models
- Background in large-scale or regulated environments
Please apply today to be considered
McGregor Boyall is an equal opportunity employer and do not discriminate on any grounds.