Site Reliability Engineer

Job Description


We are seeking an experienced Site Reliability Engineer (SRE) to join our Group Technology Team in Milton Keynes.ConnellsX is Connells Group Technologys internal developer platform, built on Microsoft Azure. It simplifies cloud hosting, embeds security and compliance by default, and enables a frictionless developer experience. As part of the team building and operating this platform, you will play a hands-on role in ensuring it is reliable, scalable, and observable.You will help establish and mature SRE practices, focusing on:Monitoring and observabilityIncident responsePost-incident reviewReliability testing and capacity planningToil reductionEnabling development velocityWe offer a hybrid working arrangement with one day per week in our Milton Keynes office.Key Responsibilities:Support teams using ConnellsX and respond to incidents in a structured, blameless wayInvestigate root causes and drive post-incident actions to completionDefine SLIs, contribute to SLOs, and monitor error budgetsBuild dashboards, alerts, and runbooks to improve visibilityAutomate repetitive tasks to reduce operational toilCollaborate with cross-functional teams to enhance reliability and observabilitySupport performance testing and capacity planningProactively identify and prioritise reliability improvementsExperience & Skills Required:Hands-on experience with Azure Monitoring (Application Insights, Alerts, Action Groups)Strong knowledge of OpenTelemetry (including Kubernetes)Scripting/automation using PowerShell and/or Azure CLIExperience with Terraform and GitHub ActionsAbility to define SLIs/SLOs and manage error budgetsIncident response and post-incident review experienceFamiliarity with Docker and KubernetesStrong communication and documentation skillsDesirable:Working knowledge of .NET/C# and React/NextJSExperience with cloud cost optimisationKnowledge of Azure networking (DNS, VNets, Firewalls)Understanding of security frameworks (e.g.

ISO 27002, NIST CSF)Azure certificationsAbout You:You may come from SRE, DevOps, platform engineering, or operations backgrounds. What matters is hands-on experience running production systems, managing incidents, creating runbooks and automating repetitive work. The focus is on identifying root causes and systemic issues, reducing manual toil through automation, and maintaining reliability by applying SRE principles and using data-driven metrics (SLIs/SLOs).You understand reliability is about balance, not perfection, and can make data-driven trade-offs between stability and delivery.

You are curious, collaborative, and take shared responsibility for system reliability.Please note that we are unable to provide visa sponsorship. Applicants must have the right to work in the UK.Connells Group UK is an equal opportunities employer and positively encourages applications from suitably qualified and eligible candidates regardless of sex, race, disability, age, sexual orientation, transgender status, religion or belief, marital status, or pregnancy and maternity.TPBN1_UKTJ

Job Details

Company
Connells Limited
Location
Milton Keynes, Buckinghamshire, UK
Hybrid / Remote Options
Employment Type
Full-time
Posted