Operations Engineer
Operations Engineer
£55,000 - £60,000 base + benefits
The Operations Engineer plays a key role in ensuring the resilience, performance, and ongoing enhancement of technologies and platforms.
Acting as the operational foundation for commercial digital services, this role is responsible for ensuring applications, integrations, enterprise platforms, customer-facing technologies, and supporting infrastructure are effectively monitored, maintained, and supported. The focus is on proactively identifying issues, improving reliability, minimising service disruption, and maximising platform availability.
The Operations Engineer will work collaboratively with Engineering, Platform, Integration, Product, Cybersecurity, and Enterprise IT teams to support operational excellence through incident management, governance activities, release coordination, platform optimisation, and service continuity.
Key Responsibilities
- Provide operational support for enterprise platforms, applications, integrations, and associated technologies.
- Monitor system health, availability, and performance using monitoring, alerting, and observability tools.
- Analyse, troubleshoot, and resolve incidents affecting services and platforms.
- Perform root cause analysis and contribute to implementing permanent solutions to prevent recurring issues.
- Coordinate major incident activities across internal technology teams and external suppliers where required.
- Support production deployments, release activities, and environment readiness checks.
- Collaborate with Engineering and Platform teams to ensure new solutions and releases are operationally stable and supportable.
- Identify opportunities to automate operational processes, reporting, alerting, and maintenance activities.
- Ensure operational practices align with governance, cybersecurity, compliance, and change management requirements.
- Create and maintain operational documentation, support guides, and technical runbooks.
- Drive continuous improvement initiatives to enhance platform stability, reliability, and operational efficiency.
- Work closely with Product, Engineering, Platform, Integration, and Service Management teams to deliver strong operational outcomes.
Skills and Experience Required
- Proven experience supporting enterprise applications, digital platforms, or cloud-based technologies.
- Strong technical troubleshooting, analytical thinking, and problem-solving abilities.
- Experience working within IT operations, support engineering, or service management environments.
- Experience supporting business-critical production services and operational platforms.
- Knowledge of monitoring, logging, alerting, and observability practices.
- Experience working with incident, problem, change, and release management processes.
- Excellent communication skills with the ability to collaborate effectively across multiple technical and business stakeholders.
Desirable Skills and Experience
- Experience working within automotive, manufacturing, retail, or enterprise digital technology environments.
- Exposure to cloud platforms, CI/CD tooling, DevOps practices, and infrastructure automation.
- Experience supporting Salesforce, APIs, integrations, digital platforms, or enterprise SaaS solutions.
- Understanding of Site Reliability Engineering (SRE), operational engineering principles, or reliability-focused practices.