Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
production estate from both a technical and process perspective. Provide a consistent smooth operation of live systems and drive all on-call support issues. Design and operate a new incident tracking process to ensure root causes are found and remediated in a timely fashion by the development team. Create and maintain high end monitoring and automation tooling. Drive automation … and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incidentmanagement, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud More ❯
Peterborough, Cambridgeshire, UK Hybrid / WFH Options
Experis
IncidentManagement Engineer Location: Remote Working hours: Sunday to Thursday (7:30am to 4pm) Salary : £28,000 We have an opportunity for an IncidentManagement Engineer to join Experis on a permanent basis. You will be working as part of our Employed Consultant team, on site with a multi-national technology company based in Reading. This … multiple industries; our approach is a very personal one, with both our clients and our own employees. We are passionate about training, technology and career development. Role Purpose: An Incident Communications and Coordination Engineer is required to work as part of a collaborative team that serve as an incidentmanagement and communications support operation for both external More ❯
Cambridge, Cambridgeshire, UK Hybrid / WFH Options
Experis
IncidentManagement Engineer Location: Remote Working hours: Sunday to Thursday (7:30am to 4pm) Salary : £28,000 We have an opportunity for an IncidentManagement Engineer to join Experis on a permanent basis. You will be working as part of our Employed Consultant team, on site with a multi-national technology company based in Reading. This … multiple industries; our approach is a very personal one, with both our clients and our own employees. We are passionate about training, technology and career development. Role Purpose: An Incident Communications and Coordination Engineer is required to work as part of a collaborative team that serve as an incidentmanagement and communications support operation for both external More ❯
Constantly improving all our processes and procedures, we believe there is nothing we cannot improve - Assisting and managing relationships with external vendors and contractors - Liaising with internal teams and management groups - Creating and maintaining metrics on all aspects of our Data Centers and utilising those metrics to drive positive changes - Assisting in implementing service methodologies including incidentmanagement, problem management, change management, capacity management, etc About the team About the team Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn't followed a traditional path, or includes alternative … critical Data Centre Operations including systems such as feeders, Transformers, Generators, Switch gear, UPS systems, ATS units, PDU units, chillers, pumps, Air Handling units - Proven track record of people management and developing teams and in particular ensuring staff are ready for any and conditions through skill and process development - Ability to solve problems at their root, stepping back to More ❯