Letchworth Garden City, Hertfordshire, England, United Kingdom Hybrid / WFH Options
Willmott Dixon
handled efficiently and in a timely manner. Collaborate: Work closely with the IT teams to deliver outstanding service, sharing information and best practices. Handle incidents in line with the IncidentManagement process, keeping all parties informed with clear and accurate updates. Participate in key tasks and contribute to the smooth operation of IT services, including Request Fulfilment, Change … Requests, Major IncidentManagement, and proactive Problem Management. Act as a liaison between the Service Desk and other IT teams to ensure effective communication and service delivery. Performance Metrics: Manage and monitor key performance indicators (KPIs) related to the role, constantly identifying areas for improvement to exceed service expectations. Maintain high standards in documenting processes, creating clear and … concise knowledge articles to assist in service delivery. Provide regular reports to management and stakeholders on process performance, compliance, and exceptions. Essential and Desirable Criteria Essential: Experience in a customer service-focused role, with strong written and verbal communication skills. Ability to manage and prioritize incidents effectively, using available resources to solve problems. Self-directed with initiative, capable of More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
production estate from both a technical and process perspective. Provide a consistent smooth operation of live systems and drive all on-call support issues. Design and operate a new incident tracking process to ensure root causes are found and remediated in a timely fashion by the development team. Create and maintain high end monitoring and automation tooling. Drive automation … and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incidentmanagement, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud More ❯
Peterborough, Cambridgeshire, United Kingdom Hybrid / WFH Options
TXP
our clients IT infrastructure through the timely application of software updates (patches). ROLE RESPONSIBILITIES Facilitate and execute the installation of agreed patches and the maintenance of customer AV management software. Responding to BAU Security and Patch Management incidents logged on the incidentmanagement system to ensure incidents are resolved within SLA, whilst providing help and … implemented solutions are backed up with documentation, training and expertise within the Service Desk and Support areas. Perform daily, weekly and monthly scheduled tasks relating to Patching and AV management when required. Escalate potential service issues to line management and ITIL Services Raise potential problem issues with Problem and Incident Management. SKILLS AND EXPERIENCE 3+ years of … knowledge of IT platforms, equipment and applications. Proven Customer service skills. Industry recognised qualifications in relevant areas. Knowledge/experience in the following technologies. - Windows Server/Workstation - Patch Management Software - SEPM and other mainstream AV solutions (e.g. Sophos, Kaspersky, etc.) - Experience with troubleshooting (Event Viewer, Log Files, Command Prompt) - Basic networking If you're proactive, detail-oriented, and More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Google Cloud, including multi-cluster and multi-regional Kubernetes environments. Create and maintain automation scripts (Python, Bash, Shell) and tools (GitLab, Terraform, Vault, Ansible) to streamline deployment, monitoring, and management processes using Infrastructure as Code (IaC). Implement and integrate monitoring and observability solutions, like AIOps, for proactive system issue detection and response. Participate in on-call rotations to … Confluence. Nice to Have Skills: CKA or CKAD certifications, and advanced certifications in AWS, GCP, or Azure. Experience working in a globally distributed team. Knowledge of ITIL practices and incidentmanagement frameworks. Additional Information: If you require accommodations during the recruitment process, please email . We respect confidentiality and are committed to providing necessary support to all candidates. More ❯
Luton, Bedfordshire, South East, United Kingdom Hybrid / WFH Options
Lean People Services Limited
distributed systems in the cloud Optimize web applications to ensure performance and scalability Develop automated tests to ensure business needs are met, and allow for regression testing Cloud Infrastructure Management: Contribute to infrastructure improvement initiatives. You will assist with monitoring and Optimisation of Cloud resources. Application Development: Ensure smooth deployment and integration of features. Operations and Maintenance: Resolve operational … issues and minimize downtime when needed. Respond to customer requests this may include direct contact with customers You may need to cover incidentmanagement when one of your colleagues is on leave Key Technologies : Cloud based systems, Azure & AWS Full stack Octopus Deploy VueJS, React, Angular Mobile development in Android Studio Who you will be: Passionate about software … to using automated testing platforms and unit tests to deliver readable, maintainable code. Have an great understanding of deployment technologies and concepts Have a good working knowledge of issue management, code versioning, continuous integration and deployment tools. Programming languages that we use are c# full stack, Vue JS You are comfortable with making decisions with general concepts including dependency More ❯