Monitoring Tools Support, Assoc Manager / Specialist
Monitoring Tools Support, Assoc Manager/Specialist
Location: UK (London - must be willing to travel to client sites throughout the UK on an ad hoc basis)
Salary: Competitive salary and package (Depending on level of experience)
Accenture are partnering with scaled UK AI compute pioneers to lead the charge on next-generation infrastructure for sovereign AI. To support this endeavour, we’re building a high-performance compute operations team in London.
Our work will be sensitive, secure, 24x7 and on the most up-to-date high density compute stacks available. Shift teams will be setup and operate 24x7 and successful candidates working on shift will be paid a shift premium for the non-standard unsociable shift hours that will be part of that rota.?We anticipate this to commence within the first 3-6 months of joining. Our opportunity is foundational to sovereign AI and national interest. We are recruiting for all levels across the TechOps stack and seek candidates who can be cleared to SC, willing to work shifts, have experience of technical operations work and have either the high performance compute skills or a proven track record in adopting new skillsets.
Any offer of employment is subject to satisfactory BPSS and SC security clearance which requires 5 years continuous UK address history (typically including no periods of 30 consecutive days or more spent outside of the UK) at the point of application.
Key Responsibilities:
•Manage monitoring configurations and familiar with Tools such as DynaTrace, NetBox, Vertiv.
•Possess in-depth understanding of the Dynatrace license model and its application.
•Execute hands-on implementation and deployment of Dynatrace across customer environments.
•Configure and customize Dynatrace as per organizational and project-specific requirements.
•Manage complete Dynatrace deployment lifecycles in complex customer settings.
•Integrate Dynatrace with various ITSM, CI/CD, and QA/performance testing tools.
•Maintain detailed documentation for resolved issues, workflows, and Dynatrace configurations.
•Share knowledge and mentor peers, develop Dynatrace usage guidelines and best practices.
•Troubleshoot monitoring platform related incidents.
•Producing high quality implementation scripts, test plans and change controls for all monitoring changes
•Maintain Observability Frameworks: Integrate monitoring, logging, and tracing for cloud- and on-premises systems.
•Maintain Data Integration & Ingestion: Manage scalable systems to collect and ingest data from multiple sources via APIs and SNMP Traps
•Custom Solutions: Develop dashboards and reporting views that provide clear, actionable insights.
•Monitor, analyse, and fine tune monitoring metrics addressing bottlenecks or inefficiencies.
Required Skills:
•Tooling Expertise: Use Prometheus, Grafana, and Dynatrace for data collection, visualisation, and deep-dive analysis.
•Proficiency in Prometheus, Grafana, and Dynatrace.
•Strong knowledge of API design, data modelling, and pipelines.
•Experience with automation, monitoring platforms, and scripting languages (e.g., Python, PowerShell, Bash, Terraform and Ansible) to enhance operational efficiency.
•Strong communication and collaboration skills, with a track record of working effectively across technical and non-technical teams.