System Center Operation Manager (SCOM) Engineer
Job Description
The System Center Operation Manager (SCOM) Subject Matter Expert will be responsible for managing, maintaining and improving on the SCOM platform. The SCOM platform is the “single pane of glass” for monitoring all technology systems and applications across the estate. This position is part of a team that tracks system availability/performance using SCOM to support comprehensive monitoring of the supported application/Infrastructure.
The candidate needs a deep understanding of SCOM administration, reporting and experience with management pack development for both standard and non-standard technologies. They will also need a broad understanding of the technologies SCOM will be monitoring such as, AD, DNS, DHCP, IIS, certificates, physical hardware (iLO, iDRAC), network devices and Linux operating systems.
The SCOM SME will be responsible for collaborating with Windows, DBA and Linux systems engineers, application technical support analysts and developers, both internal and in the customer/development teams, to determine requirements for filters, events, alerts, dashboards and reporting in various monitoring tools. A good understanding of event correlation and the ability to query data warehouse through SSRS to provide reports to assist in determining root cause.
Key Responsibilities:
- Perform administrative tasks on the SCOM platform. (e.g. managing alerts, user permissions)
- Perform changes to the SCOM platform (e.g. management pack imports, configuring overrides, applying patches)
- Determine problems within the SCOM platform, provide solutions for issues, creating documentation and maintaining an existing system life cycle.
- Maintain, review and improve end-user access and integration with the SCOM platform (e.g. dashboard, reports)
- Develop existing toolset to improve supportability of the environment - identify opportunities for improvement, including automated deployment of management packs, regular baseline checking against configuration, etc.
- Develop custom SCOM Management Packs including improving on existing custom management packs.
- Daily/Weekly/Monthly and Ad hoc reports using SQL Server Reporting Services (SSRS) technologies.
- Utilize verbal and written communication skills to maintain accurate system documentation and provide training as required
- Manage customer satisfaction through effectively communicating and managing customer expectations
- Ability to communicate how to leverage technology to create business value and become more effective and efficient
- Work with project managers and peers to understand and solve challenging technical problems, produce effort estimates, and improve system functionality, reliability and reduce costs.
- Work closely with a Business Technology peers and management emphasizing collaboration and joint effort to ensure that the needs and expectations of organizational stake holders, business partners, external customers, and third party organizations are met and/or exceeded.
- Transfer knowledge via documentation and training to other project and support teams/resources.
Basic Qualifications:
- BSc degree in a computer science, information technology, computer related discipline or 5+ years IT work experience in a global information technology infrastructure environment.
Preferred Skills/Experience:
- Experience with the installation and administration of Microsoft System Center Operations Manager 1807 and 2022
- Experience in server/application/network performance, capacity and availability monitoring
- Experience in developing custom SCOM management packs for Windows, Linux and network platforms using Microsoft Visual Studio Authoring Extensions (VSAE).
- Experience using the Microsoft Windows PowerShell.
- Experience with SQL Server Reporting Services (SSRS)
- Experience with version control platforms such as GitHub.
- Experience of using ServiceNow ITSM.
- Experience with core Microsoft technologies including PC and server operating systems
- Experience with core Linux technologies such as RedHat Enterprise
- Knowledge of server virtualization & cloud technologies, such as Hyper-V and AWS
- ITIL Certification is a plus.
Professional Skills/Experience:
- Minimum of 5-7 years IT experience managing deployment of technology components in a specific technical discipline: clients, servers, storage, web platforms, database platforms, security, networks, or communications ideally with a mix of skills that include technical architectures, engineering and operations experience
- A career track record of engineering, developing, deploying, and maintaining business critical systems in the area of technical expertise including hands on solution development and implementation experience
- Ability to impact and influence stakeholders and drive alignment on standard global technology solutions.
- Team Player with proven communication, organizational, and strong interpersonal skills. The role requires significant interaction with many different teams across a global company.
- Self-motivated, with keen attention to detail and excellent judgment skills
- Ability to establish new standards for quality, performance and productivity
- Must have excellent writing and communication skills, strong communicator with ability to maintain open communication with internal employees, contractors, managers, 3rd parties, and customers as needed
- Able to integrate and apply feedback in a professional manner
- Able to prioritize and drive to results with a high emphasis on quality