Science, or a related technical field 8+ years of experience in enterprise technology roles, with 3-5 years focused on platform operations or service management Hands-on experience with managing GenAI/ML platforms and LLM-based services (e.g., OpenAI, Anthropic, Azure OpenAI, Hugging Face) Proven track record in … implementing and scaling MLOps or LLMOps practices in a production environment Certifications in cloud platforms (e.g., Azure, AWS, GCP) and/or ITIL Service Management preferred Advanced coursework or certifications in AI/ML, MLOps, or LLMOps is a strong plus Ongoing learning and participation in GenAI or platform … e.g., Prometheus, Grafana, Azure Monitor) Exceptional communication and stakeholder engagement skills to partner with business, technical, and governance teams Experience managing platform SLAs, incident management, and continuous improvement cycles in high-availability environments Ability to balance strategic thinking with hands-on execution in a fast-paced, evolving landscape Professional More ❯
Ability to collaborate with global teams Flexibility to adapt to changing priorities and ambiguity Experience working with stakeholders at various levels Vendor and resource management skills Strong time management, multi-tasking, and prioritization skills Familiarity with DevOps and ITIL practices What would be your key responsibilities? Manage business … for proper prioritization Identify automation opportunities in business and internal processes Propose and develop improvements for Central Finance and PaPM applications as part of ProblemManagement Ensure third-party adherence to IT service management best practices Assist with change management activities, including releases and production slots More ❯
platforms on an as needed basis. You will provide second level support of break/fix, service requests, observability improvement, documentation/reporting, capacity management, and provide critical feedback to Engineering and Architecture teams. You will collaborate closely with Engineering, Architecture, Infosec and AppDev teams to drive performance optimization … and escalation for Tier 1 Perform T2 operational tasks such as Incident response, triage, troubleshooting, resolution and process standard service requests Identification and proactive problemmanagement of assigned WIFI platforms Resolve internal customer issues and work closely with customers and vendors until resolution Perform level 2 network support … configuring of WIFI systems, switches, routers, firewalls, load balancers, and other network devices Perform advanced troubleshooting and health checks, configuration modification, and upgrades Vendor management, cooperation and escalation for ISP's and OEM's to support the infrastructure Ensure network configuration and design principles are adhered to Support processes More ❯