s in Computer Science, Engineering, Data Science, or related field 8+ years in enterprise technology, with 3-5 years in platform operations or service management Experience managing GenAI/ML platforms and LLM-based services (e.g., OpenAI, Anthropic, Azure OpenAI, Hugging Face) Proven in scaling MLOps or LLMOps practices … cross-functional teams in complex ecosystems Familiarity with monitoring tools like Prometheus, Grafana, Azure Monitor Excellent communication and stakeholder engagement skills Experience managing SLAs, incidentmanagement, continuous improvement Strategic and hands-on in fast-paced environments Professional English proficiency What will be your key responsibilities? In this role … eXperiences (MAX) Platform, ensuring high availability, scalability, and performance. LLMOps Implementation: Develop and operationalize LLMOps practices, including deployment, monitoring, versioning, and performance tuning. Service Management & Support: Establish incidentmanagement, SLAs, change management, and continuous improvement processes. Governance & Compliance: Ensure adherence to Responsible AI, data privacy, and More ❯
Science, or a related technical field 8+ years of experience in enterprise technology roles, with 3–5 years focused on platform operations or service management Hands-on experience with managing GenAI/ML platforms and LLM-based services (e.g., OpenAI, Anthropic, Azure OpenAI, Hugging Face) Proven track record in … implementing and scaling MLOps or LLMOps practices in a production environment Certifications in cloud platforms (e.g., Azure, AWS, GCP) and/or ITIL Service Management preferred Advanced coursework or certifications in AI/ML, MLOps, or LLMOps is a strong plus Ongoing learning and participation in GenAI or platform … tools (e.g., Prometheus, Grafana, Azure Monitor) Exceptional communication and stakeholder engagement skills to partner with business, technical, and governance teams Experience managing platform SLAs, incidentmanagement, and continuous improvement cycles in high-availability environments Ability to balance strategic thinking with hands-on execution in a fast-paced, evolving More ❯
Science, Engineering, Data Science, or a relatedtechnical field 8+ years of experience inenterprise technology roles, with 3–5 years focused on platformoperations or service management Hands-onexperience with managing GenAI/ML platforms and LLM-based services(e.g., OpenAI, Anthropic, Azure OpenAI, HuggingFace) Proven track record in implementing andscaling … and platform telemetry tools (e.g., Prometheus,Grafana, Azure Monitor) Exceptionalcommunication and stakeholder engagement skills to partner withbusiness, technical, and governanceteams Experience managing platform SLAs,incidentmanagement, and continuous improvement cycles inhigh-availability environments Ability tobalance strategic thinking with hands-on execution in a fast-paced,evolving landscape Professional … tuning at scale. Ensureefficient management of AI models to maximize their effectivenessand businessimpact. ServiceManagement & Support Establish andmanage robust service management processes, including incidentmanagement, service-level agreements (SLAs), change management, andcontinuous service improvement for GenAIplatforms. Drive excellence in service deliverythrough proactive support and managementstrategies. Governance& Compliance Alignment Ensureplatform More ❯
expertise is VIRTUS' greatest strength. Job Summary Reporting to Senior Director of Operations, the primary function of the role is to provide leadership, direction, management and oversight on the day-to-day operations of the VIRTUS Data Centres facilities operations. This includes ensuring the safety, reliability, efficiency, and operational … Continuous Improvement: Champion the implementation of innovative practices, processes, and technologies to drive improvements in service delivery, operational efficiency, and customer satisfaction. Strategic Financial Management: You will have comprehensive control over the operational and capital expenditure, managing a significant budget with a keen eye on cost optimisation without compromising … Drive initiatives aimed at enhancing operational capabilities, streamlining processes, and implementing state-of-the-art solutions to bolster efficiency and service quality. Performance & SLA Management: Ensure that all data centres facilities operations consistently meet or exceed their 100% uptime SLA and operations goals. Address performance and goals challenges promptly More ❯
role that requires commitment to providing a high-quality service to meet customer demand. The role is required to deliver IMAC, iMACD, and Breakfix IncidentManagement Services. The duties range from small alterations such as patching and fault finding to performing large-scale equipment and device relocations including … mechanical and electrical provisioning - Technology component swaps. WTS Control functions Technology room inspections Alarm investigation assistance (CMS/BMS/Peregrine) Identification to DCO management of redundant technology infrastructure Escort and supervision of activities within technology rooms i.e. 3rd party and other bank group activities. Power circuit support (proprietary … Technology component swaps Third party supervision (escort and supervision) WTS Cabling Carry out cable installation services in accordance with STS cabling/patching schedules Management of patching within equipment cabinet Labelling of cabling in accordance with STS standards and requirements Supervision and QA for cabling installs. Recovery of unused More ❯
Work closely with SAP RISE support teams to manage and optimize cloud-hosted SAP systems. Coordinate with SAP for Service Requests, Change Requests, and IncidentManagement in the RISE framework. Integrate SAP Analytics cloud with S/4HANA Collaborate with application, development, and infrastructure teams for seamless SAP … BTP integration, and hybrid SAP architectures. Hands-on experience in system migrations, upgrades, and cloud-based SAP operations. Understanding of networking, security, and identity management in cloud-hosted SAP landscapes. Ability to coordinate with SAP, cloud providers, and internal IT teams for issue resolution. Experience with disaster recovery and More ❯