Reinforcement Learning Jobs in Watford

2 of 2 Reinforcement Learning Jobs in Watford

AI Scientist

Watford, Hertfordshire, United Kingdom
Moorepay Limited
translate into business value. We embrace a fast-paced, entrepreneurial mindset, enabling us to iterate rapidly and refine our AI strategies based on continuous learning and real-world feedback. Key Responsibilities AI Research and Model Development Conducting research in AI, using a full range of machine learning and … optimising AI that enhances automation and decision-making. Ensuring AI models are scalable and efficient for real-world enterprise deployment. Experimenting with different machine learning and GenAI techniques, including prompt engineering, RAG (Retrieval Augmented Generation), fine-tuning of LLMs, RLHF (reinforcement learning with human feedback), and adversarial … friendly manner. Leading AI experimentation initiatives and contributing to internal strategy discussions. Engaging with customers to understand AI needs and create practical solutions. Continuous learning and innovation. Staying up-to-date with AI and ML research relevant to HR and workforce management. Exploring new techniques in deep learning More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Scientist

Watford, Hertfordshire, South East, United Kingdom
Zellis
translate into business value. We embrace a fast-paced, entrepreneurial mindset, enabling us to iterate rapidly and refine our AI strategies based on continuous learning and real-world feedback. In this role your key responsibilities will include: AI Research and Model Development Conducting research in AI, using a full … range of machine learning and GenAI techniques to develop solutions across the entire HR lifecycle. Designing and optimising AI that enhances automation and decision-making. Ensuring AI models are scalable and efficient for real-world enterprise deployment. Experimenting with different machine learning and GenAI techniques, including prompt engineering … RAG (Retrieval Augmented Generation), fine-tuning of LLMs, RLHF (reinforcement learning with human feedback), and adversarial techniques. Evaluating AI model performance using statistical and business-driven metrics. Working on natural language to SQL AI transformations to extract data value. Working on natural language to other meta-language translation More ❯
Employment Type: Permanent
Posted: