Write For Us

Pushing the capabilities of Gemma 3 via distillation and RL fine-tuning

E-Commerce Solutions SEO Solutions Marketing Solutions
24 Views
Published
Specialized capabilities (e.g. math abilities, coding, multilinguality, tool use...) are key areas of improvement in post-training. In this talk we explore a novel strategy involving large-scale distillation and RL finetuning to push specialized capabilities in LMs while still improving their generality.

Subscribe to Google for Developers → https://goo.gle/developers

Speakers: Johan Ferret
Products Mentioned: Gemma
Category
Project
Tags
Google, developers, pr_pr: Gemma;
Show more
Sign in or sign up to post comments.
Be the first to comment