OpenAI's NEW Fine-Tuning Method Changes EVERYTHING
Louis-Fran?ois Bouchard
Making AI accessible. ?? What's AI on YouTube. Co-founder at Towards AI. ex-PhD Student.
Good morning!
Have you ever wanted to take a language model and make it answer the way you want without needing a mountain of data?
Well, OpenAI’s got something for us: Reinforcement Fine-Tuning, or RFT, and it changes how we customize AI models. Instead of retraining it with feeding examples of what we want and hoping it learns in the classical way, we actually teach it by rewarding correct answers and penalizing wrong ones, just like training a dog?—?but, you know, with fewer treats and more math.
Let’s break down reinforcement fine-tuning compared to supervised fine-tuning!
Both essentially have their use that we can discuss in one line:
I’ve already covered fine-tuning on the channel if you are interested in that. Today, let’s get into how RFT actually works! Read the article here or watch the video:
And that's it for this iteration! I'm incredibly grateful that?the What's AI newsletter?is now read by over 20,000 incredible human beings. Click here to share this iteration with a friend if you learned something new!
Looking for more cool AI stuff? ??
Want to share a product, event or course with my AI community? Reply directly to this email, or visit my Passionfroot profile to see my offers.
Thank you for reading, and I wish you a fantastic week! Be sure to have?enough sleep and physical activities next week!
Louis-Fran?ois Bouchard