The Story of RLHF: Origins, Motivations…

Nov 13, 2023

How learning from human feedback revolutionized generative language models...

5 Comments

Nov 13, 2023

This is phenomenally detailed. Going to come back to this in a while to absorb all the information. Great work, Cameron. This must have taken a lot of effort to put together

Expand full comment

Reply (1)

Cameron R. Wolfe, Ph.D.

Nov 13, 2023

Thanks! I made the decision to switch to writing posts every 2 weeks again (instead of every week). I felt like the more frequent posts were lacking in quality a bit, and my best posts are those that I spend time refining, making comprehensive, etc. So, I'm glad that you can tell a quality difference! I'll try to keep the detailed posts coming :)

Expand full comment

Reply (1)

Sairam Sundaresan

Nov 13, 2023

You already set a high bar to begin with. This just elevates it. Awesome work.

Expand full comment

Finn

Nov 13, 2023

Thanks Cameron!

Expand full comment

Ziniu Li

Nov 30, 2023

Thank Cameron for the great blog!!! You may be interested in ReMax, a more efficient reinforcement learning method than PPO when used in RLHF. In addition, ReMax is very simple with just 6 lines of code to implement. ReMax's paper discusses interesting properties of RLHF that may also be insightful for designing better RLHF algorithms.

ReMax's paper link: https://arxiv.org/abs/2310.10505

Expand full comment

Deep (Learning) Focus

The Story of RLHF: Origins, Motivations…