Reinforcement Learning from Human Feedback

rlhfbook.com

95 points by onurkanbkrc 9 hours ago


https://arxiv.org/abs/2504.12501

dang - 4 hours ago

Related. Others?

RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)

verdverm - 8 hours ago

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

klelatti - 9 hours ago

Web version with links, etc:

https://rlhfbook.com/

iisweetheartii - 8 hours ago

[dead]