Reinforcement Learning from Human Feedback rlhfbook.com 95 points by onurkanbkrc 9 hours ago https://arxiv.org/abs/2504.12501
dang 4 hours ago Related. Others?RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)
verdverm 8 hours ago Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials leggerss 6 hours ago You could say he's also learning from human feedback
klelatti 9 hours ago Web version with links, etc:https://rlhfbook.com/ dang 4 hours ago Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.
dang 4 hours ago Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.
Related. Others?
RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)
Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
You could say he's also learning from human feedback
Web version with links, etc:
https://rlhfbook.com/
Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.
[dead]