John
Lambert
Toggle navigation
about
publications
code
teaching
news
Announcement_4
December 31, 2025
2025
Our new work on multi-turn reward modeling is now on
[arXiv]
.