The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback

Oct 31, 2023

|

Nathan Lambert, Roberto Calandra

Tags:

Reinforcement Learning

Download the paper!Read on ArXiv!Run the code!Video available!

ABSTRACT:

hide & show ↓↑

What you need to know:

Citation

‍

Links

Download the paper!

Video

←All Research Google Scholar

Tags

Reinforcement Learning

Multi-Agent Systems