Learning to summarize with human feedback

5 years ago 9
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.
Read Entire Article