Training Language Models to Follow Instructions with Human Feedback

Paper summary of Training language models to follow instructions with human feedback from March 2022.

Intro

Blah