Training Language Models to Follow Instructions with Human Feedback

arXiv V1: Training language models to follow instructions with human feedback