Jeffrey Wu

2 posts

Fine-Tuning GPT-2 from Human Preferences

Better Language Models and Their Implications

We’ve trained a large-scale unsupervised language model which generates coherent paragraphs of text, achieves state-of-the-art performance on many language modeling benchmarks, and performs rudimentary reading comprehension, machine translation, question answering, and summarization.


24 minute read