Daniel Ziegler

1 post

Fine-Tuning GPT-2 from Human Preferences