Tom Brown

3 posts

Fine-Tuning GPT-2 from Human Preferences

Testing Robustness Against Unforeseen Adversaries

Gathering Human Feedback