GPT-4V(ision) technical work and authors
This document acknowledges the contributors and technical work done as part of the GPT-4V project. GPT-4V refers to the technology that enables the integration of multimodal vision capabilities with GPT-4. Our current body of work consists of multiple resources:
The “GPT-4 Technical Report” covers the GPT-4 system generally as well as quantitative evaluations of GPT-4V in academic evals and exams.
The “GPT-4V System Card” covers the safety considerations involved in deploying our work.
The blog post “ChatGPT Can Now See, Hear and Speak” demonstrates the user interface of the realized GPT-4V system as deployed in ChatGPT.
“The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)(opens in a new window)” work from our colleagues at Microsoft covers a plethora of practical observations and strategies for using GPT-4V.
This collection of works and the following credits reflect the multidisciplinary expertise involved in creating, building, and safely deploying multimodal AI while empowering users and educating the public.
Authorship, credit attribution, and acknowledgments
When citing GPT-4V please cite this work as “OpenAI (2023)”. Contributions are sorted alphabetically and assembled by Raul Puri.
Research contributions
Jamie Kiros Deployment research & evals lead
Daniel Levy Optimization lead
Hyeonwoo Noh Pretraining research lead
Long Ouyang Alignment data lead
Raul Puri Research engineering lead
Architecture research
Mark Chen, Casey Chu, Jamie Kiros, Christine McLeavey, Hyeonwoo Noh, Raul Puri, Alec Radford, Aditya Ramesh
Distributed training infrastructure
Trevor Cai, Yunxing Dai, Chris Hesse, Brandon Houghton, Yongjik Kim, Łukasz Kondraciuk, Hyeonwoo Noh, Mikhail Pavlov, Raul Puri, Nikolas Tezak, Amin Tootoonchian, Tianhao Zheng
Data
Alex Karpenko, Jong Wook Kim, David Mélý, Reiichiro Nakano, Hyeonwoo Noh, Long Ouyang, Raul Puri, Alec Radford, Pranav Shyam, Tao Xu
Evaluation data
Sandhini Agarwal, Madeline Boyd, Shengli Hu, Andrew Kondrich, Todor Markov, David Mélý, Hyeonwoo Noh, Reiichiro Nakano, Long Ouyang, Cameron Raymond, Filippo Rasso, Chelsea Voss, Lilian Weng, Chong Zhang, Rowan Zellers, Nicholas Turley
Alignment data
Stephanie Lin, Long Ouyang, Chong Zhang
Deployment, alignment & post-training research
Ilge Akkaya, Diogo Moitinho de Almeida, Mark Chen, Liam Fedus, Yuchen He, Alex Karpenko, Jamie Kiros, Andrew Kondrich, Rachel Lim, Randall Lin, Stephanie Lin, Ryan Lowe, Luke Metz, Reiichiro Nakano, Long Ouyang, Raul Puri, Jiayi Weng, Barret Zoph
Compute cluster scaling
Andrew Cann, Rory Carmichael, Christian Gibson, Henri Roussez, Akila Weliwinda
Hardware correctness
Oleg Boiko, Trevor Cai, Michael Petrov, Alethea Power
Training run babysitting
Trevor Cai, Kyle Kosic, Daniel Levy, David Mélý, Reiichiro Nakano, Hyeonwoo Noh, Mikhail Pavlov, Raul Puri, Amin Tootoonchian
Safety contributions
Sandhini Agarwal Policy research lead
Lama Ahmad Red teaming lead
Chong Zhang Safety systems research lead
Red teaming leaders
Lama Ahmad, Rosie Campbell, Ashyana-Jasmine Kachra
Safety systems research
Florencia Leoni Aleman, Madelaine Boyd, Yuchen He, Andrew Kondrich, Todor Markov, Raul Puri, Cameron Raymond, Andrea Vallone, CJ Weinmann, Lilian Weng, Mehmet Yatbaz, Chong Zhang
Policy research
Sandhini Agarwal, Lama Ahmad, Miles Brundage, Rosie Campbell, Michael Kolhede, Michael Lampe
Deployment contributions
Madeline Boyd Trust & safety engineering lead
Raul Puri Inference infrastructure lead
Jordan Sitkin Deployment platform lead
Isaac Wolkerstorfer ChatGPT engineering lead
Benjamin Zweig Design lead
Deployment engineering
Valerie Balcom, Jason Chen, Dave Cummings, Bogo Giertler, Joshua Gross, Eric Horacek, Mark Hudnall, Tomer Kaftan, Rachel Lim, Lien Mamitsuka, Rajeev Nayak, Henrique Ponde de Oliveira Pinto, Adam Perelman, Raul Puri, David Schnurr, Eric Sigler, Jordan Sitkin, Javier Soto, Heather Schmidt, Felipe Such, Anton Tananaev, Sherwin Wu, Isaac Wolkerstorfer
ChatGPT client engineering
Valerie Balcom, Bogo Giertler, Eric Horacek, Lien Mamitsuka, Rajeev Nayak, Raul Puri, David Schnurr, Javier Soto, Anton Tananaev
ChatGPT backend engineering
Jason Chen, Joshua Gross, Mark Hudnall, Alex Karpenko, Raul Puri, Eric Sigler, Jordan Sitkin, Isaac Wolkerstorfer, Chong Zhang, Dave Cummings
Deployment platform
Madeleine Boyd, Olivier Godement, Mark Hudnall, Rachel Lim, Raul Puri, Jordan Sitkin, Isaac Wolkerstorfer, Sherwin Wu
Inference infrastructure
Greg Brockman, Tomer Kaftan, Rachel Lim, Raul Puri, Heather Schmidt, Jordan Sitkin, Felipe Such
Trust & safety engineering
Madeleine Boyd
Design
Maddie Simens, Benjamin Zweig
Launch partners, product, and deployment management
Olivier Godement, Joanne Jang, Angela Jiang, Raul Puri, Jessica Shieh, Natalie Staudacher, Nicholas Turley
Additional contributions
Greg Brockman, Peter Deng, Jason Kwon, Bob McGrew, Mira Murati, Srinivas Narayanan, Peter Welinder, Hannah Wong
Communications
Eric Antonow, Ryan Biddy, Ruby Chen, Thomas Degry, Niko Felix, Elie Georges, Kendra Rimbach, Natalie Summers, Justin Jay Wang
Deployment security
Tiffany Citra, Jake McNeil, Karthik Rangarajan
User Support
Jeremiah Currier
Legal
Ashley Pantuliano, Filippo Raso, Thomas Stasi
Acknowledgments
We are grateful to our expert adversarial testers and red teamers who helped test our models at early stages of development and informed our risk assessments as well as the System Card output. Participation in this red teaming process is not an endorsement of the deployment plans of OpenAI or OpenAI’s policies: Sally Applin, Gerardo Adesso, Rubaid Ashfaq, Max Bai, Matthew Brammer, Ethan Fecht, Andrew Goodman, Shelby Grossman, Matthew Groh, Hannah Rose Kirk, Seva Gunitsky, Yixing Huang, Lauren Kahn, Sangeet Kumar, Dani Madrid-Morales, Fabio Motoki, Aviv Ovadya, Uwe Peters, Maureen Robinson, Paul Röttger, Herman Wasserman, Alexa Wehsener, Leah Walker, Bertram Vidgen, Jianlong Zhu.
We thank Microsoft for their partnership, especially Microsoft Azure for supporting model training with infrastructure design and management, and the Microsoft Bing team and Microsoft’s safety teams for their partnership on safe deployment and safety research. We also thank the Microsoft Research team for their exploratory work cataloguing use of GPT-4V: Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang.
Lastly, we thank our deployment partners Be My Eyes for their support and feedback in deploying this technology to the blind and low-vision community.