Kutambulisha utafiti wa kina
Wakala anayetumia mantiki kuunganisha kiasi kikubwa cha habari mtandaoni na kukamilisha shughuli za utafiti wa hatua nyingi kwa ajili yako. Inapatikana kwa watumiaji wa Pro leo, Plus na Team inayofuata.
Sasisho la Februari 10, 2026: Sasa unaweza kuunganisha utafiti wa kina na MCP au programu yoyote na kuzuia utafutaji wa wavuti kwenye tovuti zinazoaminika, ili uweze kuzingatia vyanzo vilivyothibitishwa, vya kiwango cha sekta. Pia sasa unaweza kufuatilia maendeleo kwa wakati halisi na kukatiza ili uboreshe kwa madokezo ya ufuatiliaji au vyanzo vipya. Tumesasisha uzoefu wa kuona ili iwe rahisi kuanza, kufuatilia, na kukagua utafiti wako kuanzia mwanzo hadi mwisho.
Sasisho la tarehe 17 Julai 2025: Utafiti wa kina sasa unaweza kwenda zaidi na kwa upana zaidi kwa ufikiaji wa kivinjari cha kuona kama sehemu ya wakala wa ChatGPT. Ili kufikia uwezo huu uliosasishwa, chagua tu "hali ya wakala" kutoka kwenye menyu kunjuzi ya mtunzi na uweke swali lako moja kwa moja. Utendaji asili wa utafiti wa kina bado unapatikana kupitia chaguo la "utafiti wa kina" kwenye menyu ya zana.
Sasisho la tarehe 24 Aprili 2025: Tunaongeza kwa kiasi kikubwa mara ngapi unaweza kutumia utafiti wa kina—Watumiaji wa Plus, Team, Enterprise, na Edu sasa wanapata maswali 25 kila mwezi, watumiaji wa Pro wanapata 250, na watumiaji wa Free wanapata 5. Hili linawezekana kupitia toleo jipya la utafiti wa kina linaloendeshwa na toleo la o4-mini, lililoundwa ili kuwa na gharama nafuu zaidi huku likihifadhi ubora wa juu. Ukifikia kikomo chako cha toleo kamili, maswali yako yatabadilika kiotomatiki hadi toleo jepesi.
Sasisho la Februari 25, 2025: Watumiaji wote wa Plus sasa wanaweza kutumia utafiti wa kina.
Sasisho la tarehe 5 Februari 2025: Utafiti wa kina sasa unapatikana kwa watumiaji wa Pro nchini Uingereza, Uswizi na Eneo la Kiuchumi la Ulaya.
Leo tunazindua utafiti wa kina katika ChatGPT, uwezo mpya wa wakala ambao unafanya utafiti wa hatua nyingi kwenye mtandao kwa shughuli ngumu. Inatimiza kwa dakika chache kile ambacho kingemchukua mwanadamu masaa mengi.
Utafiti wa kina ni wakala inayofuata wa OpenAI ambaye anaweza kukufanyia kazi kwa kujitegemea—unampa dokezo, na ChatGPT itapata, kuchambua, na kuunganisha mamia ya vyanzo vya mtandaoni ili kuunda ripoti ya kina katika kiwango cha mchambuzi wa utafiti. Ikiendeshwa na toleo lijalo la muundo wa OpenAI o3 ambalo limeboreshwa kwa ajili ya kuvinjari wavuti na uchanganuzi wa data, linatumia mantiki kutafuta, kutafsiri, na kuchanganua kiasi kikubwa cha maandishi, picha na PDF kwenye mtandao, likibadilika inavyohitajika ili kuitikia taarifa inayokutana nayo.
Uwezo wa kuunganisha maarifa ni sharti la unda maarifa mapya. Kwa sababu hii, utafiti wa kina unaashiria hatua muhimu kuelekea lengo letu pana la kuendeleza AGI, ambalo tumekuwa tukiliona kwa muda mrefu kama lenye uwezo wa kutoa utafiti mpya wa kisayansi.
Deep research is built for people who do intensive knowledge work in areas like finance, science, policy, and engineering and need thorough, precise, and reliable research. It can be equally useful for discerning shoppers looking for hyper-personalized recommendations on purchases that typically require careful research, like cars, appliances, and furniture. Every output is fully documented, with clear citations and a summary of its thinking, making it easy to reference and verify the information. It is particularly effective at finding niche, non-intuitive information that would require browsing numerous websites. Deep research frees up valuable time by allowing you to offload and expedite complex, time-intensive web research with just one query.
Deep research independently discovers, reasons about, and consolidates insights from across the web. To accomplish this, it was trained on real-world tasks requiring browser and Python tool use, using the same reinforcement learning methods behind OpenAI o1, our first reasoning model. While o1 demonstrates impressive capabilities in coding, math, and other technical domains, many real-world challenges demand extensive context and information gathering from diverse online sources. Deep research builds on these reasoning capabilities to bridge that gap, allowing it to take on the types of problems people face in work and everyday life.
In ChatGPT, select ‘deep research’ in the message composer and enter your query. Tell ChatGPT what you need—whether it’s a competitive analysis on streaming platforms or a personalized report on the best commuter bike. You can attach files or spreadsheets to add context to your question. Once it starts running, a sidebar appears with a summary of the steps taken and sources used.
Deep research may take anywhere from 5 to 30 minutes to complete its work, taking the time needed to dive deep into the web. In the meantime, you can step away or work on other tasks—you’ll get a notification once the research is complete. The final output arrives as a report within the chat – in the next few weeks, we will also be adding embedded images, data visualizations, and other analytic outputs in these reports for additional clarity and context.
Compared to deep research, GPT‑4o is ideal for real-time, multimodal conversations. For multi-faceted, domain-specific inquiries where depth and detail are critical, deep research’s ability to conduct extensive exploration and cite each claim is the difference between a quick summary and a well-documented, verified answer that can be usable as a work product.
GPT-4o
Deep research
Deep research responds to the prompt in a highly detailed manner, providing side-by-side country-specific data for both top 10 developed and top 10 developing countries for easy reference and comparison. It uses that information to offer detailed market-entry recommendations that are informed and usable.
Utafiti wa kina ulifundishwa kwa kutumia mafunzo ya uimarishaji wa mwisho hadi mwisho kwenye shughuli ngumu za kuvinjari na kufikiria katika vikoa mbalimbali. Kupitia mafunzo hayo, ilijifunza mpango na kutekeleza mchakato wa hatua nyingi ili kupata data inayohitaji, kurudi nyuma na kujibu taarifa za wakati halisi inapohitajika. Muundo huu pia unaweza kuvinjari faili zilizopakuliwa na mtumiaji, kuchora na kurudia kwenye grafu kwa kutumia zana ya Python, kupachika grafu zilizozalishwa na picha kutoka kwenye tovuti katika majibu yake na kunukuu sentensi au vifungu maalum kutoka kwenye vyanzo vyake. Kutokana na mafunzo haya, inafikia viwango vipya vya juu katika tathmini kadhaa za umma zinazolenga matatizo ya ulimwengu halisi.
Kwenye Mtihani wa Mwisho wa Ubinadamu(fungua katika dirisha jipya), tathmini iliyotolewa hivi karibuni ambayo hujaribu AI katika masomo mengi kwa maswali ya kiwango cha kitaalamu, muundo unaoendesha utafiti wa kina ulipata alama mpya ya juu ya usahihi wa 26.6%. Jaribio hili lina zaidi ya maswali 3,000 ya chaguo nyingi na majibu mafupi katika zaidi ya masomo 100 kutoka isimu hadi sayansi ya roketi, classics hadi ikolojia. Ikilinganishwa na OpenAI o1, mafanikio makubwa zaidi yalionekana katika kemia, masomo ya kibinadamu na sayansi ya jamii na hisabati. Muundo unaoendesha utafiti wa kina ulionyesha mbinu inayofanana na ya kibinadamu kwa kutafuta kwa ufanisi taarifa maalum inapohitajika.
| Muundo | Usahihi (%) |
|---|---|
| GPT-4o | 3.3 |
| Grok-2 | 3.8 |
| Claude 3.5 Sonnet | 4.3 |
| Gemini Thinking | 6.2 |
| OpenAI o1 | 9.1 |
| DeepSeek-R1* | 9.4 |
| OpenAI o3-mini (medium)* | 10.5 |
| OpenAI o3-mini (high)* | 13.0 |
| Utafiti wa kina wa OpenAI** | 26.6 |
Kwenye GAIA(fungua katika dirisha jipya)1, kipimo cha umma kinachotathmini AI kwenye maswali ya ulimwengu halisi, muundo unaoendesha utafiti wa kina unafikia hali mpya ya sanaa (SOTA), ukiongoza kwenye ubao wa wanaoongoza(fungua katika dirisha jipya) wa nje. Kujumuisha maswali katika viwango vitatu vya ugumu, kukamilisha kwa mafanikio shughuli hizi kunahitaji uwezo ikiwa ni pamoja na hoja, ufasaha wa njia nyingi, kuvinjari wavuti na ustadi wa matumizi ya zana.
| GAIA | ||||
|---|---|---|---|---|
| Kiwango cha 1 | Kiwango cha 2 | Kiwango cha 3 | Wastani. | |
| SOTA ya awali(fungua katika dirisha jipya) | 67.92 | 67.44 | 42.31 | 63.64 |
| Utafiti wa kina (pass@1) | 74.29 | 69.06 | 47.6 | 67.36 |
| Utafiti wa kina (cons@64) | 78.66 | 73.21 | 58.03 | 72.57 |
Mifano ya shughuli za GAIA
Katika tathmini ya ndani ya shughuli za kiwango cha mtaalamu katika maeneo mbalimbali, utafiti wa kina ulitathminiwa na wataalamu wa kikoa kuwa umefanya otomatiki saa nyingi za uchunguzi mgumu wa mikono.
Kiwango cha Kupita dhidi ya Wito wa Zana wa Juu Zaidi
Mifano ya shughuli za kiwango cha mtaalamu
Kiwango cha Kufaulu katika Shughuli za Kiwango cha Utaalamu kwa Kadirio la Thamani ya Kiuchumi
Kiwango cha Kufaulu katika Shughuli za Kiwango cha Mtaalam kwa Masaa Yanayokadiriwa
Deep research unlocks significant new capabilities, but it’s still early and has limitations. It can sometimes hallucinate facts in responses or make incorrect inferences, though at a notably lower rate than existing ChatGPT models, according to internal evaluations. It may struggle with distinguishing authoritative information from rumors, and currently shows weakness in confidence calibration, often failing to convey uncertainty accurately. At launch, there may be minor formatting errors in reports and citations, and tasks may take longer to kick off. We expect all these issues to quickly improve with more usage and time.
Deep research in ChatGPT is currently very compute intensive. The longer it takes to research a query, the more inference compute is required. We are starting with a version optimized for Pro users today, with up to 100 queries per month. Plus and Team users will get access next, followed by Enterprise. We are still working on bringing access to users in the United Kingdom, Switzerland, and the European Economic Area.
All paid users will soon get significantly higher rate limits when we release a faster, more cost-effective version of deep research powered by a smaller model that still provides high quality results.
In the coming weeks and months, we’ll be working on the technical infrastructure, closely monitoring the current release, and conducting even more rigorous testing. This aligns with our principle of iterative deployment. If all safety checks continue to meet our release standards, we anticipate releasing deep research to Plus users in about a month.
Utafiti wa kina unapatikana leo kwenye tovuti ya ChatGPT, na utasambazwa kwa programu za simu na kompyuta za mezani ndani ya mwezi huu. Hivi sasa, utafiti wa kina unaweza kufikia wavuti wazi na faili zozote zilizopakizwa. Katika siku zijazo, utaweza kuunganisha kwenye vyanzo maalum zaidi vya data—kupanua ufikiaji wake wa rasilimali zinazotegemea usajili au za ndani—ili kufanya matokeo yake kuwa thabiti zaidi na ya kibinafsi.
Tukiangalia mbele zaidi, tunatarajia uzoefu wa kiwakala kuungana katika ChatGPT kwa ajili ya utafiti na utekelezaji usiolingana wa ulimwengu halisi. Mchanganyiko wa utafiti wa kina, ambao unaweza kufanya uchunguzi wa mtandaoni usiolingana, na Operator, ambaye anaweza kitendo katika ulimwengu halisi, utawasha ChatGPT kutekeleza shughuli zinazozidi kuwa za kisasa kwa ajili yako.
Nyongeza ya tarehe 3 Februari 2025: Tulifanya majaribio makali ya usalama, tathmini za kujiandaa, na hakiki za utawala kwenye toleo la awali la o3 ambalo linaendesha utafiti wa kina, na kulibainisha kuwa hatari ya Wastani(fungua katika dirisha jipya). Pia tulifanya majaribio ya ziada ya usalama ili kuelewa vyema hatari zinazoongezeka zinazohusiana na uwezo wa utafiti wa kina wa kuvinjari wavuti, na tumeongeza urekebishaji mpya. Tutaendelea kujaribu kwa kina na kufuatilia kwa karibu toleo lenye kikomo la sasa. Tutashiriki maarifa yetu ya usalama na hatua za ulinzi kwa ajili ya utafiti wa kina katika kadi ya mfumo tunapopanua ufikiaji kwa watumiaji wa Plus.
Footnotes
- 1
We found that the ground-truth answers for this dataset were widely leaked online and have blocked several websites or URLs accordingly to ensure a fair evaluation of the model.
Authors
Research Leads
Isa Fulford, Zhiqing Sun
Foundational Contributors
Alex Tachard Passos, Alexandra Barr, Allison Tam, Charlotte Cole, Hyung Won Chung, Jason Wei, Jon Blackman, Scott Mayer McKinney na Valerie Qi
Core Contributors
Research
Elaine Ya Le, Eric Mitchell, Eric Wallace, Hyung Won Chung, Ignasi Clavera, Leo Liu, Lorenz Kuhn, Louis Feuvrier, Max Schwarzer, Saachi Jain, Scottie Yan, Shunyu Yao, Vitchyr Pong
Deployment
Carpus Chang, Harry Zhao, Joseph Trasatti, Joshua Dickens, Matt Kaufer, Mike Trpcic, Minnia Feng, Neel Ajjarapu, Peter Vidani, Sean Fitzgerald
Contributors
Research
Ahmed El-Kishky, AJ Ostrow, Alexander Wei, Andrei Gheorghe, Andrew Kondrich, Andrey Mishchenko, Anuj Nair, Behrooz Ghorbani, Brydon Eastman, Chak Li, Foivos Tsimpourlas, Francis Song, Giambattista Parascandolo,Gildas Chabot, Hessam Bagherinezhad, Haitang Hu, Hongyu Ren, Henry Aspegren, Hunter Lightman, Ilya Kostrikov, Ilge Akkaya, James Lennon, Jean Harb, Jonathan Ward, Kai Chen, Katy Shi, Kevin Liu, Kevin Yu, Manuka Stratta, Marvin Zhang, Mengyuan Yan, Mostafa Rohaninejad, Noam Brown, Phoebe Thacker, Raz Goan, Reah Miyara, Spencer Papay, Taylor Gordon, Wenda Zhou, Wenlei Xie, Yash Patil, Yann Dubois, Youlong Cheng, Yushi Wang, Wyatt Thompson
+ all the contributors to o3.
Safety Systems
Adam Kalai, Alex Beutel, Andrea Vallone, Andy Applebaum, David Robinson, Elizabeth Proehl, Evan Mays, Grace Zhao, Irina Kofman, Jason Phang, Joaquin Quinonero Candela, Joel Parish, Kevin Liu, Kristen Ying, Lama Ahmad, Leon Maksin, Leyton Ho, Meghan Shah, Michele Wang, Miles Wang, Phillip Guo, Olivia Watkins, Owen Campbell-Moore, Patrick Chao, Sam Toizer, Samuel Miserendino, Sandhini Agarwal, Tejal Patwardhan, Tina Sriskandarajah, Troy Peterson, Yaodong Yu, Yunyun Wang
Deployment
Adam Koppel, Adam Wells, Adele Li, Andy Applebaum, Andrey Malevich, Andrew Duberstein, Andrew Howell, Anton Tananaev, Ashley Tyra, Brandon Walkin, Bryan Ashley, Cary Bassin, Cary Hudson, Cory Decareaux, Cristina Scheau, Derek Chen, Dibya Bhattacharjee, Drea Lopez, Eric Antonow, Eric Burke, Filippo Raso, Fotis Chantzis, Freddie Sulit, Harris Cohen, Heather Whitney, Jay Dixit, Jeffrey Han, Jen Robinson, Jessica Shieh, Joel Parish, Kan Wu, Kevin Gladstone, Kshitij Wadhwa, Leo Vandriel, Leyton Ho, Liang Chen, Madeline Christian, Mamie Rheingold, Matt Jones, Michelle Fradin, Mike McClay, Mingxuan Wang, Nacho Soto, Niko Felix, Patrick Delaney, Paul McMillan, Philip Pronin, Rodrigo Riaza Perez, Samuel Miserendino, Scott Ethersmith, Steven Baldwin, Thomas Dimson, Tomo Hiratsuka, Yaming Lin, Yara Khakbaz, Yining Chen
Leadership
Akshay Nathan, Greg Brockman, Hannah Wong, Jakub Pachocki, Jerry Tworek, Johannes Heidecke, Josh Tobin, Liam Fedus, Mark Chen, Mia Glaese, Nick Turley, Sam Altman na Wojciech Zaremba