OpenAI 研究 | 研究發表

研究

切換卡片以顯示媒體

切換卡片以隱藏媒體

研究發表

2026年8月1日

Ten advances in mathematics and theoretical computer science

OpenAI shares new results on long-standing open problems in mathematics and theoretical computer science, including advances in geometry, cryptography, and complexity.

研究

2026年7月29日

啟用兩項設定，讓 ARC-AGI-3 基準測試分數提高至三倍

兩項 API 設定如何透過保留推理與啟用壓縮，提升 GPT-5.6 在 ARC-AGI-3 上的分數與效率。

研究發表

2026年7月28日

智慧體式 AI 時代的科學運算

一份新實地報告說明科學家如何運用 AI 程式碼編寫智慧體，推動科學運算現代化，加速基因體學及其他領域的軟體開發與科學發現。

安全

2026年7月15日

GPT-Red：解鎖提升穩健性的自我改進能力

探索 GPT-Red：OpenAI 的自動化紅隊演練系統，透過自我對弈提升 AI 安全、對齊程度，以及抵禦提示注入攻擊的能力。

安全

2026年7月9日

GPT‑5.6 System Card

GPT-5.6 is a new family of three models: Sol, our new flagship model; Terra, a capable lower-cost option; and Luna, our fastest and most cost-efficient model. The safeguards we have built for this launch—our most robust yet—are built to deliver these models safely and at scale, around the world.

研究

2026年7月8日

在程式碼評估中分辨訊號與雜訊

OpenAI 的新分析揭示熱門程式碼基準 SWE-Bench Pro 的問題，引發外界對 AI 模型評估可靠度與準確度的疑慮。

安全

2026年7月8日

GPT‑Live System Card

GPT-Live-1 and GPT-Live-1 mini are a new generation of voice models designed to make conversations with AI feel more natural and intelligent.

研究

2026年6月30日

推出 GeneBench-Pro

推出 GeneBench-Pro，這是一項全新的基準測試，使用複雜的真實世界資料集，測試 AI 在基因體學、生物學與科學研究領域的表現。

安全

2026年6月26日

GPT‑5.6 Preview System Card

GPT-5.6 is a new family of three models: Sol, our new flagship model; Terra, a capable lower-cost option; and Luna, our fastest and most cost-efficient model. The safeguards we have built for this launch – our most robust yet – are built to deliver these models safely and at scale, around the world.