OpenAI 研究

研究

切换卡片以显示媒体

切换卡片以隐藏媒体

安全

2026年7月15日

GPT-Red：解锁稳健性自我优化能力

了解 GPT-Red：OpenAI 的自动化红队测试系统，通过自我博弈提升 AI 安全性、对齐能力和提示注入稳健性。

产品

2026年7月9日

GPT-5.6：随宏大目标灵活扩展的前沿智能

每个 Token 都蕴含更多智能，更优的每美元性能，并可按需为最具挑战的任务提供更强能力

安全

2026年7月9日

GPT‑5.6 System Card

GPT-5.6 is a new family of three models: Sol, our new flagship model; Terra, a capable lower-cost option; and Luna, our fastest and most cost-efficient model. The safeguards we have built for this launch—our most robust yet—are built to deliver these models safely and at scale, around the world.

研究

2026年7月8日

剥离编程评估中的噪音，提取真实信号

OpenAI 的一项最新分析揭示了流行编程基准测试 SWE-Bench Pro 中存在的问题，引发了对 AI 模型评估可靠性与准确性的担忧。

产品

2026年7月8日

GPT-Live 发布

新一代语音模型，支持自然的人类-AI 交互，现已驱动 ChatGPT 语音。

安全

2026年7月8日

GPT‑Live System Card

GPT-Live-1 and GPT-Live-1 mini are a new generation of voice models designed to make conversations with AI feel more natural and intelligent.

研究

2026年6月30日

推出 GeneBench-Pro

推出 GeneBench-Pro：一项全新的基准测试，旨在利用复杂、真实的现实世界数据集，评估 AI 在基因组学、生物学及科学研究领域的性能表现。

产品

2026年6月26日

预览 GPT-5.6 Sol：新一代模型

OpenAI 预览 GPT-5.6 Sol：新一代模型，在编码、科学和网络安全方面能力更强，并配备其最先进的安全栈。

安全

2026年6月26日

GPT‑5.6 Preview System Card

GPT-5.6 is a new family of three models: Sol, our new flagship model; Terra, a capable lower-cost option; and Luna, our fastest and most cost-efficient model. The safeguards we have built for this launch – our most robust yet – are built to deliver these models safely and at scale, around the world.