OpenAI 리서치 | 발행

리서치

카드를 전환하여 미디어 표시

카드를 전환하여 미디어 숨기기

안전

2026년 7월 15일

GPT-Red: 자기 개선을 통한 견고성 강화

셀프 플레이를 활용해 AI의 안전성, 정렬, 그리고 프롬프트 인젝션에 대한 견고성을 향상하는 OpenAI의 자동화 레드팀 시스템 GPT-Red를 소개합니다.

안전

2026년 7월 9일

GPT‑5.6 System Card

GPT-5.6 is a new family of three models: Sol, our new flagship model; Terra, a capable lower-cost option; and Luna, our fastest and most cost-efficient model. The safeguards we have built for this launch—our most robust yet—are built to deliver these models safely and at scale, around the world.

리서치

2026년 7월 8일

코딩 평가에서 유의미한 신호와 노이즈 구분하기

OpenAI의 새로운 분석 결과에 따르면 널리 사용되는 코딩 벤치마크인 SWE-Bench Pro에서 여러 문제가 확인되었으며, 이로 인해 AI 모델 평가의 신뢰성과 정확성에 대한 우려가 제기되고 있습니다.

안전

2026년 7월 8일

GPT‑Live System Card

GPT-Live-1 and GPT-Live-1 mini are a new generation of voice models designed to make conversations with AI feel more natural and intelligent.

리서치

2026년 6월 30일

GeneBench-Pro 소개

GeneBench-Pro는 실제 연구 환경의 복잡한 데이터를 활용해 유전체학, 생물학, 과학 연구 분야에서 AI의 성능을 평가하는 새로운 벤치마크입니다.

안전

2026년 6월 26일

GPT‑5.6 Preview System Card

GPT-5.6 is a new family of three models: Sol, our new flagship model; Terra, a capable lower-cost option; and Luna, our fastest and most cost-efficient model. The safeguards we have built for this launch – our most robust yet – are built to deliver these models safely and at scale, around the world.

리서치

2026년 6월 17일

준자율 AI 화학자가 의약화학 분야의 난이도 높은 반응을 개선합니다

OpenAI와 Molecule.one은 GPT-5.4를 활용한 준자율 AI 화학자가 신약 개발에 중요한 반응을 개선함으로써 의약화학 연구를 발전시킨 사례를 소개합니다.

리서치

2026년 6월 17일

LifeSciBench 소개

LifeSciBench는 AI 시스템이 실제 생명과학 연구 과업과 의사결정을 얼마나 잘 수행하는지 평가하기 위해 전문가가 작성하고 전문가가 검토한 벤치마크입니다.

안전

2026년 5월 5일

GPT-5.5 Instant System Card