2025年4月16日

OpenAI o3 と o4-mini が登場

全てのツールにアクセスが可能な、これまでで最もスマートかつ高性能なモデル

読み込んでいます...

2025年6月10 日更新情報：OpenAI o3‑pro は、ChatGPT の Pro ユーザーと当社の API で利用できるようになりました。o3‑pro は、OpenAI o1‑pro のように OpenAI の最もインテリジェントなモデルである OpenAI o3 のバージョンで、より長く思考し、最も信頼できる回答を提供するように設計されています。詳細についてはリリースノート⁠（新しいウィンドウで開く）をご覧ください。

本日、OpenAI o3 および o4-mini をリリースします。回答する前に時間をかけて思考するよう学習させた o シリーズの最新モデルです。これまでにリリースした中で最もスマートなモデルとして、一般ユーザーから専門の研究者まで、全ての人に ChatGPT の機能の躍進を実感していただけるでしょう。リーズニングモデルは、ウェブの検索、Python によるアップロードされたファイルなどのデータの分析、視覚的な入力に対する深い論理的思考、画像の生成など、ChatGPT 内の全ツールを初めてエージェント的に使用し、組み合わせることができます。特に、これらのモデルは、より複雑な問題を解決するために、いつ、どのようにツールを使用するかを論理的に思考し、詳細で思慮深い答えを通常1分以内に適切な出力形式で生成するよう学習しています。これで多面的な質問により効果的に対応できるようになり、ユーザーに代わって独立してタスクを実行できる、よりエージェント的な ChatGPT へ一歩前進します。最先端の論理的思考と全ツールへのアクセスを組み合わせることで、学術的なベンチマークや実世界のタスクにおいて大幅に強化されたパフォーマンスを発揮し、インテリジェンスと有用性の両面で新たな基準を打ち立てます。

変更点

OpenAI o3 は、コーディング、数学、科学、視覚認識などのフロンティアを押し広げる、最も強力なリーズニングモデルです。Codeforces や SWE-bench（カスタムモデル固有のスキャフォールドを構築することなく）、MMMU などのベンチマークで新たな最先端を打ち立てます。多面的な分析が求められ、答えがすぐには導き出せような複雑なクエリに最適です。特に、画像や図表、グラフィックの分析など、視覚的なタスクを得意としています。社外専門家による評価では、o3 は OpenAI o1 と比較して、困難な実世界のタスクにおける大きなミスが20%少なく、特にプログラミングやビジネス／コンサルティング、創造的なアイデア出しのような分野で優れています。初期段階のテスターは、思考のパートナーとしての分析的な厳密さを特筆し、特に生物学や数学、工学の文脈において、新しい仮説を生み出し、批判的に評価する能力を強調しました。

OpenAI o4-mini は、高速でコスト効率の高い論理的思考に最適化された小型モデルで、特に数学やコーディング、視覚的タスクにおいて、そのサイズとコストに対して驚異的な性能を発揮します。AIME 2024 および2025でベンチマークされたモデルとして最高の性能を示しています。コンピュータにアクセスすると AIME 試験の難しさはかなり軽減されますが、o4-mini が AIME 2025 で、Python インタープリターにアクセスできるときに 99.5% pass@1 (100% consensus@8) を達成したことも特筆に値すると考えます。これらの結果を、ツールにアクセスしないモデルのパフォーマンスと比較するべきではありませんが、o4-mini が使用可能なツールをいかに効果的に活用するかを示す一例になっています。o3 も AIME 2025 で、ツールの使用による同様の成果向上を示しました (98.4% pass@1、100% consensus@8)。

専門家の評価では、o4-mini はデータサイエンスのような分野のほかに、非 STEM のタスクでも前身の o3‑mini を上回っています。効率性に優れる o4-mini は、 o3 よりもかなり高い使用制限をサポートし、推論の恩恵を受ける質問のための強力な大容量、高スループットのオプションとなっています。社外専門家による評価では、両モデルともインテリジェンスが向上し、ウェブソースが含まれるようになったことで、従来モデルよりも指示に従う精度が改善され、より有用で検証可能な回答が得られるようになったとされました。また、この両モデルは、特にメモリや過去の会話を参照してより個人に合わせた関連性の高い応答を行うことから、旧バージョンのリーズニングモデルと比較して、さらに自然で会話するような印象を与えるはずです。

マルチモーダル

コーディング

All SWE-bench evaluation runs use a fixed subset of n=477 verified tasks which have been validated on our internal infrastructure.

指示の遵守とツールのエージェント的な使用

全てのモデルは、ChatGPT の「o4-mini-high」のようなバリエーションと同様に、高い「推論努力」設定において評価されています。

強化学習の継続的拡大

OpenAI o3 の開発全体を通して、大規模な強化学習は GPT シリーズの事前学習で観察されたのと同じ「計算量が多い＝性能が高い」という傾向を示すことを観察しました。今回は RL でスケーリングパスを再トレースすることで、学習計算と推論時間リーズニングの両方でさらに1桁押し上げましたが、それでも明らかな性能向上が見られ、モデルの性能は思考時間が長いほど向上することが検証されました。OpenAI o1 と同等のレイテンシとコストで、o3 は ChatGPT でより高い性能を発揮します。また、長い時間思考を与えると、その性能が向上を続けることも確認されています。

また、強化学習を通してツールを使うように両モデルに学習させました。ツールの使い方だけでなく、ツールを使う状況について論理的に思考することも教えています。特に、視覚的な論理的思考や多段階のワークフローを伴うようなオープンエンドの状況において、望まれる結果に応じてツールを使う能力は高い能力を発揮します。初期段階のテスターが報告したように、この改善は学術的なベンチマークと実世界のタスクの両方に反映されています。

画像を使って考える

これらのモデルは、初めて、画像を Chain-of-Thought に直接組み込むことができるようになりました。画像を見るだけでなく、それを使って考えるのです。これにより、視覚とテキストによる論理的思考を融合させた新たな問題解決のクラスが実現し、マルチモーダルベンチマークにおける最先端の性能に反映されています。

ホワイトボードの写真、教科書の図表、手描きのスケッチなどをアップロードすると、イメージがぼやけている、逆さまになっている、低画質という場合でも、モデルはそれを解釈できます。ツールを使うことで、モデルはリーズニングプロセスの一環として、画像をその場で回転させる、ズームする、変形させるなどの操作ができます。

これらのモデルは、視覚認識タスクにおいてクラス最高の精度を実現し、これまで不可能だった問題を解決できるようになります。詳しくは視覚的な論理的思考の研究に関するブログ⁠をご覧ください。

ツールのエージェント的な使用へ

OpenAI o3 および o4-mini は ChatGPT 内の全ツールにアクセスできます。API の Function Calling でユーザー独自のカスタムツールも私用できます。これらのモデルは、問題を解決するために、いつ、どのようにツールを使用するかを論理的に思考し、詳細で思慮深い答えを通常1分以内に適切な出力形式で生成するよう学習しています。

プロンプト例：“How will summer energy usage in California compare to last year?” (カリフォルニアの夏のエネルギー使用量は昨年と比べてどうか？)モデルは、複数のツール呼び出しを連結して、ウェブで公共事業のデータを検索し、Python コードを書いて予測を構築し、グラフやイメージを生成し、予測を裏付ける主な要因を説明できます。論理的思考によって、モデルは受け取った情報に適宜反応し、方向転換できます。例えば、検索プロバイダーを利用してウェブを複数回検索し、結果を見て、さらに情報が必要なら新しい検索を試すことができます。

この柔軟で戦略的なアプローチにより、モデルに内蔵された知識にない最新情報へのアクセス、長い論理的思考、合成、モダリティを超えた出力生成を必要とするタスクに対応できます。

すべての例は OpenAI o3 で行いました。

OpenAI o3

Solve the following very hard math problem using your in-built knowledge of the literature:

Construct a degree 19 polynomial \ (p(x)) \ in \ mathbb{C}[x] such that \ (X := \ {\, p(x) = p(y) \, \} \ subset \ mathbb{P}^{1} \ times \ mathbb{P}^{1}\ ) has at least 3 (but not all linear) irreducible components over \ mathbb{C}. Choose \ (p(x)\ ) to be odd, monic, have real coefficients and linear coefficient \ (-19\ ) and calculate \ (p(19)\ ).

多項式に関する55秒間の論理的思考

The curve defined by p(x) = p(y) factors if p(x) is decomposable or has repeated factors. Actually, p(x)-p(y) always factors, and it’s divisible by (x-y). This gives one factor, and the rest is a degree d-1 polynomial in x and y. For the curve to have at least three irreducible components, p(x)-p(y) must factor into at least three nontrivial factors. Each factor corresponds to a component in projective space, and at least one must be nonlinear.
I’m checking the Dickson polynomials D_n(x,a) and focusing on their coefficients. With a=1, it seems the leading coefficient is always 1, confirming that Dickson polynomials are monic. The linear coefficient for odd n is following a predictable sign pattern: for n=19, it gives a coefficient of -19, which is exactly what I need. So the Dickson polynomial D_n(x,1) with odd n fits nicely with the linear coefficient I was looking for.
I’ve confirmed that the linear coefficient for p(x)=D19(x)p(x) = D_{19}(x)p(x)=D19(x) is correct, with the necessary scaling factor to match. Now for the next step, I need to compute D19(19)D_{19}(19)D19(19). Since Tn(z)T_n(z)Tn(z) grows quickly for real z>1z > 1z>1, I plan to compute this value using Python, leveraging the recurrence to avoid mistakes—it’s going to be a big number. I’ll make sure to present the numeric result neatly so it’s understandable since the coefficients are integers.
The Dickson polynomial D_{19}(x,1) is confirmed to have the required properties: monic, odd, real coefficients, and a linear coefficient of -19. The factorization of D_{19}(x) into irreducible components is also verified and satisfies the condition of having at least 10 components (1 linear + 9 quadratics). To compute p(19), we’ll use the recurrence relation for Dickson polynomials, but there’s also a closed-form expression involving cosine that could be handy here. I’ll proceed accordingly!
I confirmed the linear coefficient is working correctly, which means now I need to calculate p(19)=D19(19)p(19) = D_{19}(19)p(19)=D19(19). This involves evaluating T19(9.5)T_{19}(9.5)T19(9.5), which grows rapidly. To ensure I get an accurate result, I’ll compute this using recurrence in Python. The result is a large number: 1,876,572,071,974,094,803,391,179. I’ll use the commas to make it easier to read: p(19)=1,876,572,071,974,094,803,391,179p(19) = 1,876,572,071,974,094,803,391,179p(19)=1,876,572,071,974,094,803,391,179.
For a = 1, there’s a relationship between Dickson polynomials and Chebyshev polynomials: D_n(x, 1) = 2^(n-1) T_n(x/2). Checking this, we see that the definition of D_n(u + 1/u) aligns well with Chebyshev’s formula. By setting x = 2 cos(θ), we find that D_n(x) equals 2 T_n(x/2). Importantly, the leading coefficient works out to 1, confirming the identity holds. This is nice!
Now, I need to verify that p(x)−p(y)p(x) - p(y)p(x)−p(y) factors as desired, relying on the known Dickson polynomial factorization. According to the literature (like Fried and Jarden), the difference Dn(x)−Dn(y)D_n(x) - D_n(y)Dn(x)−Dn(y) factors into a product involving (x−y)(x - y)(x−y) and other expressions. I’ll provide the explicit polynomial D19(x,1)D_{19}(x, 1)D19(x,1), list the coefficients, and explain how it satisfies the desired properties: monic, odd, etc. Lastly, I’ll compute p(19)p(19)p(19).

Analyzed

Python

1from math import comb, cos, acos
2# Let's compute Dickson D_n(x,a=1) recursively
3def dickson(n,x,a=1):
4    if n==0:
5        return 2
6    if n==1:
7        return x
8    Dm2=2
9    Dm1=x
10    for k in range(2,n+1):
11        D= x*Dm1 - a*Dm2
12        Dm2, Dm1 = Dm1, D
13    return Dm1
14
15d19 = dickson(19,19,1)
16d19

1876572071974094803391179

Analyzed

Python

1d19

1876572071974094803391179

Analyzed

Python

1import locale, sys, math, json, textwrap, re, decimal

Analyzed

Python

1"{:,}".format(d19)

'1,876,572,071,974,094,803,391,179'

Analyzed

Python

1# compute coefficients of Dickson polynomial D_n(x,1) via recurrence polynomial representation
2from sympy import symbols, Poly
3
4x = symbols('x')
5def dickson_poly(n):

Poly(x**19 - 19*x**17 + 152*x**15 - 665*x**13 + 1729*x**11 - 2717*x**9 + 2508*x**7 - 1254*x**5 + 285*x**3 - 19*x, x,

Analyzed

Python

1poly19.as_expr()

x**19 - 19*x**17 + 152*x**15 - 665*x**13 + 1729*x**11 - 2717*x**9 + 2508*x**7 - 1254*x**5 + 285*x**3 - 19*x

Done

A convenient way to force many factors in
$p(x) \;-\; p(y)$ is to use a Dickson polynomial of the first kind.
For a parameter $a \in \mathbb{C}$ in math these are defined by

$D_0\!\left(x,\, a\right) = 2,\; D_1\!\left(x,\, a\right) = x,\; D_n\!\left(x,\, a\right) = x\,D_{n-1}\!\left(x,\, a\right) - a\,D_{n-2}\!\left(x,\, a\right)\quad (n \ge 2),$

and satisfy the functional identity

$D_n\!\left(u + \frac{a}{u},\, a\right) = u^{n} + \frac{a^{n}}{u^{n}}.$

1. Choosing the polynomial

Take $a = 1$ and $n=19$ .
Because $n$ is odd, $D_{19}\!\left(x,\, 1\right)$ is an odd polynomial; the recurrence immediately shows it is monic and has integer (hence real) coefficients.
The coefficient of $x$ is $x = (-1)^{\frac{n-1}{2}}\,n = -19,$ so all requested normalisations are satisfied.

Writing it out gives

$\boxed{% p(x) = D_{19}\!\left(x,1\right) = x^{19} - 19x^{17} + 152x^{15} - 665x^{13} + 1729x^{11} - 2717x^{9} + 2508x^{7} - 1254x^{5} + 285x^{3} - 19x% }\,.$

2. Irreducible components of $fX \;=\; \bigl\{\, p(x) = p(y) \,\bigr\}$

For Dickson polynomials one has the classical factorisation

$D_n\!\left(x,1\right) \;-\; D_n\!\left(y,1\right) \;=\; (x - y)\, \prod_{j=1}^{\frac{n-1}{2}} \Bigl( x^{2} + y^{2} - 2\cos\!\bigl(\tfrac{2\pi j}{n}\bigr)\,xy - 4\sin^{2}\!\bigl(\tfrac{\pi j}{n}\bigr) \Bigr),$

where each quadratic factor is irreducible over $\mathbb{C}$ (see, e.g., Lidl–Niederreiter, Finite Fields, §3.6).
For $n=19$ this gives

one linear component $x=y;$
nine quadratic components (corresponding to $\bigl(j = 1,\,\ldots,\,9\bigr).$

Hence $X \subset \mathbb{P}^{1} \times \mathbb{P}^{1}$ has $10$ irreducible components, at least three of them and not all linear, as required.

3. Evaluating $p(19)$

Using the recurrence (or the identity $D_n\!\left(x,\,1\right) \;=\; 2\,T_n\!\bigl(\tfrac{x}{2}\bigr)$ with Chebyshev $T_n$ one obtains

$p(19) \;=\; D_{19}\!\left(19,\,1\right) \;=\; 1\,876\,572\,071\,974\,094\,803\,391\,179.$

Thus the polynomial above fulfils every condition, and

$\boxed{% p(19) \;=\; 1{,}876{,}572{,}071{,}974{,}094{,}803{,}391{,}179 }.$

OpenAI o1

Solve the following very hard math problem using your in-built knowledge of the literature:

1分19秒間の思考

I’m piecing together a degree 19 polynomial with only odd powers and a linear coefficient of -19, like: p(x)=x19+a17x17+…+a1xp(x) = x^{19} + a_{17}x^{17} + \ldots + a_1xp(x)=x19+a17x17+…+a1x.

I’m examining if the polynomial p(x)=x19+a17x17+…+a3x3−19xp(x) = x^{19} + a_{17}x^{17} + \ldots + a_3x^3 - 19xp(x)=x19+a17x17+…+a3x3−19x has a notable classical form, considering symmetry or factorization for multiple irreducible components.

To ensure p(x)p(x)p(x) has three or more irreducible components, I’m considering the factorization of p(x)−p(y)p(x) - p(y)p(x)−p(y) into at least three irreducible factors in C[x,y]\mathbb{C}[x,y]C[x,y].

I’m working through x19−y19x^{19} - y^{19}x19−y19 to ensure it balances linear and nonlinear factors, aiming for a polynomial with at least three irreducible components that aren’t all linear.

Taking a closer look at how p(x)−p(y)p(x) - p(y)p(x)−p(y) can be factored into three distinct polynomials in xxx, aiming to achieve the desired irreducible components.

I’m thinking about how composing ppp with simpler polynomials, like xm−cx^m - cxm−c, could result in multiple irreducible components, achieving at least three distinct factors.

I’m exploring the factorization of p(x)−p(y)p(x) - p(y)p(x)−p(y) with cyclotomic polynomials. This method generates nnn linear irreducible factors, which might not achieve the required three distinct components.

I’m focusing on constructing an "odd" and monic polynomial, ensuring all exponents are odd and the leading term is x19x^{19}x19. This approach aims to achieve the desired factorization.

I’m working with the Chebyshev polynomials to ensure p(x)−p(y)p(x) - p(y)p(x)−p(y) has at least three irreducible components, leveraging their even or odd properties to manage coefficients.

Considering the odd Chebyshev polynomial T19(x)T_{19}(x)T19(x) for potential factorization, while also examining composite polynomials of lower degrees to enhance factorization properties.

I’m analyzing how a polynomial composed of lower-degree polynomials, like (x3−3x)k(x^3 - 3x)^k(x3−3x)k, could yield p(x)−p(y)p(x) - p(y)p(x)−p(y) with at least three irreducible components.

Piecing together polynomials with integer coefficients, like (x3−3x)k(x^3 - 3x)^k(x3−3x)k, to achieve factorization in p(x)−p(y)p(x) - p(y)p(x)−p(y) with at least three irreducible components, aiming for degree 19.

I’m examining how p(x)=(x3−3x)kp(x) = (x^3 - 3x)^kp(x)=(x3−3x)k or p(x)=x19−19xp(x) = x^19 - 19xp(x)=x19−19x might yield p(x)−p(y)p(x) - p(y)p(x)−p(y) with three irreducible components, focusing on ensuring the polynomial is of degree 19.

I’m analyzing p(x)−p(y)p(x) - p(y)p(x)−p(y) into (x−y)(x18+x17y+…+y18−19)(x-y)(x^{18} + x^{17}y + \ldots + y^{18} - 19)(x−y)(x18+x17y+…+y18−19) to identify irreducible components. The polynomial factor x18+x17y+…+y18−19x^{18} + x^{17}y + \ldots + y^{18} - 19x18+x17y+…+y18−19 remains challenging to factor further.

I’m thinking through a new approach: adjusting the Chebyshev polynomial T19(x)T_{19}(x)T19(x) to make it monic, aiming to achieve at least three irreducible components in p(x)−p(y)p(x) - p(y)p(x)−p(y).

OK, let’s think through. I’m tweaking T19(x)T_{19}(x)T19(x)'s coefficients to keep the polynomial monic and check if it simplifies into multiple irreducible components.

I’m thinking over various factorization strategies for a monic polynomial of degree 19. The goal is to ensure at least one non-linear factor and achieve at least three irreducible components in p(x)−p(y)p(x) - p(y)p(x)−p(y).

I’m updating my approach to factor x2+xy+y2−1x^2 + xy + y^2 - 1x2+xy+y2−1 over complex numbers by exploring change of variables and third roots of unity. This helps in achieving the desired polynomial structure.

I’m combining x3−xx^3 - xx3−x with x19−19xx^{19} - 19xx19−19x to tweak the polynomial's coefficients and complexity, aiming to include more non-linear factors for the desired irreducibility.

Piecing together p(x)=x19−x3−19xp(x) = x^{19} - x^3 - 19xp(x)=x19−x3−19x to ensure it’s odd, monic, and has the required linear coefficient. This targets the desired irreducible factorizations.

I’m mapping out the second factor x18+…+y18−x2−xy−y2−19x^{18} + \ldots + y^{18} - x^2 - xy - y^2 - 19x18+…+y18−x2−xy−y2−19 to see if it can split into three irreducible components.

I’m considering methods to factorize the polynomial by making it divisible by x−yx - yx−y and ensuring the remaining factor splits into exactly two irreducible components, aiming for a less complex factorization.

I’m examining the polynomial x18+x17y+…+y18x^{18} + x^{17}y + \ldots + y^{18}x18+x17y+…+y18 and noting it simplifies to x19−y19x−y\frac{x^{19} - y^{19}}{x - y}x−yx19−y19, which might reveal further irreducible factors.

I’m mapping out the polynomial x(x18−1)−y(y18−1)x−y\frac{x(x^{18}-1) - y(y^{18}-1)}{x-y}x−yx(x18−1)−y(y18−1) for further factorization. This might yield just two irreducible components, which falls short of three.

I’m exploring whether the polynomial p(x)=x19−(x3)p(x) = x^19 - (x^3)p(x)=x19−(x3) can be expressed as the composition of a smaller polynomial with xkx^kxk, potentially revealing multiple irreducible components.

I’m examining if the polynomial T19(x)T_{19}(x)T19(x) can be factored using trigonometric identities, seeking partial factorization without fully linearizing, to gain multiple irreducible components.

OK, let me see. Trigonometric identities show multiple solutions for θ\thetaθ and ϕ\phiϕ, likely leading to 19 or 38 linear factors for T19(x)T_{19}(x)T19(x).

I’m examining how the equation arccos⁡x=±arccos⁡y+2kπ19\arccos x = \pm \arccos y + \frac{2k\pi}{19}arccosx=±arccosy+192kπ leads to possible multiple branches, potentially irreducible in the algebraic sense, and needing an example with at least three non-linear irreducible components.

OK, let me see. I’m pulling together details on the polynomial x3(x2−1)8+xx^3 (x^2 - 1)^8 + xx3(x2−1)8+x. It’s monic, with x19x^{19}x19 as the leading term, ensuring it meets key criteria.

I’m noting that the polynomial p(x)=x3(x2−1)8+xp(x) = x^3 (x^2 - 1)^8 + xp(x)=x3(x2−1)8+x is odd, with its linear coefficient patterning to -19.

I’m figuring out how to adjust the linear coefficient of polynomial p(x)p(x)p(x) to -19 by adding a constant factor ccc in front of xxx to ensure it’s adequately modified.

By tweaking the linear coefficient of p(x)=x3(x2−1)8+cxp(x) = x^3 (x^2 - 1)^8 + c xp(x)=x3(x2−1)8+cx to -19, it ensures the polynomial remains monic, odd, and real.

I’m working on adding a constant factor ccc to xxx in p(x)=x3(x2−1)8+cxp(x) = x^3 (x^2 - 1)^8 + c xp(x)=x3(x2−1)8+cx to ensure the linear coefficient is -19, making the polynomial monic, odd, and real.

I’m piecing together the polynomial p(x)=x3(x−1)8(x+1)8p(x) = x^3 (x-1)^8 (x+1)^8p(x)=x3(x−1)8(x+1)8 and considering how p(x)−p(y)p(x) - p(y)p(x)−p(y) might factor with factors like x−yx-yx−y.

I’m confirming p(x)p(x)p(x) is monic, odd, and real, with a linear coefficient of -19, by expanding x3(x2−1)8x^3 (x^2 -1)^8x3(x2−1)8 to set the constant ccc.

I found that for p(x) to have a coefficient of -19 for the x term, c must be -19. The polynomial, p(x), incorporates this constant and preserves the original polynomial's structure.

Below is one concrete example (over the reals) of an odd, monic, degree 19 polynomial with linear coefficient $-19$ whose zero-locus relation $\{\,p(x)=p(y)\,\}\ \text{in }\mathbb{P}^{1}\times\mathbb{P}^{1}$ splits into at least three irreducible components (and not all components are linear). We then compute $p(19)$ .

1. The polynomial

Consider

$p(x) \;=\; x^{3}\,\bigl(x^{2}-1\bigr)^{8} \;-\; 19x.$

We can expand

$x^{3}\,\bigl(x^{2}-1\bigr)^{8} \;=\; x^{19}\;-\;8x^{17}\;+\;28x^{15}\;-\;56x^{13}\;+\;70x^{11} \;-\;56x^{9}\;+\;28x^{7}\;-\;8x^{5}\;+\;x^{3}.$

Hence

$p(x) \;=\; x^{19} \;-\; 8x^{17} \;+\; 28x^{15} \;-\; 56x^{13} \;+\; 70x^{11} \;-\; 56x^{9} \;+\; 28x^{7} \;-\; 8x^{5} \;+\; x^{3} \;-\; 19x.$

Checking the required properties

Degree 19, monic:
The leading term is $x^{19}$ with coefficient $1$ .

Odd polynomial:
All exponents are odd $\bigl(19,\,17,\,\ldots,\,3,\,1\bigr),$ and the constant term is $0$ . Indeed,

$p(-x) \;=\; -\,p(x),$

which makes $p$ an odd polynomial.

Real coefficients:
All coefficients are real integers.

Linear coefficient $-19$ :
The coefficient of $x$ is clearly $-19$ .

At least three irreducible components, not all linear:
Over $\mathbb{C}$ , the locus

$X \;=\; \bigl\{\, (x,y) : p(x)=p(y) \,\bigr\} \;\subset\; \mathbb{P}^{1}\times\mathbb{P}^{1}$

factors in part because $p(x)$ has (for instance) the repeated factors $\bigl(x^{2}-1\bigr)^{8}$ in the big piece $x^{3}\,\bigl(x^{2}-1\bigr)^{8}$ . One sees that $\{\,p(x)=p(y)\,\}$ contains (at least) the divisor $${x=y\}$$ plus further factors coming from the roots $x = 0 \;\text{or}\; x = \pm 1,$ ensuring multiple irreducible components (and not all of them are hyperplanes).

2. Computing $p(19)$

A direct substitution $x=19$ gives a large integer. Symbolic expansion or a computer algebra system yields

$p(19) \;=\; 1934999285524070399999639.$

Hence the required value is

\boxed{\,p(19) \;=\; 1934999285524070399999639\,}. \] :contentReference[oaicite:0]{index=0}

o1 が正しい応答をできない一方、OpenAI o3 は検索を使わずとも正しく応答を取得します。

コスト効率の高いリーズニングを進歩させる

コストと性能：o3‑mini と o4-mini

コストと性能：o1 と o3

OpenAI o3 および o4-mini は、当社がこれまでにリリースした中で最もインテリジェントなモデルであり、しばしば従来の OpenAI o1 および o3‑mini よりも効率的です。例えば、2025年の AIME 数学競技では、o3 のコストパフォーマンスフロンティアは o1 よりも厳密に改善され、同様に o4-mini のフロンティアは o3‑mini よりも厳密に改善されています。より全般的には、ほとんどの実世界での使用において、o3 と o4-mini は、それぞれ o1 と o3‑mini よりも賢く安価になると予測しています。

安全性

モデルの性能が向上するごとに、安全性も相応に向上します。OpenAI o3 および o4-mini では、安全学習データを完全に再構築し、生物学的脅威（バイオリスク）、マルウェア生成、ジェイルブレイクなどの分野で新しい拒否プロンプトを追加しました。この一新されたデータにより、o3 および o4-mini は、社内の拒否ベンチマーク（例：指示階層⁠、ジェイルブレイク）で強力な性能を達成しました。モデル拒否で強力な性能を発揮したのに加えて、フロンティアリスク領域で危険なプロンプトにフラグを立てるためのシステムレベルの緩和策も開発しました。画像生成⁠で当社が以前に行った作業と同様に、人間が書いた解釈可能な安全仕様から動作するリーズニング LLM モニターを学習させました。このモニターをバイオリスクに適用したところ、人間のレッドチーミングキャンペーンで会話の最大99%にフラグを立てることに成功しました。

両モデルとも、これまでで最も厳格な安全性プログラムでストレステストを行いました。更新版 Preparedness Framework⁠ に従い、このフレームワークが網羅する3つの追跡可能な能力分野、すなわち生物化学、サイバーセキュリティ、AI の自己改善で o3 および o4-mini を評価しました。これらの評価結果に基づき、o3 および o4-mini は、3つのカテゴリすべてにおいて同フレームワークの「高」のしきい値を超えていないと判断しました。これらの評価の詳細な結果は、付随する System Card⁠ に掲載しています。

Codex CLI：ターミナルでのフロンティアリーズニング

新しい実験もご紹介します：Codex CLI は、ターミナルから実行できる軽量なコーディングエージェントです。コンピューター上で直接動作し、o3 や o4-mini のようなモデルの論理的思考能力を最大限に引き出すように設計されており、GPT‑4.1⁠ のような追加の API モデルもサポートする予定です。

スクリーンショットや忠実度の低いスケッチをモデルに渡し、ローカルでコードにアクセスすることで、コマンドラインからマルチモーダルリーズニングの利点を活用できます。当社は、モデルをユーザーとそのコンピューターに接続するための最小限のインターフェイスであると考えています。Codex CLI は現在、github.com/openai/codex⁠（新しいウィンドウで開く）にて完全にオープンソース公開しています。

これと並行して、Codex CLI と OpenAI のモデルを使ったプロジェクトを支援する100万ドルのイニシアチブを開始します。API クレジットの形で、25,000米ドル単位で助成金の申請を募集、審査します。プロポーザルはこちらから提出できます。

アクセス

ChatGPT Plus、Pro、Team のユーザーは、o1、o3‑mini、o3‑mini‑high に代わり、本日より o3、o4-mini、o4-mini-high がモデルセレクターに表示されます。ChatGPT Enterprise および Edu のユーザーは、1週間でアクセスできるようになります。無料ユーザーは、クエリを送信する前に、コンポーザーで「Think」を選択することで o4-mini を試すことができます。全てのプランにおけるレート上限は、前回のモデルから変更ありません。

数週間以内にツールをフルサポートした OpenAI o3‑pro をリリースする予定です。現時点では、Pro ユーザーは引き続き o1‑pro にアクセスできます。

o3 および o4-mini は、いずれも Chat Completions API と Responses API を介して、本日から開発者向けに利用可能です（一部の開発者は、これらのモデルにアクセスするために組織を確認⁠（新しいウィンドウで開く）していただく必要があります）。Responses API は論理的思考のサマリー（性能を向上させるために関数呼び出しのリーズニングトークンを保持する機能）をサポートし、モデルの論理的思考内でウェブ検索、ファイル検索、Code Interpreter といった内蔵ツールをまもなくサポートする予定です。まずはドキュメント⁠（新しいウィンドウで開く）をご覧のうえ、続報をお待ちください。

今後の展望

本日のアップデートは、当社のモデルの方向性を反映しています。o シリーズの専門的な論理的思考能力を、GPT シリーズの自然な会話能力とツールの使用に収束させていきます。これらの強みを統合することで、将来のモデルは、積極的なツールの使用や高度な問題解決とともに、シームレスで自然な会話をサポートしていきます。

4月16日更新：Charxiv-r と Mathvista での o3 の結果が更新され、当初の評価にはなかったシステムプロンプトの変更が反映されました。

ライブストリームのリプレイ

著者

OpenAI

脚注

* tau-bench 評価の数値は、分散を抑えるために5回の実行で平均化され、カスタムツールやプロンプトなしで実行されます。tau-bench の小売のロールアウトは、ユーザーモデルのエラーがより発生しやすいことを発見しました。丸括弧内の数値は、GPT-4o よりも指示の追従性が格段に優れているため、GPT-4.1 をユーザーモデルとして実行したものです。

* SWE-bench では、256kの最大コンテキスト長を使用しており、o4-mini の正答率が約3%向上し、o3 の正答率への影響は<1%です。また、当社内のインフラで実行できない23のサンプルを除外しています。

** ブラウジングを有効にすると、モデルは、例えばデータセットからのサンプル問題のあるブログ記事を読むことによって、オンラインで正確な解答を見つけることができます。当社は、下記2つの戦略でブラウジング中のモデルによる不正行為の懸念を軽減します。

過去にモデルの不正行為を観測したドメインをブロックしました。
リーズニングモデルをモニターとして使用し、不審な動作を特定するために、各試行で全てのトークンを検査しました。疑わしい行動とは、次のように定義します。「特定の問題に対する正確な解答を提供することを主な目的としたページやファイル、スニペット。例えば、公式の採点キー、流出した『解答』の要点、完成した解答をそのまま引用したディスカッションなど。」良性の行動とは、次のように定義します。「勤勉な人間が参照する可能性のある権威あるリソース（文書、マニュアル、学術論文、評判の高い記事）で、それが偶然に正答を含んでいても構わない。」モニターがロールアウトを疑わしいと判断した試行は、不正確としてカウントされます。このチェックで不合格になったサンプルのほとんどは、正確な解が HLE とは関係のない複数のインターネットソースで入手可能な問題でした。

最後に、ChatGPT と OpenAI API では検索エンジンのバックエンドが異なるため、ブラウジングでの評価は OpenAI API では完全に再現できない可能性があります。これらの結果は、ChatGPT のユーザー体験を代表するためのものですが、需要に応じて、検索構成は将来的に変更される可能性があります。

Contributors

Aaditya Singh, Aaron Schlesinger, Adam Fry, Adam Lerer, Adam Perelman, Adam Walker, Ahmed El-Kishky, Aidan Clark, Aidan McLaughlin, Aiden Low, Akila Welihinda, Akshay Nathan, Aleksander Madry, Aleksandra Spyra, Alex Karpenko, Alex Neitz, Alex Tachard Passos, Alex Wei, Alexander Prokofiev, Alexander Zielenski, Alexandra Barr, Alexey Ivanov, Alexi Christakis, Alfred Xue, Allison Tam, Ally Bennett, Ally Bennett , Amelia Liu, Amy McDonald Sandjideh, Ananya Kumar, Andre Saraiva, Andrea Vallone, Andrew Chen, Andrew Duberstein, Andrew Gibiansky, Andrew Kondrich, Andrew Tulloch, Andrey Mishchenko, Andy Applebaum, Andy Wang, Angela Baek, Annie Wei, Anting Shen, Antoine Pelisse, Anuj Saharan, Arun Vijayvergiya, Ashley Tyra, Ashvin Nair, Avi Nayak, Avital Oliver, Behrooz Ghorbani, Belinda Truong, Ben Sokolowsky, Beth Hoover, Bo Xu, Boaz Barak, Bohan Zhang, Borys Minaiev, Botao Hao, Bowen Baker, Bowen Cheng, Brandon McKinzie, Brandon Wang, Brian Hsu, Brian Yang, Brian Yu, Brian Zhang, Camillo Lugaresi, Carolina Paz, Carpus Chang, Cary Bassin , Cary Hudson, Casey Chu, Chak Li, Charles Zhao, Charlie Jatt, Charlotte Cole, Chelsea Voss, Chen Shen, Chengxu Zhuang, Chris Colby, Chris Hallacy , Chris Koch, Christina Kaplan, Christina Kim, Colin Reid, Colin Wei, Cristina Scheau, D. Sculley, Damien Deville, Dan Roberts, Dana Palmie, Dane Stuckey, Daniel Levine, David Hu, David Martin, David Robinson, David Sasaki, Davis Wu, Derek Chen, Dibya Bhattacharjee, Dimitris Tsipras, Dinghua Li, DJ Strouse, dmed Medina, Drew Hintz, Eddie Zhang, Edmund Wong, Elaine Ya Le, Eli Yani , Elizabeth Proehl, Emily Sokolova, Enoch Cheung, Eri Schwartz, Eric Mitchell, Eric Ning, Eric Sigler, Eric Wallace, Eugenio Panero, Evan Mays, Evgenii Nikishin, Fan Wang, Fangyuan Li, Filippo Raso, Foivos Tsimpourlas, Fouad Matin, Francis Song, Francis Zhang, Gary Yang, Gene Oden, Giambattista Parascandolo, Gildas Chabot, Grace Kim, Grace Zhao, Greg Brockman, Gregory Valiant, Guillaume Leclerc, Hadi Salman, Haitang Hu, Hannah Sheahan, Hao Sheng, Haoyu Wang, Henrique Ponde de Oliveira Pinto, Henry Aspegren, Heqing Yan, Hessam Bagherinezhad, Hongyu Ren, Hunter Lightman, Hyeonwoo Noh, Ian Kivlichan, Ian Sohl, Ignasi Clavera, Ikai Lan, Ilge Akkaya, Ilya Kostrikov, Irina Kofman, Isa Fulford, Jake Brill, Jakub Pachocki, James Betker, James Lee, James Qin, Jamie Kiros, Jason Ai, Jay Wang, Jean Harb, Jeff Mickey, Jeffrey Han, Jeffrey Wang, Jeremy Chen, Jerry Tworek, Jessica Liang, Jessica Shieh, Ji Lin, Jiahui Yu, Jianfeng Wang, Jie Tang, Jihan Yin, Jing Li, Joanne Jang, Joel Morris, Johannes Ferstad, Johannes Heidecke, John Fishbein, Jon Okun, Jonathan Gordon, Joost Huizinga, Jos Kraaijeveld, Joseph Mo, Josh Lawson , Josh Tobin, Junhua Mao, Kai Chen, Kai Hayashi, Karan Singhal, Karina Nguyen, Katy Shi, Kelly Stirman, Kenji Hata, Kenny Nguyen, Keren Gu-Lemberg, Kevin Gladstone, Kevin King, Kevin Liu, Kevin Lu, Kevin Park, Kevin Stone, Kevin Weil, Kevin Whinnery, Kevin Yu, Kote Mushegiani, Kristen Ying, Kristian Georgiev, Kshitij Gupta, Kyle Kosic, Lama Ahmad, Larry Lv, Lauren Itow, Lauren Yang, Lee Byron, Leo Chen, Leo Liu, Leon Maksin, Leyton Ho, Li Jing, Liang Xiong, Lin Yang, Linden Li, Lorenz Kuhn, Louis Feuvrier, Lu Zhang, Łukasz Kaiser, Mahmoud Eariby, Maja Trębacz, Manas Joglekar, Manoli Liodakis, Manuka Stratta, Mark Chen, Mark Hudnall, Mark Sun, Mark Wang, Martin Li, Marvin Zhang, Mateusz Litwin, Matt Jones, Matt Lim, Max Johnson, Max Schwarzer, Mayank Gupta, Meghan Shah, Mengqing Wang, Mengyuan Yan, Mia Glaese, Michael Bolin, Michael Lampe, Michael Malek, Michael Sharman, Michael Zhang, Michele Wang, Michelle Pokrass, Miguel Oom Temudo de Castro, Mihai Florian, Mike McClay, Mike Trpcic, Miki Habryn, Miles Wang, Ming Chen, Mingxuan Wang, Minnia Feng, Mitchell Gordon, Mo Bavarian, Mostafa Rohaninejad, Nacho Soto, Nakul Khanna, Nat McAleese, Natalie Staudacher, Natan LaFontaine, Neel Ajjarapu, Nick Felt, Nick Turley, Nikil Pancha, Nikita Mikhaylin, Niko Felix, Nikunj Handa, Ning Liu, Nishant Rai, Noah Jorgensen, Noam Brown, Oleg Boiko, Oleg Murk, Olivia Watkins, Olivier Godement, Oona Gleeson, Paul Ashbourne, Pavel Belov, Peter Flockhart, Peter Hoeschele, Peter Zhokhov, Philip Pronin, Phillip Guo, Phoebe Thacker, Prafulla Dhariwal, Prashanth R, Rachel Dias, Rahul Arora, Rajkumar Samuel, Rasmus Rygaard, Ravi Teja Mullapudi, Raymond Li, Raz Gaon, Reah Miyara, Reiichiro Nakano, Reimar Leike, Rennie Song, Rhythm Garg, RJ Marsan, Robert Xiong, Robin Brown, Roman Tsiupa, Rui Shu, Ruslan Nigmatullin, Saachi Jain, Saagar Patel, Sam Altman, Sam Toizer, Sam Toyer, Samir Ahmed, Samuel Miserendino, Samuel Wolrich , Sandhini Agarwal, Santiago Hernández, Sarah Dong, Savannah Heon, Scott Ethersmith, Scott Mayer McKinney, Sean Fitzgerald, Sever Banesiu, Shamez Hemani, Shengjia Zhao, Shengli Hu, Shibani Santurkar, Shreyas Krishnaswamy, Shuchao Bi, Shunyu Yao, Shuyuan Zhang, Simón Posada Fishman, Spencer Papay, Spug Golden, Srinivas Narayanan, Stanley Hsieh, Stephen Logsdon, Sundeep Tirumalareddy, Tal Stramer, Tao Wang, Tao Xin, Taylor Gordon, Tejal Patwardhan, Thibault Sottiaux, Tina Sriskandarajah, Tony Casparro, Tony Zhao, Trapit Bansal, Trevor Creech, Uzair Navid Iftikhar, Valerie Qi, Vineet Kosaraju, Vishal Kuo, Vitchyr Pong, Vivek Verma, Vlad Petrov, Wenda Zhou, Wenlei Xie, Wenting Zhan, Will DePue, Will Ellsworth, William Sheu, Wyatt Thompson, Yaming Lin, Yann Dubois, Yaodong Yu, Yara Khakbaz, Yash Patil, Yifan Wu, Yilong Qin, Yining Chen, Yirui Zhang, Yo Shavit , Young Cha, Yunyun Wang, Yushi Wang, Zack Sultan, Zehao Dou, Zewei Chu, Zheng Shao, Zhigang Wang, Zhishuai Zhang, Zihao Zhang