16 квітня 2025 р.

Introducing OpenAI o3 and o4-mini

Завантаження…

Оновлення від 10 червня 2025 р.: OpenAI o3‑pro тепер доступний для користувачів Pro у ChatGPT, а також у нашому API. Як і OpenAI o1‑pro, o3‑pro — це версія нашої найрозумнішої моделі OpenAI o3, розроблена для довшого мислення та надання найточніших відповідей. Деталі можна знайти у наших нотатках до релізу⁠(відкривається у новому вікні).

Сьогодні ми випускаємо OpenAI o3 та o4-mini, найновіші моделі нашої серії o, навчені довше думати перед тим, як відповідати. Це найрозумніші моделі, які ми випустили на сьогодні, що представляють собою значний крок у зміні можливостей ChatGPT для всіх — від допитливих користувачів до досвідчених дослідників. Вперше наші моделі міркування можуть агентно використовувати та комбінувати всі інструменти в ChatGPT — це включає пошук у вебі, аналіз переданих файлів та інших даних за допомогою Python, глибоке міркування про візуальні введення і навіть генерування зображень. Що важливо, так це те, що ці моделі навчені аргументувати, коли і як використовувати інструменти для створення детальних і продуманих відповідей у правильних форматах виводу, зазвичай менш ніж за хвилину, щоб розв'язувати складніші проблеми. Це дозволяє їм ефективніше вирішувати багатогранні питання, що є кроком до більш агентного ChatGPT, який може самостійно виконувати завдання від вашого імені. Поєднання передового мислення з повним доступом до інструментів перетворюється на значно більшу ефективність у академічних тестах і реальних завданнях, встановлюючи новий стандарт як в інтелекті, так і в корисності.

What’s changed

OpenAI o3 is our most powerful reasoning model that pushes the frontier across coding, math, science, visual perception, and more. It sets a new SOTA on benchmarks including Codeforces, SWE-bench (without building a custom model-specific scaffold), and MMMU. It’s ideal for complex queries requiring multi-faceted analysis and whose answers may not be immediately obvious. It performs especially strongly at visual tasks like analyzing images, charts, and graphics. In evaluations by external experts, o3 makes 20 percent fewer major errors than OpenAI o1 on difficult, real-world tasks—especially excelling in areas like programming, business/consulting, and creative ideation. Early testers highlighted its analytical rigor as a thought partner and emphasized its ability to generate and critically evaluate novel hypotheses—particularly within biology, math, and engineering contexts.

OpenAI o4-mini is a smaller model optimized for fast, cost-efficient reasoning—it achieves remarkable performance for its size and cost, particularly in math, coding, and visual tasks. It is the best-performing benchmarked model on AIME 2024 and 2025. Although access to a computer meaningfully reduces the difficulty of the AIME exam, we also found it notable that o4-mini achieves 99.5% pass@1 (100% consensus@8) on AIME 2025 when given access to a Python interpreter. While these results should not be compared to the performance of models without tool access, they are one example of how effectively o4-mini leverages available tools; o3 shows similar improvements on AIME 2025 from tool use (98.4% pass@1, 100% consensus@8).

In expert evaluations, o4-mini also outperforms its predecessor, o3‑mini, on non-STEM tasks as well as domains like data science. Thanks to its efficiency, o4-mini supports significantly higher usage limits than o3, making it a strong high-volume, high-throughput option for questions that benefit from reasoning. External expert evaluators rated both models as demonstrating improved instruction following and more useful, verifiable responses than their predecessors, thanks to improved intelligence and the inclusion of web sources. Compared to previous iterations of our reasoning models, these two models should also feel more natural and conversational, especially as they reference memory and past conversations to make responses more personalized and relevant.

Multimodal

Coding

All SWE-bench evaluation runs use a fixed subset of n=477 verified tasks which have been validated on our internal infrastructure.

Виконання інструкцій і використання інструментів

Усі моделі проходили оцінку при високому рівні міркувань, аналогічно таким варіантам, як o4-mini-high в ChatGPT.

Продовження масштабування навчання з підкріпленням

Throughout the development of OpenAI o3, we’ve observed that large-scale reinforcement learning exhibits the same “more compute = better performance” trend observed in GPT‑series pretraining. By retracing the scaling path—this time in RL—we’ve pushed an additional order of magnitude in both training compute and inference-time reasoning, yet still see clear performance gains, validating that the models’ performance continues to improve the more they’re allowed to think. At equal latency and cost with OpenAI o1, o3 delivers higher performance in ChatGPT—and we’ve validated that if we let it think longer, its performance keeps climbing.

We also trained both models to use tools through reinforcement learning—teaching them not just how to use tools, but to reason about when to use them. Their ability to deploy tools based on desired outcomes makes them more capable in open-ended situations—particularly those involving visual reasoning and multi-step workflows. This improvement is reflected both in academic benchmarks and real-world tasks, as reported by early testers.

Мислення за допомогою зображень

Вперше ці моделі можуть інтегрувати зображення безпосередньо у свій ланцюжок міркувань. Вони не просто бачать зображення — вони думають із його допомогою. Це відкриває новий клас вирішення завдань, який поєднує візуальне та текстове мислення, що відображається в їхніх передових результатах на мультимодальних тестах.

Люди можуть завантажити фотографію дошки, діаграму з підручника або ескіз, що намальований від руки, і модель зможе його інтерпретувати, навіть якщо зображення розмито, перевернуто або низької якості. Використовуючи інструменти, моделі можуть миттєво маніпулювати зображеннями — обертати, масштабувати або трансформувати їх у процесі міркування.

Ці моделі забезпечують найкращу у своєму класі точність у виконанні завдань візуального сприйняття, дозволяючи розв'язувати питання, розв'язання яких раніше були недоступним. Більше детальної інформації можна отримати в нашому блозі досліджень візуального мислення⁠.

Toward agentic tool use

OpenAI o3 and o4-mini have full access to tools within ChatGPT, as well as your own custom tools via function calling in the API. These models are trained to reason about how to solve problems, choosing when and how to use tools to produce detailed and thoughtful answers in the right output formats quickly—typically in under a minute.

For example, a user might ask: “How will summer energy usage in California compare to last year?” The model can search the web for public utility data, write Python code to build a forecast, generate a graph or image, and explain the key factors behind the prediction, chaining together multiple tool calls. Reasoning allows the models to react and pivot as needed to information it encounters. For example, they can search the web multiple times with the help of search providers, look at results, and try new searches if they need more info.

This flexible, strategic approach allows the models to tackle tasks that require access to up-to-date information beyond the model’s built-in knowledge, extended reasoning, synthesis, and output generation across modalities.

All examples were completed with OpenAI o3.

OpenAI o3

Solve the following very hard math problem using your in-built knowledge of the literature:

Construct a degree 19 polynomial \ (p(x)) \ in \ mathbb{C}[x] such that \ (X := \ {\, p(x) = p(y) \, \} \ subset \ mathbb{P}^{1} \ times \ mathbb{P}^{1}\ ) has at least 3 (but not all linear) irreducible components over \ mathbb{C}. Choose \ (p(x)\ ) to be odd, monic, have real coefficients and linear coefficient \ (-19\ ) and calculate \ (p(19)\ ).

Reasoned about polynomial construction for 55 seconds

The curve defined by p(x) = p(y) factors if p(x) is decomposable or has repeated factors. Actually, p(x)-p(y) always factors, and it’s divisible by (x-y). This gives one factor, and the rest is a degree d-1 polynomial in x and y. For the curve to have at least three irreducible components, p(x)-p(y) must factor into at least three nontrivial factors. Each factor corresponds to a component in projective space, and at least one must be nonlinear.
I’m checking the Dickson polynomials D_n(x,a) and focusing on their coefficients. With a=1, it seems the leading coefficient is always 1, confirming that Dickson polynomials are monic. The linear coefficient for odd n is following a predictable sign pattern: for n=19, it gives a coefficient of -19, which is exactly what I need. So the Dickson polynomial D_n(x,1) with odd n fits nicely with the linear coefficient I was looking for.
I’ve confirmed that the linear coefficient for p(x)=D19(x)p(x) = D_{19}(x)p(x)=D19(x) is correct, with the necessary scaling factor to match. Now for the next step, I need to compute D19(19)D_{19}(19)D19(19). Since Tn(z)T_n(z)Tn(z) grows quickly for real z>1z > 1z>1, I plan to compute this value using Python, leveraging the recurrence to avoid mistakes—it’s going to be a big number. I’ll make sure to present the numeric result neatly so it’s understandable since the coefficients are integers.
The Dickson polynomial D_{19}(x,1) is confirmed to have the required properties: monic, odd, real coefficients, and a linear coefficient of -19. The factorization of D_{19}(x) into irreducible components is also verified and satisfies the condition of having at least 10 components (1 linear + 9 quadratics). To compute p(19), we’ll use the recurrence relation for Dickson polynomials, but there’s also a closed-form expression involving cosine that could be handy here. I’ll proceed accordingly!
I confirmed the linear coefficient is working correctly, which means now I need to calculate p(19)=D19(19)p(19) = D_{19}(19)p(19)=D19(19). This involves evaluating T19(9.5)T_{19}(9.5)T19(9.5), which grows rapidly. To ensure I get an accurate result, I’ll compute this using recurrence in Python. The result is a large number: 1,876,572,071,974,094,803,391,179. I’ll use the commas to make it easier to read: p(19)=1,876,572,071,974,094,803,391,179p(19) = 1,876,572,071,974,094,803,391,179p(19)=1,876,572,071,974,094,803,391,179.
For a = 1, there’s a relationship between Dickson polynomials and Chebyshev polynomials: D_n(x, 1) = 2^(n-1) T_n(x/2). Checking this, we see that the definition of D_n(u + 1/u) aligns well with Chebyshev’s formula. By setting x = 2 cos(θ), we find that D_n(x) equals 2 T_n(x/2). Importantly, the leading coefficient works out to 1, confirming the identity holds. This is nice!
Now, I need to verify that p(x)−p(y)p(x) - p(y)p(x)−p(y) factors as desired, relying on the known Dickson polynomial factorization. According to the literature (like Fried and Jarden), the difference Dn(x)−Dn(y)D_n(x) - D_n(y)Dn(x)−Dn(y) factors into a product involving (x−y)(x - y)(x−y) and other expressions. I’ll provide the explicit polynomial D19(x,1)D_{19}(x, 1)D19(x,1), list the coefficients, and explain how it satisfies the desired properties: monic, odd, etc. Lastly, I’ll compute p(19)p(19)p(19).

Analyzed

Python

1from math import comb, cos, acos
2# Let's compute Dickson D_n(x,a=1) recursively
3def dickson(n,x,a=1):
4    if n==0:
5        return 2
6    if n==1:
7        return x
8    Dm2=2
9    Dm1=x
10    for k in range(2,n+1):
11        D= x*Dm1 - a*Dm2
12        Dm2, Dm1 = Dm1, D
13    return Dm1
14
15d19 = dickson(19,19,1)
16d19

1876572071974094803391179

Analyzed

Python

1d19

1876572071974094803391179

Analyzed

Python

1import locale, sys, math, json, textwrap, re, decimal

Analyzed

Python

1"{:,}".format(d19)

'1,876,572,071,974,094,803,391,179'

Analyzed

Python

1# compute coefficients of Dickson polynomial D_n(x,1) via recurrence polynomial representation
2from sympy import symbols, Poly
3
4x = symbols('x')
5def dickson_poly(n):

Poly(x**19 - 19*x**17 + 152*x**15 - 665*x**13 + 1729*x**11 - 2717*x**9 + 2508*x**7 - 1254*x**5 + 285*x**3 - 19*x, x,

Analyzed

Python

1poly19.as_expr()

x**19 - 19*x**17 + 152*x**15 - 665*x**13 + 1729*x**11 - 2717*x**9 + 2508*x**7 - 1254*x**5 + 285*x**3 - 19*x

Done

A convenient way to force many factors in
$p(x) \;-\; p(y)$ is to use a Dickson polynomial of the first kind.
For a parameter $a \in \mathbb{C}$ in math these are defined by

$D_0\!\left(x,\, a\right) = 2,\; D_1\!\left(x,\, a\right) = x,\; D_n\!\left(x,\, a\right) = x\,D_{n-1}\!\left(x,\, a\right) - a\,D_{n-2}\!\left(x,\, a\right)\quad (n \ge 2),$

and satisfy the functional identity

$D_n\!\left(u + \frac{a}{u},\, a\right) = u^{n} + \frac{a^{n}}{u^{n}}.$

1. Choosing the polynomial

Take $a = 1$ and $n=19$ .
Because $n$ is odd, $D_{19}\!\left(x,\, 1\right)$ is an odd polynomial; the recurrence immediately shows it is monic and has integer (hence real) coefficients.
The coefficient of $x$ is $x = (-1)^{\frac{n-1}{2}}\,n = -19,$ so all requested normalisations are satisfied.

Writing it out gives

$\boxed{% p(x) = D_{19}\!\left(x,1\right) = x^{19} - 19x^{17} + 152x^{15} - 665x^{13} + 1729x^{11} - 2717x^{9} + 2508x^{7} - 1254x^{5} + 285x^{3} - 19x% }\,.$

2. Irreducible components of $fX \;=\; \bigl\{\, p(x) = p(y) \,\bigr\}$

For Dickson polynomials one has the classical factorisation

$D_n\!\left(x,1\right) \;-\; D_n\!\left(y,1\right) \;=\; (x - y)\, \prod_{j=1}^{\frac{n-1}{2}} \Bigl( x^{2} + y^{2} - 2\cos\!\bigl(\tfrac{2\pi j}{n}\bigr)\,xy - 4\sin^{2}\!\bigl(\tfrac{\pi j}{n}\bigr) \Bigr),$

where each quadratic factor is irreducible over $\mathbb{C}$ (see, e.g., Lidl–Niederreiter, Finite Fields, §3.6).
For $n=19$ this gives

one linear component $x=y;$
nine quadratic components (corresponding to $\bigl(j = 1,\,\ldots,\,9\bigr).$

Hence $X \subset \mathbb{P}^{1} \times \mathbb{P}^{1}$ has $10$ irreducible components, at least three of them and not all linear, as required.

3. Evaluating $p(19)$

Using the recurrence (or the identity $D_n\!\left(x,\,1\right) \;=\; 2\,T_n\!\bigl(\tfrac{x}{2}\bigr)$ with Chebyshev $T_n$ one obtains

$p(19) \;=\; D_{19}\!\left(19,\,1\right) \;=\; 1\,876\,572\,071\,974\,094\,803\,391\,179.$

Thus the polynomial above fulfils every condition, and

$\boxed{% p(19) \;=\; 1{,}876{,}572{,}071{,}974{,}094{,}803{,}391{,}179 }.$

OpenAI o1

Solve the following very hard math problem using your in-built knowledge of the literature:

Thought for 1m 19s

I’m piecing together a degree 19 polynomial with only odd powers and a linear coefficient of -19, like: p(x)=x19+a17x17+…+a1xp(x) = x^{19} + a_{17}x^{17} + \ldots + a_1xp(x)=x19+a17x17+…+a1x.

I’m examining if the polynomial p(x)=x19+a17x17+…+a3x3−19xp(x) = x^{19} + a_{17}x^{17} + \ldots + a_3x^3 - 19xp(x)=x19+a17x17+…+a3x3−19x has a notable classical form, considering symmetry or factorization for multiple irreducible components.

To ensure p(x)p(x)p(x) has three or more irreducible components, I’m considering the factorization of p(x)−p(y)p(x) - p(y)p(x)−p(y) into at least three irreducible factors in C[x,y]\mathbb{C}[x,y]C[x,y].

I’m working through x19−y19x^{19} - y^{19}x19−y19 to ensure it balances linear and nonlinear factors, aiming for a polynomial with at least three irreducible components that aren’t all linear.

Taking a closer look at how p(x)−p(y)p(x) - p(y)p(x)−p(y) can be factored into three distinct polynomials in xxx, aiming to achieve the desired irreducible components.

I’m thinking about how composing ppp with simpler polynomials, like xm−cx^m - cxm−c, could result in multiple irreducible components, achieving at least three distinct factors.

I’m exploring the factorization of p(x)−p(y)p(x) - p(y)p(x)−p(y) with cyclotomic polynomials. This method generates nnn linear irreducible factors, which might not achieve the required three distinct components.

I’m focusing on constructing an "odd" and monic polynomial, ensuring all exponents are odd and the leading term is x19x^{19}x19. This approach aims to achieve the desired factorization.

I’m working with the Chebyshev polynomials to ensure p(x)−p(y)p(x) - p(y)p(x)−p(y) has at least three irreducible components, leveraging their even or odd properties to manage coefficients.

Considering the odd Chebyshev polynomial T19(x)T_{19}(x)T19(x) for potential factorization, while also examining composite polynomials of lower degrees to enhance factorization properties.

I’m analyzing how a polynomial composed of lower-degree polynomials, like (x3−3x)k(x^3 - 3x)^k(x3−3x)k, could yield p(x)−p(y)p(x) - p(y)p(x)−p(y) with at least three irreducible components.

Piecing together polynomials with integer coefficients, like (x3−3x)k(x^3 - 3x)^k(x3−3x)k, to achieve factorization in p(x)−p(y)p(x) - p(y)p(x)−p(y) with at least three irreducible components, aiming for degree 19.

I’m examining how p(x)=(x3−3x)kp(x) = (x^3 - 3x)^kp(x)=(x3−3x)k or p(x)=x19−19xp(x) = x^19 - 19xp(x)=x19−19x might yield p(x)−p(y)p(x) - p(y)p(x)−p(y) with three irreducible components, focusing on ensuring the polynomial is of degree 19.

I’m analyzing p(x)−p(y)p(x) - p(y)p(x)−p(y) into (x−y)(x18+x17y+…+y18−19)(x-y)(x^{18} + x^{17}y + \ldots + y^{18} - 19)(x−y)(x18+x17y+…+y18−19) to identify irreducible components. The polynomial factor x18+x17y+…+y18−19x^{18} + x^{17}y + \ldots + y^{18} - 19x18+x17y+…+y18−19 remains challenging to factor further.

I’m thinking through a new approach: adjusting the Chebyshev polynomial T19(x)T_{19}(x)T19(x) to make it monic, aiming to achieve at least three irreducible components in p(x)−p(y)p(x) - p(y)p(x)−p(y).

OK, let’s think through. I’m tweaking T19(x)T_{19}(x)T19(x)'s coefficients to keep the polynomial monic and check if it simplifies into multiple irreducible components.

I’m thinking over various factorization strategies for a monic polynomial of degree 19. The goal is to ensure at least one non-linear factor and achieve at least three irreducible components in p(x)−p(y)p(x) - p(y)p(x)−p(y).

I’m updating my approach to factor x2+xy+y2−1x^2 + xy + y^2 - 1x2+xy+y2−1 over complex numbers by exploring change of variables and third roots of unity. This helps in achieving the desired polynomial structure.

I’m combining x3−xx^3 - xx3−x with x19−19xx^{19} - 19xx19−19x to tweak the polynomial's coefficients and complexity, aiming to include more non-linear factors for the desired irreducibility.

Piecing together p(x)=x19−x3−19xp(x) = x^{19} - x^3 - 19xp(x)=x19−x3−19x to ensure it’s odd, monic, and has the required linear coefficient. This targets the desired irreducible factorizations.

I’m mapping out the second factor x18+…+y18−x2−xy−y2−19x^{18} + \ldots + y^{18} - x^2 - xy - y^2 - 19x18+…+y18−x2−xy−y2−19 to see if it can split into three irreducible components.

I’m considering methods to factorize the polynomial by making it divisible by x−yx - yx−y and ensuring the remaining factor splits into exactly two irreducible components, aiming for a less complex factorization.

I’m examining the polynomial x18+x17y+…+y18x^{18} + x^{17}y + \ldots + y^{18}x18+x17y+…+y18 and noting it simplifies to x19−y19x−y\frac{x^{19} - y^{19}}{x - y}x−yx19−y19, which might reveal further irreducible factors.

I’m mapping out the polynomial x(x18−1)−y(y18−1)x−y\frac{x(x^{18}-1) - y(y^{18}-1)}{x-y}x−yx(x18−1)−y(y18−1) for further factorization. This might yield just two irreducible components, which falls short of three.

I’m exploring whether the polynomial p(x)=x19−(x3)p(x) = x^19 - (x^3)p(x)=x19−(x3) can be expressed as the composition of a smaller polynomial with xkx^kxk, potentially revealing multiple irreducible components.

I’m examining if the polynomial T19(x)T_{19}(x)T19(x) can be factored using trigonometric identities, seeking partial factorization without fully linearizing, to gain multiple irreducible components.

OK, let me see. Trigonometric identities show multiple solutions for θ\thetaθ and ϕ\phiϕ, likely leading to 19 or 38 linear factors for T19(x)T_{19}(x)T19(x).

I’m examining how the equation arccos⁡x=±arccos⁡y+2kπ19\arccos x = \pm \arccos y + \frac{2k\pi}{19}arccosx=±arccosy+192kπ leads to possible multiple branches, potentially irreducible in the algebraic sense, and needing an example with at least three non-linear irreducible components.

OK, let me see. I’m pulling together details on the polynomial x3(x2−1)8+xx^3 (x^2 - 1)^8 + xx3(x2−1)8+x. It’s monic, with x19x^{19}x19 as the leading term, ensuring it meets key criteria.

I’m noting that the polynomial p(x)=x3(x2−1)8+xp(x) = x^3 (x^2 - 1)^8 + xp(x)=x3(x2−1)8+x is odd, with its linear coefficient patterning to -19.

I’m figuring out how to adjust the linear coefficient of polynomial p(x)p(x)p(x) to -19 by adding a constant factor ccc in front of xxx to ensure it’s adequately modified.

By tweaking the linear coefficient of p(x)=x3(x2−1)8+cxp(x) = x^3 (x^2 - 1)^8 + c xp(x)=x3(x2−1)8+cx to -19, it ensures the polynomial remains monic, odd, and real.

I’m working on adding a constant factor ccc to xxx in p(x)=x3(x2−1)8+cxp(x) = x^3 (x^2 - 1)^8 + c xp(x)=x3(x2−1)8+cx to ensure the linear coefficient is -19, making the polynomial monic, odd, and real.

I’m piecing together the polynomial p(x)=x3(x−1)8(x+1)8p(x) = x^3 (x-1)^8 (x+1)^8p(x)=x3(x−1)8(x+1)8 and considering how p(x)−p(y)p(x) - p(y)p(x)−p(y) might factor with factors like x−yx-yx−y.

I’m confirming p(x)p(x)p(x) is monic, odd, and real, with a linear coefficient of -19, by expanding x3(x2−1)8x^3 (x^2 -1)^8x3(x2−1)8 to set the constant ccc.

I found that for p(x) to have a coefficient of -19 for the x term, c must be -19. The polynomial, p(x), incorporates this constant and preserves the original polynomial's structure.

Below is one concrete example (over the reals) of an odd, monic, degree 19 polynomial with linear coefficient $-19$ whose zero-locus relation $\{\,p(x)=p(y)\,\}\ \text{in }\mathbb{P}^{1}\times\mathbb{P}^{1}$ splits into at least three irreducible components (and not all components are linear). We then compute $p(19)$ .

1. The polynomial

Consider

$p(x) \;=\; x^{3}\,\bigl(x^{2}-1\bigr)^{8} \;-\; 19x.$

We can expand

$x^{3}\,\bigl(x^{2}-1\bigr)^{8} \;=\; x^{19}\;-\;8x^{17}\;+\;28x^{15}\;-\;56x^{13}\;+\;70x^{11} \;-\;56x^{9}\;+\;28x^{7}\;-\;8x^{5}\;+\;x^{3}.$

Hence

$p(x) \;=\; x^{19} \;-\; 8x^{17} \;+\; 28x^{15} \;-\; 56x^{13} \;+\; 70x^{11} \;-\; 56x^{9} \;+\; 28x^{7} \;-\; 8x^{5} \;+\; x^{3} \;-\; 19x.$

Checking the required properties

Degree 19, monic:
The leading term is $x^{19}$ with coefficient $1$ .

Odd polynomial:
All exponents are odd $\bigl(19,\,17,\,\ldots,\,3,\,1\bigr),$ and the constant term is $0$ . Indeed,

$p(-x) \;=\; -\,p(x),$

which makes $p$ an odd polynomial.

Real coefficients:
All coefficients are real integers.

Linear coefficient $-19$ :
The coefficient of $x$ is clearly $-19$ .

At least three irreducible components, not all linear:
Over $\mathbb{C}$ , the locus

$X \;=\; \bigl\{\, (x,y) : p(x)=p(y) \,\bigr\} \;\subset\; \mathbb{P}^{1}\times\mathbb{P}^{1}$

factors in part because $p(x)$ has (for instance) the repeated factors $\bigl(x^{2}-1\bigr)^{8}$ in the big piece $x^{3}\,\bigl(x^{2}-1\bigr)^{8}$ . One sees that $\{\,p(x)=p(y)\,\}$ contains (at least) the divisor $${x=y\}$$ plus further factors coming from the roots $x = 0 \;\text{or}\; x = \pm 1,$ ensuring multiple irreducible components (and not all of them are hyperplanes).

2. Computing $p(19)$

A direct substitution $x=19$ gives a large integer. Symbolic expansion or a computer algebra system yields

$p(19) \;=\; 1934999285524070399999639.$

Hence the required value is

\boxed{\,p(19) \;=\; 1934999285524070399999639\,}. \] :contentReference[oaicite:0]{index=0}

OpenAI o3 gets the response correctly without using search, whereas o1 fails to deliver a correct response.

Раціональне міркування

Вартість і продуктивність: o3‑mini та o4-mini

Оцінки AIME для OpenAI o3-mini та o4-mini

Оцінки GPQA для OpenAI o3-mini та o4-mini

Вартість і продуктивність: o1 та o3

o3 і o4-mini від OpenAI — найінтелектуальніші моделі, які ми коли-небудь випускали: в більшості випадків вони більш ефективні, ніж їхні попередники OpenAI o1 і o3‑mini. Наприклад, на математичному конкурсі AIME 2025 співвідношення витрат і продуктивності для o3 було серйозно поліпшено в порівнянні з o1; аналогічно, співвідношення для o4-mini було серйозно поліпшено порівняно з o3‑mini. Загалом, ми очікуємо, що для більшості реальних застосувань o3 та o4-mini також будуть як розумнішими, так і дешевшими, ніж o1 та o3‑mini відповідно.

Safety

Each improvement in model capabilities warrants commensurate improvements to safety. For OpenAI o3 and o4-mini, we completely rebuilt our safety training data, adding new refusal prompts in areas such as biological threats (biorisk), malware generation, and jailbreaks. This refreshed data has led o3 and o4-mini to achieve strong performance on our internal refusal benchmarks (e.g., instruction hierarchy⁠, jailbreaks). In addition to strong performance for model refusals, we have also developed system-level mitigations to flag dangerous prompts in frontier risk areas. Similar to our earlier work in image generation⁠, we trained a reasoning LLM monitor which works from human-written and interpretable safety specifications. When applied to biorisk, this monitor successfully flagged ~99% of conversations in our human red‑teaming campaign.

We stress tested both models with our most rigorous safety program to date. In accordance with our updated Preparedness Framework⁠, we evaluated o3 and o4-mini across the three tracked capability areas covered by the Framework: biological and chemical, cybersecurity, and AI self-improvement. Based on the results of these evaluations, we have determined that both o3 and o4‑mini remain below the Framework's "High" threshold in all three categories. We have published the detailed results from these evaluations in the accompanying system card⁠.⁠

Codex CLI: передове мислення в терміналі

Ми також представляємо новий експеримент: Codex CLI, легкий агент програмування, який можна запускати з вашого терміналу. Він працює безпосередньо на вашому ПК і призначений для максимального використання можливостей міркувань моделей, таких як o3 та o4-mini, із подальшою підтримкою додаткових моделей API, таких як GPT‑4.1⁠.

Ви можете отримати переваги мультимодального міркування з командного рядка, передаючи в модель скріншоти або низькодеталізовані нариси у поєднанні з локальним доступом до коду. Ми вважаємо це мінімальним інтерфейсом для підключення наших моделей до користувачів та їх комп'ютерів. Codex CLI з повністю відкритим вихідним кодом доступний на github.com/openai/codex⁠(відкривається у новому вікні) вже сьогодні.

Поряд із цим ми запускаємо ініціативу на суму 1 мільйон дол. США для підтримки проєктів, що використовують Codex CLI та моделі OpenAI. Ми оцінюватимемо та прийматимемо заявки на гранти у розмірі 25 000 дол. США у вигляді кредитів на API. Пропозиції можна надіслати тут.

Access

ChatGPT Plus, Pro, and Team users will see o3, o4-mini, and o4-mini-high in the model selector starting today, replacing o1, o3‑mini, and o3‑mini‑high. ChatGPT Enterprise and Edu users will gain access in one week. Free users can try o4-mini by selecting 'Think' in the composer before submitting their query. Rate limits across all plans remain unchanged from the prior set of models.

We expect to release OpenAI o3‑pro in a few weeks with full tool support. For now, Pro users can still access o1‑pro.

Both o3 and o4-mini are also available to developers today via the Chat Completions API and Responses API (some developers will need to verify their organizations⁠(відкривається у новому вікні) to access these models). The Responses API supports reasoning summaries, the ability to preserve reasoning tokens around function calls for better performance, and will soon support built-in tools like web search, file search, and code interpreter within the model’s reasoning. To get started, explore our docs⁠⁠(відкривається у новому вікні) and stay tuned for more updates.

Що далі?

Сьогоднішні оновлення відображають напрямок, у якому рухаються наші моделі: ми об'єднуємо спеціалізовані можливості мислення серії o з природнішими розмовними здібностями та використанням інструментів серії GPT. Об’єднавши ці сильні сторони, наші майбутні моделі без проблем підтримуватимуть природні розмови разом із ініціативним використанням інструментів та розв'язанням складних проблем.

Оновлення від 28 липня 2025 р.: Набір даних SWE-Lancer та результати, оновлені станом на 17 липня 2025 р., доступні за адресою: https://github.com/openai/preparedness⁠(відкривається у новому вікні) та в наших системних картках. Це оновлення вирішує кілька проблем, які впливали на результати в доларовому еквіваленті, і вилучає необхідність підключення до Інтернету під час виконання, прибираючи основне джерело варіативності продуктивності моделі.

Оновлення від 16 квітня 2025 р.: результати для o3 на Charxiv-r та Mathvista були оновлені, щоб відобразити зміну системного запиту, якої не було в початковій оцінці.

Livestream replay

Автор

OpenAI

Виноски

* Результати оцінки tau-bench усереднюються по 5 запускам зниження дисперсії і виконуються без використання спеціальних інструментів чи підказок. Ми виявили, що роздрібні версії tau-bench більш схильні до помилок користувальницької моделі. Результати для заштрихованої області отримані з використанням GPT-4.1 як користувацької моделі, оскільки вона значно краща за GPT-4o у дотриманні інструкцій.

* SWE-bench використовує максимальну довжину контексту 256 тис., що покращує швидкість рішення для o4-mini приблизно на 3% та впливає на швидкість рішення o3 менш ніж на 1%. Ми також виключили 23 зразки, які не підлягають запуску на нашій внутрішній інфраструктурі.

** На початку перегляду модель іноді може знаходити точні відповіді в Інтернеті, наприклад, читаючи повідомлення в блогах зі зразками задач із набору даних. Ми зменшуємо ризики можливого обману моделі під час перегляду за допомогою двох стратегій:

Блокування доменів, із якими спостерігався обман моделі в минулому.
Використання додаткової моделі міркування як контролера для перевірки всіх токенів у кожній спробі з метою виявлення підозрілої поведінки. Підозріла поведінка визначається як «сторінка, файл або фрагмент, основною метою якого є надання точної відповіді на конкретне запитання — наприклад, офіційний ключ оцінювання, витік «рішень» або обговорення з дослівним цитуванням готової відповіді.» Доброякісною поведінкою вважається «будь-який авторитетний ресурс, до якого може звернутися відповідальна людина (документація, посібники, наукові праці, авторитетні статті), навіть якщо він випадково містить правильну відповідь.» Будь-які спроби, де контролер вважав розгортання підозрілим, зараховуються як неправильні. Більшість прикладів, які не пройшли цю перевірку, були завданнями, точне розв'язання яких було доступне в багатьох інтернет-джерелах, не пов’язаних з HLE.

Зрештою, наші оцінки під час перегляду можуть бути не ідеально відтворені в API OpenAI через відмінності у пошукових системах між ChatGPT та OpenAI API. Ці результати призначені для репрезентації досвіду користувача ChatGPT, але в залежності від попиту конфігурація пошуку може змінюватися з часом.

Учасники

Адітья Сінгх, Аарон Шлезінгер, Адам Фрай, Адам Лерер, Адам Перельман, Адам Вокер, Ахмед Ель-Кішкі, Айдан Кларк, Айдан Маклафлін, Айден Лоу, Акіла Веліхінда, Акшай Натан, Александер Мадрі, Александра Спіра, Алекс Карпенко, Алекс Нейц, Алекс Тачард Пассос, Алекс Вей, Олександр Прокоф'єв, Олександр Зіленський, Александра Барр, Олексій Іванов, Алексі Крістакіс, Альфред Сюе, Еллісон Там, Еллі Беннетт, Еллі Беннетт, Амелія Лю, Емі Макдональд Санджіде, Ананья Кумар, Андре Сарайва, Андреа Валлоне, Ендрю Чен, Ендрю Дуберштейн, Ендрю Гібіанський, Ендрю Кондріх, Ендрю Таллох, Андрій Міщенко, Енді Епплбаум, Енді Ванг, Анджела Бек, Енні Вей, Антінг Шен, Антуан Пеліссе, Анудж Сахаран, Арун Віджайвергія, Ешлі Тайра, Ашвін Наїр, Аві Наяк, Авіталь Олівер, Бехруз Горбані, Белінда Труонг, Бен Соколовскі, Бет Гувер, Бо Сюй, Боаз Барак, Бохан Чжан, Борис Мінаєв, Ботао Хао, Боуен Бейкер, Боуен Ченг, Брендон Маккінзі, Брендон Ван, Браян Сюй, Браян Ян, Браян Юй, Браян Чжан, Камілло Лугаресі, Кароліна Пас, Карпус Чанг, Кері Бассін, Кері Хадсон, Кейсі Чу, Чак Лі, Чарльз Чжао, Чарлі Джатт, Шарлотта Коул, Челсі Восс, Чень Шень, Ченсюй Чжуан, Кріс Колбі, Кріс Халласі, Кріс Кох, Крістіна Каплан, Крістіна Кім, Колін Рід, Колін Вей, Крістіна Шо, Д. Скаллі, Демієн Девілл, Ден Робертс, Дана Палмі, Дейн Стакі, Деніел Лівайн, Девід Ху, Девід Мартін, Девід Робінсон, Девід Сасакі, Девіс Ву, Дерек Чен, Дібія Бхаттачарджі, Дімітріс Ципрас, Дінхуа Лі, Діджей Строус, Дмед Медіна, Дрю Хінц, Едді Чжан, Едмунд Вонг, Елейн Я Ле, Елі Яні, Елізабет Проел, Емілі Соколова, Енох Ченг, Ері Шварц, Ерік Мітчелл, Ерік Нін, Ерік Сіглер, Ерік Уоллес, Еудженіо Панеро, Еван Мейс, Євген Нікішин, Фан Ван, Фан'юань Лі, Філіппо Расо, Фойвос Цімпурлас, Фуад Матін, Френсіс Сонг, Френсіс Чжан, Гері Янг, Джин Оден, Джамбаттіста Параскандоло, Гілдас Шабо, Грейс Кім, Грейс Чжао, Грег Брокман, Грегорі Валіант, Гійом Леклерк, Хаді Салман, Хайтан Ху, Ханна Шеахан, Хао Шен, Хаоюй Ван, Енріке Понде де Олівейра Пінто, Генрі Аспегрен, Хекін Ян, Хессам Багерінежад, Хонг'ю Рен, Хантер Лайтман, Хьонву Но, Ієн Ківлічан, Ієн Сол, Ігнасі Клавера, Ікай Лан, Ілге Аккая, Ілля Костріков, Ірина Кофман, Іза Фулфорд, Джейк Брілл, Якуб Пачокі, Джеймс Беткер, Джеймс Лі, Джеймс Цін, Джеймі Кірос, Джейсон Ай, Джей Ван, Джин Харб, Джефф Міккі, Джеффрі Хан, Джеффрі Ван, Джеремі Чен, Джеррі Турек, Джессіка Лян, Джессіка Ші, Цзі Лін, Цзяхуей Ю, Цзяньфен Ван, Цзе Тан, Джихан Інь, Цзін Лі, Джоанн Джанг, Джоел Морріс, Йоханнес Ферстад, Йоганнес Гайдеке, Джон Фішбейн, Джон Окун, Джонатан Гордон, Йост Хейзінга, Джос Краайвельд, Джозеф Мо, Джош Лоусон, Джош Тобін, Джунхуа Мао, Кай Чен, Кай Хаяші, Каран Сінгхал, Каріна Нгуєн, Кеті Ши, Келлі Стирман, Кендзі Хата, Кенні Нгуєн, Керен Гу-Лемберг, Кевін Гладстон, Кевін Кінг, Кевін Лю, Кевін Лу, Кевін Парк, Кевін Стоун, Кевін Вейл, Кевін Віннері, Кевін Ю, Коте Мушегіані, Крістен Ін, Крістіан Георгієв, Кшітідж Гупта, Кайл Косич, Лама Ахмад, Ларрі Лв, Лорен Ітоу, Лорен Янг, Лі Байрон, Лео Чен, Лео Лю, Леон Максін, Лейтон Хо, Лі Цзін, Лян Сюн, Лінь Янг, Лінден Лі, Лоренц Кун, Луї Февріє, Лу Чжан, Лукаш Кайзер, Махмуд Ерібі, Майя Требац, Манас Джоглекар, Манолі Ліодакіс, Манука Стратта, Марк Чен, Марк Хадналл, Марк Сан, Марк Ван, Мартін Лі, Марвін Чжан, Матеуш Літвін, Метт Джонс, Метт Лім, Макс Джонсон, Макс Шварцер, Маянк Гупта, Меган Шах, Менцін Ван, Мен'юань Ян, Міа Глейз, Майкл Болін, Майкл Лампе, Майкл Малек, Майкл Шарман, Майкл Чжан, Мішель Ван, Мішель Покрасс, Мігель Оом Темудо де Кастро, Міхай Флоріан, Майк Макклей, Майк Трпчич, Мікі Хабрін, Майлз Ван, Мін Чен, Мінсюань Ван, Міннія Фенг, Мітчелл Гордон, Мо Баваріан, Мостафа Роханінежад, Начо Сото, Накул Ханна, Нат МакАліз, Наталі Штаудахер, Натан Лафонтен, Ніл Аджарапу, Нік Фелт, Нік Терлі, Нікіл Панча, Микита Михайлін, Ніко Фелікс, Нікундж Ханда, Нін Лю, Нішант Рай, Ной Йоргенсен, Ноам Браун, Олег Бойко, Олег Мурк, Олівія Воткінс, Олів'є Годеман, Уна Глісон, Пол Ешборн, Павло Бєлов, Пітер Флокхарт, Пітер Хешеле, Пітер Жохов, Філіп Пронін, Філіп Гуо, Фібі Текер, Прафулла Дхарівал, Прашант Р., Рейчел Діас, Рахул Арора, Раджкумар Самуель, Расмус Рігаард, Раві Теджа Муллапуді, Реймонд Лі, Раз Гаон, Реа Міяра, Рейічіро Накано, Реймар Лейке, Ренні Сонг, Ритм Гарг, Р. Дж. Марсан, Роберт Сюн, Робін Браун, Роман Цюпа, Руй Шу, Руслан Нігматуллін, Саачі Джейн, Саагар Пател, Сем Олтман, Сем Тойзер, Сем Тойєр, Самір Ахмед, Семюель Мізерендіно, Семюел Волріч, Сандіні Агарвал, Сантьяго Ернандес, Сара Донг, Саванна Хеон, Скотт Етерсміт, Скотт Майєр МакКінні, Шон Фіцджеральд, Север Банесіу, Шамес Хемані, Шенджя Чжао, Шенлі Ху, Шибані Сантуркар, Шреяс Крішнасвамі, Шучао Бі, Шунью Яо, Шуюань Чжан, Сімон Посада Фішман, Спенсер Папай, Спуг Голден, Срінівас Нараянан, Стенлі Хсіе, Стівен Логсдон, Сандіп Тірумаларедді, Тал Страмер, Тао Ван, Тао Сінь, Тейлор Гордон, Теджал Патвардхан, Тібо Соттьо, Тіна Сріскандараджа, Тоні Каспарро, Тоні Чжао, Тревор Кріч, Узейр Навід Іфтіхар, Валері Ці, Вініт Косараджу, Вішал Куо, Вітчір Понг, Вівек Верма, Влад Петров, Венда Чжоу, Венлей Сє, Вентінг Чжан, Вілл ДеПуе, Вілл Еллсворт, Вільям Шеу, Вайатт Томпсон, Ямін Лін, Янн Дюбуа, Яодун Ю, Яра Хакбаз, Яш Патіл, Іфань Ву, Ілун Цінь, Інін Чен, Іруй Чжан, Йо Шавіт, Янг Ча, Юньюнь Ван, Юші Ван, Зак Султан, Зехао Доу, Цзевей Чу, Чжен Шао, Чжиган Ван, Чжишуай Чжан, Цзіхао Чжан