跳至主要內容
OpenAI

2025年12月16日

產品發布

全新「ChatGPT 圖像」功能登場

載入中…

全新版本的「ChatGPT 圖像」即日起正式推出,由最新的圖像生成旗艦模型驅動。現在,無論你是從零開始創作,還是後製編輯照片,都能得到心中所想的成果。新版本可在保留人物外貌細節的同時精確編輯,且圖像生成速度提升多達 4 倍。同時,我們也在 ChatGPT 中推出全新「圖像」專區,圖像生成現在更加趣味好玩,讓你靈感滿滿,盡情玩創意。

新的圖像模型和功能現已在 ChatGPT 中向所有使用者推出,並在 API 中以 gpt-image-1.5 提供。

精確編輯,保留重要細節

現在,當你上傳圖像並要求編輯時,模型會更可靠地遵循你的意圖,連細微之處也能準確掌握,只更改你所要求的部分,同時在原始圖像、生成結果與後續多次編輯之間,維持光線、構圖和人物外觀的一致性。

這讓生成結果能更貼近你的意圖,包括更實用的照片編輯、更逼真的服裝與髮型模擬穿搭,以及在保留原始圖像精髓的同時,套用風格濾鏡和進行概念轉換。綜合這些改進,ChatGPT 就像你的口袋創意工作室,既能完成實用的編輯需求,也能重塑創意,以更具表現力的方式呈現。

編輯

新推出的模型擅長各種編輯操作,包括新增、移除、合併、融合與重新配置,讓你精準完成想要的修改,卻又不失圖像原有的特色。

chatgpt-images-example-1-input-2chatgpt-images-example-1-input-1chatgpt-images-example-1-input-2

Combine the two men and the dog in a 2000s film camera-style photo of them looking bored at a kids birthday party.

chatgpt-images-example-1-output-1

Add chaotic kids in the background throwing things and screaming.

chatgpt-images-example-1-output-2

Change the man on the left to a hand-drawn retro anime style, the dog to plushie style, keep the man on the right and background scenery the way they are.

chatgpt-images-example-1-output-3
Screenshot 2025-12-12 at 10.23.01 AM

Put them all in OpenAI sweaters that look like this.

chatgpt-images-example-1-output-4

Now remove the two men, just keep the dog, and put them in an OpenAI livestream that looks like the attached image.

chatgpt-images-example-1-output-5

創意轉換

模型的創意在各種轉換中自然展現,透過調整或加入文字、版面等元素,讓構想逐步成形,同時保留關鍵細節。這些轉換對簡單或複雜的構想都適用,你也可直接透過全新 ChatGPT 圖像(在新視窗中開啟)功能中的預設風格與靈感快速嘗試,無需撰寫任何提示詞。

chatgpt-images-example-3-output-1

Make an old school golden age hollywood movie poster of a movie called 'codex' from the image of these two men. feel free to change their costumes to fit the times

Change the names of the actors to Wojciech Zaremba (left) and Greg Brockman (right) 

Directed by Sam Altman, produced by Fidji Simo. A Feel the AGI Pictures Production.

chatgpt-images-example-3-output-2

指令遵循

這一代模型在遵循指令方面的表現,比最初的版本更加可靠。不僅能進行更精準的編輯,也能完成更細緻的原創構圖,而且維持畫面邏輯,保留各個元素之間原本設定的關係。

draw a 6x6 grid

Make a 6 (columns) by 6 (rows) grid grid of:

Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog

Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope

Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z

Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet

Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet

Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14

chatgpt-images-instruction-following-new

先前版本

draw a 6x6 grid

Make a 6 (columns) by 6 (rows) grid grid of:

Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog

Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope

Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z

Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet

Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet

Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14

chatgpt-images-instruction-following-old

文字呈現

這個模型在文字呈現方面再進化,能清楚呈現更密集、字級更小的文字內容。

There is a newspaper on a desk. The newspaper shows the markdown below laid out as a **natural** newspaper article. Preserve all content, formatting, and numbers exactly. The image should be tall.

# Introducing GPT‑5.2

### *The most advanced frontier model for professional work and long-running agents*

**December 11, 2025**

---

We are introducing **GPT‑5.2**, the most capable model series yet for professional knowledge work.

Already, the average ChatGPT Enterprise user says AI saves them 40–60 minutes a day, and heavy users say it saves them more than 10 hours a week. We designed GPT‑5.2 to unlock even more economic value for people; it’s better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects.

GPT‑5.2 sets a new state of the art across many benchmarks, including GDPval, where it outperforms industry professionals at well-specified knowledge work tasks spanning 44 occupations.

---

## Benchmark highlights

| Benchmark | Domain | GPT‑5.2 Thinking | GPT‑5.1 Thinking |

|---|---|---:|---:|

| GDPval (wins or ties) | Knowledge work tasks | **70.9%** | 38.8% (GPT‑5) |

| SWE-Bench Pro (public) | Software engineering | **55.6%** | 50.8% |

| SWE-bench Verified | Software engineering | **80.0%** | 76.3% |

| GPQA Diamond (no tools) | Science questions | **92.4%** | 88.1% |

| CharXiv Reasoning (w/ Python) | Scientific figure questions | **88.7%** | 80.3% |

| AIME 2025 (no tools) | Competition math | **100.0%** | 94.0% |

| FrontierMath (Tier 1–3) | Advanced mathematics | **40.3%** | 31.0% |

| FrontierMath (Tier 4) | Advanced mathematics | **14.6%** | 12.5% |

| ARC-AGI-1 (Verified) | Abstract reasoning | **86.2%** | 72.8% |

| ARC-AGI-2 (Verified) | Abstract reasoning | **52.9%** | 17.6% |

---

Notion, Box, Shopify, Harvey, and Zoom observed that GPT‑5.2 demonstrates state-of-the-art long-horizon reasoning and tool-calling performance. Databricks, Hex, and Triple Whale found GPT‑5.2 to be exceptional at agentic data science and document analysis tasks. Cognition, Warp, Charlie Labs, JetBrains, and Augment Code report that GPT‑5.2 delivers state-of-the-art agentic coding performance, with measurable improvements in areas such as interactive coding, code reviews, and bug finding.

In ChatGPT, GPT‑5.2 Instant, Thinking, and Pro will begin rolling out today, starting with paid plans. In the API, they are available now to all developers.

Overall, GPT‑5.2 brings significant improvements in general intelligence, long-context understanding, agentic tool-calling, and vision—making it better at executing complex, real-world tasks end-to-end than any previous model.

chatgpt-images-text-rendering-2

Now change the article to the markdown below:

# Introducing GPT‑Image‑1.5

### *The new and improved ChatGPT Images*

**December 16, 2025**

---

Today, we’re introducing a new and improved version of ChatGPT Images, powered by our best image generation model yet. With stronger instruction following and more precise editing, ChatGPT Images delivers the changes you ask for while keeping important details like facial likeness consistent across edits—now with generation speeds up to **4× faster**, making it easier to iterate and explore ideas with less waiting.

This is our most capable general-purpose text-to-image model to date, with more expressive transformations, improved dense text rendering, and more natural-looking results. Whether you’re making a tiny fix or a total reinvention, you can simply say what you want—or choose from preset styles and ideas in the new Images experience—and ChatGPT handles the rest, delivering results that are both useful and compelling, and better match your intent.

The new Images model and experience is beginning to roll out today in ChatGPT for all users, and in the API as **GPT‑Image‑1.5**.

---

## Results that match your intent

The model now follows instructions more reliably—down to the small details—changing what you ask for while able to keep elements like lighting, composition, and likeness consistent across inputs, outputs, and subsequent edits.

This unlocks results that match your intent—more useful photo edits, more believable clothing and hairstyle try-ons, alongside stylistic filters and conceptual transformations that retain the essence of the original image. Together, these improvements mean ChatGPT can act as a creative studio in your pocket, capable of both practical edits and expressive reimaginings.

### Editing

The model excels at different types of editing so you get the changes you want without losing what makes the image special.

### Creative Transformations

The model’s creativity shines with creative transformations, changing and adding elements—like text and layout—that help the concept come to life while maintaining important details.

### Instruction Following

The model is able to better follow instructions versus GPT Image 1.0.

### Text Rendering

The model takes another step ahead in text rendering, capable of handling denser and smaller text.

---

## A new creation space

In addition to asking for images through ChatGPT by describing what you’d like to see, we’re also introducing a dedicated Images experience in the ChatGPT sidebar to make exploring and trying images easier and quicker. This includes preset filters and trending prompts to jump-start inspiration, as well as a one-time likeness upload so you can reuse your appearance across future creations without the need to go through your camera roll again.

Together, these upgrades let you create images that better match your vision, from small edits to full reimaginings. Images now render up to four times faster, and you can continue generating new images while others are still in progress—so you can explore more ideas without waiting.

chatgpt-images-text-rendering-3

其他品質提升

該模型在其他方面也有所改進,更快生成可用的輸出結果,例如呈現多人畫面中眾多細小的人臉細節,整體視覺效果看起來更加自然。

make a scene in chelsea, london in the 1970s, photorealistic, everything in focus, with tons of people, and a bus with an advertisement for "ImageGen 1.5" with the OpenAI logo and subtitle "Create what you imagine". Hyper-realistic amateur photography, iPhone snapshot quality…

chatgpt-images-quality-1

先前版本

make a scene in chelsea, london in the 1970s, photorealistic, everything in focus, with tons of people, and a bus with an advertisement for "ImageGen 1.5" with the OpenAI logo and subtitle "Create what you imagine". Hyper-realistic amateur photography, iPhone snapshot quality…

chatgpt-images-quality-2

全新創作空間

除了透過描述你想看到的內容來生成圖像,我們也在 ChatGPT 中推出一個專屬的圖像(在新視窗中開啟)空間,可在行動應用程式與 chatgpt.com 的側邊欄中使用,讓探索和嘗試圖像的過程變得更快速、更簡單。其中包含數十種預設的風格濾鏡與提示詞靈感,並會定期更新,掌握最新創作趨勢。

上述升級讓你能更忠實呈現創作構想,無論是小幅調整,還是完整重塑,都能做出你想要的圖像成果。

ChatGPT 圖像:工作應用

此模型透過更快速的圖像生成、更精準的編輯能力,以及在多次調整過程中維持一致的視覺細節,協助簡化工作流程。團隊能更輕鬆地探索想法、進行重點調整,並將複雜或偏抽象的概念轉化為清楚的視覺呈現,支援行銷、設計、電子商務與內部溝通等多種應用情境。

改進與限制

我們重新測試了初次推出圖像生成功能時的多個範例,用來評估整體表現。結果顯示,模型在多種情境下都有明顯進步,但仍未臻理想。這次更新是一個重要的里程碑,但後續版本仍有相當大的改善空間。

create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

chatgpt-images-output-1

先前版本

create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

chatgpt-images-output-2

仍有部分科學細節不夠精確,但整體正確率約為 70%,畫面表現更加生動,也避免不當裁切。

API 中的 GPT 圖像 1.5

API 中的 gpt-image-1.5 同樣具備 ChatGPT 圖像的所有改進,在圖像保留與編輯能力方面,表現均優於 GPT 圖像 1。

使用者會發現在多次編輯過程中,品牌標誌與關鍵視覺元素都能維持一致呈現,因此特別適合用於行銷與品牌設計工作,例如圖像與標誌製作,也適合電子商務團隊從單一來源圖像生成完整的產品圖像目錄(包含不同款式、場景與拍攝角度)。

相較於 GPT 圖像 1,GPT 圖像 1.5 的圖像輸入與輸出成本降低 20%,讓你在相同預算下生成並反覆調整更多圖像。

你可以前往
OpenAI Playground(在新視窗中開啟) 試用新模型,或閱讀提示詞指南(在新視窗中開啟),發掘更多靈感。

目前已有來自創意工具、電子商務、行銷軟體等不同產業的企業與新創團隊,開始採用 GPT 圖像 1.5。以下是我們精選的幾個使用範例,歡迎參考。

chatgpt-images-API-output-1

先前版本

chatgpt-images-API-output-2

「GPT 圖像 1.5 能生成高擬真度的圖像,準確理解並回應指令,同時保留構圖、光影與細節層次。產出的畫面乾淨自然、穩定可靠,成功協助 Wix 等平台能更快把構想推進到實際製作階段。根據我們的測試結果與 Wix 的實際使用案例,這款模型憑藉穩定的一致性與出色的整體品質,躋身最先進旗艦級圖像生成模型之列。」

— Hila Gat,Wix AI 研究與資料科學部門主管

適用情況

全新的 ChatGPT 圖像功能即日起於各介面向全球所有 ChatGPT 與 API 使用者逐步推出。此功能可跨模型使用,無需額外選擇即可直接體驗。

我們相信,圖像生成的潛力才剛開始發揮。這次更新是一個重要進展,未來精彩可期,包括更細緻的編輯能力,以及在多語言情境下更豐富、細節更完整的輸出。

作者

OpenAI

Contributors

Project Leadership

Gabriel Goh — Research Lead

Adele Li — Product Lead

Bill Peebles — Sora Lead 

Aditya Ramesh — World Simulation Lead

Mark Chen — Chief Research Officer

Prafulla Dhariwal — Multimodal Lead

Core Team 

Alex Fang, Alex Yu, Ben Wang, Bing Liang, Boyuan Chen, Charlie Nash, David Medina, Dibya Bhattacharjee, Jianfeng Wang, Kenji Hata, Kiwhan Song, Mengchao Zhong, Mike Starr, Yuguang Yang

Research Contributors

Bram Wallace, Dmytro Okhonko, Haitang Hu, Kshitij Gupta, Li Jing, Lu Liu, Peter Zhokhov, Qiming Yuan, Senthil Purushwalkam, Yizhen Zhang

Core Inference

Adam Tart, Alyssa Huang, Andrew Braunstein, Jane Park, Karen Li, Tomer Kaftan

Research Collaborators

Aditya Ramesh, Alex Nichol, Andrew Kondrich, Andrew Liu, Benedikt Winter, Bill Peebles, Connor Holmes, Cyril Zhang, Daniel Geng, Eric Mintun, James Betker, Jamie Kiros, Manuka Stratta, Martin Li, Raoul de Liedekerke, Ricky Wang, Ruslan Vasilev, Vladimir Chalyshev, Welton Wang, Wyatt Thompson, Yaming Lin

Inference Collaborators

Jiayu Bai, Kevin King, Stanley Hsieh, Weiyi Zheng

Data & Evaluation

Alexandra Barr, Aparna Dutta, Arshi Bhatnagar, Chao Yu, Charlotte Cole, Dragos Oprica, Emma Tang, Gowrishankar Sunder, Henry Baer, Ian Sohl, James Park Lennon, Jason Xu, Peilin Yang, Somay Jain, Szi-chieh Yu, Wesam Manassra, Xiaolei Zhu, Yilei Qian

Applied

Affonso Reis, Alan Gou, Alexandra Vodopianova, Amandeep Grewal, Andi Liu, Andrew Sima, Angus Fletcher, Antonia Woodford, Arun Eswara, Benny Wong, Bharat Rangan, Boyang Niu, Bridget Collins, Bryan Brandow, Callie Riggins Zetino, Chris Wendel, Ethan Chang, Gilman Tolle, Greg Hochmuth, Ibrahim Okuyucu, Jesse Chand, Jesse Hendrickson, Jiayu Bai, Jimmy Lin, Johan Cervantes, Kan Wu, Liam Esparraguera, Maja Wichrowska, Matthew Ferrari, Murat Yesildal, Nikunj Handa, Nithanth Kudige, Ola Okelola, Osman Khwaja, Peter Argany, Peter Bakkum, Peter Vidani, Richard Zadorozny, Rohan Sahai, Savelii Bondini, Sean Chang, Vickie Duong, Victoria Huang, Xiaolin Hao, Xueqing Li

Safety, Safety Systems, Integrity, Policy & Trust

Abby Fanlo Susk, Adam Wells, Aleah Houze, Annie Cheng, Artyi Xu, Carolina Paz, David Abelman, Femi Alamu, Jay Wang, Jeremiah Currier, Jesika Haria, Mariya Guryeva, Max Burkhardt, Paige Walker, Pedro Aguilar, Rutsu Koshimizu, Sam Toizer, Savannah Heon, Tom Rubin, Tonia Osadebe, Willow Primack, Zoe Stoll

Product Operations, Program Management and Governance

Antonio Di Francesco, Filippo Raso, Grace Wu, Josh Metherd, Ruth Costigan

Legal

Ally Bennett, Tony Song, Tyce Walters

Communications, Marketing, Community, Design & Creative

Akash Iyer, Alex Baker-Whitcomb, Angie Luo, Anne Oburgh, Antonia Richmond, Annie Tsang, Ashley Tyra, Bailey Richardson, Brandon McGraw, Cary Hudson, Dana Palmie, Evan Corrigan, Gaby Raila, Indgila Samad Ali, James Anderson, Jeremy Schwartz, Jordan Liss, Juan Garza, Julie Steele, Kara Zichittella, Karn Piluntanadilok, Kendal Peirce, Kim Baschet, Leah Anise, Livvy Pierce, Maria Clara M. Fleury Osorio, Minnia Feng, Nick Ciffone, Nick Forland, Niko Felix, Paige Ford, Rachel Puckett, Rishabh Aggarwal, Rusty Rupprecht, Souki Mansoor, Tasia Potasinski, Taya Christianson, Vasundhara Mudgil, Whitney Ferris, Yara Khakbaz, Zach Brock, Zoë Silverman

Special Thanks

Amy Yang, Arvin Wu, Avital Oliver, Brandon McKinzie, Chak Li, Chris Lu, David Duxin, Dian Ang Yap, Gabriel Petersson, Guillaume Leclerc, Hazel Byrne, Henry Aspegren, Jennifer Luckenbill, Ji Lin, Joseph Mo, Julius Hochmuth, Liunian (Harold) Li, Long Ouyang, Mariano López, Michael Zhang, Ravi Teja Mullapudi, Suvansh Sanjeev, Varun Shetty, Wenda Zhou

Exec

Fidji Simo, Hannah Wong, Jakub Pachocki, Jason Kwon, Johannes Heidecke, Kate Rouch, Lauren Itow, Mark Chen, Mia Glaese, Nick Ryder, Nick Turley, Prafulla Dhariwal, Sam Altman, Sulman Choudhry