۲۵ آذر ۱۴۰۴

تصاویر جدید ChatGPT اینجاست

در حال بارگذاری…

امروز، نسخه جدیدی از ChatGPT Images را منتشر می‌کنیم که با مدل پرچم‌دار جدید تولید تصویر ما تقویت شده است. حالا، چه در حال ساختن چیزی از ابتداء باشید یا ویرایش یک عکس، نتیجه‌ای که در ذهن دارید را به دست می‌آورید. این برنامه ویرایش‌های دقیقی انجام می‌دهد و در عین حال جزئیاتی مانند ظاهر افراد را دست‌نخورده نگه می‌دارد و تصاویر را تا ۴ برابر سریع‌تر تولید می‌کند. همزمان، ما یک ویژگی جدید تصاویر را در ChatGPT معرفی می‌کنیم که برای لذت‌بخش کردن تولید تصویر طراحی شده است—تا الهام‌بخش باشد و کاوش خلاقانه را آسان کند

مدل و ویژگی جدید تصاویر از امروز در ChatGPT برای همه کاربران عرضه می‌شود و در API به عنوان gpt-image-1.5 ارائه می‌شود.

ویرایش‌های دقیقی که آنچه مهم است را حفظ می‌کنند

حالا، وقتی که درخواست ویرایش یک تصویر بارگذاری‌شده رو می‌کنی، مدل به نیتت به‌طور قابل‌اعتمادتری پایبند می‌مونه—تا جزئیات کوچک—و فقط اون چیزی رو که درخواست کردی تغییر می‌ده، در حالی که عناصری مثل نور پردازی، ترکیب‌بندی و ظاهر افراد رو در ورودی‌ها، خروجی‌ها و ویرایش‌های بعدی ثابت نگه می‌داره.

این قفل‌گشایی نتایجی را که با نیت تو مطابقت دارند، فراهم می‌کند—ویرایش‌های عکس مفیدتر، امتحان لباس و مدل مو باور پذیرتر، به همراه فیلترهای سبکی و تغییرات مفهومی که جوهره تصویر اصلی را حفظ می‌کنند. این بهبودها به این معناست که ChatGPT می‌تواند به عنوان یک استودیوی خلاق در جیبت عمل کند، قادر به انجام ویرایش‌های عملی و بازآفرینی‌های بیانی است.

ویرایش

این مدل در انواع مختلف ویرایش مهارت دارد—از جمله افزودن، کم کردن، ترکیب کردن، مخلوط کردن و جابجا کردن — بنابراین می‌تونی تغییراتی که می‌خوای رو بدون از دست دادن ویژگی‌های خاص تصویر اعمال کنی.

Combine the two men and the dog in a 2000s film camera-style photo of them looking bored at a kids birthday party.

Add chaotic kids in the background throwing things and screaming.

Change the man on the left to a hand-drawn retro anime style, the dog to plushie style, keep the man on the right and background scenery the way they are.

Put them all in OpenAI sweaters that look like this.

Now remove the two men, just keep the dog, and put them in an OpenAI livestream that looks like the attached image.

تحولات خلاقانه

خلاقیت مدل از طریق تغییراتی که عناصر را تغییر داده و اضافه می‌کند—مانند متن و چیدمان—برای زنده کردن ایده‌ها در حالی که جزئیات مهم را حفظ می‌کند، می‌درخشد. این تغییرات برای مفاهیم ساده و پیچیده‌تر کار می‌کنند و به راحتی می‌توانید با استفاده از سبک‌ها و ایده‌های از پیش تعیین‌شده در ویژگی جدید تصاویر ChatGPT⁠(در یک پنجره جدید باز می‌شود) امتحان‌شان کنید—بدون نیاز به پیام نوشتاری.

Make an old school golden age hollywood movie poster of a movie called 'codex' from the image of these two men. feel free to change their costumes to fit the times

Change the names of the actors to Wojciech Zaremba (left) and Greg Brockman (right)

Directed by Sam Altman, produced by Fidji Simo. A Feel the AGI Pictures Production.

پیروی از دستورالعمل

این مدل به‌طور قابل‌اعتمادتری از دستور العمل‌ها نسبت به نسخه اولیه ما پیروی می‌کند. این امکان ویرایش‌های دقیق‌تر و همچنین ترکیب‌های اصلی پیچیده‌تر را فراهم می‌کند، به‌طوری که روابط بین عناصر همان‌طور که در نظر گرفته شده حفظ می‌شوند.

جدید

draw a 6x6 grid

Make a 6 (columns) by 6 (rows) grid grid of:

Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog

Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope

Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z

Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet

Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet

Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14

chatgpt-images-instruction-following-new

قبلی

draw a 6x6 grid

Make a 6 (columns) by 6 (rows) grid grid of:

Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog

Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope

Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z

Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet

Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet

Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14

chatgpt-images-instruction-following-old

تبدیل متن

مدل یک گام دیگر در رندر متن به جلو می‌رود و قادر به پردازش متن‌های متراکم‌تر و کوچک‌تر است.

There is a newspaper on a desk. The newspaper shows the markdown below laid out as a **natural** newspaper article. Preserve all content, formatting, and numbers exactly. The image should be tall.

# Introducing GPT‑5.2

### *The most advanced frontier model for professional work and long-running agents*

**December 11, 2025**

---

We are introducing **GPT‑5.2**, the most capable model series yet for professional knowledge work.

Already, the average ChatGPT Enterprise user says AI saves them 40–60 minutes a day, and heavy users say it saves them more than 10 hours a week. We designed GPT‑5.2 to unlock even more economic value for people; it’s better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects.

GPT‑5.2 sets a new state of the art across many benchmarks, including GDPval, where it outperforms industry professionals at well-specified knowledge work tasks spanning 44 occupations.

---

## Benchmark highlights

|---|---|---:|---:|

| GDPval (wins or ties) | Knowledge work tasks | **70.9%** | 38.8% (GPT‑5) |

| SWE-Bench Pro (public) | Software engineering | **55.6%** | 50.8% |

| SWE-bench Verified | Software engineering | **80.0%** | 76.3% |

| GPQA Diamond (no tools) | Science questions | **92.4%** | 88.1% |

| CharXiv Reasoning (w/ Python) | Scientific figure questions | **88.7%** | 80.3% |

| AIME 2025 (no tools) | Competition math | **100.0%** | 94.0% |

| FrontierMath (Tier 1–3) | Advanced mathematics | **40.3%** | 31.0% |

| FrontierMath (Tier 4) | Advanced mathematics | **14.6%** | 12.5% |

| ARC-AGI-1 (Verified) | Abstract reasoning | **86.2%** | 72.8% |

| ARC-AGI-2 (Verified) | Abstract reasoning | **52.9%** | 17.6% |

---

Notion, Box, Shopify, Harvey, and Zoom observed that GPT‑5.2 demonstrates state-of-the-art long-horizon reasoning and tool-calling performance. Databricks, Hex, and Triple Whale found GPT‑5.2 to be exceptional at agentic data science and document analysis tasks. Cognition, Warp, Charlie Labs, JetBrains, and Augment Code report that GPT‑5.2 delivers state-of-the-art agentic coding performance, with measurable improvements in areas such as interactive coding, code reviews, and bug finding.

In ChatGPT, GPT‑5.2 Instant, Thinking, and Pro will begin rolling out today, starting with paid plans. In the API, they are available now to all developers.

Overall, GPT‑5.2 brings significant improvements in general intelligence, long-context understanding, agentic tool-calling, and vision—making it better at executing complex, real-world tasks end-to-end than any previous model.

Now change the article to the markdown below:

# Introducing GPT‑Image‑1.5

### *The new and improved ChatGPT Images*

**December 16, 2025**

---

Today, we’re introducing a new and improved version of ChatGPT Images, powered by our best image generation model yet. With stronger instruction following and more precise editing, ChatGPT Images delivers the changes you ask for while keeping important details like facial likeness consistent across edits—now with generation speeds up to **4× faster**, making it easier to iterate and explore ideas with less waiting.

This is our most capable general-purpose text-to-image model to date, with more expressive transformations, improved dense text rendering, and more natural-looking results. Whether you’re making a tiny fix or a total reinvention, you can simply say what you want—or choose from preset styles and ideas in the new Images experience—and ChatGPT handles the rest, delivering results that are both useful and compelling, and better match your intent.

The new Images model and experience is beginning to roll out today in ChatGPT for all users, and in the API as **GPT‑Image‑1.5**.

---

## Results that match your intent

The model now follows instructions more reliably—down to the small details—changing what you ask for while able to keep elements like lighting, composition, and likeness consistent across inputs, outputs, and subsequent edits.

This unlocks results that match your intent—more useful photo edits, more believable clothing and hairstyle try-ons, alongside stylistic filters and conceptual transformations that retain the essence of the original image. Together, these improvements mean ChatGPT can act as a creative studio in your pocket, capable of both practical edits and expressive reimaginings.

### Editing

The model excels at different types of editing so you get the changes you want without losing what makes the image special.

### Creative Transformations

The model’s creativity shines with creative transformations, changing and adding elements—like text and layout—that help the concept come to life while maintaining important details.

### Instruction Following

The model is able to better follow instructions versus GPT Image 1.0.

### Text Rendering

The model takes another step ahead in text rendering, capable of handling denser and smaller text.

---

## A new creation space

In addition to asking for images through ChatGPT by describing what you’d like to see, we’re also introducing a dedicated Images experience in the ChatGPT sidebar to make exploring and trying images easier and quicker. This includes preset filters and trending prompts to jump-start inspiration, as well as a one-time likeness upload so you can reuse your appearance across future creations without the need to go through your camera roll again.

Together, these upgrades let you create images that better match your vision, from small edits to full reimaginings. Images now render up to four times faster, and you can continue generating new images while others are still in progress—so you can explore more ideas without waiting.

بهبودهای بیشتر در کیفیت

این مدل همچنین در ابعاد اضافی که به خروجی‌های بلافاصله قابل استفاده‌تر تبدیل می‌شوند، مانند رندر کردن چهره‌های کوچک متعدد و طبیعی بودن ظاهر خروجی‌ها، بهبود یافته است.

جدید

make a scene in chelsea, london in the 1970s, photorealistic, everything in focus, with tons of people, and a bus with an advertisement for "ImageGen 1.5" with the OpenAI logo and subtitle "Create what you imagine". Hyper-realistic amateur photography, iPhone snapshot quality…

قبلی

یک فضای جدید برای خلق

علاوه بر تولید تصاویر با توصیف آنچه دوست داری در یک پیام ببینی، ما یک بخش اختصاصی برای تصاویر⁠(در یک پنجره جدید باز می‌شود) در ChatGPT معرفی کرده‌ایم—که از طریق برنامه موبایل و در chatgpt.com در نوار کناری در دسترس است—کاوش و امتحان کردن تصاویر را سریع‌تر و آسان‌تر کن. این شامل ده‌ها فیلتر و درخواست از پیش تعیین‌شده است که برای شروع الهام‌بخشی به‌طور منظم به‌روزرسانی می‌شوند تا روندهای نوظهور را منعکس کنند.

با هم، این ارتقاءها بهت اجازه می‌دهند تصاویری ایجاد کنی که بهتر با چشم‌اندازت مطابقت داشته باشند، از ویرایش‌های کوچک تا بازآفرینی‌های کامل.

تصاویر ChatGPT برای کار

این مدل با تولید سریع‌تر تصاویر، ویرایش‌های دقیق و جزئیات بصری هماهنگ در تکرارها، جریان‌های کاری کسب و کار را ساده‌سازی می‌کند. تیم‌ها می‌توانند ایده‌ها را بررسی کنند، تغییرات هدفمند ایجاد کنند و مفاهیم پیچیده یا خشک را تجسم کنند و از موارد استفاده در بازاریابی، طراحی، تجارت الکترونیک و ارتباطات داخلی پشتیبانی کنند.

بهبودها و محدودیت‌ها

ما بسیاری از مثال‌ها را از راه‌اندازی اولیه تولید تصویرمان دوباره اجراء کردیم تا عملکرد را ارزیابی کنیم. این مدل در موارد مختلف بهبودهای آشکاری نشان می‌دهد، هرچند نتایج هنوز کامل نیستند. در حالی که این نسخه نشان‌دهنده پیشرفت معناداری است، هنوز فضای زیادی برای بهبود در نسخه‌های آینده وجود دارد.

جدید

create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

قبلی

create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

هنوز برخی نادرستی‌های علمی وجود دارد، اما حدود ۷۰٪ صحیح است و گرافیک‌های بسیار زنده‌تری دارد و از برش زود هنگام جلوگیری می‌کند.

GPT Image 1.5 در API

gpt-image-1.5 در API تمام بهبودهای مشابه در ChatGPT Images را ارائه می‌کند: این مدل در حفظ و ویرایش تصاویر نسبت به GPT Image 1 قوی‌تر است.

تو شاهد حفظ مداوم‌تری از لوگوهای برند و تصاویر کلیدی در ویرایش‌ها خواهی بود—که آن را برای کارهای بازاریابی و برند مانند ایجاد گرافیک و لوگو، و برای تیم‌های تجارت الکترونیک که کاتالوگ‌های کامل تصاویر محصول (انواع، صحنه‌ها و زوایا) را از یک تصویر منبع واحد تولید می‌کنند، مناسب می‌سازد.

ورودی‌ها و خروجی‌های تصویری اکنون در GPT Image 1.5 نسبت به GPT Image 1، ۲۰٪ ارزان‌تر شده‌اند، بنابراین می‌تونی با همون بودجه، تصاویر بیشتری تولید و تکرار کنی.

می‌تونی مدل جدید رو در OpenAI Playground⁠(در یک پنجره جدید باز می‌شود) امتحان کنی یا برای الهام گرفتن راهنمای پرامپت‌نویسی⁠(در یک پنجره جدید باز می‌شود) رو بخونی.

شرکت‌ها و استارت‌آپ‌ها در صنایع مختلف از جمله ابزارهای خلاقانه، تجارت الکترونیک، نرم‌افزارهای بازاریابی و موارد دیگر، هم‌اکنون از GPT Image 1.5 بهره می‌برند. ما خوشحال می‌شویم که تعدادی از این مثال‌ها را در زیر با شما به اشتراک بگذاریم.

جدید

قبلی

«GPT Image 1.5» تصاویر با وضوح بالا را با رعایت دقیق دستورات تولید می‌کند و ترکیب‌بندی، نورپردازی و جزئیات ریز را حفظ می‌کند. نتایج تمیز، واقع‌گرایانه و قابل‌اعتماد هستند و از جریان‌های کاری سریع‌تر از مفهوم تا تولید در پلتفرم‌هایی مانند Wix پشتیبانی می‌کنند. بر اساس آزمایش‌های ما و موارد استفاده اصلی که در Wix مشاهده می‌کنیم، ثبات و کیفیت با هم رقابت می‌کنند تا آن را به یکی از مدل‌های پرچمدار تولید تصویر امروزی تبدیل کند.

— هیلا گت، مدیر تحقیقات هوش مصنوعی و علم داده در Wix

در دسترس بودن

قابلیت جدید تصاویر ChatGPT اکنون برای همه کاربران ChatGPT و کاربران API در سطح جهانی در حال عرضه است. این ویژگی در تمام مدل‌ها کار می‌کند، پس نیازی نیست چیزی رو انتخاب کنی تا ازش استفاده کنی.

ما باور داریم که هنوز در ابتدای مسیری هستیم که تولید تصویر می‌تواند ممکن سازد. به‌روزرسانی امروز یک گام معنادار به جلو است و در آینده، از ویرایش‌های دقیق‌تر تا خروجی‌های غنی‌تر و جزئی‌تر در زبان‌های مختلف، موارد بیشتری در راه خواهد بود.

2025

نویسنده

OpenAI

Contributors

Project Leadership

Gabriel Goh — Research Lead

Adele Li — Product Lead

Bill Peebles — Sora Lead

Aditya Ramesh — World Simulation Lead

Mark Chen — Chief Research Officer

Prafulla Dhariwal — Multimodal Lead

Core Team

Alex Fang, Alex Yu, Ben Wang, Bing Liang, Boyuan Chen, Charlie Nash, David Medina, Dibya Bhattacharjee, Jianfeng Wang, Kenji Hata, Kiwhan Song, Mengchao Zhong, Mike Starr, Yuguang Yang

Research Contributors

Bram Wallace, Dmytro Okhonko, Haitang Hu, Kshitij Gupta, Li Jing, Lu Liu, Peter Zhokhov, Qiming Yuan, Senthil Purushwalkam, Yizhen Zhang

Core Inference

Adam Tart, Alyssa Huang, Andrew Braunstein, Jane Park, Karen Li, Tomer Kaftan

Research Collaborators

Aditya Ramesh, Alex Nichol, Andrew Kondrich, Andrew Liu, Benedikt Winter, Bill Peebles, Connor Holmes, Cyril Zhang, Daniel Geng, Eric Mintun, James Betker, Jamie Kiros, Manuka Stratta, Martin Li, Raoul de Liedekerke, Ricky Wang, Ruslan Vasilev, Vladimir Chalyshev, Welton Wang, Wyatt Thompson, Yaming Lin

Inference Collaborators

Jiayu Bai, Kevin King, Stanley Hsieh, Weiyi Zheng

Data & Evaluation

Alexandra Barr, Aparna Dutta, Arshi Bhatnagar, Chao Yu, Charlotte Cole, Dragos Oprica, Emma Tang, Gowrishankar Sunder, Henry Baer, Ian Sohl, James Park Lennon, Jason Xu, Peilin Yang, Somay Jain, Szi-chieh Yu, Wesam Manassra, Xiaolei Zhu, Yilei Qian

Applied

Affonso Reis, Alan Gou, Alexandra Vodopianova, Amandeep Grewal, Andi Liu, Andrew Sima, Angus Fletcher, Antonia Woodford, Arun Eswara, Benny Wong, Bharat Rangan, Boyang Niu, Bridget Collins, Bryan Brandow, Callie Riggins Zetino, Chris Wendel, Ethan Chang, Gilman Tolle, Greg Hochmuth, Ibrahim Okuyucu, Jesse Chand, Jesse Hendrickson, Jiayu Bai, Jimmy Lin, Johan Cervantes, Kan Wu, Liam Esparraguera, Maja Wichrowska, Matthew Ferrari, Murat Yesildal, Nikunj Handa, Nithanth Kudige, Ola Okelola, Osman Khwaja, Peter Argany, Peter Bakkum, Peter Vidani, Richard Zadorozny, Rohan Sahai, Savelii Bondini, Sean Chang, Vickie Duong, Victoria Huang, Xiaolin Hao, Xueqing Li

Safety, Safety Systems, Integrity, Policy & Trust

Abby Fanlo Susk, Adam Wells, Aleah Houze, Annie Cheng, Artyi Xu, Carolina Paz, David Abelman, Femi Alamu, Jay Wang, Jeremiah Currier, Jesika Haria, Mariya Guryeva, Max Burkhardt, Paige Walker, Pedro Aguilar, Rutsu Koshimizu, Sam Toizer, Savannah Heon, Tom Rubin, Tonia Osadebe, Willow Primack, Zoe Stoll

Product Operations, Program Management and Governance

Antonio Di Francesco, Filippo Raso, Grace Wu, Josh Metherd, Ruth Costigan

Legal

Ally Bennett, Tony Song, Tyce Walters

Communications, Marketing, Community, Design & Creative

Akash Iyer, Alex Baker-Whitcomb, Angie Luo, Anne Oburgh, Antonia Richmond, Annie Tsang, Ashley Tyra, Bailey Richardson, Brandon McGraw, Cary Hudson, Dana Palmie, Evan Corrigan, Gaby Raila, Indgila Samad Ali, James Anderson, Jeremy Schwartz, Jordan Liss, Juan Garza, Julie Steele, Kara Zichittella, Karn Piluntanadilok, Kendal Peirce, Kim Baschet, Leah Anise, Livvy Pierce, Maria Clara M. Fleury Osorio, Minnia Feng, Nick Ciffone, Nick Forland, Niko Felix, Paige Ford, Rachel Puckett, Rishabh Aggarwal, Rusty Rupprecht, Souki Mansoor, Tasia Potasinski, Taya Christianson, Vasundhara Mudgil, Whitney Ferris, Yara Khakbaz, Zach Brock, Zoë Silverman

Special Thanks

Amy Yang, Arvin Wu, Avital Oliver, Brandon McKinzie, Chak Li, Chris Lu, David Duxin, Dian Ang Yap, Gabriel Petersson, Guillaume Leclerc, Hazel Byrne, Henry Aspegren, Jennifer Luckenbill, Ji Lin, Joseph Mo, Julius Hochmuth, Liunian (Harold) Li, Long Ouyang, Mariano López, Michael Zhang, Ravi Teja Mullapudi, Suvansh Sanjeev, Varun Shetty, Wenda Zhou

Exec

Fidji Simo, Hannah Wong, Jakub Pachocki, Jason Kwon, Johannes Heidecke, Kate Rouch, Lauren Itow, Mark Chen, Mia Glaese, Nick Ryder, Nick Turley, Prafulla Dhariwal, Sam Altman, Sulman Choudhry

به خواندن ادامه بده

مشاهده همه

معرفی GPT-5.2

محصول۲۰ آذر ۱۴۰۴

Sora ۲ اینجاست

تحقیق۸ مهر ۱۴۰۴

معرفی قابلیت تولید تصویر 4o

محصول۵ فروردین ۱۴۰۴