25 tháng 3, 2025

Giới thiệu tính năng Tạo sinh ảnh 4o

Mở khóa khả năng tạo sinh ảnh hữu ích và có giá trị với một mô hình đa phương thức tự nhiên có khả năng tạo ra kết quả chính xác, chi tiết và chân thực như ảnh chụp.

Thử trong ChatGPT

Đang tải…

Tại OpenAI, từ lâu chúng tôi tin rằng tạo sinh ảnh nên là một khả năng chính trong các mô hình ngôn ngữ của chúng tôi. Đó là lý do tại sao chúng tôi đã tích hợp trình tạo hình ảnh tiên tiến nhất của mình vào GPT‑4o. Kết quả là—tạo sinh ảnh không chỉ đẹp mà còn hữu ích.

A wide image taken with a phone of a glass whiteboard, in a room overlooking the Bay Bridge. The field of view shows a woman writing, sporting a tshirt wiith a large OpenAI logo. The handwriting looks natural and a bit messy, and we see the photographer's reflection.

The text reads:

(left)
"Transfer between Modalities:

Suppose we directly model
p(text, pixels, sound) [equation]
with one big autoregressive transformer.

Pros:
* image generation augmented with vast world knowledge
* next-level text rendering
* native in-context learning
* unified post-training stack

Cons:
* varying bit-rate across modalities
* compute not adaptive"

(Right)
"Fixes:
* model compressed representations
* compose autoregressive prior with a powerful decoder"

On the bottom right of the board, she draws a diagram:
"tokens -> [transformer] -> [diffusion] -> pixels"

^{Best of 8}

selfie view of the photographer, as she turns around to high five him

^{Best of 8}

Tạo sinh ảnh hữu ích

Từ những bức tranh hang động đầu tiên đến các đồ họa thông tin hiện đại, con người đã sử dụng hình ảnh trực quan để giao tiếp, thuyết phục và phân tích—không chỉ để trang trí. Các mô hình tạo sinh ngày nay có thể tạo ra những cảnh tượng siêu thực, ngoạn mục, nhưng lại gặp khó khăn với những hình ảnh thông dụng mà con người sử dụng để chia sẻ và tạo thông tin. Từ logo đến sơ đồ, hình ảnh có thể truyền tải ý nghĩa chính xác khi được bổ sung với các biểu tượng liên quan đến ngôn ngữ và trải nghiệm chung.

Tính năng tạo sinh ảnh của GPT‑4o xuất sắc trong việc tái hiện chính xác văn bản, tuân thủ chặt chẽ các lời nhắc, và tận dụng cơ sở kiến thức cùng ngữ cảnh trò chuyện vốn có của 4o—bao gồm cả việc biến đổi các hình ảnh đã tải lên hoặc sử dụng chúng làm nguồn cảm hứng hình ảnh. Những khả năng này giúp dễ dàng hơn trong việc tạo ra chính xác hình ảnh mà bạn hình dung, giúp bạn giao tiếp hiệu quả hơn qua hình ảnh và đưa việc tạo sinh ảnh trở thành một công cụ thực tiễn với độ chính xác và mạnh mẽ.

Nhiều khả năng được tăng cường

Chúng tôi huấn luyện các mô hình của mình trên phân bố kết hợp giữa hình ảnh và văn bản trực tuyến, học không chỉ cách các hình ảnh liên quan đến ngôn ngữ, mà còn cách chúng liên quan đến nhau. Kết hợp với quá trình huấn luyện bổ sung mạnh mẽ, mô hình kết quả có độ lưu loát thị giác đáng ngạc nhiên, có khả năng tạo ra những hình ảnh hữu ích, nhất quán và nhận biết ngữ cảnh.

Kết xuất văn bản

Một bức tranh đáng giá ngàn lời nói, nhưng đôi khi việc tạo ra vài từ đúng chỗ có thể nâng cao ý nghĩa của một bức ảnh. Khả năng của 4o trong việc kết hợp các ký hiệu chính xác với hình ảnh biến việc tạo sinh ảnh thành một công cụ giao tiếp trực quan.

Create a photorealistic image of two witches in their 20s (one ash balayage, one with long wavy auburn hair) reading a street sign.

Context:
a city street in a random street in Williamsburg, NY with a pole covered entirely by numerous detailed street signs (e.g., street sweeping hours, parking permits required, vehicle classifications, towing rules), including few ridiculous signs at the middle: (paraphrase it to make these legitimate street signs)"Broom Parking for Witches Not Permitted in Zone C" and "Magic Carpet Loading and Unloading Only (15-Minute Limit)" and "Reindeer Parking by Permit Only (Dec 24–25)
Violators will be placed on Naughty List." The signpost is on the right of a street. Do not repeat signs. Signs must be realistic.

Characters:
one witch is holding a broom and the other has a rolled-up magic carpet. They are in the foreground, back slightly turned towards the camera and head slightly tilted as they scrutinize the signs.

Composition from background to foreground:
streets + parked cars + buildings -> street sign -> witches. Characters must be closest to the camera taking the shot

^{Best of ~8}

Tạo sinh đa lượt

Vì tạo sinh ảnh hiện đã được tích hợp vào GPT‑4o, bạn có thể tinh chỉnh ảnh thông qua trò chuyện tự nhiên. GPT‑4o có thể xây dựng dựa trên ảnh và văn bản trong ngữ cảnh chat, đảm bảo tính nhất quán xuyên suốt. Ví dụ, nếu bạn đang thiết kế một ký tự trong trò chơi điện tử, ngoại hình của ký tự sẽ vẫn nhất quán qua nhiều lần lặp lại khi bạn tinh chỉnh và thử nghiệm.

Give this cat a detective hat and a monocle

^{Best of 1}

turn this into a triple A video games made with a 4k game engine and add some User interface as overlay from a mystery RPG where we can see a health bar and a minimap at the top as well as spells at the bottom with consistent and iconography

^{Best of 1}

update to a landscape image 16:9 ratio, add more spells in the UI, and unzoom the visual so that we see the cat in a third person view walking through a steampunk manhattan creating beautiful contrast and lighting like in the best triple A game, with cool-toned colors

^{Best of 2}

create the interface when the player opens the menu and we see the cat's character profile with his equipment and another page showing active quests (and it should make sense in relationship with the universe worldbuilding we are describing in the image)

^{Best of 8}

credit creator: Manuel Sainsily

Thực hiện theo hướng dẫn

Việc tạo sinh ảnh của GPT‑4o tuân theo các lời nhắc chi tiết với sự chú ý đến từng chi tiết. Trong khi các hệ thống khác gặp khó khăn với khoảng 5-8 đối tượng, GPT‑4o có thể xử lý lên đến 10-20 đối tượng khác nhau. Việc liên kết chặt chẽ hơn giữa các đối tượng với các thuộc tính và mối quan hệ của chúng cho phép kiểm soát tốt hơn.

A square image containing a 4 row by 4 column grid containing 16 objects on a white background. Go from left to right, top to bottom. Here's the list:
1. a blue star
2. red triangle
3. green square
4. pink circle
5. orange hourglass
6. purple infinity sign
7. black and white polka dot bowtie
8. tiedye "42"
9. an orange cat wearing a black baseball cap
10. a map with a treasure chest
11. a pair of googly eyes
12. a thumbs up emoji
13. a pair of scissors
14. a blue and white giraffe
15. the word "OpenAI" written in cursive
16. a rainbow-colored lightning bolt

^{Best of 5}

Học tập trong ngữ cảnh

GPT‑4o có thể phân tích và học hỏi từ các hình ảnh do người dùng tải lên, tích hợp liền mạch các chi tiết của chúng vào ngữ cảnh để hỗ trợ việc tạo sinh ảnh.

draw a design for a vehicle with triangular wheels, using these images as reference.
label the front wheel, the back wheel, and at the of the diagram say (in small caps)
TRIANGLE WHEELED VEHICLE. English Patent. 2025. OPENAI.

^{Best of ~16}

now put this in a photo taken in new york city.

^{Best of ~16}

Tri thức về thế giới

Tính năng tạo sinh ảnh gốc cho phép 4o liên kết kiến thức giữa văn bản và hình ảnh, tạo ra một mô hình có vẻ thông minh hơn và hiệu quả hơn.

Code Example (Three.js)

HTML

1<!DOCTYPE html>
2<html lang="en">
3  <head>
4    <meta charset="UTF-8" />
5    <title>OpenAI Banner</title>
6    <style>
7      body { margin: 0; overflow: hidden; }
8      canvas { display: block; }
9    </style>
10  </head>
11  <body>
12    <script type="module">
13      import * as THREE from 'https://cdn.jsdelivr.net/npm/three@0.160.0/build/three.module.js';
14      import { OrbitControls } from 'https://cdn.jsdelivr.net/npm/three@0.160.0/examples/jsm/controls/OrbitControls.js';
15      import { FontLoader } from 'https://cdn.jsdelivr.net/npm/three@0.160.0/examples/jsm/loaders/FontLoader.js';
16      import { TextGeometry } from 'https://cdn.jsdelivr.net/npm/three@0.160.0/examples/jsm/geometries/TextGeometry.js';
17
18      const scene = new THREE.Scene();
19      const camera = new THREE.PerspectiveCamera(45, window.innerWidth / window.innerHeight, 0.1, 1000);
20      const renderer = new THREE.WebGLRenderer({ antialias: true });
21      renderer.setSize(window.innerWidth, window.innerHeight);
22      document.body.appendChild(renderer.domElement);
23
24      // Lighting
25      const light = new THREE.AmbientLight(0xffffff, 1);
26      scene.add(light);
27
28      const dirLight = new THREE.DirectionalLight(0xffffff, 1);
29      dirLight.position.set(0, 5, 10);
30      scene.add(dirLight);
31
32      // Camera position
33      camera.position.z = 20;
34
35      // Controls
36      const controls = new OrbitControls(camera, renderer.domElement);
37
38      // Banner background
39      const bannerGeometry = new THREE.PlaneGeometry(20, 10);
40      const bannerMaterial = new THREE.MeshStandardMaterial({ color: 0x1a1a1a });
41      const banner = new THREE.Mesh(bannerGeometry, bannerMaterial);
42      scene.add(banner);
43
44      // OpenAI Logo texture (placeholder)
45      const loader = new THREE.TextureLoader();
46      loader.load('https://upload.wikimedia.org/wikipedia/commons/4/4d/OpenAI_Logo.svg', texture => {
47        const logoGeometry = new THREE.PlaneGeometry(4, 4);
48        const logoMaterial = new THREE.MeshBasicMaterial({ map: texture, transparent: true });
49        const logo = new THREE.Mesh(logoGeometry, logoMaterial);
50        logo.position.set(-5, 0, 0.1); // Slightly in front of the banner
51        scene.add(logo);
52      });
53
54      // Load font and add text
55      const fontLoader = new FontLoader();
56      fontLoader.load('https://threejs.org/examples/fonts/helvetiker_regular.typeface.json', font => {
57        const textGeometry = new TextGeometry("I am 4-o", {
58          font: font,
59          size: 1,
60          height: 0.2,
61          curveSegments: 12,
62          bevelEnabled: true,
63          bevelThickness: 0.02,
64          bevelSize: 0.02,
65          bevelOffset: 0,
66          bevelSegments: 5
67        });
68
69        textGeometry.center();
70
71        const textMaterial = new THREE.MeshStandardMaterial({ color: 0x00ffcc });
72        const textMesh = new THREE.Mesh(textGeometry, textMaterial);
73        textMesh.position.set(5, -0.5, 0.1); // Opposite side of logo
74        scene.add(textMesh);
75      });
76
77      // Resize handler
78      window.addEventListener('resize', () => {
79        camera.aspect = window.innerWidth / window.innerHeight;
80        camera.updateProjectionMatrix();
81        renderer.setSize(window.innerWidth, window.innerHeight);
82      });
83
84      // Render loop
85      function animate() {
86        requestAnimationFrame(animate);
87        controls.update();
88        renderer.render(scene, camera);
89      }
90
91      animate();
92    </script>
93  </body>
94</html>

make an image of what this means to you

Chủ nghĩa hiện thực và phong cách

Việc huấn luyện trên các hình ảnh phản ánh nhiều phong cách hình ảnh khác nhau cho phép mô hình tạo hoặc chuyển đổi hình ảnh một cách thuyết phục.

A candid paparazzi-style photo of Karl Marx hurriedly walking through the parking lot of the Mall of America, glancing over his shoulder with a startled expression as he tries to avoid being photographed. He’s clutching multiple glossy shopping bags filled with luxury goods. His coat flutters behind him in the wind, and one of the bags is swinging as if he’s mid-stride. Blurred background with cars and a glowing mall entrance to emphasize motion. Flash glare from the camera partially overexposes the image, giving it a chaotic, tabloid feel.
A candid paparazzi-style photo of Karl Marx hurriedly walking through the parking lot of the Mall of America, glancing over his shoulder with a startled expression as he tries to avoid being photographed. He’s clutching multiple glossy shopping bags filled with luxury goods. His coat flutters behind him in the wind, and one of the bags is swinging as if he’s mid-stride. Blurred background with cars and a glowing mall entrance to emphasize motion. Flash glare from the camera partially overexposes the image, giving it a chaotic, tabloid feel.
A candid paparazzi-style photo of Karl Marx hurriedly walking through the parking lot of the Mall of America, glancing over his shoulder with a startled expression as he tries to avoid being photographed. He’s clutching multiple glossy shopping bags filled with luxury goods. His coat flutters behind him in the wind, and one of the bags is swinging as if he’s mid-stride. Blurred background with cars and a glowing mall entrance to emphasize motion. Flash glare from the camera partially overexposes the image, giving it a chaotic, tabloid feel.

A cat looking into a puddle of water on a street, but its reflection is that of a tiger, and both reflections are realistically distorted by ripples in the water — A candid paparazzi-style photo of Karl Marx hurriedly walking through the parking lot of the Mall of America, glancing over his shoulder with a startled expression as he tries to avoid being photographed. He’s clutching multiple glossy shopping bags filled with luxury goods. His coat flutters behind him in the wind, and one of the bags is swinging as if he’s mid-stride. Blurred background with cars and a glowing mall entrance to emphasize motion. Flash glare from the camera partially overexposes the image, giving it a chaotic, tabloid feel.
A candid paparazzi-style photo of Karl Marx hurriedly walking through the parking lot of the Mall of America, glancing over his shoulder with a startled expression as he tries to avoid being photographed. He’s clutching multiple glossy shopping bags filled with luxury goods. His coat flutters behind him in the wind, and one of the bags is swinging as if he’s mid-stride. Blurred background with cars and a glowing mall entrance to emphasize motion. Flash glare from the camera partially overexposes the image, giving it a chaotic, tabloid feel.
A candid paparazzi-style photo of Karl Marx hurriedly walking through the parking lot of the Mall of America, glancing over his shoulder with a startled expression as he tries to avoid being photographed. He’s clutching multiple glossy shopping bags filled with luxury goods. His coat flutters behind him in the wind, and one of the bags is swinging as if he’s mid-stride. Blurred background with cars and a glowing mall entrance to emphasize motion. Flash glare from the camera partially overexposes the image, giving it a chaotic, tabloid feel.

Hạn chế

Mô hình của chúng tôi không hoàn hảo. Chúng tôi nhận thức được nhiều hạn chế hiện tại mà chúng tôi sẽ làm việc để khắc phục thông qua việc cải thiện mô hình sau khi ra mắt ban đầu.

Chúng tôi nhận thấy GPT‑4o đôi khi có thể cắt ảnh dài hơn, như áp phích, quá chặt, đặc biệt là gần phía dưới.

An toàn

Trên cơ sở tuân theo Đặc tả mô hình của chúng tôi, chúng tôi hướng tới việc tối đa hóa sự tự do sáng tạo bằng cách hỗ trợ các trường hợp sử dụng có giá trị như phát triển trò chơi, khám phá lịch sử và giáo dục—mà vẫn duy trì các tiêu chuẩn an toàn nghiêm ngặt. Đồng thời, việc ngăn chặn các yêu cầu vi phạm các tiêu chuẩn này vẫn là ưu tiên hàng đầu. Dưới đây là các đánh giá về các lĩnh vực rủi ro bổ sung mà chúng tôi đang làm việc để cho phép nội dung an toàn, có tính ứng dụng cao và hỗ trợ biểu đạt sáng tạo rộng rãi hơn cho người dùng.

Khả năng truy nguyên qua C2PA và tìm kiếm đảo ngược nội bộ
Tất cả các hình ảnh được tạo ra đều đi kèm với siêu dữ liệu C2PA, giúp xác định hình ảnh là từ GPT‑4o, nhằm cung cấp sự minh bạch. Chúng tôi cũng đã phát triển một công cụ tìm kiếm nội bộ sử dụng các thuộc tính kỹ thuật của các thế hệ để hỗ trợ xác minh xem nội dung có xuất phát từ mô hình của chúng tôi hay không.

Chặn nội dung xấu
Chúng tôi đang tiếp tục chặn các yêu cầu tạo ảnh có thể vi phạm chính sách nội dung của chúng tôi, chẳng hạn như tài liệu lạm dụng tình dục trẻ em và deepfake tình dục. Khi hình ảnh của người thật nằm trong ngữ cảnh, chúng tôi áp dụng các hạn chế nghiêm ngặt hơn về loại hình ảnh có thể được tạo ra, với các biện pháp bảo vệ đặc biệt mạnh mẽ đối với khỏa thân và bạo lực đồ họa. Giống như bất kỳ lần ra mắt nào, an toàn không bao giờ hoàn tất mà là một lĩnh vực đầu tư liên tục. Khi chúng tôi tìm hiểu thêm về việc sử dụng thực tế của mô hình này, chúng tôi sẽ điều chỉnh các chính sách của mình cho phù hợp.

Để biết thêm về phương pháp của chúng tôi, hãy truy cập phụ lục của thẻ hệ thống GPT‑4o⁠ về tạo sinh ảnh.

Sử dụng khả năng lập luận để tăng cường an toàn
Tương tự như công việc điều chỉnh có suy xét⁠ của chúng tôi, chúng tôi đã huấn luyện một mô hình ngôn ngữ lớn (LLM) biết lập luận để làm việc trực tiếp từ các đặc tả an toàn do con người viết và có thể diễn giải. Chúng tôi đã sử dụng mô hình ngôn ngữ lớn (LLM) biết lập luận này trong quá trình phát triển để giúp chúng tôi xác định và giải quyết những điểm mơ hồ trong các chính sách của mình. Cùng với những tiến bộ đa phương thức và các kỹ thuật an toàn hiện có được phát triển cho ChatGPT và Sora, điều này cho phép chúng tôi kiểm duyệt⁠ cả văn bản đầu vào và hình ảnh đầu ra theo các chính sách của chúng tôi.

Khả năng truy cập và tính khả dụng

4o image generation rolls out starting today to Plus, Pro, Team, and Free users as the default image generator in ChatGPT, with access coming soon to Enterprise and Edu. It’s also available to use in Sora. For those who hold a special place in their hearts for DALL·E, it can still be accessed through a dedicated DALL·E GPT.

Developers will soon be able to generate images with GPT‑4o via the API, with access rolling out in the next few weeks.

Creating and customizing images is as simple as chatting using GPT‑4o - just describe what you need, including any specifics like aspect ratio, exact colors using hex codes, or a transparent background. Because this model creates more detailed pictures, images take longer to render, often up to one minute.

credit creator: [Alex Duffy](https://every.to/@AlxAi)
credit creator: [Alex Duffy](https://every.to/@AlxAi)
credit creator: [Alex Duffy](https://every.to/@AlxAi)

credit creator: [August Kamp](https://www.instagram.com/august.kamp/?igsh=MTRpeG9xd3F2MzEyeg#) — credit creator: [Alex Duffy](https://every.to/@AlxAi)
credit creator: [Alex Duffy](https://every.to/@AlxAi)
credit creator: [Alex Duffy](https://every.to/@AlxAi)

Phát lại buổi livestream

Tác giả

OpenAI

Lãnh đạo

Gabriel Goh: Tạo sinh ảnh

Jackie Shannon: Sản phẩm ChatGPT

Mengchao Zhong, Wayne Chang: Kỹ thuật ChatGPT

Rohan Sahai: Sản phẩm và Kỹ thuật Sora

Brendan Quinn, Tomer Kaftan: Suy luận

Prafulla Dhariwal: Tổ chức Đa phương thức

Nghiên cứu

Nghiên cứu Nền tảng

Allan Jabri, David Medina, Gabriel Goh, Kenji Hata, Lu Liu, Prafulla Dhariwal

Nghiên cứu Cốt lõi

Aditya Ramesh, Alex Nichol, Casey Chu, Cheng Lu, Dian Ang Yap, Heewoo Jun, James Betker, Jianfeng Wang, Long Ouyang, Li Jing, Wesam Manassra

Người đóng góp nghiên cứu

Aiden Low, Brandon McKinzie, Charlie Nash, Huiwen Chang, Ishaan Gulrajani, Jamie Kiros, Ji Lin, Kshitij Gupta, Yang Song

Hành vi Mô hình

Laurentia Romaniuk

Tổ chức Đa phương thức

Andrew Gibiansky, Yang Lu

Dữ liệu

Trưởng nhóm Dữ liệu

Gildas Chabot, James Park Lennon

Dữ liệu

Arshi Bhatnagar, Dragos Oprica, Rohan Kshirsagar, Spencer Papay, Szi-chieh Yu, Wesam Manassra, Yilei Qian

Người điều hành

Hazel Byrne, Jennifer Luckenbill, Mariano López

Human Data Advisors

Long Ouyang

Mở rộng quy mô

Trưởng nhóm Suy luận

Brendan Quinn, Tomer Kaftan

Suy luận

Alyssa Huang, Jacob Menick, Nick Stathas, Ruslan Vasilev, Stanley Hsieh

Ứng dụng

Trưởng nhóm Sản phẩm ChatGPT

Jackie Shannon

Trưởng nhóm Kỹ thuật ChatGPT

Mengchao Zhong, Wayne Chang

Trưởng nhóm Thiết kế Sản phẩm

Matt Chan

Khoa học Dữ liệu

Xiaolin Hao

ChatGPT

Andrew Sima, Annie Cheng, Benjamin Goh, Boyang Niu, Dian Ang Yap, Duc Tran, Edede Oiwoh, Eric Zhang, Ethan Chang, Jeffrey Dunham, Jay Chen, Kan Wu, Karen Li, Kelly Stirman, Mengyuan Xu, Michelle Qin, Ola Okelola, Pedro Aguilar, Rocky Smith, Rohit Ramchandani, Sara Culver, Sean Fitzgerald, Vlad Fomenko, Wanning Jiang, Wesam Manassra, Xiaolin Hao, Yilei Qian

Sora

Sora Product Leads

Rohan Sahai, Wesam Manassra

Sản phẩm và Kỹ thuật Sora

Boyang Niu, David Schnurr, Gilman Tolle, Joe Taylor, Joey Flynn, Mike Starr, Rajeev Nayak, Rohan Sahai, Wesam Manassra

An toàn

Trưởng nhóm An toàn

Somay Jain

An toàn

Alex Beutel, Andrea Vallone, Botao Hao, Brendan Quinn, Cameron Raymond, Chong Zhang, David Robinson, Eric Wallace, Filippo Raso, Huiwen Chang, Ian Kivlichan, Irina Kofman, Keren Gu-Lemberg, Kristen Ying, Madelaine Boyd, Meghan Shah, Michael Lampe, Owen Campbell-Moore, Rohan Sahai, Rodrigo Riaza Perez, Sam Toizer, Sandhini Agarwal, Troy Peterson

Chiến lược

Adam Cohen, Adam Wells, Ally Bennett, Ashley Pantuliano, Carolina Paz, Claudia Fischer, Declan Grabb, Gaby Sacramone-Lutz, Lauren Jonas, Ryan Beiermeister, Shiao Lee, Tom Stasi, Tyce Walters, Ziad Reslan, Zoe Stoll

Tiếp thị & Truyền thông

Trưởng nhóm Truyền thông và Tiếp thị

Minnia Feng, Natalie Summers, Taya Christianson

Truyền thông

Alex Baker-Whitcomb, Ashley Tyra, Bailey Richardson, Gaby Raila, Marselus Cayton, Scott Ethersmith, Souki Mansoor

Thiết kế & Sáng tạo

Trưởng nhóm

Kendra Rimbach, Veit Moeller

Thiết kế

Adam Brandon, Adam Koppel, Angela Baek, Cary Hudson, Dana Palmie, Freddie Sulit, Jeffrey Sabin Matsumoto, Leyan Lo, Matt Nichols, Thomas Degry, Vanessa Antonia Schefke, Yara Khakbaz

Lời cảm ơn đặc biệt

Aditya Ramesh, Aidan Clark, Alex Beutel, Ben Newhouse, Ben Rossen, Che Chang, Greg Brockman, Hannah Wong, Ishaan Singal, Jason Kwon, Jiacheng Feng, Jiahui Yu, Joanne Jang, Johannes Heidecke, Kevin Weil, Mark Chen, Mia Glaese, Nick Turley, Raul Puri, Reiichiro Nakano, Rui Shu, Sam Altman, Shuchao Bi, Vinnie Monaco

Giới thiệu tính năng Tạo sinh ảnh 4o

Tạo sinh ảnh hữu ích

Nhiều khả năng được tăng cường

Kết xuất văn bản

Tạo sinh đa lượt

Thực hiện theo hướng dẫn

Học tập trong ngữ cảnh

Tri thức về thế giới

HTML

Chủ nghĩa hiện thực và phong cách

Hạn chế

An toàn

Khả năng truy cập và tính khả dụng

Phát lại buổi livestream

Tác giả

Lãnh đạo

Nghiên cứu

Dữ liệu

Mở rộng quy mô

Ứng dụng

Sora

An toàn

Chiến lược

Tiếp thị &amp; Truyền thông

Thiết kế &amp; Sáng tạo

Lời cảm ơn đặc biệt

Tiếp thị & Truyền thông

Thiết kế & Sáng tạo