全新 ChatGPT 图像现已上线-青云TOP-AI综合资源站平台|青云聚合API大模型调用平台|全网AI资源导航平台

📢 转载信息

原文链接：https://openai.com/index/new-chatgpt-images-is-here

原文作者：OpenAI

今天，我们正式推出全新的 ChatGPT 图像，由最新旗舰级图像生成模型驱动。无论你是从零开始创作，还是对照片进行编辑，都能更轻松地获得心仪效果。它能在保持人物等细节不变的前提下进行精确修改，生成速度最高提升 4 倍。同时，我们还在 ChatGPT 中加入了全新的图像功能，让图像创作变得更愉悦、更直观，帮助你激发灵感，让创意探索变得毫不费力。

全新的图像模型和相关功能将从今天起在 ChatGPT 中向所有用户陆续开放，并会在 API 中以 gpt-image-1.5 的形式提供。

精准编辑，保留重要细节

现在，当你对上传的图片提出修改需求时，模型会更准确地理解你的意向，细致到每一个小变化。它只会调整你指定的部分，同时保持光线、构图以及人物外观等元素在输入、输出和后续编辑中始终一致。

这让结果更贴近你的构思：更实用的照片编辑、更逼真的服装与发型试穿，以及兼具创意的风格滤镜和概念化变换，同时保留原始图像的核心神韵。这些升级让 ChatGPT 仿佛变成了随身携带的创意工作室，既能处理实用的编辑任务，也能支持更具表现力的全新想象。

编辑

模型在多种编辑方式上都表现出色。无论是添加、移除、组合、融合，还是位置调整，都能准确实现你想要的变化，同时不丢失让图片独特的那些细节。

Combine the two men and the dog in a 2000s film camera-style photo of them looking bored at a kids birthday party.

Add chaotic kids in the background throwing things and screaming.

Change the man on the left to a hand-drawn retro anime style, the dog to plushie style, keep the man on the right and background scenery the way they are.

Put them all in OpenAI sweaters that look like this.

Now remove the two men, just keep the dog, and put them in an OpenAI livestream that looks like the attached image.

创意变换

模型的创造力在各种转化中展现得淋漓尽致。它能改变或添加元素，例如文字和版式，让你的想法真正成形，同时保留关键细节。无论是简单的点子还是更复杂的概念，这些转化都能轻松实现。你可以直接在全新的 ChatGPT 图像⁠功能中尝试预设的风格和创意，无需输入文字提示。

Make an old school golden age hollywood movie poster of a movie called 'codex' from the image of these two men. feel free to change their costumes to fit the times

Change the names of the actors to Wojciech Zaremba (left) and Greg Brockman (right)

Directed by Sam Altman, produced by Fidji Simo. A Feel the AGI Pictures Production.

遵守指令

模型的指令遵循能力比最初版本更加稳定可靠。这让它不仅能进行更精细的编辑，也能创作更复杂的原创构图，并在其中准确保留各元素之间应有的关系。

新

draw a 6x6 grid

Make a 6 (columns) by 6 (rows) grid grid of:

Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog

Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope

Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z

Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet

Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet

Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14

chatgpt-images-instruction-following-new

上一版本

draw a 6x6 grid

Make a 6 (columns) by 6 (rows) grid grid of:

Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog

Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope

Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z

Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet

Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet

Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14

chatgpt-images-instruction-following-old

文本渲染

模型在文字呈现方面再次进化，能够更稳定地处理更密集、更小号的文字。

There is a newspaper on a desk. The newspaper shows the markdown below laid out as a natural newspaper article. Preserve all content, formatting, and numbers exactly. The image should be tall.

# Introducing GPT‑5.2

### *The most advanced frontier model for professional work and long-running agents*

December 11, 2025

---

We are introducing GPT‑5.2, the most capable model series yet for professional knowledge work.

Already, the average ChatGPT Enterprise user says AI saves them 40–60 minutes a day, and heavy users say it saves them more than 10 hours a week. We designed GPT‑5.2 to unlock even more economic value for people; it’s better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects.

GPT‑5.2 sets a new state of the art across many benchmarks, including GDPval, where it outperforms industry professionals at well-specified knowledge work tasks spanning 44 occupations.

---

## Benchmark highlights

|---|---|---:|---:|

| GDPval (wins or ties) | Knowledge work tasks | 70.9% | 38.8% (GPT‑5) |

| SWE-Bench Pro (public) | Software engineering | 55.6% | 50.8% |

| SWE-bench Verified | Software engineering | 80.0% | 76.3% |

| GPQA Diamond (no tools) | Science questions | 92.4% | 88.1% |

| CharXiv Reasoning (w/ Python) | Scientific figure questions | 88.7% | 80.3% |

| AIME 2025 (no tools) | Competition math | 100.0% | 94.0% |

| FrontierMath (Tier 1–3) | Advanced mathematics | 40.3% | 31.0% |

| FrontierMath (Tier 4) | Advanced mathematics | 14.6% | 12.5% |

| ARC-AGI-1 (Verified) | Abstract reasoning | 86.2% | 72.8% |

| ARC-AGI-2 (Verified) | Abstract reasoning | 52.9% | 17.6% |

---

Notion, Box, Shopify, Harvey, and Zoom observed that GPT‑5.2 demonstrates state-of-the-art long-horizon reasoning and tool-calling performance. Databricks, Hex, and Triple Whale found GPT‑5.2 to be exceptional at agentic data science and document analysis tasks. Cognition, Warp, Charlie Labs, JetBrains, and Augment Code report that GPT‑5.2 delivers state-of-the-art agentic coding performance, with measurable improvements in areas such as interactive coding, code reviews, and bug finding.

In ChatGPT, GPT‑5.2 Instant, Thinking, and Pro will begin rolling out today, starting with paid plans. In the API, they are available now to all developers.

Overall, GPT‑5.2 brings significant improvements in general intelligence, long-context understanding, agentic tool-calling, and vision—making it better at executing complex, real-world tasks end-to-end than any previous model.

Now change the article to the markdown below:

# Introducing GPT‑Image-1.5

### *The new and improved ChatGPT Images*

December 16, 2025

---

Today, we’re introducing a new and improved version of ChatGPT Images, powered by our best image generation model yet. With stronger instruction following and more precise editing, ChatGPT Images delivers the changes you ask for while keeping important details like facial likeness consistent across edits—now with generation speeds up to 4× faster, making it easier to iterate and explore ideas with less waiting.

This is our most capable general-purpose text-to-image model to date, with more expressive transformations, improved dense text rendering, and more natural-looking results. Whether you’re making a tiny fix or a total reinvention, you can simply say what you want—or choose from preset styles and ideas in the new Images experience—and ChatGPT handles the rest, delivering results that are both useful and compelling, and better match your intent.

The new Images model and experience is beginning to roll out today in ChatGPT for all users, and in the API as GPT‑Image-1.5.

---

## Results that match your intent

The model now follows instructions more reliably—down to the small details—changing what you ask for while able to keep elements like lighting, composition, and likeness consistent across inputs, outputs, and subsequent edits.

This unlocks results that match your intent—more useful photo edits, more believable clothing and hairstyle try-ons, alongside stylistic filters and conceptual transformations that retain the essence of the original image. Together, these improvements mean ChatGPT can act as a creative studio in your pocket, capable of both practical edits and expressive reimaginings.

### Editing

The model excels at different types of editing so you get the changes you want without losing what makes the image special.

### Creative Transformations

The model’s creativity shines with creative transformations, changing and adding elements—like text and layout—that help the concept come to life while maintaining important details.

### Instruction Following

The model is able to better follow instructions versus GPT Image 1.0.

### Text Rendering

The model takes another step ahead in text rendering, capable of handling denser and smaller text.

---

## A new creation space

In addition to asking for images through ChatGPT by describing what you’d like to see, we’re also introducing a dedicated Images experience in the ChatGPT sidebar to make exploring and trying images easier and quicker. This includes preset filters and trending prompts to jump-start inspiration, as well as a one-time likeness upload so you can reuse your appearance across future creations without the need to go through your camera roll again.

Together, these upgrades let you create images that better match your vision, from small edits to full reimaginings. Images now render up to four times faster, and you can continue generating new images while others are still in progress—so you can explore more ideas without waiting.

新

make a scene in chelsea, london in the 1970s, photorealistic, everything in focus, with tons of people, and a bus with an advertisement for "ImageGen 1.5" with the OpenAI logo and subtitle "Create what you imagine". Hyper-realistic amateur photography, iPhone snapshot quality…

上一版本

全新的创作空间

除了在对话中描述你想看到的内容来生成图像外，我们还在 ChatGPT 中推出了专属的图像⁠区域。你可以在移动端应用的侧边栏或 chatgpt.com 上轻松进入，让图像体验变得更快捷、更直观。这里提供数十种预设滤镜与提示，并会定期更新，紧跟最新趋势，迅速激发创意。

这些升级让你能够创作出更贴近心中愿景的图像。从细微的编辑到完整的再创作，都能轻松实现。

ChatGPT 图像的商用场景

这款模型让业务流程更高效：图像生成更快、编辑更精准、视觉细节在多次迭代中保持一致。团队可以更轻松地探索创意、进行针对性修改，并将复杂或枯燥的概念可视化，适用于营销、设计、电商和内部沟通等多种场景。

新

create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

上一版本

create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

虽然还有些科学细节不太准确，但大概有七成内容是正确的，画面也更鲜活，不会再出现过早裁切。

API 中的 GPT Image 1.5

API 提供的 gpt-image-1.5 带来了与 ChatGPT 图像相同的全面升级：在图像保真与编辑能力上都比 GPT Image 1 更强大。

在多次编辑中，你会看到品牌徽标和关键视觉元素得到更稳定的保留。这让模型非常适合用于营销与品牌相关的创意工作，如图形设计和徽标制作；也能帮助电商团队从一张源图生成完整的产品图集，包括不同款式、场景与角度。

在 GPT Image 1.5 中，图像输入与输出的费用相比 GPT Image 1 降低了 20%，让你在相同预算下能够生成并迭代更多图像。

你可以在 OpenAI Playground⁠ 中体验新模型，或者阅读提示指南⁠获取灵感。

各类企业与初创团队 — 从创意工具、电商到营销软件等行业 — 已经在使用 GPT Image 1.5。我们也很高兴在下方分享其中的一些案例。

新

上一版本

“GPT Image 1.5 能生成高保真图像，并严格遵循提示要求，能够很好地保留构图、光线和细节。其输出干净、逼真且高度可靠，能在 Wix 等平台上加速从概念到成品的工作流程。根据我们的测试以及 Wix 的主要使用场景，它的稳定性和质量足以让它成为当下最出色的图像生成模型之一。”

— Hila Gat，Wix 人工智能研究与数据科学负责人

适用地区

全新的 ChatGPT 图像功能正在面向全球所有 ChatGPT 用户与 API 用户陆续推出，覆盖各类使用界面。它可在不同模型间通用，无需额外选择即可直接使用。

我们相信，图像生成的潜力才刚刚开始释放。今天的更新是向前迈出的重要一步，未来能力还会大幅提升，包括更精准的编辑，以及在多语言环境下生成更丰富、更细致的内容。

🚀 想要体验更好更全面的AI调用？

欢迎使用青云聚合API，约为官网价格的十分之一，支持300+全球最新模型，以及全球各种生图生视频模型，无需翻墙高速稳定，文档丰富，小白也可以简单操作。

目录CONTENT

全新 ChatGPT 图像现已上线

精准编辑，保留重要细节

编辑

创意变换

遵守指令

新

上一版本

文本渲染

更多品质提升

新

上一版本

全新的创作空间

ChatGPT 图像的商用场景

新

上一版本

API 中的 GPT Image 1.5

新

上一版本

适用地区

评论区