I’ve spent quite a while testing the new 4o ImageGen from OpenAI, and comparing it to models released just yesterday, like Reve, Midjourney, Imagen 3, as well as models not yet out.
https://app.grayswan.ai/ai-explained
AI Insiders ($9!): https://www.patreon.com/AIExplained
Rarely in AI is one model so much better than the rest, as we can see on the chatbot-side of things. Yes, I have a video imminent on Gemini 2.5 and DeepSeek. But for ImageGen, I was very impressed, as you’ll see. Still not perfect, don’t show it a mirror for example, and definitely not photorealistic, but incredibly obedient. You’ll see what I mean. What Sam Altman calls ‘Images in ChatGPT’ will be available to everyone apparently, even free users. There are some filters, but I am sure everyone will soon have access to an unfiltered model of its strength, and its easy to imagine what will come of that.
Chapters:
00:00 – Intro
01:07 – Prompt Adherence, vs Reve, Midjourney, Imagen 3 + one other
03:39 – Idioms
04:20 – Thumbnails?
05:56 – Captions / Infographics
07:20 – Filters and Public Figures + Gray Swan
08:30 – Sora?
08:49 – Ethnicities/hands
09:09 – Where’s Waldo?
10:33 – Selfies and Photorealism
Images with ChatGPT/4o ImageGen: https://chatgpt.com/
Imagen 3: https://labs.google/fx/tools/image-fx
Reve: https://preview.reve.art/app
Altman Announcement: https://x.com/sama/status/1904598788687487422
Non-hype Newsletter: https://signaltonoise.beehiiv.com/