ChatGPT Enhances its Image-Generation Capabilities

By Jorge Paka Apps & Software, Artificial Intelligence, Tech ChatGPT, Image-Generation Comments Off

During a livestream on Tuesday, OpenAI CEO Sam Altman announced the first major update to ChatGPT’s image-generation capabilities in over a year.

Now, ChatGPT can use the GPT-4o model to create and edit images directly. While GPT-4o has long powered OpenAI’s chatbot, it previously handled only text-based tasks.

Altman stated that GPT-4o’s image-generation feature is now available in ChatGPT and Sora, OpenAI’s AI video tool, for subscribers of the company’s $200-a-month Pro plan. The feature will soon expand to Plus and free-tier ChatGPT users, as well as developers using OpenAI’s API.

Enhanced Image Precision and Editing Capabilities

http://DALL-E 3GPT-4o takes slightly longer to generate images than DALL-E 3, the model it replaces, but OpenAI claims it produces more detailed and accurate visuals. It can also edit existing images, including those featuring people, by altering elements or “inpainting” details like foreground and background objects.

To develop this feature, OpenAI told the Wall Street Journal that it trained GPT-4o using publicly available data and proprietary content from partnerships with companies like Shutterstock.

Generative AI companies often treat their training data as a key competitive asset, keeping details tightly guarded. Additionally, concerns over intellectual property disputes serve as another reason for companies to limit disclosures.

“We respect artists’ rights in how we generate outputs and have policies in place to prevent the creation of images that closely replicate the work of living artists,” OpenAI COO Brad Lightcap told the Wall Street Journal.

Creator Control and Data Privacy Measures

OpenAI provides an opt-out form that lets creators request the removal of their works from its training datasets. The company also states that it honors requests to block its web-scraping bots from gathering training data, including images, from websites.

The upgraded image-generation feature in ChatGPT comes shortly after Google introduced experimental native image output in its Gemini 2.0 Flash model. The feature quickly gained attention on social media, though not all of it was positive. Gemini 2.0 Flash’s image tool had minimal safeguards, enabling users to remove watermarks and generate images featuring copyrighted characters.

This article was updated at 12 p.m. PT to include OpenAI’s statement to the Wall Street Journal regarding GPT-4o’s training data.

Read the original article on: TechCrunch

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

ChatGPT Enhances its Image-Generation Capabilities