OpenAI, the creator of ChatGPT, announced on Monday the release of its new AI model, GPT-4o. This model enhances voice conversations, allowing users to interact with ChatGPT through both voice and images. With new audio features, users can now speak directly to ChatGPT and receive immediate responses, including the ability to interrupt it, making conversations more realistic.
At a live event, OpenAI demonstrated these capabilities, including a demonstration where ChatGPT helped a researcher solve a math problem using both voice and vision. Another demo showcased the model's real-time language translation abilities.
OpenAI's CEO, Sam Altman, praised the new model, likening the experience to conversing with a computer as seen in movies. The company also emphasized that GPT-4o will be available for free to increase its accessibility, though paid users will have access to more features.
This announcement comes just before Google's annual developer conference, where Google is expected to reveal its own AI advancements. The timing places OpenAI in direct competition with Google and other tech giants in the AI field.
Key Points
OpenAI introduced GPT-4o, enhancing voice and image interactions in ChatGPT for more realistic user experiences.
The launch strategically positions OpenAI as a key competitor to Google, especially as Google prepares to unveil its own AI advancements.
FAQs
Q1: What is GPT-4o?
GPT-4o is a new AI model from OpenAI that enhances ChatGPT's voice and image interaction capabilities, making conversations more realistic.
Q2: How does GPT-4o differ from previous models?
GPT-4o introduces immediate voice responses and the ability for users to interrupt ChatGPT, along with real-time language translation and image interaction features.
Q3: Is GPT-4o free to use?
Yes, GPT-4o is available for free, but there are additional features and greater capacity for paid users.
Reference
Comments