ChatGPT's New Image and Voice Capabilities
ChatGPT's New Image and Voice Capabilities

ChatGPT's New Image and Voice Capabilities

OpenAI, the company behind ChatGPT, has announced exciting new capabilities for the AI model. In an upcoming update, ChatGPT will gain the ability to analyze photos and engage in audio conversations.

  1. Image Recognition: Users will be able to upload photos of scenes or objects and ask ChatGPT to describe what it sees or answer questions related to the images using image recognition technology.
  2. Voice Capabilities: ChatGPT will also have voice capabilities, allowing it to mimic voices and generate speech. It can do this after listening to "just a few seconds" of someone speaking.

However, OpenAI acknowledges that these new features come with potential risks, such as the possibility of malicious actors using the technology for impersonation or fraud. To mitigate this, ChatGPT will only use voices that have been previously approved by OpenAI.

This update is expected to provide a more intuitive and interactive experience for users, enabling voice conversations and image-based interactions. Spotify, for example, is already using OpenAI's voice generation technology for podcast translations, enhancing the listening experience by retaining the original podcaster's voice and inflections.

The rollout of these voice and image features will begin for ChatGPT Plus and Enterprise users in the next two weeks.

?? Sources

  1. TechNewsWorld - OpenAI's ChatGPT to Add Image and Voice Recognition

要查看或添加评论,请登录

Mohamed Ibrahim的更多文章

社区洞察

其他会员也浏览了