登录查看更多内容

ChatGPT's New Image and Voice Capabilities

Mohamed Ibrahim

SEO Manger AI Prompt Engineer

发布日期: 2023年9月27日

OpenAI, the company behind ChatGPT, has announced exciting new capabilities for the AI model. In an upcoming update, ChatGPT will gain the ability to analyze photos and engage in audio conversations.

Image Recognition: Users will be able to upload photos of scenes or objects and ask ChatGPT to describe what it sees or answer questions related to the images using image recognition technology.
Voice Capabilities: ChatGPT will also have voice capabilities, allowing it to mimic voices and generate speech. It can do this after listening to "just a few seconds" of someone speaking.

However, OpenAI acknowledges that these new features come with potential risks, such as the possibility of malicious actors using the technology for impersonation or fraud. To mitigate this, ChatGPT will only use voices that have been previously approved by OpenAI.

This update is expected to provide a more intuitive and interactive experience for users, enabling voice conversations and image-based interactions. Spotify, for example, is already using OpenAI's voice generation technology for podcast translations, enhancing the listening experience by retaining the original podcaster's voice and inflections.

The rollout of these voice and image features will begin for ChatGPT Plus and Enterprise users in the next two weeks.

?? Sources

TechNewsWorld - OpenAI's ChatGPT to Add Image and Voice Recognition

要查看或添加评论，请登录

Mohamed Ibrahim的更多文章

Meta Unveils Game-Changing Generative AI Tools for Advertisers

2023年10月5日

Meta Unveils Game-Changing Generative AI Tools for Advertisers

In a bid to revolutionize the advertising landscape, Meta, the social media giant formerly known as Facebook, has…
Jony Ive and Sam Altman in talks to create the 'iPhone of AI'

2023年9月29日

Jony Ive and Sam Altman in talks to create the 'iPhone of AI'

Former Apple designer Jony Ive and OpenAI's Sam Altman are reportedly in advanced talks with SoftBank's Masayoshi Son…

ChatGPT's New Image and Voice Capabilities

Mohamed Ibrahim

SEO Manger AI Prompt Engineer

?? Sources

Mohamed Ibrahim的更多文章

社区洞察

其他会员也浏览了

My First Impressions of ChatGPT o1: A Quantum Leap in AI Reasoning

ChatGPT 101 (what to know in proposals) & Never do this in a Q&A

The HUGE issue with ChatGPT that We Don't Talk About

ChatGPT: A Game-Changing Innovation in Natural Language Processing

DeepSeek vs. ChatGPT

Can ChatGPT Make Stone Tools?

ChatGPT good or bad for the channel

ChatGPT Introduces Mobile Application

After ChatGPT and DALL-E, meet VALL-E - the text-to-speech AI that can mimic anyone’s voice

AI Rewind: ChatGPT is Getting Lazy

?? Sources

Mohamed Ibrahim的更多文章

Meta Unveils Game-Changing Generative AI Tools for Advertisers

Jony Ive and Sam Altman in talks to create the 'iPhone of AI'

社区洞察

其他会员也浏览了

My First Impressions of ChatGPT o1: A Quantum Leap in AI Reasoning

ChatGPT 101 (what to know in proposals) & Never do this in a Q&A

The HUGE issue with ChatGPT that We Don't Talk About

ChatGPT: A Game-Changing Innovation in Natural Language Processing

DeepSeek vs. ChatGPT

Can ChatGPT Make Stone Tools?

ChatGPT good or bad for the channel

ChatGPT Introduces Mobile Application

After ChatGPT and DALL-E, meet VALL-E - the text-to-speech AI that can mimic anyone’s voice

AI Rewind: ChatGPT is Getting Lazy