登录查看更多内容

OpenAI Unveils Hyper-Realistic Voice Feature for ChatGPT Plus Users

ChandraKumar R Pillai

Top AI Voice | AI & Blockchain Expert | Tech Advisor | Leadership Insights

发布日期: 2024年7月31日

OpenAI ’s Advanced Voice Mode: A Leap in Hyper-Realistic AI Interaction

OpenAI is once again at the forefront of AI innovation with the introduction of ChatGPT’s Advanced Voice Mode. This new feature promises to revolutionize the way we interact with AI by offering hyper-realistic audio responses that closely mimic human speech. This groundbreaking development is set to gradually roll out to ChatGPT Plus users, with the alpha version available to a select group starting this fall.

The Journey to Advanced Voice Mode

OpenAI first showcased the capabilities of GPT-4o’s voice in May 2024, stunning audiences with its quick and lifelike responses. The voice, dubbed Sky, bore an uncanny resemblance to actress Scarlett Johansson’s voice in the movie “Her.” This led to some controversy, as Johansson had reportedly declined multiple requests from OpenAI to use her voice. Despite OpenAI’s denial, the voice was removed from the demo, and the release was delayed to improve safety measures.

What Sets Advanced Voice Mode Apart?

Unlike the previous Voice Mode, which relied on three separate models to convert voice to text, process the prompt, and convert text back to voice, Advanced Voice Mode utilizes GPT-4o’s multimodal capabilities. This integration allows for significantly lower latency conversations and the ability to detect emotional intonations such as sadness, excitement, or even singing.

Gradual Rollout and Safety Measures

OpenAI is taking a cautious approach with the release of Advanced Voice Mode. Initially, a small group of ChatGPT Plus users will have access, and the feature will be gradually made available to all Plus users by the end of the year. Users in the alpha group will receive notifications through the ChatGPT app and detailed instructions via email.

To ensure the safety and reliability of this feature, OpenAI has conducted extensive testing with over 100 external red teamers across 45 different languages. A report detailing these safety efforts is expected in early August. Additionally, the Advanced Voice Mode will be limited to four preset voices – Juniper, Breeze, Cove, and Ember – created in collaboration with paid voice actors.

Avoiding Deepfake Controversies

In light of past controversies surrounding deepfake technologies, OpenAI has implemented measures to prevent ChatGPT from impersonating individuals or public figures. The AI will also block requests to generate copyrighted audio, aiming to avoid legal issues similar to those faced by AI startups like ElevenLabs, Suno, and Udio.

Artificial Inspiration 1 年前

Are GPTs GPTs?

Madison Mohns 2 个月前

Insider's Edit: ChatGPT Performance Drift - a New Risk…

AI Business 1 年前

The Future of AI Interaction

OpenAI’s Advanced Voice Mode is a significant step forward in making AI interactions more natural and human-like. By combining advanced voice synthesis with robust safety measures, OpenAI is setting a new standard for AI technology.

Critical Questions for Discussion

1. How will hyper-realistic AI voices impact user interaction and trust in AI technologies?

2. What are the potential ethical implications of AI-generated voices that closely mimic human speech?

3. How can companies balance innovation with the need to prevent misuse of AI technologies?

4. What additional safety measures should be implemented to protect against deepfake abuses?

5. How will the introduction of Advanced Voice Mode influence the competitive landscape of AI voice technologies?

The release of ChatGPT’s Advanced Voice Mode is poised to reshape the landscape of AI interaction. Share your thoughts and insights on this groundbreaking development. Engage with us and let’s discuss the future of AI together.

Join me and my incredible LinkedIn friends as we embark on a journey of innovation, AI, and EA, always keeping climate action at the forefront of our minds. ?? Follow me for more exciting updates https://lnkd.in/epE3SCni

#AI #VoiceTechnology #OpenAI #ChatGPT #AIInnovation #TechNews #ArtificialIntelligence #DigitalTransformation #AIRegulation #LinkedInDiscussion

Sources: TechCrunch ; OpenAI

AI Daily Nutshell

13,964 位关注者

Jacques Gérard Bérubé

MD; MBA; Civil Eng.; LL.B.; Organic Farmer; Nonprofit Consultant Cybersecurity; 1st Officer Merchant Marine; Ethical Hacker; AI (Photo, Video, Post, Investigation).

2 个月

Very informative

Woodley B. Preucil, CFA

Senior Managing Director

2 个月

ChandraKumar R Pillai Very interesting. Thank you for sharing

Bragadeesh Sundararajan

2 个月

Great information, I'm wondering whether this will reduce the usage of NLP APIs for voice to text and language translation. I'm also curious to see the efficacy, does it translate or transliterate before passing to the RAG to elicit a coherent response. Eager to use the Plus model.

Indira B.

2 个月

OpenAI's Advanced Voice Mode is truly remarkable, paving the way for more immersive AI interactions. Your insights on the subject are always enlightening, ChandraKumar R Pillai.

Clint Engler

CEO/Principal: CERAC Inc. FL USA..... ?? ????????Consortium for Equitable Research, Analysis & Communication

2 个月

Thank you for sharing that information......Have a blessed day!!! ??

2 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

OpenAI Unveils Hyper-Realistic Voice Feature for ChatGPT Plus Users

ChandraKumar R Pillai

Top AI Voice | AI & Blockchain Expert | Tech Advisor | Leadership Insights

领英推荐

AI Daily Nutshell

13,964 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Embracing AI: Breakthroughs, Controversies, and Societal Impact

From "Her" to Here: When Sci-Fi Almost Becomes Reality with OpenAI's ChatGPT-4o

Apple's Partnership with OpenAI?-?What to?Expect

From Text to Talk: The Rise of ChatGPT in Conversational AI

The Cold War of Chatbots!!

OpenAI's ChatGPT: Should Google be worried?

Google Has Finally Dethroned ChatGPT

OpenAI GPT-4o Unveiled Alongside Enhanced Features for ChatGPT Users

Face-PaLM, ChatGPT ??

Exciting news for Apple users About OpenAI!

领英推荐

AI Daily Nutshell

13,964 位关注者

The Future of AI is Local: Mistral’s Edge Models for Privacy-First Innovation

2024年10月17日

Is AI Really as Smart as We Think? Breaking Down AI's Limitations

2024年10月16日

Robots and Humans Working Together: Amazon’s Vision for the Future of Fulfillment

2024年10月15日

Can AI Truly Transform the World? Examining the Bold Predictions of Anthropic’s CEO

2024年10月14日

AI, Ethics, and War: Can Technology Make Better Decisions Than Humans?

2024年10月13日

Amazon’s Vision-Assisted Package Retrieval: The Future of AI-Powered Delivery

2024年10月12日

Meta AI’s Global Rollout: Are AI Assistants the Future of Social Media?

2024年10月11日

Rovo AI : Unlocking New Levels of Efficiency in Jira, Confluence, and Beyond

2024年10月10日

AI Regulation in California: A New Era of Transparency or Legal Trouble?

2024年10月9日

Speech-to-Speech Innovation: Exploring the Power of OpenAI’s Realtime API

2024年10月8日

社区洞察

其他会员也浏览了

Embracing AI: Breakthroughs, Controversies, and Societal Impact

From "Her" to Here: When Sci-Fi Almost Becomes Reality with OpenAI's ChatGPT-4o

Apple's Partnership with OpenAI?-?What to?Expect

From Text to Talk: The Rise of ChatGPT in Conversational AI

The Cold War of Chatbots!!

OpenAI's ChatGPT: Should Google be worried?

Google Has Finally Dethroned ChatGPT

OpenAI GPT-4o Unveiled Alongside Enhanced Features for ChatGPT Users

Face-PaLM, ChatGPT ??

Exciting news for Apple users About OpenAI!