登录查看更多内容

OpenAI's Voice Mode is finally here!

AJ Green

Founder & CEO, AI Advantage | Keynote Speaker on Generative AI Chairman, Washington County Chamber Young Professionals Featured in AI Business Journal "Top 10 AI CEOs" Subscribe to my newsletter for AI daily news??

发布日期: 2024年9月25日

Welcome, AI entrepreneurs & enthusiasts.

After a multi-month wait, OpenAI finally announced that Advanced Voice Mode is rolling out to all ChatGPT Plus and Team subscribers this week (outside of the EU).

But with no current o1 integration, image upload option, or live video, will it live up to the hype? Let’s get into it…

In today’s AI news:

OpenAI rolls out Advanced Voice Mode
Google releases production-ready models
Customize images fast with PuLID-FLUX
James Cameron joins Stability AI’s board
More AI & tech news

OpenAI rolls out Advanced Voice Mode

The News: OpenAI is finally rolling out an enhanced Advanced Voice Mode (AVM) to all ChatGPT Plus and Teams subscribers this week, featuring new voices and improved functionality to make AI interactions feel more natural and personalized.

The details:

The initial rollout for OpenAI’s new Advanced Voice Mode started in July, but it only ever reached a select few ChatGPT users.
During the delay, OpenAI updated its AVM to integrate Custom Instructions and Memory, allowing for more personalized interactions and conversation recall.
OpenAI also improved AVM’s ability to understand accents and claims smoother, faster conversations, while adding five new nature-inspired voices (and removing the “Sky” voice that sounded like Scarlett Johansson).
AVM will not yet be available in several regions, including the EU, the UK, Switzerland, Iceland, Norway, and Liechtenstein.

Why it matters: With OpenAI CEO Sam Altman writing about AI agents and superintelligence, ChatGPT Advanced Voice Mode feels more relevant than ever. If we’re going to interact with AI every day—it has to sound and feel human—which is exactly what AVM is attempting to accomplish.

Editors note: If you still don’t have access to Advanced Voice Mode on your ChatGPT app, try uninstalling and reinstalling the app.

Google releases production-ready models

The News: Google just announced significant updates to its Gemini AI models, including performance improvements, cost reductions, and increased accessibility for developers.

The details:

Two new production-ready models came out today: Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002, offering improved quality across various tasks, including a 20% boost in math-related benchmarks.
Pricing for Gemini 1.5 Pro has been reduced by over 50% for both input and output on prompts under 128K tokens, while rate limits have been increased significantly.
The models boast 2x faster output and 3x lower latency compared to previous versions, with improvements in long context understanding and vision capabilities.
Google also updated its default filter settings, giving developers more control over model configuration for their specific use cases.

Why it matters: Google is iterating quickly and pushing the boundaries of affordability for developers building with AI. While this isn’t Gemini 2 — it is a significant upgrade over the experimental models and will help builders create faster, smarter, cheaper applications.

James Cameron joins Stability AI’s board

The News: James Cameron, the acclaimed director of Titanic, Avatar, and The Terminator, recently joined the board of directors at Stability AI, the company behind the popular Stable Diffusion text-to-image AI model.

Generative AI 3 周前

OpenAI adds web search to ChatGPT, puts web ecosystem…

Fast Company 3 周前

AI Revolution: From Siri Upgrade to Robo-Taxis - This…

Dr. Joerg Storm 5 个月前

The details:

Cameron, known for pushing technological boundaries in filmmaking, sees the convergence of generative AI and CGI as “the next wave” in visual media creation.
Stability AI’s CEO, Prem Akkaraju, formerly led visual effects company WETA Digital, highlighting the firm’s focus on creative applications of AI.
The move comes as Hollywood grapples with AI’s potential, with some studios embracing the technology while others express concerns over content rights.

Why it matters: Just days after Lionsgate teamed up with AI startup Runway to create a custom video generation model, this move by one of Hollywood’s biggest directors could signal a significant shift in how influential filmmakers are thinking about navigating AI.

Customize images fast with PuLID-FLUX

The News: Hugging Face’s new PuLID-Flux space offers a tuning-free solution for quick image customization with your own likeness using just one reference photo.

Step-by-step:

Visit PuLID-Flux Hugging Face space (also available in Replicate)
Upload your reference image in the "ID Image" section
Play with the different parameters it offers and write a descriptive prompt for your desired output
Click "Generate" and refine as needed

Pro tip: Adjust the "timestep to start inserting ID" parameter to balance fidelity and creativity.

Trending AI Tools

Adobe GenStudio - Helps marketing teams measure on-brand content
FactBot by Snopes - Fact-checking for urban legends and misinformation
JustPaid - Automate invoice follow-ups and payment tracking
Pathway - Helps product teams test UX solutions and gather insights
Tubit AI - AI that summarizes YouTube videos for a deeper understanding

QUICK HITS

Warner Bros. Discovery adopted Google Cloud’s AI for caption generation, aiming to cut production time and costs for unscripted programming.

Intel launched Xeon 6 processors and Gaudi 3 AI accelerators, doubling performance for AI workloads and offering improved price and performance compared to Nvidia’s H100.

OpenAI increased API access for o1 models, adding tier 4 to the list of authorized users at 100 requests per minute and upping tier 5 users to 1000 requests per minute.

Suno AI announced a new cropping feature available to AI-generated songs, allowing Pro and Premier users to adjust the start and end of their creations.

Duolingo introduced AI-powered Adventures mini-games and a Video Call feature to enhance language learning through immersive, practical experiences for its users.

Apple unveiled its plan to roll out Siri’s major AI-powered updates gradually, with the most significant enhancements expected in iOS 18.3, likely launching in January 2025.

Thank you for reading our newsletter! If you want to stay two steps ahead of the competition, subscribe to this newsletter. If you want to leave your competition in the past, hop on a quick, complimentary, no-obligation call with our team to explore our consulting and custom development services.

We've proudly worked with over 400+ companies to revolutionize their business with AI, and our team of 4,000+ developers, engineers, consultants, and experts are more than ready to help you take advantage of all the latest and greatest AI technology for your business.

Ready to get started? Book a Consultation today!

OpenAI's Voice Mode is finally here!

AJ Green

Founder & CEO, AI Advantage | Keynote Speaker on Generative AI Chairman, Washington County Chamber Young Professionals Featured in AI Business Journal "Top 10 AI CEOs" Subscribe to my newsletter for AI daily news??

Welcome, AI entrepreneurs & enthusiasts.

OpenAI rolls out Advanced Voice Mode

Google releases production-ready models

James Cameron joins Stability AI’s board

领英推荐

Customize images fast with PuLID-FLUX

Trending AI Tools

QUICK HITS

The AI Advantage

1,557 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

ChatGPT Is Letting Users Customize the Experience

ChatGPT and Generative AI Tips & News; Bing is no longer a search engine; Google invested in OpenAI's rival; AI models spit out photos of real people;

This AI newsletter is all you need #34

AI Breakthroughs: OpenAI Agents, Meta’s Llama 3.2, Google’s Video AI and more...

Latest In Web3, AI & Emerging Tech

Foundation Model Battle: Google Bard vs OpenAI ChatGPT vs Claude vs Meta Llama

AI Newsletter

AI Weekly Digest – August 19 2024

Sunday Signal #14: Google's Bard Stumbles, Bing Back With a Bang & Staggering ChatGPT Plug-in Announcement

The Latest News in AI: Key Developments and Insights

Welcome, AI entrepreneurs & enthusiasts.

OpenAI rolls out Advanced Voice Mode

Google releases production-ready models

James Cameron joins Stability AI’s board

领英推荐

Customize images fast with PuLID-FLUX

Trending AI Tools

QUICK HITS

The AI Advantage

1,557 位关注者

Unlock Unbelievable AI Creativity with Runway’s ‘Frames’ – Here’s How

2024年11月26日

$4B Raise for Anthropic: Is Amazon Doing Enough to Beat OpenAI and xAI?

2024年11月25日

AI Week in Review: Specialized Agents, Breakthrough Models, and What They Mean for You!

2024年11月23日

OpenAI’s Evidence ‘Glitch’: The Mistake That Could Cost Billions

2024年11月22日

1/1500 the Size, 2 Months Flat: Chinese Startup Unveils Open-Source OpenAI o1 Alternative

2024年11月21日

The Future of Work Is Here: Microsoft’s AI Agents Are Redefining Work for 1 Billion+ Users

2024年11月20日

Mistral vs. Silicon Valley: Is Europe Closing the AI Innovation Gap?

2024年11月19日

Musk vs. Altman Exposed: Early Power Struggles That Shaped OpenAI’s AI Empire

2024年11月18日

AI Week in Review: AGI Predictions, Agentic Tools, and What It Means for You

2024年11月16日

ChatGPT + Desktop = New Era of AI Collaboration...

2024年11月15日

社区洞察

其他会员也浏览了

ChatGPT Is Letting Users Customize the Experience

ChatGPT and Generative AI Tips & News; Bing is no longer a search engine; Google invested in OpenAI's rival; AI models spit out photos of real people;

This AI newsletter is all you need #34

AI Breakthroughs: OpenAI Agents, Meta’s Llama 3.2, Google’s Video AI and more...

Latest In Web3, AI & Emerging Tech

Foundation Model Battle: Google Bard vs OpenAI ChatGPT vs Claude vs Meta Llama

AI Newsletter

AI Weekly Digest – August 19 2024

Sunday Signal #14: Google's Bard Stumbles, Bing Back With a Bang & Staggering ChatGPT Plug-in Announcement

The Latest News in AI: Key Developments and Insights