登录查看更多内容

A simulated interview with Andrew Ng about Voice GenAI

Andy Jaffke

Senior Sales Leader at Teradata | Figuring out the value of AI & GenAI | Passionate Coach and Mentor

发布日期: 2025年3月22日

I thought it was a refreshing idea to simulate an interview with Andrew Ng using ChatGPT 4 and his newsletter "The Batch - What Matters in AI Right Now." published February 27th, 2025.

Here is what I did:

Copy the full newsletter into ChatGPT 4 and write the following prompt:

"Hello, today I want you to simulate an interview between me and Andrew Ng based on the following newsletter he published very recently. I am the interviewer and will ask you the questions, and you answer based on the newsletter context. Here is the newsletter: [Newsletter text]"

Chat GPT: Sounds great! Go ahead and ask your first question, and I'll respond as Andrew Ng based on the newsletter's insights.

Here is our conversation about voice-in voice-out generative models.

Andy:

Good morning Andrew, how are you doing?

"Andrew" said:

Good morning! I’m doing great, thanks for having me. Excited to talk about voice AI and share some insights!

Andy:

Generative Voice AI is the big promise for Natural User Interfaces and these are currently in the making. Surprisingly I could not yet experience any improvements when talking to the call centers of my large Utilities and Telco providers. They are still using the very slow IVR systems. Why is that?

"Andrew" said:

That’s a great question! While generative voice AI is advancing rapidly, large utilities and telecom providers tend to adopt new technologies more slowly due to a few key reasons:

Legacy Systems & Infrastructure – Many call centers still rely on older IVR (Interactive Voice Response) systems, which are deeply integrated into their existing workflows. Upgrading to AI-driven voice interfaces requires significant investment and restructuring.
Accuracy & Control Challenges – Unlike text-based AI, controlling the output of voice-in, voice-out models is still difficult. Businesses need to ensure responses align with policies, comply with regulations, and provide accurate information - something IVR, though slow, still guarantees.
Latency Sensitivity – Real-time voice AI must respond in under a second for a smooth conversation. Today’s AI pipelines (speech-to-text → LLM reasoning → text-to-speech) still introduce delays, which can make interactions feel unnatural. Techniques like "pre-responses" help, but many enterprises aren’t yet comfortable deploying them at scale.
Risk Aversion – Large companies prioritize reliability. A voice AI making a mistake - like offering a refund it can’t process - could lead to compliance issues and customer dissatisfaction. Until voice AI is as controllable as traditional systems, adoption will be cautious.

That said, I believe we’ll see more advanced AI-driven voice assistants in these industries soon. The technology is maturing, and as companies see the benefits of improved customer experience, they’ll start making the transition.

Andy:

Sounds promising but it brings us back to a hot topic: Responsible and secure AI. How will that be solved while responding quickly to the users?

"Andrew" said:

Great point! Responsible and secure AI in voice interactions is a major challenge, especially when balancing speed, accuracy, and control. Here’s how we can address it:

Hybrid Workflows for Control – Instead of relying solely on end-to-end voice models, companies can use structured pipelines:
Pre-Responses for Speed – To minimize latency, AI can generate quick pre-responses like “Let me check that for you”, while the system runs deeper reasoning checks. This keeps the conversation fluid without compromising accuracy.
Real-time Monitoring & Intervention – AI models should operate with audit trails and human-in-the-loop oversight for sensitive cases. Some companies already flag uncertain responses for human review before execution.
Security & Compliance Layers – AI models must respect data privacy regulations (GDPR, CCPA) and implement identity verification to prevent fraud, especially in financial or healthcare applications.
Continuous Learning & Updates – Responsible AI means ongoing monitoring and fine-tuning based on user feedback to reduce biases and improve trustworthiness over time.

While we’re not fully there yet, these strategies make AI-driven voice systems both secure and practical. The key is blending automation with structured oversight, ensuring AI is helpful but also accountable.

领英推荐

Why You Should Be Polite To ChatGPT And Other AIs

Bernard Marr 4 个月前

The Mind of a Machine: A Unique Caricature…

Michael Browers 11 个月前

From ELIZA to ChatGPT: Early Chatbots and the…

Rahul Bhattacharya 3 周前

Andy:

So you are suggesting sort of a short warm-up conversation to keep the users engaged while AI works behind the scenes to provide the most accurate answer? How much time would we need before we come up with these answers? Think about multiple questions coming in from the user in sequence.

"Andrew" said:

Exactly! A short "warm-up" response keeps users engaged while AI processes their request in the background. The goal is to mask latency and create a smoother experience, much like how humans naturally pause or acknowledge before answering.

How much time is needed?

For a single response, we aim for 0.3–1 second (human-like conversational speed). The breakdown:

Pre-response (e.g., "Let me check that for you") → ~0.1–0.3 sec
AI processing (LLM reasoning, validation, API calls, etc.) → ~0.5–1.5 sec
Final response via TTS → ~0.2–0.5 sec

Advanced optimizations (like streaming responses or parallel processing) can reduce total time to ~0.5–1 second, making it feel seamless.

Handling Multiple Sequential Questions

When users ask follow-ups in quick succession, the AI can:

Process in parallel – Continue reasoning about Question 1 while pre-responding to Question 2.
Use context memory – Maintain conversation history so each response is contextual and flows naturally.
Prioritize faster queries – Simple requests (e.g., "What's my balance?") get answered instantly, while complex ones (e.g., "Can I change my plan?") use the pre-response technique.

With ongoing improvements in LLM efficiency and multimodal AI, I expect real-time voice conversations with AI to feel much more natural within the next 1–2 years.

Andy:

Great to hear about the advancements GenAI is maing in the Voice-in Voice-out field Andrew. I hope we can have another interview like that soon and you can share the first real life examples with us.

"Andrew" said:

Absolutely! The progress in voice-in, voice-out AI is exciting, and I look forward to seeing more real-world applications emerge. As the technology matures, we’ll see more real-time, intelligent, and natural voice interactions across industries.

Let’s definitely reconnect soon—I’d love to share new breakthroughs and real-life deployments when they happen. Thanks for the great conversation! Keep building! ??

I hope you liked this simulated interview format conversation with ChatGPT.

I truly enjoyed it and hope to be able to have a real conversation with Andrew Ng one day soon!

P.S. This edition of "The Batch" includes an incredible article titled:

Reading Minds, No Brain Implant Required

To date, efforts to decode what people are thinking from their brain waves often relied on electrodes implanted in the cortex. New work used devices outside the head to pick up brain signals that enabled an AI system, as a subject typed, to accurately guess what they were typing.

Conversations with Gen.AI

902 位关注者

David Roldán Martínez

2 天前

Jan Carbonell

1 次回应

要查看或添加评论，请登录

Andy Jaffke的更多文章

Don't be a toxic sales manager

2025年3月17日

Don't be a toxic sales manager

Sales teams often fall short of expectations - not because of market conditions or product issues, but due to a…

2 条评论
Investment Advisor Agent

2025年3月9日

Investment Advisor Agent

Today we will work with GenAi - Chat GPT 4 - To use it as an Investment Advisor. Initial Prompt: Setting the stage and…
Berlin Marathon 2025: Asking for a realistic pace

2025年2月25日

Berlin Marathon 2025: Asking for a realistic pace

I am preparing since 4 weeks the third marathon of my life. Berlin 2025, which will take place on September 21st.

4 条评论
Deep reasoning with DeepSeek - An example using ancient Greek philosophy

2025年2月8日

Deep reasoning with DeepSeek - An example using ancient Greek philosophy

I ran my first test using the "DeepThink (R1)" feature on DeepSeeks mobile app using two prompts. The first prompt took…
How to reignite your Sales Team's morale after a tough year

2025年2月3日

How to reignite your Sales Team's morale after a tough year

Every sales leader faces challenging years - times when the numbers don’t add up, the quotas remain unmet, and…

1 条评论
When will the global ecosystem collapse?

2025年2月2日

When will the global ecosystem collapse?

ChatGPT: The global ecosystem is under significant stress due to climate change, biodiversity loss, pollution, and…

4 条评论
Comparing answers: ChatGPT, Copilot, Perplexity and Gemini

2025年1月25日

Comparing answers: ChatGPT, Copilot, Perplexity and Gemini

Today we will ask the exact same question to chatgpt.com, Microsoft Copilot (https://m365.

11 条评论
What do Sam Altman, Bill Gates, Elon Musk, Steve Jobs, Mark Zuckerberg, Sergey Brin, Larry Page, and Jeff Bezos have in common?

2025年1月20日

What do Sam Altman, Bill Gates, Elon Musk, Steve Jobs, Mark Zuckerberg, Sergey Brin, Larry Page, and Jeff Bezos have in common?

Editors note: You are reading an edition of my newsletter "Conversations with ChatGPT." From Silicon Valley boardrooms…

3 条评论
Why sales leaders and Account Executives in enterprise B2B software need to post on LinkedIn - How to start today

2025年1月12日

Why sales leaders and Account Executives in enterprise B2B software need to post on LinkedIn - How to start today

In the high-stakes world of enterprise B2B software sales, trust and visibility are everything. Clients vet not just…

9 条评论
How to Turn Around Sales

2024年12月11日

How to Turn Around Sales

Sales leaders, we’ve all been there: Fierce competition, a sluggish pipeline, and customers slipping away. It’s a tough…

1 条评论

See all articles

A simulated interview with Andrew Ng about Voice GenAI

Andy Jaffke

Senior Sales Leader at Teradata | Figuring out the value of AI & GenAI | Passionate Coach and Mentor

Andy:

"Andrew" said:

Andy:

"Andrew" said:

Andy:

"Andrew" said:

领英推荐

Andy:

"Andrew" said:

How much time is needed?

Handling Multiple Sequential Questions

Andy:

"Andrew" said:

Conversations with Gen.AI

902 位关注者

Andy Jaffke的更多文章

社区洞察

其他会员也浏览了

The Curious Case of ChatGPT's "Laziness"

A Panel Discussion with Bard, Claude, and ChatGPT - Part 5 - My Take on the Panel Discussion

The New Era of AI and ChatGPT: Navigating the Future of Work and Innovation.

#106 ChatGPT, Bard, and Claude: The Evolving Dynamics of AI Conversations

Bias in AI: Lessons from Real Conversations with ChatGPT

Six Studies Reveal How ChatGPT is Augmenting Negotiation and Strategy

Is ChatGPT Mocking the Truth? A Reflection in the Post-Truth Era

Is ChatGPT’s behavior changing over time and should we care?

TRYING TO BUILD A SIMPLE CHATBOT

ChatGPT Fills Me With Dread

Andy:

"Andrew" said:

Andy:

"Andrew" said:

Andy:

"Andrew" said:

领英推荐

Andy:

"Andrew" said:

How much time is needed?

Handling Multiple Sequential Questions

Andy:

"Andrew" said:

Conversations with Gen.AI

902 位关注者

Andy Jaffke的更多文章

Don't be a toxic sales manager

Investment Advisor Agent

Berlin Marathon 2025: Asking for a realistic pace

Deep reasoning with DeepSeek - An example using ancient Greek philosophy

How to reignite your Sales Team's morale after a tough year

When will the global ecosystem collapse?

Comparing answers: ChatGPT, Copilot, Perplexity and Gemini

What do Sam Altman, Bill Gates, Elon Musk, Steve Jobs, Mark Zuckerberg, Sergey Brin, Larry Page, and Jeff Bezos have in common?

Why sales leaders and Account Executives in enterprise B2B software need to post on LinkedIn - How to start today

How to Turn Around Sales

社区洞察

其他会员也浏览了

The Curious Case of ChatGPT's "Laziness"

A Panel Discussion with Bard, Claude, and ChatGPT - Part 5 - My Take on the Panel Discussion

The New Era of AI and ChatGPT: Navigating the Future of Work and Innovation.

#106 ChatGPT, Bard, and Claude: The Evolving Dynamics of AI Conversations

Bias in AI: Lessons from Real Conversations with ChatGPT

Six Studies Reveal How ChatGPT is Augmenting Negotiation and Strategy

Is ChatGPT Mocking the Truth? A Reflection in the Post-Truth Era

Is ChatGPT’s behavior changing over time and should we care?

TRYING TO BUILD A SIMPLE CHATBOT

ChatGPT Fills Me With Dread