登录查看更多内容

Why should you measure bots like humans?

Kane Simms

??Triple Award-Winning AI Automation Consultancy ?? VUX. helps businesses leverage AI… Properly.

发布日期: 2022年12月14日

You’ll get a variety of answers if you ask anyone with a call centre bot how they measure its success. They might measure containment rates (how many callers were serviced by the bot without any help from a human agent). They might also measure CSAT (customer satisfaction score) and NPS (net promoter score).

Those are flawed success metrics, as Frank Schneider, AI Evangelist, Verint, explains to Kane Simms in his VUX World interview.

How do you measure any conversation?

We’re too focused on the numbers these days. You could focus on the accuracy of your NLU (Natural Language Understanding) for example - it's a very important component of your conversational AI system - but what about resolving customer problems?

Bear with me here. I want to look at this from a different angle. Let’s turn this around and take the focus off AI for a moment. How would you measure the success of a human-to-human conversation between customer and brand in any context?

We’ll boil it down to a simple everyday scenario.

Let’s say you go into your neighbourhood bakery to buy a loaf of bread. You don’t know the staff personally, but you've been there before. The first baker says “what would you like?”, so you say that you want that sesame loaf they bake so well. Another baker overhears you and places your loaf of bread on the counter. They say nothing and return to what they were doing. As you pay, the first baker says “I know you love this sesame seed loaf – we’ve been trying different recipes so you should come back next Monday to try our improved version.”

You leave the shop with a smile on your face, and then someone appears with a clipboard and asks you to rate each baker. How would you do it? Although the total experience was just fine, if you put each interaction under a microscope you could say the bakers were underperforming.

Baker one asked what you wanted, but didn’t give it to you. Baker two said nothing but gave you what you wanted. Finally, baker one teased you with a new product which you might never buy. So, baker one was pretty ineffective really? They only asked you a question and floated an advert to you but delivered nothing. How high would you rate them? Probably quite low. Baker two could be seen as antisocial because they said nothing, but in fact they gave you exactly what you wanted.

The combined result was that everything went smoothly and you got what you came for with minimal friction! Measuring each baker’s effectiveness gives a slightly different perspective though doesn’t it?

That's the challenge

It’s not so easy to measure any customer experience whether it’s humans or bots. In the above scenario baker one didn’t need to get the bread because baker two did it. Baker two didn’t need to say anything – would “here’s the loaf of bread you just asked for” or "there you are" have helped the situation at all?

You could say some call centre bots are like baker one – asking questions and then escalating to an expert. But bots can also be baker two – delivering what you need so seamlessly you barely notice, in the background, without any communication.

And when someone talks to a call centre, they don’t stay on the same subject. They might suddenly remember something else they wanted, or get distracted and ask to leave so they can return to the conversation later. Can you measure the success of conversations in those scenarios?

领英推荐

AI skepticism is giving way to curiosity.

Intercom 11 个月前

17 AI prompts to fake your way until new year’s day

Marco van Hurne 3 个月前

What's next in VoC as traditional customer surveys…

Ricardo Saltz Gulko 1 年前

Speakeasy (now Verint) is thinking differently

As you see, we’re not dealing with situations that can be assessed on simple criteria. You really should check out Speakeasy AI’s whitepaper. Rather than focusing on metrics such as containment (which is a poor metric because proper bot integrations allow both live agent and bot to work as a team rather than competitors), they propose measuring the ‘correctness’ (how closely the response aligns with the truth) and ‘fluency’ (how smooth and effortless the flow of conversation is) of conversational AI.

It's all about measuring your bots as you would measure your live agents, rather than focusing on out-of-date metrics. As Frank puts it:

“If you introduce measures around business KPIs, contextual awareness, courtesy, and generosity, then you’ll train your AI to be so much more than just error-proof. When done right, conversational AI can significantly improve the quality of your customer service — we just need to assess and train virtual assistants with as much enthusiasm as we invest in human agents.”

You can download the whitepaper – it’s baked perfectly and full of goodness ??

You can hear Frank’s full interview here where he gives some great insights into measuring the impact of conversational AI.

This article was written by Benjamin McCulloch. Ben is a freelance conversation designer and an expert in audio production. He has a decade of experience crafting natural sounding dialogue: recording, editing and directing voice talent in the studio. Some of his work includes dialogue editing for Philips’ ‘Breathless Choir’ series of commercials, a Cannes Pharma Grand-Prix winner; leading teams in localizing voices for Fortune 100 clients like Microsoft, as well as sound design and music composition for video games and film.

—-

About Kane Simms

Kane Simms is the front door to the world of AI-powered customer experience, helping business leaders and teams understand why voice, conversational AI and NLP technologies are revolutionising customer experience and business transformation.

He's a Harvard Business Review-published thought-leader, a top?'voice AI influencer'?(Voicebot and SoundHound), who helps executives formulate the future of customer experience strategies, and guides teams in designing, building and implementing revolutionary products and services built on emerging AI and NLP technologies.

Subscribe to my newsletter
Listen to the?VUX World podcast?on?Apple,?Spotify?or wherever you get your podcasts
Take our free conversational Ai maturity assessment

带有此图标的链接由领英创建，不带此图标的链接由作者添加。

Conversational AI & NLP

19,419 位关注者

Dr. Lisa Precht

Head of AI Productivity and Sovereign AI @IBM | Accelerating Conversational AI, Assistants & Agents | Enabling Data Sovereignty, Governance & Measurable Business Value

2 年

I'm not sure I'd compare digital assistants - or user interfaces in general - to "human evaluation". In the case of a digital assistant, other effects also come into play to some extent than in an exchange with a human being. In addition to normal usability and user experience issues, I evaluate, for example: - how transparent is the system? - how much does the user feel in control and is the experience in line with expectations - how quickly is the desired result achieved - as how accurate are the answers rated and questions understood? - does the assistant fall into the uncanny valley?? Of course, we expect these characteristics from human interaction as well, but I'm not sure I would rate a human analogously. Interesting question - and as always super article Kane Simms.

2 次回应

Frank Schneider

VP, AI Evangelist @ Verint

2 年

Kane Simms gave a keynote on this topic at The AI Summit NYC last week ?? Some great engagement in the room around measuring AI based on actually getting things done as opposed to deflecting away from humans ??

3 次回应

Martin Jahn

Customer Success Executive | Driving Revenue, Retention, & Scalable Growth

2 年

I do not like the word "bots" at all. When working with customers, I always use the term "assistant", "digital assistant" or "digital (co-)worker. The term bot has negative connotations and you might create adoption barriers because of it. We all know and have read the articles that "bots" come for our jobs. Mature organizations, those that have been on their digital transformation or intelligent automation journey for some time, usually use "digital worker" and build an entire "digital workforce". And they measure the work that these workers do the same way they measure the actual humans - they use process or business KPIs.

3 次回应

Caio Calado

Conversation Designer and Community Manager

2 年

Love this. Here is an example of a product metric for that: it's called "Jobs to be Done". It can be measured and associated with the bot's main strategy ("this bot has 3 main JTBD / skills").

1 次回应

Dustin Laidsaar

Are you solving the problem right or are you solving the right problem?

2 年

Maybe you should measure the human that is managing the bot(s), and the bot should be performance managed accordingly. Otherwise you end up in the same situation as the IVR - the hardest working employee in your contact centre that never gets coaching, never gets a performance review, training / upsklling and then gets replaced with a new technology to replicate a human - rather than actually solving the problem that’s driving a poor customer interaction to your business.

查看更多评论

要查看或添加评论，请登录

Kane Simms的更多文章

Shatter the hidden ceiling in CX automation

2025年2月25日

Shatter the hidden ceiling in CX automation

You’ve probably experienced this conversation with a live agent: “I’m sending a link to your phone now. Go ahead and…
2 strategies for using AI to increase digital adoption of web and mobile journeys

2025年2月18日

2 strategies for using AI to increase digital adoption of web and mobile journeys

Many businesses today still struggle with digital adoption. They invest heavily in online and mobile experiences, but…

6 条评论
Top AI trends shaping customer experience in 2025

2025年2月4日

Top AI trends shaping customer experience in 2025

According to our latest report, 15 AI Trends in CX 2025, businesses are at a crossroads in their AI journey. 2024 saw…

1 条评论
The importance of AI orchestration

2025年1月31日

The importance of AI orchestration

There are two growing uses for artificial intelligence in enterprise. One is process automation.

11 条评论
Conversational UIs will bring a flurry of “micro” productivity

2025年1月27日

Conversational UIs will bring a flurry of “micro” productivity

I recently had a conversation with Noam Fine, co-founder and CEO, Hear.ai and something dawned on me.

8 条评论
15 AI Trends in Customer Experience 2025

2025年1月24日

15 AI Trends in Customer Experience 2025

Yo! Short and sweet today. This should speak for itself.
Weekly AI news rundown!

2025年1月17日

Weekly AI news rundown!

Hello! Welcome to the first newsletter of 2025, packed with the latest on AI automation. We’re diving into the insights…
The truth about AI agents that the tech vendors don't want you to know

2025年1月14日

The truth about AI agents that the tech vendors don't want you to know

The term "agentic AI" has become a buzzword, fuelled by tech giants like Google, Microsoft, and OpenAI, alongside a…

3 条评论
Why AI is a change project, not a tech project

2025年1月2日

Why AI is a change project, not a tech project

AI is a change project, not a tech project. Why? Because success doesn’t rely on how good the technical solution is.

7 条评论
Bridging the Generative AI Gap: Governance, Quality, and Enterprise Adoption

2024年12月23日

Bridging the Generative AI Gap: Governance, Quality, and Enterprise Adoption

Every big tech company is working on foundational generative AI models and promises that the future will be different…

2 条评论

See all articles

Why should you measure bots like humans?

Kane Simms

??Triple Award-Winning AI Automation Consultancy ?? VUX. helps businesses leverage AI… Properly.

How do you measure any conversation?

That's the challenge

领英推荐

Speakeasy (now Verint) is thinking differently

About Kane Simms

Conversational AI & NLP

19,419 位关注者

Kane Simms的更多文章

社区洞察

其他会员也浏览了

How to use Perplexity AI for outbound (Including 4 AI Prompts)

The Hits & Misses of Conversational AI-Powered Bots for Personalized Information

What is Conversation as a Service, and Why Does It Matter?

GenAI perspectives (contents+ post): a year with my robot friends

Teloz: The #1 Voice Bot Solutions Provider for Businesses

Harry Potter and the Unmarked White Van by ChatGPT and Claude.ai

I used AI tools to build a BOT to give me advice

Facing AI Fears: How to Embrace Automation in Customer Interactions Without Losing the Human Touch

How AI Identifies At-Risk Customers and Reduces Churn

6 Ways to Use AI Without Losing Your Sanity (Or Your Voice)

How do you measure any conversation?

That's the challenge

领英推荐

Speakeasy (now Verint) is thinking differently

About Kane Simms

Conversational AI & NLP

19,419 位关注者

Kane Simms的更多文章

Shatter the hidden ceiling in CX automation

2 strategies for using AI to increase digital adoption of web and mobile journeys

Top AI trends shaping customer experience in 2025

The importance of AI orchestration

Conversational UIs will bring a flurry of “micro” productivity

15 AI Trends in Customer Experience 2025

Weekly AI news rundown!

The truth about AI agents that the tech vendors don't want you to know

Why AI is a change project, not a tech project

Bridging the Generative AI Gap: Governance, Quality, and Enterprise Adoption

社区洞察

其他会员也浏览了

How to use Perplexity AI for outbound (Including 4 AI Prompts)

The Hits & Misses of Conversational AI-Powered Bots for Personalized Information

What is Conversation as a Service, and Why Does It Matter?

GenAI perspectives (contents+ post): a year with my robot friends

Teloz: The #1 Voice Bot Solutions Provider for Businesses

Harry Potter and the Unmarked White Van by ChatGPT and Claude.ai

I used AI tools to build a BOT to give me advice

Facing AI Fears: How to Embrace Automation in Customer Interactions Without Losing the Human Touch

How AI Identifies At-Risk Customers and Reduces Churn

6 Ways to Use AI Without Losing Your Sanity (Or Your Voice)