登录查看更多内容

How to measure the success of a Conversational AI project

Assist Digital

Digital and business transformation services, driven by customer-centric obsession.

发布日期: 2023年2月10日

Designing and developing a conversational interface requires a lot of time and effort, and usually an important investment. Consequently, it is essential to measure how it performs after it is released and when users start interacting with it.

Understanding which data are relevant, and how they can be read, is not easy. This is why AI Data Analysts are there for!

Moreover, the release marks the beginning of a new phase altogether, usually called ‘continuous improvement’, in which the bot’s skills to understand and answer will evolve.

But how does this happen? It is often believed that conversational solutions learn and evolve autonomously, while interacting with users, thanks to machine learning.

However, it is not that simple. At least not today, and not in business scenarios.

Behind a bot’s ability to learn, there are experienced professionals who analyze conversations, highlight problems and provide data that allow the team to understand what is not working and to make informed decisions on how to improve it.

Different metrics for different criteria

That said, how can we choose what to analyze?

First of all, we must understand what we want (and can) measure. To do it, we should distinguish the metrics into three areas:

tech performances, which refer to NLU, STT, API calls, etc…?
business goals' achievement, which might be reducing the number of tickets open to customer care, or generating leads, selling products, etc…
user behavior and satisfaction, which refers to how they interact with the virtual assistant and if they find what they’re looking for.

Each area has different success criteria and thus should focus on different KPIs.

Secondly, we should understand which metrics make sense for the specific type of solution we are dealing with.

For example, the conversational interface:

领英推荐

The hottest trends in conversational AI

Genesys 2 年前

Adaptive Intelligence: Beyond Data-Driven AI, How…

Martin Milani 4 个月前

Using Digital Humans to Cut Costs, Provide Stronger…

Dr. Andrée Bates 2 年前

might be chat-only, voice-only or multimodal:
if it has a chat, it might allow interacting through buttons, carousels and other graphical elements or just through text, so we might want to measure how many users prefer one modality or the other (user behavior)
if it has a chat, it might have an NLU engine (while if it is voice-only, it always needs to have it); so if it has an NLU engine, its ability to understand users can be measured with the classic KPI used in AI: accuracy, precision, recall, error-rate and F-1 score (tech metrics)
if it is a voice interface, the word-error-rate might be measured, that is the ratio between the number of words correctly transcribed and the total number of words spoken (tech metric)
might be integrated with third-party systems, so we might want to measure if the API respond always and within a specific range of time (tech metric)
might be active on multiple channels, such as Facebook Messenger, Whatsapp, Telegram, websites, so we might want to measure the traffic distribution across channels (user behavior)
It might have human handover by design, so we might want to measure how many users end up requiring the help of a human (user behavior and business goal achievement)
…

Quantitative and qualitative metrics

Another distinction can be made between quantitative and qualitative metrics.

The former are certainly easier to extract, but might be misleading, if not properly interpreted; the latter require more effort, but might be more enlightening.

As always, however, preferring one over the other depends on what we want to measure and what we want to achieve with that measurement. Let’s make an example.

Quantitative metrics can be used, for example, to analyze users’ behavior; popular metrics in this regard are:

Number of total conversations
Average duration of conversations
Average number of interactions for each session
Number of unique users
Distributions of the users across channels
Sentiment for each session
Top intents
Task completed
Average time to complete a task
Abandoned conversations and steps of the flows in which users abandon the interaction more often
Human handover?

These indicators are certainly useful, but very often they do not provide an explanation to describe why a certain phenomenon occur.?

For example, why users prefer to chat via Whatsapp instead of via Facebook Messenger? Why users tend to abandon the interaction after a certain question? Why do user spend a certain amount of time in a conversation?

These questions might find an answer in a more precise (qualitative) analysis, for example conducting a user test or asking for an explicit feedback. Feedback can aim at measuring different things, for instance if users reached their goals, e.g. with a closed yes/no question, such as “Did you find what you were looking for?” or how satisfying the experience was, e.g. asking to rate the experience with numbers (1-5), emojis (smiling, neutral, sad face), words (positive, neutral, negative).

Analyze, improve, repeat

Summing up, measuring the success of a Conversational Interface after its release means defining the right KPIs, analyzing conversations, and listening to the users’ voice. All this work should hopefully lead to improve the bot’s ability to understand users and to provide helpful answers, but it’s not a stand-alone activity. On the contrary, monitoring and improving should be key throughout all the project lifecycle.

Robert Andrew Seijkens

Manager @ Fasteners and Springs & Export Sales Manager @ RAS

2 年

The ability to understand, anticipate and read between the lines for the mindset of another human when working with AI is possible by the open experience, having lived the moments and understood the situations in order the place these as input for AI evaluations, I admire the level Assist Digital handles this complex detail for our day to day customer success and efficient experience.

fabiana taliercio

esperta di ... cosa te ne frega ?

2 年

Cambridge university have salesforce as AI

查看更多评论

要查看或添加评论，请登录

Assist Digital的更多文章

See all articles

How to measure the success of a Conversational AI project

Assist Digital

Digital and business transformation services, driven by customer-centric obsession.

Different metrics for different criteria

领英推荐

Quantitative and qualitative metrics

Analyze, improve, repeat

Assist Digital的更多文章

社区洞察

其他会员也浏览了

3 AI Use Cases (That Are Not a Chatbot)

Unleashing the Potential of AI and GPT Models: A Glimpse into the Future of Technology and Innovation

How to Design Safe Conversational AI Agents: Part 5 - AI Agent Analytics to Support Improvements and Auditing

Transforming Industries with Generative AI & Large Language Models

2025: The Year of AI Agents

What’s All the Fuss About AI Agents?

The world of Multi-agent conversation

Stretching The Limits of Traditional Chatbots in Supply Chain Management

AI Transformation: A Practical Guide to AI Adoption for Organizations

Reflections from my one year journey at Kore.ai : 2018 in Review & How we Accelerated Creation of Intelligent Enterprises Through Conversational AI

Different metrics for different criteria

领英推荐

Quantitative and qualitative metrics

Analyze, improve, repeat

Assist Digital的更多文章

Assist Digital partners with Snowflake’s AI Data Cloud to unlock the potential of data

Our 5 pillars of CX

Our 5 pillars of CX

Our 5 pillars of CX

Our 5 pillars of CX

Assist Digital joins Talkdesk Partner Ecosystem to Help Businesses Deliver Better Customer Experiences

The world's first automotive Metaverse Store: a human-driven digital customer experience

Customer experience in B2B: only one third of companies are already 'customer centric'

AI Knowledge Engineer: a key role in any AI project

Our path towards a sustainable future

社区洞察

其他会员也浏览了

3 AI Use Cases (That Are Not a Chatbot)

Unleashing the Potential of AI and GPT Models: A Glimpse into the Future of Technology and Innovation

How to Design Safe Conversational AI Agents: Part 5 - AI Agent Analytics to Support Improvements and Auditing

Transforming Industries with Generative AI & Large Language Models

2025: The Year of AI Agents

What’s All the Fuss About AI Agents?

The world of Multi-agent conversation

Stretching The Limits of Traditional Chatbots in Supply Chain Management

AI Transformation: A Practical Guide to AI Adoption for Organizations

Reflections from my one year journey at Kore.ai : 2018 in Review & How we Accelerated Creation of Intelligent Enterprises Through Conversational AI