登录查看更多内容

点击“继续加入或登录”，即表示您同意遵守领英的《用户协议》、《隐私政策》及《Cookie 政策》。

The 3 types of NLU systems in conversational AI

Kane Simms

??Triple Award-Winning AI Transformation Consultancy ?? VUX. helps businesses leverage AI… Properly.

发布日期: 2023年2月13日

We recently spoke to?Raj Koneru and Prasanna Arikala of Kore AI?on the VUX World podcast, discussing Large Language Models (LLMs) and the forecasted impact they’ll have on the creation of enterprise AI assistants.

Raj shared his thoughts on the types of NLU systems that exist today, and the benefits of each. This will help creators understand a little more about the way LLMs work and how you can tune them vs the industry standard intent-based NLU models.

---

Join VUX @ The European Chatbot & Conversational AI | Generative AI Summit , presented by koreai , for a full day of enterprise AI automation best practice and learn from the brands implementing conversational AI solutions at scale.

We will have experts from brands including Decathlon, loveholidays, Totaljobs Group, London North Eastern Railway and more.?

Get 30% off in-person tickets with the code VUXEU23.

Can't make it to Edinburgh? You can also attend online and access all talks live or on demand.

Find out more

---

Three types of NLU

1. Curated, intent-based model

This approach involved using an intent-based NLU with customised intents and training data, which?is the most common approach used by most businesses today. Here, you gather your own training data to form your own intents based on your business needs.

User utterances are then matched and classified against your intents based on the model’s ability to find a pattern between the utterance and sample training data it has in its model.

This works well for simple utterances, but struggles to understand things like long form sentences and utterances that are distinctly different from your sample training data.

Most NLU systems have used this approach so far, but the emergence of Large Language Models over the last 3 or so years is changing this.

2. Zero-shot model

This approach involves using a transformer-based?Large Language Model (LLM) to generate understanding of a customer utterance without the need to provide training data.

Large Language Models?are trained on billions of data points and huge corpuses of data from readily available text online.?They use sources such as Reddit, Wikipedia and others to train models on how to identify and reproduce patterns in language.

These advanced pattern matching systems perform great feats and can be used out-of-the-box to do things like intent classification and entity extraction.

Most of the LLMs available today, because they’re trained on general text data from the web, they’re not honed for specific business purposes. This means that out-of-the-box performance might only get you so far.

Also, because of the inherent limitations of pattern recognition, they’re prone to making a few mistakes here and there. This can result in some utterances being misclassified. However, I haven’t seen an assistant built on an intent-based system to date that doesn’t trip up and misclassify (or not match) on some utterances, either.

3. Few-shot model (hybrid):

This approach takes the best of both worlds and uses word embeddings to tune LLMs according to a few example phrases of the types of utterances you’d expect for a given intent.

This is how you can tune a Large Language Model to a specific use case or set of intents. By feeding it a few examples of different training phrases, you can provide it with additional context and influence how it classifies something.

This not only means that you can tune it for your specific business use cases, but providing some sample data means you can reduce the likelihood of it misclassifying.

According to Raj, you could even use an LLM to generate sample training data, which you'd then use to train your few-shot model. This can give you the efficiency of a zero-shot model, whilst ensuring that the model is tuned to your business needs. This gives you even more control, as you’re able to both influence the training and tuning of the model, as well as validate the output from it.

Multi NLU approach

“LLMs are highly accurate at classifying an intent, except when they get it wrong.” Raj Koneru, CEO, Kore AI

As mentioned, an LLM misclassifying an intent can happen because LLMs are trained on world data from across the internet. They’re not highly tuned for your business use cases.

For an end user to ask ChatGPT a question, for example, and ChatGPT gets it wrong, it's not consequential. For a user to ask a question of a business and the business gets it wrong, that is more consequential, especially for high-emotion or important use cases.

Therefore, the best approach is to utilise all three models above where relevant. And we’ll be diving into how you can architect this arrangement with Kore AI in an up-and-coming post.

Stay tuned.

For more information on Kore.ai, you can?book a demo?with the team or?book a free consultation.

---

About Kane Simms

Kane Simms is the front door to the world of AI-powered customer experience, helping business leaders and teams understand why voice, conversational AI and NLP technologies are revolutionising customer experience and business transformation.

He's a Harvard Business Review-published thought-leader, a top?'voice AI influencer'?(Voicebot and SoundHound), who helps executives formulate the future of customer experience strategies, and guides teams in designing, building and implementing revolutionary products and services built on emerging AI and NLP technologies.

Subscribe to my newsletter
Listen to the?VUX World podcast?on?Apple,?Spotify?or wherever you get your podcasts
Take our free conversational AI maturity assessment

Conversational AI & NLP

18,888 位关注者

Pingping Xiu

Data Engineer Leader @ Caltrans | Data Engineering / AI

1 年

I super like the categorization of the three use cases based on the different ways they do the inference using LLM. Looking forward to hearing more about a playbook or service that "utilizes all three models above where relevant". Kane Simms

1 次回应

Jo?o Paulo Alqueres ??♀?

AI Product Lead

1 年

This was extremely helpful, Kane. Thanks for sharing.

1 次回应

Dave Flanagan

1 年

Great explanation Kane Simms much appreciated !

1 次回应

Srini Pagidyala

Mission: To bring Human-Level AI to Humanity as a Co-Founder @Aigo.ai | Author | Advisor | Columnist | Always Learning

1 年

Good one Kane Simms, missing the fourth model that is Cognitive AI, in other words it breaks down the #NLU with cognitive capabilities such as Memory (short-term and long-term), interactive learning, deep understanding of the context and complex sentences, reasoning and ability to manage ongoing conversation. In other words, something that mimics human mind or what we call ‘Chatbot with a Brain’. DARPA calls it the ‘Third Wave of AI’. https://towardsdatascience.com/the-rise-of-cognitive-ai-a29d2b724ccc

2 次回应

James O'Hare

MD @ Sitoo | Redefining In-Store Technology for UK Retailers

1 年

Great summary, thank you Kane ??

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

The 3 types of NLU systems in conversational AI

Kane Simms

??Triple Award-Winning AI Transformation Consultancy ?? VUX. helps businesses leverage AI… Properly.

Three types of NLU

1. Curated, intent-based model

2. Zero-shot model

3. Few-shot model (hybrid):

Multi NLU approach

About Kane Simms

Conversational AI & NLP

18,888 位关注者

更多精彩文章

社区洞察

Three types of NLU

1. Curated, intent-based model

2. Zero-shot model

3. Few-shot model (hybrid):

Multi NLU approach

About Kane Simms

Conversational AI & NLP

18,888 位关注者

7 year's worth of AI experience in 3 hours

2024年11月20日

Zooming ahead in 2025: An open invitation

2024年11月18日

Weekly news rundown

2024年11月15日

New Horizons: What's next for AI-powered customer experience in 2025

2024年11月12日

Most contact centres have this upside down...

2024年11月11日

Unlock the Future of AI at Symbiosis III: Join DRUID in NYC for a Symphony of Innovation!

2024年11月8日

Best tips for prompt engineering: Insights from Anthropic

2024年10月31日

The future of AI orchestration with Dmitry Shapiro of MindStudio

2024年10月30日

Getting AI into the enterprise with Stephan Schuessler, Deloitte

2024年10月24日

The AI chatbot serving 10m customers a year, with Nick Allgaier, Lufthansa

2024年10月23日

社区洞察