登录查看更多内容

Building your first chatbot with Python

Valentina Alto

AI and App Innovation Technical Architect at Microsoft | Tech Author | MSc in Data Science

发布日期: 2019年7月11日

Today, if you are about to order some foods on a restaurant's website or you need assistance because your router is not working properly, you will probably get in touch with a chatbot. They appear to you like instant messaging chats, in one of the corners of the screen, and gently ask you whether you need help.

But what is a chatbot? It is a computer program whose aim is interacting with humans as they receive, as inputs, texts or audios. The most sophisticated chatbots are based on Natural Language Processing, that field of AI in which models aim at interpreting the way humans interact among each other, hence their natural language, which might be written or spoken (if you want to read more about NLP, you can read my former article here).

However, many chatbots are programmed so that they capture some keywords which are directly related to pre-defined responses. This process is based on a first training of the algorithm on given libraries, where it will be able to size the connections between answers and questions, as well as some emotional reactions typical of human beings. And this is the idea behind ELIZA, the chatbot programmed by Joseph Weizenbaum: it recognizes clue words as input and returns pre-programmed sentences as output. This method might sound kind of naive, but there are two further elements that need to be considered:

First, when I say 'some clue words' and 'some pre-programmed responses', I'm referring to the enormous amount of data on which the chatbot has been trained. Namely, if you are about to build a chatbot which answers questions about the Empire State Building, you could train it on the Wikipedia page about that building, then on some interviews about the topic, and even on books which takes place in NYC and involves some speeches about it. So, even though responses are 'copied' and not originally created, it doesn't mean your chatbot is not able to reach its goal.
The second element is a bit more psychological. It turned out that chatbots like ELIZA, not intelligent by definition, are capable of being interpreted as intelligent by humans. Indeed, they generate an illusion of understanding which humans tend to appreciate, often forgetting they are not talking with another human being. That's one of the reasons why these naive chatbots are so popular today: they are without doubts useful for several purposes, ranging from online assistance to personal use.

With that being said, let's jump to the practical application. Here, I will show a very easy implementation of a naive chatbot in Python, using a pre-compiled library and training environment called ChatterBot (you can install it via pip install chatterbot in your terminal).

#importing the library

from chatterbot import ChatBot
bot = ChatBot('Chatbot')

#importing a corpus of sample sentences and conversations so that our chatbot
#can learn

from chatterbot.trainers import ChatterBotCorpusTrainer

# Create a new trainer for the chatbot
trainer = ChatterBotCorpusTrainer(bot)


# Train the chatbot based on the english corpus (other languages are available too)

trainer.train('chatterbot.corpus.english')

Non è stato fornito nessun testo alternativo per questa immagine

As you can see, the corpus library contains many domains of sample conversations or sentences, like sports, science and so forth. To give you the intuition, I will replicate the matching procedure manually for the domain 'greetings'. Indeed, looking at the following code:

GREETING_INPUTS = ("hello", "hi", "good morning", "what's up","hey",)
GREETING_RESPONSES = ["hi", "hey","good morning", "hello", "thanks for asking my help"]


def greeting(sentence):
 
    for word in sentence.split():
        if word.lower() in GREETING_INPUTS:
            return random.choice(GREETING_RESPONSES)
            return random.choice(GREETING_RESPONSES)

You can see that I linked some greetings written by the users to other greetings which will be the chatbot's response. Basically, I defined a function 'greeting' which return, for any input in GREETING_INPUTS, a random output within the GREETING_OUTPUTS. This procedure is replicated, more widely, for each category of the chatterbot.corpus.

Now let's initialize out chatbot:

while True:
        # Input message from user
    message=input('You:')
        #if message is different from Bye
    if message.strip()!='Bye':
             #if message is equal to thanks or thank you
        if(message.strip()=='thanks' or message.strip()=='thank you' ):
            flag=False
            print('Chatbot: You are welcome..')
        else:
             #if message is withing greetings
            if(greeting(message.strip())!=None):
                print('Chatbot: '+greeting(message.strip()))
            else:
             #if none of the above is true, use the embedded method '.get_response'
                reply=bot.get_response(message)
                print('Chatbot:',reply)
        # if message is Bye
    if message.strip()=='Bye':
        print('Chatbot: Bye')
        break

It wasn't too bad. Again, if you want your chatbot to be an expert in a specific domain, you should train it on that topic more deeply. Here, I just picked some random domains, poorly assorted compared to the huge range of potential conversations, in order to show the idea behind a naive chatbot.

Of course, intelligent chatbots - those which use NLP techniques to understand human interactions and create original, smart responses - are far more interesting, and this field of study is evolving rapidly. Nevertheless, naive chatbots are still widely used in industries, thanks to their capability of being seen as intelligent by humans.

Max Kleiner

Engineer at kleiner kommunikation

5 年

Great Article again but you have to install corpus too: pip3 install chatterbot_corpus

要查看或添加评论，请登录

Valentina Alto的更多文章

Getting started with Azure GPT-4-Turbo Vision in Industry-based scenarios

2023年12月16日

Getting started with Azure GPT-4-Turbo Vision in Industry-based scenarios

On December 12, Microsoft announced the Public Preview of the GPT-4-turbo Vision model in Azure. GPT-4-turbo vision is…

5 条评论
Getting started with Azure AI Studio

2023年11月16日

Getting started with Azure AI Studio

During the opening keynote of the Microsoft Ignite event, on November 15th, Artificial Intelligence was the undiscussed…

3 条评论
Computer Vision: Feature Matching with OpenCV

2019年7月14日

Computer Vision: Feature Matching with OpenCV

Computer vision is a field of study which aims at gaining a deep understanding from digital images or videos. Combined…
The Bias-Variance trade-off

2019年7月5日

The Bias-Variance trade-off

Machine Learning models' ultimate goal is making reliable predictions on new, unknown data. With this purpose in mind…

1 条评论
Streaming analysis with Kafka, InfluxDB and Grafana

2019年7月2日

Streaming analysis with Kafka, InfluxDB and Grafana

If you are dealing with the streaming analysis of your data, there are some tools which can offer performing and…
Decision Tree and Information Gain

2019年6月27日

Decision Tree and Information Gain

Decision trees are some of the most popular ML algorithms used in industry, as they are quite interpretable and…
Natural Language Processing with TextBlob

2019年6月20日

Natural Language Processing with TextBlob

Natural Language Processing (NPL) is a field of Artificial Intelligence whose purpose is finding computational methods…
Features Engineering: behind the scenes of ML algorithms

2019年6月13日

Features Engineering: behind the scenes of ML algorithms

The majority of people (including me) tend to think that the core activity of building a Machine Learning algorithm is,…
Neural Networks: parameters, hyperparameters and optimization strategies

2019年6月7日

Neural Networks: parameters, hyperparameters and optimization strategies

Neural Networks (NNs) are the typical algorithms used in Deep Learning analysis. NNs can take different shapes and…

1 条评论
Unsupervised Learning: PCA and K-means

2019年6月2日

Unsupervised Learning: PCA and K-means

Machine Learning algorithms can be categorized mainly into two bunches: supervised learning: we are provided with data…

2 条评论

See all articles

Building your first chatbot with Python

Valentina Alto

AI and App Innovation Technical Architect at Microsoft | Tech Author | MSc in Data Science

Valentina Alto的更多文章

社区洞察

其他会员也浏览了

BERT Embeddings for data sets Explained: Key Benefits, Examples, and ML Model Steps

This week in CAI and NLP: 010 - the coded become the coding

WHAT IS TEXT STEMMING IN NLP?

?? Generative AI Agents Revealed

From Idea to App in Minutes: The Impact of AI on Low Code No Code Development

Mastering Sentiment Analysis with ChatGPT, OpenAI, and Python

Word Embedding: Unveiling the Hidden Semantics of Words

Embedded Words And Embedded Sentences Tools

Langchain with Mistral LLM using Embeddings and PostgreSQL with pg_embedding

Valentina Alto的更多文章

Getting started with Azure GPT-4-Turbo Vision in Industry-based scenarios

Getting started with Azure AI Studio

Computer Vision: Feature Matching with OpenCV

The Bias-Variance trade-off

Streaming analysis with Kafka, InfluxDB and Grafana

Decision Tree and Information Gain

Natural Language Processing with TextBlob

Features Engineering: behind the scenes of ML algorithms

Neural Networks: parameters, hyperparameters and optimization strategies

Unsupervised Learning: PCA and K-means

社区洞察

其他会员也浏览了

BERT Embeddings for data sets Explained: Key Benefits, Examples, and ML Model Steps

This week in CAI and NLP: 010 - the coded become the coding

WHAT IS TEXT STEMMING IN NLP?

?? Generative AI Agents Revealed

From Idea to App in Minutes: The Impact of AI on Low Code No Code Development

Mastering Sentiment Analysis with ChatGPT, OpenAI, and Python

Word Embedding: Unveiling the Hidden Semantics of Words

Embedded Words And Embedded Sentences Tools

Langchain with Mistral LLM using Embeddings and PostgreSQL with pg_embedding