登录查看更多内容

Creating Tirukkural with the help of Thiruvalluvar using Artificial Intelligence!!

Vijay A.

Lead Machine Learning Engineer @ SAMSUNG

发布日期: 2020年11月20日

In the age of texting and communicating in short texts instead of verbal calling, there comes a time when one realized that they have written something so genuine and with the thought that maybe if we wrote with such vigor on our cover letters, our high school essays, we might have been the next Thiruvalluvar.

Nowadays, Machine Learning creates a huge buzz and drums up the enthusiasm within everyone to know about it. But for the amateurs, knowledge of neural network algorithms are a hurdle to admire the power of Machine Learning.

Keeping that in mind, I thought to give a brief layman introduction followed by a technical introduction to RNN and LSTM neural network models.

Layman intro about RNN and LSTM

The brain functionalities are the inspiration for most of the neural network algorithms. To understand this model, we have to recollect how we learned alphabets and numbers.

For example, we will start to learn pronunciation of first four alphabets (A,B,C,D) individually, then we will again try to say from A to D. Then we will learn next four alphabets (E, F, G, H), then we will again try to say from A to H. If you notice, we were looping from the first alphabet everytime whenever we learn new alphabets. Through this, we not only learn alphabets but also their sequence.

This is the logic behind Recurrent neural networks. It uses what it has learned from the previous inputs while learning the new input. It has a problem with very long input which was blown-away by LSTM.

Technical intro about RNN and LSTM

Recurrent Neural Network is a neural network model in which each hidden layer receives both input from the previous layer and input from itself at each step. This enables it to hold information across inputs which can be thought as a memory.

The problem of Recurrent Neural Network is that its memory is very short term. This is solved in Long Short Term Memory networks (LSTM).

Long Short Term Memory (LSTM) network is kind of recurrent neural network which are capable of learning long-term dependencies. LSTM has a cell state which runs through all the modules of the neural network. The cell state is convenient for the information to flow along it unchanged. There are four gates which regulate the addition and removal of information from the cell state.

As LSTM holds the information for long term, it can be trained with text file at character level as input, so that it learns to predict the next character in the sequence. The other interesting applications of this RNN/LSTM model are music generation, image captioning, language translator and even writing The Bible.

For further understanding about this neural networks, check the references.

About Dataset - Thirukkural

Thirukkural is one of the most prominent and celebrated works in Tamil Literature. It is also one of the most widely translated non-religious works in the world. It is written by the poet Thiruvalluvar who lived in the 6th century. It is a unique ethical guide which delivers code of conduct to the mankind to follow for all time to come. In total, there are 1330 couplets (two lines joined by rhyme) which are divided into 133 sections with 10 couplets each. Each couplet has exactly 7 words, 4 in one line and 3 in next.

Example couplet from Thirukkural (no : 78)

??????? ?????? ????????????? ??????????
?????? ??????????? ?????

Explanation in English would be

 Without love in the heart, 
 Life is like a sapless tree in a barren desert.

Why this dataset?

Since childhood, I admired how he expresses great morals in just 7 words. He wrote only 1330 couplets which almost delivers all the essential morals for a person to lead a successful life. This provoked the curiosity within me to know how it would be if Thiruvalluvar has written more Thirukkural. Crazy thought !!

About the model

The first step is to create the dataset. I found a Thirukkural literature in json format. I parsed the json file and created a text file with 1330 couplets alone.

The next step is to feed this file as input to LSTM model. I took the sample code from Keras Github repository and made changes to adapt to this dataset. It is a simple model with single LSTM layer with 128 neurons. It is fed with very small data of 1330 couplets with each of 7 words, in total 9310 words. The model is trained for 20 iterations.

Results

The results definitely show Thiruvalluvar cannot be replaced by Machine anytime soon. But if you notice the results, astonishingly the machine (LSTM) has learnt to produce sensible Tamil words with punctuations. I was also amazed that how this simple model learnt the syntax of Thirukkural such as nearly four words in the first line and three words in the second line.

Start Word : “??????? “

??????? ??????? ??????? ????????? 
???????? ?????????? ????.


????????? ????????????? ??????????? ??????? 
????? ????????? ??????.

Start Word : “ ?????”

??????????? ???????? ?????? ?????? 
???????? ???? ????.

??????? ???????? ??????????? ????????????
?????? ?????????????? ????????.

Start Word : “ ????? “

??????????? ??????? ?????? ????????? ?????????? 
?????? ????????? ??????.


??????????? ?????????? ???????? ???????????? 
???? ??????? ?????.

Start Word : “ ?????? “

?????? ??? ???????? ???????? 
???????? ?????? ????.

?????? ?????????? ?????????? ???????? 
???????? ???????? ?????.

Start Word : “ ??????”

???????? ???????? ???????? ?????? 
?????????????? ???????? ??????.

Start Word : “ ???????”

??????? ?????? ??????? ????????? 
?????????? ????????????? ???????.


??????? ?????? ???????? ???????? ??????? 
??????? ??????? ???????.

Start Word : “ ?????? “

?????? ??????? ????????? ??????? 
??????? ??????? ????????.

?????? ??????? ????? ??????????? 
????? ??????? ?????.

Take away

LSTM is capable of holding information for long term, hence we can say it has memory
LSTM networks are capable of modelling temporal aspects of data and hence have been used widely for text, videos, and time-series

References

RNN/LSTM

The Unreasonable Effectiveness of Recurrent Neural Networks
Understanding LSTM Networks
Anyone Can Learn To Code an LSTM-RNN in Python (Part 1: RNN)
Understanding LSTM and its diagrams
Composing Music With Recurrent Neural Networks

Vikram Asokan

1 年

Excellent work Vijay A. Thanks for shring

Ganesh Sridharan

Lazy Engineering Manager

1 年

Checkout my work around that is love https://www.askvalluvar.com

Dr R Raj kumar

Associate Professor | SRMIST l PhD Supervisor l Secretary - IET Chennai LN | Keynote Speaker l GenAI l Metaverse | Convenor - SDG Hackathon

1 年

Good work, I am glad to include your work in our magazine Data chronicle July 2023 edition. I will cite your work

Lakshminarasimhan S.

~1 Billion Impressions | StoryListener | PolyMath

3 年

Not bad! A good attempt. IM me.

1 次回应

查看更多评论

要查看或添加评论，请登录

Vijay A.的更多文章

How AI Is Transforming Agriculture?

2020年11月14日

How AI Is Transforming Agriculture?

Agriculture and farming is one of the oldest and most important professions in the world. Humanity has come a long way…
will AI take over Music/Musician?

2020年11月8日

will AI take over Music/Musician?

Even being more attached to Artificial intelligence. But my passion has always been music.
CAN BLOCKCHAIN AND MACHINE LEARNING WORK TOGETHER?

2020年4月25日

CAN BLOCKCHAIN AND MACHINE LEARNING WORK TOGETHER?

Both blockchains and machine learning are new technologies that have emerged in the last decade that have far-reaching…
The future of IoT is AI

2020年4月14日

The future of IoT is AI

There is a clear intersection between the Internet of Things (IoT) and Artificial Intelligence (AI). IoT is about…
Types of Machine Learning Algorithms You Should Know

2020年4月12日

Types of Machine Learning Algorithms You Should Know

As a request from my friend on Linkedin, in this post I’m going to explain the types of machine learning algorithms and…
Supervised vs unsupervised learning

2020年4月11日

Supervised vs unsupervised learning

Before diving into the nitty-gritty of how supervised and unsupervised learning works, let’s first compare and contrast…

2 条评论
Why Artificial Intelligence Is the Future of Growth?

2020年4月10日

Why Artificial Intelligence Is the Future of Growth?

The concept of Artificial Intelligence has been around for centuries. At its very root, AI is the concept of using…

See all articles

Creating Tirukkural with the help of Thiruvalluvar using Artificial Intelligence!!

Vijay A.

Lead Machine Learning Engineer @ SAMSUNG

Layman intro about RNN and LSTM

Technical intro about RNN and LSTM

About Dataset - Thirukkural

Why this dataset?

About the model

Results

Take away

References

RNN/LSTM

Thirukkural

Vijay A.的更多文章

社区洞察

其他会员也浏览了

Transformers without pain ??

KAN Do

Multilayer Network, Threshold Unit, Feedforward Network.

Ilya Sutskever on The Magic of Neural Networks

What Is Neural Network In Artificial Intelligence

How to Master LLMs: Part 2 — Understanding Backpropagation and Its Role in AI

How to train your Neural Network

AI has to defend or explain too!

Why Initialize a Neural Network with Random Weight?

An Introduction to Neural Networks

Layman intro about RNN and LSTM

Technical intro about RNN and LSTM

About Dataset - Thirukkural

Why this dataset?

About the model

Results

Take away

References

RNN/LSTM

Thirukkural

Vijay A.的更多文章

How AI Is Transforming Agriculture?

will AI take over Music/Musician?

CAN BLOCKCHAIN AND MACHINE LEARNING WORK TOGETHER?

The future of IoT is AI

Types of Machine Learning Algorithms You Should Know

Supervised vs unsupervised learning

Why Artificial Intelligence Is the Future of Growth?

社区洞察

其他会员也浏览了

Transformers without pain ??

KAN Do

Multilayer Network, Threshold Unit, Feedforward Network.

Ilya Sutskever on The Magic of Neural Networks

What Is Neural Network In Artificial Intelligence

How to Master LLMs: Part 2 — Understanding Backpropagation and Its Role in AI

How to train your Neural Network

AI has to defend or explain too!

Why Initialize a Neural Network with Random Weight?

An Introduction to Neural Networks