登录查看更多内容

Natural Language Processing _ Part 3

ARNAB MUKHERJEE ????

Automation Specialist (Python & Analytics) at Capgemini ??|| Master's in Data Science || PGDM (Product Management) || Six Sigma Yellow Belt Certified || Certified Google Professional Workspace Administrator

发布日期: 2023年2月18日

Sentiment Analysis

Sentiment Analysis or opinion mining is an NLP technique used to determine whether data is positive, negative, or neutral.

No alt text provided for this image — Sentiment Analysis

Types of Sentiment Analysis :

1. Standard Sentiment Analysis: In Standard Sentiment Analysis we can find out that if anything is written about anyone or anyplace or anything in general, then what is the opinion that is formed in that writing about the topic that has been reported. It can be a positive opinion, a negative opinion, or a neutral opinion.

Eg :

" The book 'The complete idiot's guide to statistics' is a fantastic book" - Positive

"I need a free trial of your course to see if it covers all relevant topics or not" - Neutral

"This book on AI ML is so confusing" - Negative

2. Fine-Grained - Sentiment Analysis:

Here the divisions are such that, the divisions have more sub-divisions -

Very Positive

Positive

Neutral

Negative

Very Negative

3. Emotion Detection: This detects under what emotion this topic was thought of and written. It can be anger, sadness, happiness, or any other emotion.

4. Aspect-based Sentiment Analysis: In Aspect based sentiment analysis, if someone has used a product and is giving a review on it, then aspect-based sentiment analysis checks on what aspect that note was written or that review was written.

5. Intent Detection: This detects the intent with which this note or comment has been written.

Eg: My app gets shut down as soon as I try to upload a video. Can you help ?

This intends to assist.

Project on Sentiment Analysis using the 'Bag of Word' model

#X_train

X_train = ["My goal in this chapter is to provide a useful concept of statistics ",?

??????" Here comes your life preserver ",?

??????" Not interpreting statistical information properly can lead to disaster "

??????"These decisions can affect our lives in many ways ",

??????" Today's corporates are making major decisions based on statistical analysis"

??????" The field of statistics is not evolving at all "

" Population surveys appear to be the primary motivation for the historical development of statistics as we know it today "]

y_train = [1,1,0,1,1,0,1]

# 1- Positive?0 - Negative

#The class represents whether this sentence is a positive or a negative sentence, if it is Positive, it is 1, if it is negative it is zero.

X_test = ["Statistics is very confusing for me"]

X_train

#Data cleaning

from nltk.tokenize import RegexpTokenizer

#Stop word removal

from nltk.stem.porter import PorterStemmer

from nltk.corpus import stopwords

#downloading stopwords package

import nltk

nltk.download('stopwords')

tokenizer = RegexpTokenizer(r'\wt')

#taking only the words and concatinate them

en_stopwords = set(stopwords.words('english'))

ps = PorterStemmer()

#using clean data function

def getCleanedText(text):

领英推荐

Sense and Sentimentality

Helen Wall 2 年前

Day 5 - Harness the Power of Market Sentiments with AI!

Ritesh Kanjee 9 个月前

New Normal 2.0: Lets 'Talk' GPT-3

Kary Bheemaiah 4 年前

#converting the text into lowercase??

?text = text.lower()

#tokenize

?tokens = tokenizer.tokenize(text)

?#combining stopword removal and tokenizer

?new_tokens = [token for token in tokens if token not in en_stopwords]

#stemming

?stemmed_tokens = [ps.stem(tokens) for tokens in new_tokens]

#cleantext

?clean_text = " ".join(stemmed_tokens)

?return clean_text

#define X_text

X_test?

#Use clean text to clean our test data and train data

X_clean = [getCleanedText(i) for i in X_train]

Xt_clean = [getCleanedText(i) for i in X_test]

X_clean

#vectorize

#before classification we need to vectorize our text

#from scikit learn extract text and import count vectorizer?

from sklearn.feature_extraction.text import CountVectorizer

cv = CountVectorizer(ngram_range = (1,2))

#vectorize our output?

X_vec = cv.fit_transform(X_clean).toarray()

X_vec

#so for every word we will get a vector over here?

#getting feature names

print(cv.get_feature_names())

#The countervectorizer tells how many times a word/ "string" has been repeated in a sentence

#This kind of model is known as bag of word model.?

#vectorization for test value

Xt_vect = cv.transform(Xt_clean).toarray()

#Classification Task

#In order to perform text classification we use Multinomial Naive Bayes(NB)

#import Multinomial Naive Bayes(NB)

from sklearn.naive_bayes import MultinomialNB?

mn = MultinomialNB()

mn.fit(X_vec , y_train)

#Perform Prediction

y_pred = mn.predict(Xt_vect)

#This will give us an array i.e 1 & 0, 1-Positive class, 0-Negative class??

y_pred

Natural Language Processing _ Part 3

ARNAB MUKHERJEE ????

Automation Specialist (Python & Analytics) at Capgemini ??|| Master's in Data Science || PGDM (Product Management) || Six Sigma Yellow Belt Certified || Certified Google Professional Workspace Administrator

Sentiment Analysis

Types of Sentiment Analysis :

领英推荐

AI and Beyond

2,820 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Cool tech is not enough. What’s the exact problem you’re trying to solve and for whom?

SHAP for text-based data

How Natural Language Processing is Shaping the Future of Search

What is Sentiment Analysis? An Ultimate Guide for 2022

The Ultimate Guide to learn Heuristic Search in AI

Best Practices for Text Classification with Distillation (Part 1/4) - How to achieve BERT results by using tiny models

Class 10 - OBJECT RECOGNITION & HUGGING FACE Notes from the AI Basic Course (Class 10) by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)

Harnessing the Power of Content Windows: Revolutionizing AI's Data Processing

?? Unlocking Stock Insights with Machine Learning and NLP ????

How Artificial Intelligence Learns from Any Source of Information to Facilitate Knowledge Transfer

Sentiment Analysis

Types of Sentiment Analysis :

领英推荐

AI and Beyond

2,820 位关注者

The Role of Generative AI in Fashion

2024年11月22日

Impact of AI on Data Augmentation

2024年11月20日

Demand Outlook of AI in Agriculture – Industry Size, Share, Trends, Growth, Export Value, Shipment, Volume & Trade, Sales, Pricing Forecast 2026

2024年11月18日

November Weekly News Update

2024年11月15日

The Bhawal Case: A Legal and Mystical Mystery

2024年11月14日

Jawaharlal Nehru - The Last Englishman to Rule India

2024年11月14日

Managing Future Uncertainties and Life Events

2024年11月13日

How AI Can Transform Environmental Conservation Efforts

2024年11月11日

AI in Forensic Medicine and Toxicology: Revolutionizing Investigation and Diagnosis

2024年11月8日

Firm Plans To Fire 7,000 Employees Amid Ongoing Business Challenges

2024年11月7日

社区洞察

其他会员也浏览了

Cool tech is not enough. What’s the exact problem you’re trying to solve and for whom?

SHAP for text-based data

How Natural Language Processing is Shaping the Future of Search

What is Sentiment Analysis? An Ultimate Guide for 2022

The Ultimate Guide to learn Heuristic Search in AI

Best Practices for Text Classification with Distillation (Part 1/4) - How to achieve BERT results by using tiny models

Class 10 - OBJECT RECOGNITION & HUGGING FACE Notes from the AI Basic Course (Class 10) by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)

Harnessing the Power of Content Windows: Revolutionizing AI's Data Processing

?? Unlocking Stock Insights with Machine Learning and NLP ????

How Artificial Intelligence Learns from Any Source of Information to Facilitate Knowledge Transfer