+ All Categories
Home > Documents > RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP -...

RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP -...

Date post: 13-Aug-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
28
> < 1 SENTIMENT ANALYSIS KADRIYE YASEMIN USTA OĞUZHAN MURAT ÇAKMAK CMPE 561:Natural Language Processing RESEARCH PROJECT 1 SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS
Transcript
Page 1: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

>< 1

SENTIMENT ANALYSISKADRIYE YASEMIN USTA

OĞUZHAN MURAT ÇAKMAK

CMPE 561:Natural Language Processing

RESEARCH PROJECT

1SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS

Page 2: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

010203

Introduction

Sentiment Classification

The state-of-the-art

040506

Tools&Corpora

Challenges

Conclusions

2

Agenda

SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS

Page 3: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

01Introduction

3

SentimentSentiment := <Holder, Target, Polarity, Auxiliary>

Holder: who expresses the sentiment

Target: what/whom the sentiment is expressed to

Polarity: the nature of the sentiment (positive, negative, or neutral)

Auxiliary: strength, summary, confidence, time

Page 4: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

01

4SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS

Page 5: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

015

Introduction

Sentiment Analysis

Computational study of opinions, sentiments, appraisal, and emotions expressed in text

Ex: Reviews, blogs, discussions, microblogs, social networks

Basic tasks:

Holder detection

Target recognition

Sentiment (Polarity) classification

Opinion summarization

Opinion spam detection

Page 6: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

016

Introduction

Application Domains

Politics/political science: to find public opinions about political candidates and issues

Law/policy making: to examine public opinions about law candidates and issues

Sociology: to understand Idea propagation through groups

Psychology: to investigate dream sentiment analysis

Page 7: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

02Sentiment

Classification

7

Types of Sentiment Classification

Lexicon Based Sentiment Classification

Dictionary-based Approach

Corpora-based Approach

Statistical and Semantic

Machine Learning Based Sentiment Classification

Unsupervised Learning

Supervised Learning

Decision Tree,Linear,Rule-based,Probabilistic

Page 8: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

028

Sentiment Classification

Lexicon-based Sentiment Classification

Use the dominant polarity of the opinion words (ADJ, ADV and VERB) in the sentence to determine its polarity If positive/negative opinion prevails, the opinion sentence is regarded as positive/negative

Optimization Methods:Lexicon + Grammar Rule + Inference Lexicon + Counting

Page 9: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

02

9SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS

General Work-Flow Diagram of Lexicon-based Classification

Page 10: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

0210

Machine Learning-based Sentiment Classification

Sentiment Classification

Treat sentiment classification simply as a special case of topic-based categorization

Ex : With the two “topics” being positive sentiment and negative sentiment

Common Method Data + Feature + Model

Page 11: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

02

11SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS

General Work-Flow Diagram of Machine Learning-based Classification

Page 12: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

0212

Sentiment Classification

Machine Learning-based vs Lexicon-based Classification

No explicit result The success depends on details, data, domains, scenarios

Observation Lexicon-based: simple, intuitive, and understandable ML-based: data-driven, uninterruptible, and large-scaled

Briefly, Sentence: Lexicon-based is better Document: ML-based is better

Page 13: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

03The state-of-the-art

13

Current Studies• Turney(2002)

an unsupervised learning algorithm for classifying reviews as recommended or not recommended

• Starts with tagging phrases, then calculates PMI(Point-wise Mutual Information)

Page 14: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

03The state-of-the-art

14

Current Studies

• Pang and Lee(2008) classify documents not by topic, by overall sentiment

Page 15: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

03The state-of-the-art

15

Current Studies• Socher et al.(2013) uses

Deep Learning

• Introduce a sentiment treebank

• Recursive Neural Tensor Network(RNTN)

• Very good results

Page 16: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

04Tools&Corpora

16

Tools

General Inquirer :A computer-assisted approach for content analysis of textual data.

• Classifies as positive and negative

Page 17: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

04Tools&Corpora

17

Tools

LIWC(Linguistic Inquiry and Word Count) • The program includes the primary text analysis

module along with a group of built-in dictionaries.

• Reads computer-readable form.

• Contains 6400 words, word stems, and selected emoticons

• Affective(negative, positive) and cognitive(tentative, inhibition)

Page 18: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

04Tools&Corpora

18

Tools

WordNet is a lexical database.

• Include sets of synonyms called senses, short definitions, use cases, and various relations among the words.

Page 19: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

04Tools&Corpora

19

Tools

SentiWordNet is a strategy for examination of definition related to synnets of the WordNet data set.

• The synnet has three numerical score including objective, negative, and positive polarity.

Page 20: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

04Tools&Corpora

20

Tools

SentiTurkNet is the first Turkish polarity resource.

• Three scores: positivity, negativity and objectivity

• Results for Turkish better than SentiWordNet

Page 21: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

0421

Corpora

Tools&Corpora

• Pang & Lee Data SetsPolarity(positive or negative) Subjective Rate(from 1 to 5) Subjective or Objective

• Blitzer et al. Multi-domain sentiment analysisAmazon dataClassify as Pos or Neg

• MPQAPrivate states(beliefs, emotions, sentiments, speculations

Page 22: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

05Challenges

• Sentiments words and phrases(lexicon) is necessary but not sufficient for high performance sentiment analysis 22

Current Challenges

Page 23: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

05Challenges

• A positive or negative opinion word may have inverse introductions in diverse application areas.

• “This camera sucks”

• “This vacuum cleaner really sucks"

23

Current Challenges

Page 24: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

05Challenges

• Sarcastic sentences with or without sentiments words are hard to handle

• “What a great car! It stopped working in two days."

24

Current Challenges

Page 25: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

05Challenges

• A sentence containing opinion words may not express any sentiment.

• e.g “Can you tell me which Sony camera is good?”

• “Does anyone know how to repair this terrible printer?"

25

Current Challenges

Page 26: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

05Challenges

• Many sentences without sentiment words can likewise suggest opinions.

• “This washer uses a lot of water”

26

Current Challenges

Page 27: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

><

06Conclusions

27

Sentiment analysis is a difficult task

The difficulty increases with the nuance and complexity of opinions expressed

There are many different methods for sentiment analysis

- Lexicon-based- Machine Learning-based

Page 28: RESEARCH PROJECT SENTIMENT ANALYSIS 561/… · RESEARCH PROJECT SPRING 2017 CMPE 561: NLP - SENTIMENT ANALYSIS < > 01 02 03 Introduction Sentiment Classification The state-of-the-art

THANKS FOR LISTENING

28


Recommended