+ All Categories
Home > Documents > Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney...

Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney...

Date post: 17-Dec-2015
Category:
Upload: spencer-perkins
View: 217 times
Download: 3 times
Share this document with a friend
Popular Tags:
26
Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman Rutgers University ACM Trans. on Information Systems 2003
Transcript
Page 1: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Measuring Praise and Criticism: Inference of

Semantic Orientation from AssociationPeter D. Turney

National research Council CanadaMichael L. LittmanRutgers University

ACM Trans. on Information Systems 2003

Page 2: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Outline

• Introduction.• Semantic Orientation from Association.• Related Work.• Experiments.• Conclusions.

Page 3: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Introduction

• The evaluative character of a word is called its semantic orientation.– It is also known as valence in the linguistics literature.

• A positive semantic orientation denotes a positive evaluation (i.e., praise) and a negative semantic orientation denotes a negative evaluation (i.e., criticism).

• Semantic orientation has both direction (positive or negative) and intensity (mild or strong).– Okay – fabulous (mild – strong).

Page 4: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Introduction (cont.)

• We introduce a method for automatically inferring the direction and intensity of the semantic orientation of a word from its statistical association with a set of positive and negative paradigm words.– Using two different measures of word association:

• Pointwise Mutual Information (PMI).

• Latent Semantic Analysis (LSA).

– PMI and LSA are based on co-occurrence.• A word is characterized by the company it keeps.

• The semantic orientation of a word tends to correspond to the semantic orientation of its neighbors.

Page 5: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Applications

• Semantic orientation may be used to– Classify reviews (movie reviews or automobile reviews) as positi

ve or negative [Turney 2002].

– Provide summary statistics for search engines.• Query “Paris travel review”, return “5,000 hits, 80% positive, 20% ne

gative). [Hearst 1992].

– Filter “flames” for newsgroups. [Spertus 1997]

– Software game, chat system … etc.

Page 6: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Semantic Orientation from Association

• The semantic orientation of a given word is calculated from the strength of its association with a set of positive words, minus the strength of its association with a set of negative words.

– Pwords: a set of words with positive semantic orientation.

– Nwords: a set of words with negative semantic orientation.

– A(w1,w2): a measure of association between w1 and w2.

• Maps to a real number. • Positive/negative – presence/absence.

NwordsnwordPwordspword

nwordwordApwordwordAwordASO ),(),()(

(Semantic Orientation from Association): positive/negative, magnitude – strength.

Page 7: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Semantic Orientation from Association (cont.)

• Seven positive and seven negative words are used as paradigms of positive and negative semantic orientation:– Good, nice, excellent, positive, fortunate, correct, and superior.– Bad, nasty, poor, negative, unfortunate, wrong, and inferior.

• Supervised or unsupervised learning ??– It seems more appropriate to say that the paradigm words are

defining semantic orientation, rather than training the algorithm.

Page 8: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI

• The Pointwise Mutual Information (PMI) between two words is defined as follows:

– If the words are statistically independent, PMI 0.• PMI = log2(p(w1)p(w2) / p(w1)p(w2)) = log2(1) = 0.

– Tend to co-occur positive.• PMI = log2(p(w1) or p(w2) / p(w1)p(w2)) = log2(1/p(w1) or p(w2)) = log2

(a value > 1) positive.

– Presence – absence negative.• PMI = log2( p(w1 & w2) ≈0) / p(w1)p(w2)) = log2(a value≈0) = -∞.

)()(

)&(log),(

21

21221 wordpwordp

wordwordpwordwordPMI

Page 9: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI (cont.)

• We estimates PMI by issuing queries to a search engine (AltaVista) and noting the number of hits (matching documents).– AltaVista was chosen over other search engines because it has

a NEAR operator.• Which constrains the search to documents that contain the words wi

thin ten words of one another, in either order.

• Previous work [Turney 2001] has shown that NEAR performs better than AND when measuring the strength of semantic association between words.

Page 10: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI (cont.)

)(1

)(1

)(1

log),(

)()(

)()(

log

),(),()(

21

21

221

2

wordhitsN

wordhitsN

word NEAR wordhitsNwordwordPMI

where

nword NEAR wordhitspwordhits

nwordhitspword NEAR wordhits

nwordwordPMIpwordwordPMIwordPMISO

NwordsnwordPwordspword

NwordsnwordPwordspword

NwordsnwordPwordspword

• To avoid division by zero, 0.01 was added to the number of hits.– This is a form of Laplace

smoothing.

• Other alternatives to PMI:– Likelihood ratios,

– Z-score.

N: number of document in AltaVistahits(pword) & hits(nword) constants

Page 11: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-LSA

• SO-LSA applies Latent Semantic Analysis (LSA) to calculate the strength of the semantic association between words.– LSA uses the Singular Value Decomposition (SVD) to analyze

the statistical relationships among words in a corpus.

• The first step is to use the text to construct a matrix X in which the row vectors represent words and the column vectors represent chunks of text (e.g., sentences, paragraphs, documents).– Each cell represents the weight of the corresponding word in the

corresponding chunk of text.– TF-IDF weighting.

Page 12: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-LSA (cont.)

• The next step is to apply SVD to X, to decompose X into a product of three matrix U∑VT.– U and V are in column orthonormal form.– ∑ is a diagonal matrix of singular value.

• X can be approximated by the matrix Uk∑kVkT by selectin

g the top k singular values and vectors.

Xwords

sentences

Uk≈

words

k hidden semantics

VkT∑k

sentences

Page 13: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-LSA (cont.)

• The similarity of two words LSA(word1, word2) is measured by the cosine of the angle between their corresponding row vectors of Uk.

• Then, SO-LSA of a word is defined as follows:– SO-LSA(word) = [LSA(word, good) + … + LSA(word, superior)]

- [LSA(word, bad) + … + LSA(word, inferior)].

– Positive/negative, magnitude – strength.

• 有個問題 (or assumption)…corpus (X)必須要包含要 predict的 word與 14 paradigm.

Page 14: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Experiments

• Lexicons and Corpora:– The experiments use two different lexicons and three different corpora.

• The corpora are used for unsupervised learning.– AltaVista-ENG, AltaVista-CA, TASA.

• The lexicons are used to evaluate the results of the learning.

– Lexicons:• The HM lexicon is a list of 1,336 labeled adjectives created by human

experts.– 657 positive and 679 negative.

• The GI lexicon is a list of 3,596 labeled words extracted from the General Inquirer lexicon (http://www.wjh.harvard.edu/~inquirer/).

– 1,614 positive and 1,982 negative adjectives, adverbs, nouns, and verbs.

Page 15: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI - Baseline

• A small corpus not only result in lower accuracy, but also results in less stability.

Page 16: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI – Laplace Smoothing Factor

• The smoothing factor has relatively little impact until it rises above 10, at which point the accuracy begins to fall off.

• For the small TASA corpus, the performance is quite sensitive to the choice of smoothing factor.

• There is less need for smoothing when a large quantity of data is available.

Page 17: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI – Neighborhood Size

• We can vary the neighborhood size with the TASA corpus.

• A small neighborhood:– Words that occur closer to

each other are more likely to be semantically related.

• A large neighborhood:– There will usually be more

occurrences of the pair within a large neighborhood than within a small neighborhood.

– Tend to have higher statistical reliability.

• A larger corpus should yield better statistical reliability than a smaller corpus, so the optimal neighborhood size will be smaller with a larger corpus.

• It seems best to have a neighborhood size of at least 100 words.

– 10 words is clearly suboptimal for TASA.

Page 18: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI – Neighborhood Size (cont.)

• With AltaVista, we can use the AND operator instead of the NEAR operator to test the effect of the neighborhood size.

• NEAR is clearly superior to AND, but the gap closes as the threshold decreases.

• The smaller corpus show more clearly the greater sensitivity of a small neighborhood.

Page 19: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-PMI – Product versus Disjunction

• We investigate the effect of the OR operator.– Pquery = (good OR nice OR

… OR superior).

– Nquery = (bad OR nasty OR … OR inferior).

• There is a clear advantage to using our original equation, but the two equations have similar performance with the smaller corpora.

)()(

)()(log)( 2 PqueryhitsNquery NEAR wordhits

NqueryhitsPquery NEAR wordhitswordPMISO

Page 20: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-LSA - Baseline

• The TASA corpus was used to generate a matrix X with 92,409 rows (words) and 37,651 columns (documents), and SVD was used to reduce the matrix to 300 dimensions.

• SO-PMI and SO-LSA have approximately the same accuracy when evaluated on the full test set, but SO-LSA rapidly pulls ahead as we decrease the percentage of the test set that is classified.

– SO-LSA appears more stable than SO-PMI.

Page 21: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

SO-LSA – Number of Dimensions

• The behavior of LSA is known to be sensitive to the number of dimensions of the matrix.

• The optimal value is likely near 250 dimensions.

Page 22: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Varying the Paradigm Words

• The experiment examines the behavior of SO-A when the paradigm words are randomly selected.

• Since rare words would tend to require a larger corpus for SO-A to work well, we controlled for frequency effects.

• For each original paradigm word, we found the word in the General Inquirer lexicon with the same tag (Pos or Neg) and the most similar frequency.– The frequency was measured by the number of hits in AltaVista.

Page 23: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Varying the Paradigm Words (cont.)

Page 24: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Varying the Paradigm Words (cont.)

• The inclusion of some of the words, such as “pick”, “raise”, and “capital”, may seem surprising.

– These words are only negative in certain contexts, such as “pick on your brother”, “raise a protest”.

• It is clear that the original words perform much better than the new words.

• We hypothesized that the poor performance of the new paradigm words was due to their sensitivity to context.

Page 25: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Related Work• Sentiment (orientation or polarity) classification:

– Classification words by positive or negative semantic orientation.

• Subjectivity analysis:– To distinguish sentences (or paragraph or documents or other suitable

chunks of text) that present opinions and evaluations from sentences that objectively present factual information.

• (Product or movie) review mining:– To extract the positive and negative features from reviews.– Became a popular research issues since the emergence of Web 2.0.

• CIKM 2006/2007 CFP, WWW 2007 CFP.– An application (instance) of the sentiment classification and subjectivity

analysis.– Need to classify the orientation of a review (sentence),

• Which require a orientation lexicon, usually composed by human experts.

Page 26: Measuring Praise and Criticism: Inference of Semantic Orientation from Association Peter D. Turney National research Council Canada Michael L. Littman.

Conclusions

• This paper has presented a general strategy for measuring semantic orientation from semantic association.

• Two instance of this strategy have been empirically evaluated. – A high accuracy is attained on the test set.


Recommended