Date post: | 29-May-2018 |
Category: |
Documents |
Upload: | khang-nguyen |
View: | 223 times |
Download: | 0 times |
of 24
8/9/2019 meghneel
1/24
Text Categorization With Support
Vector Machines: Learning WithMany Relevant Features
By Thornsten JoachimsPresented By Meghneel Gore
8/9/2019 meghneel
2/24
Goal of Text Categorization Classify documents into a number of pre-
defined categories.
Documents can be in multiple categories
Documents can be in none of the categories
8/9/2019 meghneel
3/24
Applications of Text
Categorization Categorization of news stories for online
retrieval
Finding interesting information from theWWW
Guiding a user's search through
hypertext
8/9/2019 meghneel
4/24
Representation of Text Removal of stop words
Reduction of word to its stem
Preparation of feature vector
8/9/2019 meghneel
5/24
Representation of Text
.......................
......................
......................
......................
......................
...........................................
2 Comput
1 Process2 Buy3 Memory
....
This is a Document Vector
8/9/2019 meghneel
6/24
What's Next... Appropriateness of support vector
machines for this application
Support vector machine theory
Conventional learning methods
Experiments
Results
Conclusions
8/9/2019 meghneel
7/24
Why SVMs? High dimensional input space
Few irrelevant features
Sparse document vectors
Text categorization problems are linearlyseparable
8/9/2019 meghneel
8/24
Support Vector Machines
Visualization of a Support Vector Machine
8/9/2019 meghneel
9/24
8/9/2019 meghneel
10/24
Support Vector Machines We define a structure of hypothesis
spaces Hi such that their respective VCdimensions di increases
8/9/2019 meghneel
11/24
Support Vector Machines Lemma [Vapnik, 1982]
Consider hyperplanes}{)( bdwsigndh !
TTT
As hypotheses
8/9/2019 meghneel
12/24
Support Vector Machines
Awwithb
dw!u
TTT,
1
If all example vectors are contained in
Ahypersphere of radius R and it isRequired that
8/9/2019 meghneel
13/24
Support Vector Machines Then this set of hyperplane has a VC
dimension d bounded by
1)],min([ 22 e nARd
8/9/2019 meghneel
14/24
8/9/2019 meghneel
15/24
Conventional Learning
Methods Nave Bayes classifier
Rocchio algorithm
K-nearest Neighbors
Decision tree classifier
8/9/2019 meghneel
16/24
Nave Bayes Classifier Consider a document vector with
attributes a1, a2 an with target values v
Bayesian approach:
),,,(maxarg 21 njVv
map aaavPvj
-
!
8/9/2019 meghneel
17/24
Nave Bayes Classifier We can rewrite that using Bayes
theorem as
)()...,(maxarg
)...,(
)()...,(maxarg
21
21
21
jjnVv
n
jjn
Vvmap
vPvaaaP
aaaP
vPvaaaPv
j
j
!
!
8/9/2019 meghneel
18/24
Nave Bayes Classifier Nave Bayes method assumes that the
attributes are independent
)""(
...)""()""()(maxarg
)()(maxarg
11
21},{
1},{
j
jjjdislikelikev
n
i
jijdislikelikev
NB
vsnowaP
vhadaP
vMary
aPv
P
vaPvPv
j
j
!
!!!
!
!
-
8/9/2019 meghneel
19/24
Experiments Datasets
Performance measures
Results
8/9/2019 meghneel
20/24
Datasets Reuters-21578 dataset
9603 training examples
3299 testing documents
Ohsumed Corpus
10000 training documents
10000 testing examples
8/9/2019 meghneel
21/24
Performance Measures Precision
Probability that a document predicted to be
in class x truly belongs to that class
Recall
Probability that a document belonging to
class x is classified into that class Precision/recall breakeven point
8/9/2019 meghneel
22/24
Results
Precision/recall break-even point on Ohsumed dataset
8/9/2019 meghneel
23/24
Results
Precision/recall break-even point on Reuters dataset
8/9/2019 meghneel
24/24
Conclusions Introduces SVMs for text categorization
Theoretical and empirical evidence thatSVMs are well suited for textcategorization
Consistent improvement in accuracy over
other methods