+ All Categories
Home > Documents > Presenter : Chang,Chun-Chih Authors : David Milne * , Ian H. Witten 2012, AI

Presenter : Chang,Chun-Chih Authors : David Milne * , Ian H. Witten 2012, AI

Date post: 22-Feb-2016
Category:
Upload: mireya
View: 28 times
Download: 0 times
Share this document with a friend
Description:
An open-source toolkit for mining Wikipedia. Presenter : Chang,Chun-Chih Authors : David Milne * , Ian H. Witten 2012, AI. Outlines. Motivation Objectives Methodology Experiments Conclusions Comments. Motivation. - PowerPoint PPT Presentation
Popular Tags:
15
Intelligent Database Systems Presenter : Chang,Chun-Chih Authors : David Milne * , Ian H. Witten 2012, AI An open-source toolkit for mining Wikipedia
Transcript
Page 1: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Presenter : Chang,Chun-Chih

Authors : David Milne * , Ian H. Witten

2012, AI

An open-source toolkit for mining Wikipedia

Page 2: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

OutlinesMotivationObjectivesMethodologyExperimentsConclusionsComments

Page 3: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Motivation The online encyclopedia Wikipedia is a vast,

constantly evolving tapestry of interlinked articles.

For developers and researchers it represents a giant multilingual database of concepts and semantic relations, a potential resource for natural language processing

Page 4: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Objectives

• The Wikipedia Miner toolkit, an open-source software system that allows researchers and developers to integrate Wikipedia’s rich semantics into their own applications.

• Wikipedia Miner is intended to be a platform for sharing data mining techniques.

Page 5: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Methodology - Architecture of the wikipedia Miner toolkit

Page 6: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Methodology - Measuring relatedness between concepts

Page 7: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Methodology - Measuring relatedness between concepts

Page 8: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Methodology -Features for measuring artucle relatedness

Page 9: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Experiments - Impact of thresholds for disambiguation and detection

Page 10: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Experiments - Impact of relatedness dependencies

Page 11: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Experiments - Impact of traning data

Page 12: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Experiments - performance of the disambiguator

Page 13: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Experiments - performance of the detector

Page 14: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Conclusions

• Our aim in releasing this work open source is not to provide a complete and polished product,

• but rather a resource for the research community to collaborate around and continue building together.

Page 15: Presenter   :  Chang,Chun-Chih Authors      : David Milne  * , Ian H. Witten 2012,  AI

Intelligent Database Systems Lab

Comments

• Advantages• Applications - wikipedia - Disambiguation - Annotation


Recommended