+ All Categories
Home > Documents > Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Date post: 22-Feb-2016
Category:
Upload: xanti
View: 44 times
Download: 0 times
Share this document with a friend
Description:
Chapters 1 - 5. Ontology Learning and Population from Text: Algorithms, Evaluation and Applications. Presented by Sole. Introduction. Artificial intelligence - PowerPoint PPT Presentation
Popular Tags:
35
ONTOLOGY LEARNING AND POPULATION FROM TEXT: ALGORITHMS, EVALUATION AND APPLICATIONS Presented by Sole Chapters 1 - 5
Transcript
Page 1: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

ONTOLOGY LEARNING AND POPULATION FROM TEXT: ALGORITHMS, EVALUATION

AND APPLICATIONS

Presented by Sole

Chapters 1 - 5

Page 2: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Introduction Artificial intelligence

Build systems that incorporate knowledge about a domain to reason on the basis of this knowledge and solve problems not encountered before Include explicit and symbolic representation of

knowledge about a domain Symbolic representation and procedural aspects

are separated so that it can be reused across systems

Which symbols to use and what they stand for?

2

Page 3: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Introduction Ontology

Defines what is important in a domain and how concepts are related Knowledge-based system: determine which

symbols are needed and how they are interpreted Logical level: interpretation can be constraint

according to the ontology by axiomatizing symbols Issues

Costly to construct Time-consuming Significant coverage of domain is needed Meaning and consistent generalization are required

Knowledge

Acquisition

Bottleneck

3

Page 4: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Introduction Solution

Automatically learn ontologies from data Goal: bridging the gap between

World of symbols (words used in natural language) World of concepts (abstractions of human thought)

Challenge Correctness and consistency of the model can not

be guaranteed Human post-processing definitely necessary

Automatically learned ontologies need to be inspected, validated, and modified by humans before they can be applied for applications relying on logical reasoning

4

Page 5: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Ontologies Definition

Philosophical discipline Science of existence or the study of being

Computer Science Formal specifications of a conceptualization

Resources representing the conceptual model underlying a certain domain, describing it in a declarative fashion and thus cleanly separating it from procedural aspects

5

Page 6: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Ontologies Example

6

Page 7: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text Ontology learning

Acquire a domain model from data Lifting : XML-DTDs, UML diagrams, databases Semi-structured sources: HTML, XML Unstructured sources: ontology learning from text

7

Page 8: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text Meaning triangle

Every language has symbols that evoke a concept that refers to a concrete individual in the world

8

Page 9: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text Ontology population

Learning concepts and relations Knowledge markup or annotation: select text

fragments and assign them to an ontological concept

Applications Several methods have been developed in

recent years Challenge

No consensus within ontology learning community on concrete tasks for ontology learning

Comparison between approaches is difficult

9

Page 10: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text10

Ontology learning tasks (layer cake)

Page 11: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text11

Terms: Task: find a set of relevant concepts and

relations E.g., words, multi-word compounds

State-of-the-art IR methods NLP methods: POS tagger, statistical

approaches

Page 12: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text12

Synonyms: Task: find words which denote the same

concept E.g., synsets on WordNet

State-of-the-art Semantically-similar words Sense disambiguation and synonym discovery Latent Semantic Indexing (LSI) Statistical information measures defined over

the Web to detect synonyms

Page 13: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text13

Concepts: Task: find intentional definitions of concept,

their extension, and lexical signs used to refer to them

State-of-the-art Clusters of related terms LSI-based techniques Discovery of hierarchies of named entities Know-it-all system OntoLearn system

Page 14: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text14

Hierarchies: Task: concept hierarchy induction,

refinement and lexical extension State-of-the-art

Lexico-syntactic patterns Clustering algorithm to automatically derive

concept hierarchies Analysis of term co-occurrence in same

sentence/document

Page 15: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text15

Relations: Task: learn relations identifiers or labels as

well as their appropriate domain and range State-of-the-art

Association rules Syntactic-dependencies

Very few approaches address the issue of learning ontology relations from text

Page 16: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text16

Axiom schemata instantiations: Task: learn which concepts, relations, or

pair of concepts the axioms in a given system apply to

General axioms Task: derive more complex relationships

and connections between concepts and relations Logical interpretations constraining the

interpretation of concepts and relations

Page 17: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Learning from Text17

Population: Task: learn instances of concepts and

relations State-of-the-art

Associated to well-known tasks for which a variety of approaches have been developed

Information extraction Named entity recognition

Page 18: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics18

Natural Language Processing

Page 19: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics19

Pre-processing steps

Chunking Syntactic analysis: parsing

NLP

Page 20: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics20

Pre-processing

Contextual features

Syntactic dependencies

Bank

River FinancialInstitution

The museum houses an impressive collection of medieval and modern art. The building combines geometric abstraction with classical references that allude to the Roman influence on the region.

NLP

Page 21: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics21

Similarity measures

NLP

Page 22: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics22

Similarity measures Binary similarity measures

Geometric similarity measures

NLP

Page 23: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics23

Similarity measures Measures based on probability distribution

Hypothesis testing

NLP

Page 24: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics24

Term relevance Weight the importance of a term in a

document

NLP

Page 25: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics25

WordNet Lexical database for the English language

NLP

Page 26: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics26

Formal concept analysis Formal objects: concepts+ Formal attributes: characteristics describing

objects+ Incidence relation: information about which

attributes hold for each object= Formal context

Page 27: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics27

Example

FCA

Page 28: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics28

Example

FCA

Page 29: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics29

Machine learning Automatic recognition/detection of patterns

and regularities within sample data Patterns can be used to understand/describe the

data or to make predictions Learning process

Supervised Predicts the appropriate category for an example

from a set of categories represented by a set of labels

Unsupervised Search for common and frequent structures within

the data (data exploration)

Page 30: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics30

Supervised learning Regression

Numeric prediction (labels are continue values) Classification

Assign proper category to a given example

ML

Target value

Feature vector

Page 31: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics31

Classifiers Bayesian Classifiers Decision Trees Instance-Based Learning Support Vector Machines Artificial Neural Networks

Tools WEKA RapidMiner

ML

Page 32: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics32

Examples

ML

Page 33: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics33

Unsupervised learning Clustering: find groups of similar objects in data

There is no labeled data to train from Classification

Hierarchical vs. non-hierarchical Non-hierarchical algorithms produce a set of groups Hierarchical algorithms order groups in a tree

structure Hard vs. soft

Hard: elements are assigned to distinct clusters Soft: elements are assigned to clusters with a

certain degree of membership

ML

Page 34: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Basics34

Algorithms K-means Hierarchical clustering Hierarchical Agglomerative (Bottom-Up)

Clustering Divisive (Top-Down) Clustering

ML

Page 35: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Datasets35

Corpus description


Recommended