+ All Categories
Home > Documents > Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC...

Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC...

Date post: 01-Mar-2021
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
112
Tutorial on Ontology Matching Pavel Shvaiko erˆomeEuzenat Trento, Italy Monbonnot, France [email protected] [email protected] December 18, 2006
Transcript
Page 1: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Tutorial on Ontology Matching

Pavel Shvaiko Jerome Euzenat

Trento, Italy Monbonnot, [email protected] [email protected]

December 18, 2006

Page 2: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Goals of the tutorial

� Illustrate the role of ontology matching

� Provide an overview of basic matching techniques

� Demonstrate the use of basic matching techniquesin state of the art systems

� Motivate future research

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 2 / 55

Page 3: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Outline

Matching problem

Classification

Basic techniques

Matching process

Systems

Conclusions

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 3 / 55

Page 4: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Outline

Matching problem

Classification

Basic techniques

Matching process

Systems

Conclusions

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 4 / 55

Page 5: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching operation

Matching operation takes as input ontologies, each consisting of a set ofdiscrete entities (e.g., tables, XML elements, classes, properties) anddetermines as output the relationships (e.g., equivalence, subsumption)holding between these entities

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 5 / 55

Page 6: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two XML schemas

Electronics

Personal Computers

Microprocessors

PIDNameQuantityPrice

Accessories

Photo and Cameras

PIDNameQuantityPrice

Electronics

PC

PC board

IDBrand

AmountPrice

Cameras and Photo

Accessories

Digital Cameras

IDBrand

AmountPrice

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 6 / 55

Page 7: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two XML schemas

Electronics

Personal Computers

Microprocessors

PIDNameQuantityPrice

Accessories

Photo and Cameras

PIDNameQuantityPrice

Electronics

PC

PC board

IDBrand

AmountPrice

Cameras and Photo

Accessories

Digital Cameras

IDBrand

AmountPrice

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 6 / 55

Page 8: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two ontologies

Product

DVD

Book

CD

Monograph

Essay

Litterary critics

Politics

Biography

Autobiography

Literature

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 7 / 55

Page 9: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two ontologies

Product

DVD

Book

CD

pricetitledoicreatortopic

author

Monograph

Essay

Litterary critics

Politics

Biography

Autobiography

Literature

isbnauthor

title

subject

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 7 / 55

Page 10: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two ontologies

Product

DVD

Book

CD

pricetitledoicreatortopic

author

integer

string

uri

Person

Monograph

Essay

Litterary critics

Politics

Biography

Autobiography

Literature

isbnauthor

title

subject

Human

Writer

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 7 / 55

Page 11: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two ontologies

Product

DVD

Book

CD

pricetitledoicreatortopic

author

integer

string

uri

Person

Monograph

Essay

Litterary critics

Politics

Biography

Autobiography

Literature

isbnauthor

title

subject

Human

Writer

Bertrand Russell: My life

Albert Camus: La chute

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 7 / 55

Page 12: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two ontologies

Product

DVD

Book

CD

pricetitledoicreatortopic

author

integer

string

uri

Person

Monograph

Essay

Litterary critics

Politics

Biography

Autobiography

Literature

isbnauthor

title

subject

Human

Writer

Bertrand Russell: My life

Albert Camus: La chute

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 7 / 55

Page 13: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Motivation: two ontologies

Product

DVD

Book

CD

pricetitledoicreatortopic

author

Person

Monograph

Essay

Litterary critics

Politics

Biography

Autobiography

Literature

isbnauthor

title

subject

Human

Writer

Bertrand Russell: My life

Albert Camus: La chute

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 7 / 55

Page 14: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Schema matching vs. ontology matching: differences

� Schemas often do not provide explicit semantics for their data� Relational schemas provide no generalization

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 8 / 55

Page 15: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Schema matching vs. ontology matching: differences

� Schemas often do not provide explicit semantics for their data� Relational schemas provide no generalization

� Ontologies are logical systems that constrain the meaning� Ontology definitions as a set of logical axioms

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 8 / 55

Page 16: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Schema matching vs. ontology matching: commonalities

� Schemas and ontologies provide a vocabulary of terms that describes adomain of interest

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 9 / 55

Page 17: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Schema matching vs. ontology matching: commonalities

� Schemas and ontologies provide a vocabulary of terms that describes adomain of interest

� Schemas and ontologies constrain the meaning of terms used in thevocabulary

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 9 / 55

Page 18: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Schema matching vs. ontology matching: commonalities

� Schemas and ontologies provide a vocabulary of terms that describes adomain of interest

� Schemas and ontologies constrain the meaning of terms used in thevocabulary

Techniques developed for both problems are of a mutual benefit

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 9 / 55

Page 19: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Scope

Heterogeneity between ontologies can occur when

� different languages are used

� different terminologies are used

� different modeling is used

� . . .

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 10 / 55

Page 20: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Scope

Heterogeneity between ontologies can occur when

� different languages are used

� different terminologies are used

� different modeling is used

� . . .

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 10 / 55

Page 21: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Scope

� Reducing heterogeneity can be performed in 2 steps� Match, thereby determine the alignment

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 11 / 55

Page 22: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Scope

� Reducing heterogeneity can be performed in 2 steps� Match, thereby determine the alignment� Process the alignment (merging, transforming, etc.)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 11 / 55

Page 23: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Scope

� Reducing heterogeneity can be performed in 2 steps� Match, thereby determine the alignment� Process the alignment (merging, transforming, etc.)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 11 / 55

Page 24: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Scope

� Reducing heterogeneity can be performed in 2 steps� Match, thereby determine the alignment� Process the alignment (merging, transforming, etc.)

� When do we match?� Design time� Run time

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 11 / 55

Page 25: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Scope

� Reducing heterogeneity can be performed in 2 steps� Match, thereby determine the alignment� Process the alignment (merging, transforming, etc.)

� When do we match?� Design time� Run time

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 11 / 55

Page 26: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Correspondence

Definition (Correspondence)

Given two ontologies O and O ′, a correspondence M between O and O ′ isa 5-uple: 〈id , e, e′,R, n〉 such that:

� id is a unique identifier of the correspondence

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 12 / 55

Page 27: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Correspondence

Definition (Correspondence)

Given two ontologies O and O ′, a correspondence M between O and O ′ isa 5-uple: 〈id , e, e′,R, n〉 such that:

� id is a unique identifier of the correspondence

� e and e′ are entities of O and O ′ (e.g., XML elements, classes)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 12 / 55

Page 28: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Correspondence

Definition (Correspondence)

Given two ontologies O and O ′, a correspondence M between O and O ′ isa 5-uple: 〈id , e, e′,R, n〉 such that:

� id is a unique identifier of the correspondence

� e and e′ are entities of O and O ′ (e.g., XML elements, classes)

� R is a relation (e.g., equivalence (=), more general (�), disjointness(⊥))

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 12 / 55

Page 29: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Correspondence

Definition (Correspondence)

Given two ontologies O and O ′, a correspondence M between O and O ′ isa 5-uple: 〈id , e, e′,R, n〉 such that:

� id is a unique identifier of the correspondence

� e and e′ are entities of O and O ′ (e.g., XML elements, classes)

� R is a relation (e.g., equivalence (=), more general (�), disjointness(⊥))

� n is a confidence measure in some mathematical structure (typically inthe [0,1] range)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 12 / 55

Page 30: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Alignment

Definition (Alignment)

Given two ontologies O and O ′, an alignment (A) between O and O ′:� is a set of correspondences on O and O ′

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 13 / 55

Page 31: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Alignment

Definition (Alignment)

Given two ontologies O and O ′, an alignment (A) between O and O ′:� is a set of correspondences on O and O ′

� with some cardinality: 1-1, 1-*, etc.

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 13 / 55

Page 32: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Alignment

Definition (Alignment)

Given two ontologies O and O ′, an alignment (A) between O and O ′:� is a set of correspondences on O and O ′

� with some cardinality: 1-1, 1-*, etc.

� some additional metadata (method, date, properties, etc.)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 13 / 55

Page 33: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching process

O

O ′

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 14 / 55

Page 34: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching process

O

O ′

matching

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 14 / 55

Page 35: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching process

O

O ′

matching A′

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 14 / 55

Page 36: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching process

O

O ′

matching A′A

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 14 / 55

Page 37: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching process

O

O ′

matching A′A

parameters

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 14 / 55

Page 38: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching process

O

O ′

matching A′A

parameters

resources

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 14 / 55

Page 39: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Application domains

� Traditional� Ontology evolution

� Schema integration

� Catalog integration

� Data integration

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 15 / 55

Page 40: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Application domains

� Traditional� Ontology evolution

� Schema integration

� Catalog integration

� Data integration

� Emergent� P2P information sharing

� Agent communication

� Web service composition

� Query answering on the web

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 15 / 55

Page 41: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Application: catalog integration (simplified)

DB

O O ′

DBPortal

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 16 / 55

Page 42: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Application: catalog integration (simplified)

DB

O O ′

DBPortal

Matcher

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 16 / 55

Page 43: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Application: catalog integration (simplified)

DB

O O ′

DBPortal

Matcher

A

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 16 / 55

Page 44: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Application: catalog integration (simplified)

DB

O O ′

DBPortal

Matcher

A

Generator

Transformation

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 16 / 55

Page 45: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Application: catalog integration (simplified)

DB

O O ′

DBPortal

Matcher

A

Generator

Transformation

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 16 / 55

Page 46: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Applications: P2P information sharing

peer1

O

peer2

O ′

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 17 / 55

Page 47: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Applications: P2P information sharing

peer1

O

peer2

O ′Matcher

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 17 / 55

Page 48: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Applications: P2P information sharing

peer1

O

peer2

O ′Matcher

A

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 17 / 55

Page 49: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Applications: P2P information sharing

peer1

O

peer2

O ′Matcher

A

Generator

mediator

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 17 / 55

Page 50: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Applications: P2P information sharing

peer1

O

peer2

O ′Matcher

A

Generator

mediatorquery query

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 17 / 55

Page 51: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Applications: P2P information sharing

peer1

O

peer2

O ′Matcher

A

Generator

mediatorquery query

answeranswer

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 17 / 55

Page 52: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Applications: summary

Application inst

ance

s

run

tim

e

auto

mat

ic

corr

ect

com

ple

te

oper

atio

n

Ontology evolution√ √ √

transformationSchema integration

√ √ √merging

Catalog integration√ √ √

data translationData integration

√ √ √query answering

P2P information sharing√

query answeringWeb service composition

√ √ √data mediation

Multi agent communication√ √ √ √

data translationQuery answering

√ √query reformulation

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 18 / 55

Page 53: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Outline

Matching problem

Classification

Basic techniques

Matching process

Systems

Conclusions

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 19 / 55

Page 54: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching dimensions

� Input dimensions� Underlying models (e.g., XML, OWL)� Schema-level vs. Instance-level

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 20 / 55

Page 55: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching dimensions

� Input dimensions� Underlying models (e.g., XML, OWL)� Schema-level vs. Instance-level

� Process dimensions� Approximate vs. Exact� Interpretation of the input

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 20 / 55

Page 56: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching dimensions

� Input dimensions� Underlying models (e.g., XML, OWL)� Schema-level vs. Instance-level

� Process dimensions� Approximate vs. Exact� Interpretation of the input

� Output dimensions� Cardinality (e.g., 1-1, 1-*)� Equivalence vs. Diverse relations (e.g., subsumption)� Graded vs. Absolute confidence

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 20 / 55

Page 57: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Matching dimensions

� Input dimensions� Underlying models (e.g., XML, OWL)� Schema-level vs. Instance-level

� Process dimensions� Approximate vs. Exact� Interpretation of the input

� Output dimensions� Cardinality (e.g., 1-1, 1-*)� Equivalence vs. Diverse relations (e.g., subsumption)� Graded vs. Absolute confidence

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 20 / 55

Page 58: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Three layers

� The upper layer� Granularity of match� Interpretation of the input information

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 21 / 55

Page 59: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Three layers

� The upper layer� Granularity of match� Interpretation of the input information

� The middle layer represents classes of elementary (basic) matchingtechniques

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 21 / 55

Page 60: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Three layers

� The upper layer� Granularity of match� Interpretation of the input information

� The middle layer represents classes of elementary (basic) matchingtechniques

� The lower layer is based on the kind of input which is used byelementary matching techniques

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 21 / 55

Page 61: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Classification of schema-based techniques (simplified)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 22 / 55

Page 62: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Classification of schema-based techniques (simplified)

Element-level

Syntactic External

Structure-level

Syntactic External Semantics

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 22 / 55

Page 63: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Classification of schema-based techniques (simplified)

Element-level

Syntactic External

Structure-level

Syntactic External Semantics

String-basedname,

descriptionsimilarity

Language-based

tokenization,elimination

Linguisticresourceslexicons,thesauri

Constraint-basedtype

similarity,key

properties

Upper,domainspecificformal

ontologiesDOLCE,

FMA

Graph-based

graph ho-momorphism

children,leaves

Taxonomy-based

taxonomystructure

Repositoryof

structuresstructuremetadata

Model-based

SAT solvers,DL

reasoners

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 22 / 55

Page 64: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Classification of schema-based techniques (simplified)

Element-level

Syntactic External

Structure-level

Syntactic External Semantics

String-basedname,

descriptionsimilarity

Language-based

tokenization,elimination

Linguisticresourceslexicons,thesauri

Constraint-basedtype

similarity,key

properties

Upper,domainspecificformal

ontologiesDOLCE,

FMA

Graph-based

graph ho-momorphism

children,leaves

Taxonomy-based

taxonomystructure

Repositoryof

structuresstructuremetadata

Model-based

SAT solvers,DL

reasoners

Terminological

Linguistic

Structural

Internal Relational

Semantic

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 22 / 55

Page 65: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Classification of schema-based techniques (simplified)

Element-level

Syntactic External

Structure-level

Syntactic External Semantics

String-basedname,

descriptionsimilarity

Language-based

tokenization,elimination

Linguisticresourceslexicons,thesauri

Constraint-basedtype

similarity,key

properties

Upper,domainspecificformal

ontologiesDOLCE,

FMA

Graph-based

graph ho-momorphism

children,leaves

Taxonomy-based

taxonomystructure

Repositoryof

structuresstructuremetadata

Model-based

SAT solvers,DL

reasoners

Terminological

Linguistic

Structural

Internal Relational

Semantic

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 22 / 55

Page 66: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Outline

Matching problem

Classification

Basic techniques

Matching process

Systems

Conclusions

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 23 / 55

Page 67: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: String-based

� Prefix� takes as input two strings and checks whether the first string starts with

the second one� net = network; but also hot = hotel

(e.g., COMA, SF, S-Match, OLA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 24 / 55

Page 68: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: String-based

� Prefix� takes as input two strings and checks whether the first string starts with

the second one� net = network; but also hot = hotel

� Suffix� takes as input two strings and checks whether the first string ends with

the second one� ID = PID; but also word = sword

(e.g., COMA, SF, S-Match, OLA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 24 / 55

Page 69: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: String-based

� Edit distance� takes as input two strings and calculates the number of edition

operations, (e.g., insertions, deletions, substitutions) of charactersrequired to transform one string into another, normalized by length ofthe maximum string

� EditDistance(NKN,Nikon) = 0.4

(e.g., S-Match, OLA, Anchor-Prompt)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 25 / 55

Page 70: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: Language-based

� Tokenization� parses names into tokens by recognizing punctuation, cases� Hands-Free Kits → 〈 hands, free, kits 〉

(e.g., COMA, Cupid, S-Match, OLA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 26 / 55

Page 71: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: Language-based

� Tokenization� parses names into tokens by recognizing punctuation, cases� Hands-Free Kits → 〈 hands, free, kits 〉

� Lemmatization� analyses morphologically tokens in order to find all their possible basic

forms� Kits → Kit

(e.g., COMA, Cupid, S-Match, OLA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 26 / 55

Page 72: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: Language-based

� Elimination� discards “empty” tokens that are articles, prepositions, conjunctions . . .� a, the, by, type of, their, from

(e.g., Cupid, S-Match)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 27 / 55

Page 73: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: Linguistic resources

� Sense-based: WordNet� A B if A is a hyponym or meronym of B

� Brand � Name

� A � B if A is a hypernym or holonym of B� Europe � Greece

� A = B if they are synonyms� Quantity = Amount

� A ⊥ B if they are antonyms or the siblings in the part of hierarchy� Microprocessors ⊥ PC Board

(e.g., Artemis, CtxMatch, S-Match)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 28 / 55

Page 74: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Element-level techniques: Linguistic resources

� Gloss-based: WordNet gloss comparison� The number of the same words occurring in both input glosses increases

the similarity value. The equivalence relation is returned if the resultingsimilarity value exceeds a given threshold

� Maltese dog is a breed of toy dogs having a long straight silky white coatAfghan hound is a tall graceful breed of hound with a long silky coat

(e.g., S-Match)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 29 / 55

Page 75: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Taxonomy-based

Ontologies are viewed as graph-like structures containing terms and theirinter-relationships.

� Bounded path matching� These take two paths with links between classes defined by the

hierarchical relations, compare terms and their positions along thesepaths, and identify similar terms

(e.g., Anchor-Prompt, NOM, QOM)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 30 / 55

Page 76: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Taxonomy-based

Ontologies are viewed as graph-like structures containing terms and theirinter-relationships.

� Bounded path matching� These take two paths with links between classes defined by the

hierarchical relations, compare terms and their positions along thesepaths, and identify similar terms

� Super(sub)-concepts rules� If super-concepts are the same, the actual concepts are similar to each

other

(e.g., Anchor-Prompt, NOM, QOM)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 30 / 55

Page 77: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Tree-based

� Children� Two non-leaf schema elements are structurally similar if their immediate

children sets are highly similar

(e.g., Cupid, COMA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 31 / 55

Page 78: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Tree-based

� Children� Two non-leaf schema elements are structurally similar if their immediate

children sets are highly similar

� Leaves� Two non-leaf schema elements are structurally similar if their leaf sets

are highly similar, even if their immediate children are not

(e.g., Cupid, COMA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 31 / 55

Page 79: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Tree-based

Electronics

Personal computers

Photos and cameras

PID

Name

Quantity

Price

Electronics

PC

Cameras and photos

Digital cameras

ID

Brand

Amount

Price

(e.g., Cupid, COMA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 32 / 55

Page 80: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Tree-based

Electronics

Personal computers

Photos and cameras

PID

Name

Quantity

Price

Electronics

PC

Cameras and photos

Digital cameras

ID

Brand

Amount

Price

(e.g., Cupid, COMA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 32 / 55

Page 81: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Tree-based

Electronics

Personal computers

Photos and cameras

PID

Name

Quantity

Price

Electronics

PC

Cameras and photos

Digital cameras

ID

Brand

Amount

Price

(e.g., Cupid, COMA)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 32 / 55

Page 82: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Model-based

� Propositional satisfiability (SAT)

Axioms→rel(context1, context2)

(e.g., CtxMatch, S-Match)Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 33 / 55

Page 83: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Model-based

� Propositional satisfiability (SAT)

Axioms→rel(context1, context2)

Electronics

Personal Computers

Microprocessors

PID

Electronics

PC

PC board

ID

(e.g., CtxMatch, S-Match)Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 33 / 55

Page 84: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Model-based

� Propositional satisfiability (SAT)

Axioms→rel(context1, context2)

Electronics

Personal Computers

Microprocessors

PID

Electronics

PC

PC board

ID

Axioms︷ ︸︸ ︷

(Electronics1 ↔ Electronics2) ∧ (Personal Computers1 ↔ PC2)→context1

︷ ︸︸ ︷

(Electronics1 ∧ Personal Computers1)↔context2

︷ ︸︸ ︷

(Electronics2 ∧ PC2)

(e.g., CtxMatch, S-Match)Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 33 / 55

Page 85: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Model-based

Description logics (DL)-based

micro-company = company

≤5 employeeSME = firm

≤10 associate

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 34 / 55

Page 86: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Model-based

Description logics (DL)-based

micro-company = company

≤5 employeeSME = firm

≤10 associate

=≥

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 34 / 55

Page 87: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Model-based

Description logics (DL)-based

micro-company = company

≤5 employeeSME = firm

≤10 associate

=≥

company = firm ; associate employee

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 34 / 55

Page 88: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Structure-level techniques: Model-based

Description logics (DL)-based

micro-company = company

≤5 employeeSME = firm

≤10 associate

=≥

company = firm ; associate employee

micro-company SME

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 34 / 55

Page 89: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Outline

Matching problem

Classification

Basic techniques

Matching process

Systems

Conclusions

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 35 / 55

Page 90: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Sequential composition

O

O ′

A matching A′ matching′ A′′

parameters

resources

parameters ′

resources ′

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 36 / 55

Page 91: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Parallel composition

O

O ′

A

matching A′

matching′ A′′

aggregation A′′′

resources ′

parameters ′

resources

parameters

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 37 / 55

Page 92: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Selecting the final alignment

� Ranking strategies� Thresholds� MaxDelta

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 38 / 55

Page 93: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Selecting the final alignment

� Ranking strategies� Thresholds� MaxDelta

� Cardinalities� 1-1, 1-*, *-*

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 38 / 55

Page 94: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Selecting the final alignment

� Ranking strategies� Thresholds� MaxDelta

� Cardinalities� 1-1, 1-*, *-*

� Optimization� stable marriage� maximal weight match

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 38 / 55

Page 95: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Selecting the final alignment

� Ranking strategies� Thresholds� MaxDelta

� Cardinalities� 1-1, 1-*, *-*

� Optimization� stable marriage� maximal weight match

� Directionality� O → O ′, O ′ → O (SmallLarge, LargeSmall)� O → O ′ and O ′ → O (Both)

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 38 / 55

Page 96: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Outline

Matching problem

Classification

Basic techniques

Matching process

Systems

Conclusions

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 39 / 55

Page 97: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

State of the art systems

∼50 matching systems exist, . . . we consider some of them

� Cupid (U. of Washington, Microsoft Corporation and U. of Leipzig)

� Falcon-AO (China Southwest U.)

� OLA (INRIA Rhone-Alpes and U. de Montreal)

� S-Match (U. of Trento)

� . . .

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 40 / 55

Page 98: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Cupid

� Schema-based

� Computes similarity coefficients in the [0 1] range

� Performs linguistic and structure matching

� Sequential system

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 41 / 55

Page 99: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Cupid architecture

O

O ′

MLinguisticmatching M ′ Structure

matching M ′′ Weighting

M ′′′A′

thesauri

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 42 / 55

Page 100: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

OLA

� Schema- and Instance-based

� Computes dissimilarities + extracts alignments (equivalences in the[0 1] range)

� Based on terminological (including linguistic) and structural (internaland relational) distances

� Neither sequential nor parallel

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 43 / 55

Page 101: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

OLA architecture

O

O ′

A Msimilaritycompu-tation

M ′ A′

parameters

resources

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 44 / 55

Page 102: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Falcon-OA architecture

O

O ′

MLinguisticmatching M ′ Structure

matching M ′′ A′

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 45 / 55

Page 103: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

S-Match

� Schema-based

� Computes equivalence (=), more general (�), less general (),disjointness (⊥)

� Analyzes the meaning (concepts, not labels) which is codified in theelements and the structures of ontologies

� Sequential system with a composition at the element level

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 46 / 55

Page 104: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

S-Match architecture

O

O ′

Translator APre-

processingPTrees

Matchmanager A′

oraclessemanticmatchers

SATsolvers

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 47 / 55

Page 105: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Outline

Matching problem

Classification

Basic techniques

Matching process

Systems

Conclusions

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 48 / 55

Page 106: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Summary

� We have discussed the ontology matching problem and its applicationdomains

� We have provided classificatory elements for approaching ontologymatching techniques

� We have presented a number of basic matching techniques as well asdifferent strategies for building the matching process

� We have reviewed some existing matching systems

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 49 / 55

Page 107: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Uses of classification

� It provides a common conceptual basis, and hence, can be used forcomparing (analytically) different existing ontology matching systems

� It can help in designing a new matching system, or an elementarymatcher, taking advantages of state of the art solutions

� It can help in designing systematic benchmarks, e.g., by discardingfeatures one by one from ontologies, namely, what class of basictechniques deals with what feature

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 50 / 55

Page 108: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Challenges

� Missing background knowledge

� Performance of systems

� Interactive approaches

� Explanations of matching

� Social aspects of ontology matching

� Large-scale evaluation

� Infrastructures

� . . .

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 51 / 55

Page 109: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Acknowledgments

We thank all the participants of the Heterogeneity workpackage of theKnowledge Web network of excellence

In particular, we are grateful to T.-L. Bach, J. Barrasa, P. Bouquet, J. Bo,R. Dieng-Kuntz, M. Ehrig, E. Franconi, R. Garcıa Castro, F. Giunchiglia, M.Hauswirth, P. Hitzler, M. Jarrar, M. Krotzsch, R. Lara, D. Maynard, A.Napoli, L. Serafini, G. Stamou, H. Stuckenschmidt, Y. Sure, S. Tessaris, P.Traverso, P. Valchev, S. van Acker, M. Yatskevich, and I. Zaihrayeu for theirsupport and insightful comments

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 52 / 55

Page 110: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

...coming up soon

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 53 / 55

Page 111: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Thank you

for your attention and interest!

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 54 / 55

Page 112: Pavel Shvaiko J´erˆome Euzenat - UniTrentop2p/matching/SWAP06-OMtutorial.pdfElectronics PC PC board ID Brand Amount Price Cameras and Photo Accessories Digital Cameras ID Brand Amount

Matching problem Classification Basic techniques Matching process Systems Conclusions

Questions?

[email protected]@inrialpes.fr

http://www.ontologymatching.org

Tutorial on Ontology Matching at SWAP-2006, Pisa, Italy 55 / 55


Recommended