KRDB2010-GoodRelations

Post on 17-Dec-2014

2,052 views 0 download

description

Slides from my talk at the 3rd KRDB school on Trends in the Web of Data, September 18, Brixen-Bressanone, Italy. http://www.inf.unibz.it/krdb/school/2010/

transcript

The GoodRelations Ontology for E-Commerce

3rd KRDB School on

Trends in the Web of Data (KRDB-2010)

Brixen-Bressanone, Italy,

17-18 September 2010

Prof. Dr. Martin Hepp Professur für Allgemeine BWL, insbesondere E-Business

Part 1: Why bother?

18.09.2010 2

6. Upcoming Research Challenges

Part 1: Why bother?

18.09.2010 4

Matchmaking in Market Economies

18.09.2010 5

Macroeconomic Impact

18.09.2010 6

John Joseph Wallis and Douglas C. North:

Measuring the Transaction Sector in the

American Economy, 1870 – 1970

(1986)

Transaction Costs:

> 50 % of the

US GDP (1970)

Key Driver of Search Costs: Specificity

How much you loose when you can‘t

use a good for what it was designed.

Growth in Specificity

1920: 5168 Types of Goods

8 18.09.2010

Examples 2010

9 18.09.2010

Examples 2010

10 18.09.2010

Examples 2010

11 18.09.2010

Specificity Increases the Search Space

12 18.09.2010

WWW: Dramatic Reduction of Search Effort

Lower search costs per search than ever before in history.

18.09.2010 13

1993 2010

But ….

The WWW: A Giant Data Shredder

18.09.2010 15

Source: Structured Data

Recipient: Unstructured Text

What is Linked Data Linked

18.09.2010 16

Susi Martin

loves

1 2 3 4

What is Special About E-Commerce Data?

18.09.2010 17

$$$

RDBMS

1

2

3

4

GoodRelations: A Global Schema for Commerce Data on the Web

18

Product Model Master Data Shop

Offerings Auctions Spare Parts & Consumables

Warranty

Delivery Payment

Retailers Manufacturers

Arbitrary Query

Extraction and Reuse

18.09.2010

On the Shoulders of Giants

19

A Unified View of Commerce Data on the Web

18.09.2010

GoodRelations Deployment: Small Data Packets Inside Your Page (RDFa)

20 18.09.2010

Valuable Types of Links: Product - Product Model

18.09.2010 21

Ford T Data-sheet

Photo

cre

dits: F

lickr.

com

, availa

ble

u

nder

CC

BY

2.0

by b

sabarn

ow

l

gr:hasMakeAndModel

Often via strong, non-URI identifiers like EAN/UPC

Valuable Types of Links: Offer – Store(s)

18.09.2010 22

XYZ for $ 99

gr:availableAtOrFrom

Valuable Types of Links: Company – Store(s)

18.09.2010 23

gr:hasPOS

Part 2: Ontology Engineering Revisited

18.09.2010 24

Immanuel Kant on Ontologies & Linked Data

„Thoughts without content are empty,

intuitions without concepts are blind.“ Critique of Pure Reason (1781)

1. Ontologies without data are useless

2. Data without ontologies is blind

18.09.2010 25

In other words: Schemas Matter

18.09.2010 26

Otherwise your data is just landfill… Photo

cre

dits: F

lickr.

com

, availa

ble

under

CC

BY

2.0

by d

norm

an

Albert Einstein on Schema Design

"Make everything as simple as possible, but

not simpler.“

Albert Einstein

27 18.09.2010

Data, Standards, Ontologies

28 18.09.2010

Subtle Distinctions Foster Data Reuse

• Product Offer

– „You can buy or lease my house“

• Store Business entity

– „How many Tesco stores are in London?“

• Product Product Model

– „How many digital cameras by Canon are

listed on eBay“?

18.09.2010 29

Sophisticated Category Systems:

Foundation for Intelligence and Judgment

18.09.2010 30

Ontology Economics

18.09.2010 31 Hepp,

Mart

in:

Possib

le O

nto

logie

s:

How

Realit

y C

onstr

ain

s t

he

Develo

pm

ent

of

Rele

vant

Onto

logie

s,

in:

IEE

E I

nte

rnet

Com

puting,

V

ol. 1

1, N

o. 1, pp. 90-9

6, Jan-F

eb 2

007

Incremental Granularity & Lexical Carry-Over

18.09.2010 32

Ontology Engineering

• Generic model

– Stable distinctions

– Easy to populate

– Incremental Enrichment

• Good textual elements

• Good documentation

• Tool support for the entire tool chain

18.09.2010 33

Part 3: GoodRelations Overview

18.09.2010 34

Basic Structure of Offers: Agent-Promise-Object Principle

35

Agent 1 Object or

Happening Promise

Agent 2

Compensation Transfer of Rights

The Minimal Scenario

• Scope

– Business entity

– Points-of-sale

– Opening hours

– Payment options

• Suitable for

– Every business

– E-commerce and brick-and-mortar

36

The Simple Scenario

• Scope: Minimal scenario plus

– Range of products or services

– Business functions

– Eligible regions or customer types

– Delivery options

• Suitable for

– Any business: E-Commerce and brick-and-mortar

– Specific products or services 37

The Comprehensive Scenario

• Scope: Simple scenario plus

– Individual products or services

– Product features

– Pricing, rebates, etc.

– Availability

• Suitable for

– Any business: E-commerce and brick-and-mortar

– Specific products or services

– Structured product database

38

Product Model Data Scenario

• Scope

– Individual product models

– Quantitative and qualitative features

• Suitable for

– Manufacturers of commodities

39

Developer Resources, Data, Tools

http://purl.org/goodrelations/

18.09.2010 40

The Minimal Scenario (UML & RDF/N3)

18.09.2010 41

The Simple Scenario: UML

18.09.2010 42

The Simple Scenario: RDF/N3 - Details

18.09.2010 43

Alternative Ways of Describing the Product or Service

• Omit it

– Minimal Example: Describe just your business & store

• gr:ProductOrServiceSomeInstancesPlaceholder + rdfs:comment – Textual

• Product or service ontology

– eclassOWL

– freeClass

• DBPedia URIs

• Turn proprietary hierarchy into pseudo-ontology

18.09.2010 44

Impact and Success

• One of the few vocabularies implemented by major businesses out of their own budgets

• BestBuy, O’Reilly, Overstock.com,…

• Ca. 16 % of all triples as of now

• Supported by Yahoo

• Bing, Google may join

18.09.2010 45

Yahoo Enhanced by SearchMonkey

46 18.09.2010

Incredible Success

47 18.09.2010

GoodRelations #2 of all Web Ontologies

48

…and this does not yet include the > 10 Mio. offers from Amazon and eBay!

18.09.2010

GoodRelations #2 of all Web Ontologies

49 18.09.2010

GoodRelations Design Principles

• Keep simple things simple and make complex things possible

• Cater for LOD and OWL DL worlds

• Academically sound

• Industry-strength engineering

• Practically relevant

50

Lightweight Web of Data

LOD RDF + a little bit

Heavyweight Web of Data

OWL DL

18.09.2010

Syntax-neutral

• RDF/XML, Turtle

• RDFa

• OData

• GData

• Microdata

• dataRSS

18.09.2010 51

http://www.ebusiness-unibw.org/wiki/Syntaxes4GoodRelations

Part 4: Publishing GoodRelations Data

18.09.2010 52

RDFa in Snippet Style

http://www.ebusiness-unibw.org/tools/rdf2rdfa/

18.09.2010 53

Publishing GoodRelations Data

• RDFa in Snippet Style

• sitemap.xml with proper lastmod attribute

• robots.txt

18.09.2010 54

Microdata in Snippet Style

http://www.ebusiness-unibw.org/tools/rdf2microdata/

18.09.2010 55

Part 5: GoodRelations Advanced Topics

18.09.2010 56

GoodRelations-compliant Domain Ontologies

18.09.2010 57

Meta-Model for Quantitative Data

18.09.2010 58

Both Sides Can Help Build a Bridge

59 18.09.2010

gr:seeks property

Ownership & Self Exposure

• gr:owns property

18.09.2010 60

6. Upcoming Research Challenges

Research Challenges

(1) Natural Language Processing

(2) Ontology Mapping and Alignment

(3) Collaborative Ontology Engineering

(4) Crawling, Update, Federation

(5) Matchmaking & Query Learning

(6) Applications and Interaction Patterns

(7) Storage and Reasoning

18.09.2010 62

Natural Language Processing

18.09.2010 63

Ontology Mapping and Alignment

18.09.2010 64

Collaborative Ontology Engineering

• OpenVocab

• Knoodl

• Protégé Collaboration Support

• OntoVerse

• MyOntology

• Twine Ontology Editor

• Neologism

• MoKi

18.09.2010 65

http://www.ebusiness-unibw.org/wiki/Own_GoodRelations_Vocabularies

Crawling, Update, Federation

(1) Shop data changes every 1..24 h

(2) Can you harvest the data from 1,000,000 shop sites just via

– Sitemap.xml with proper lastmod attribute

– RDFa inside the pages

18.09.2010 66

Matchmaking & Query Learning

18.09.2010 67

Applications and Interaction Patterns

18.09.2010 68

Storage and Reasoning

• RDFS-style reasoning

• Non-standard inference rules

• Massive scale

– 1 Mio shops etc.

– 1 k – 100 k items,let’s say 10 k

– 100 triples per item

– 1 Mio * 10 k * 100 = 1,000,000,000,000

– 1 trillion triples 18.09.2010 69

Storage and Reasoning

• Hybrid queries

18.09.2010 70

Data Quality Management

18.09.2010 71

http://www.ebusiness-unibw.org/tools/goodrelations-validator/

Thank you!

http://purl.org/goodrelations/

Prof. Dr. Martin Hepp

Chair of General Management and E-Business Universitaet der Bundeswehr Muenchen

Werner-Heisenberg-Weg 39 D-85579 Neubiberg, Germany

Phone: +49 89 6004-4217 Fax: +49 89 6004-4620

http://www.unibw.de/ebusiness/

http://purl.org/goodrelations/

mhepp@computer.org

72 18.09.2010