+ All Categories
Home > Technology > The Semantic Web #1 - Overview

The Semantic Web #1 - Overview

Date post: 19-Jan-2015
Category:
Upload: myungjin-lee
View: 1,124 times
Download: 0 times
Share this document with a friend
Description:
This is a lecture note #1 for my class of Graduate School of Yonsei University, Korea. It describes overview of the Semantic Web, its recommendations, and case studies.
48
Linked Data & Semantic Web Technology The Semantic Web Part 1. Overview of the Semantic Web Dr. Myungjin Lee
Transcript
Page 1: The Semantic Web #1 - Overview

Linked Data &Semantic WebTechnology

The Semantic WebPart 1. Overview of the Semantic Web

Dr. Myungjin Lee

Page 2: The Semantic Web #1 - Overview

2Linked Data & Semantic Web Technology

Overview of the Semantic Web

• What is the Semantic Web?

• Semantic Web Technologies

• Semantic Web Case Studies

Page 3: The Semantic Web #1 - Overview

3Linked Data & Semantic Web Technology

Overview of the Semantic Web

• What is the Semantic Web?

• Semantic Web Technologies

• Semantic Web Case Studies

Page 4: The Semantic Web #1 - Overview

4Linked Data & Semantic Web Technology

Vision

• Knowledge Navigator (1987)– http://www.youtube.com/watch?v=QRH8eimU_20

• IBM Watson– http://www.youtube.com/watch?v=FC3IryWr4c8– http://www.youtube.com/watch?v=lI-M7O_bRNg

Page 5: The Semantic Web #1 - Overview

5Linked Data & Semantic Web Technology

Internet

• a global system of interconnected computer networks• a network of networks • Network

– a collection of computers interconnected by communication channels

Network

Internet

Page 6: The Semantic Web #1 - Overview

6Linked Data & Semantic Web Technology

Internet Services before the Web

• E-Mail Communication: SMTP, POP3• File Transfer: FTP• Remote Control: Telnet

• Problem of these services:– Information access requires expert knowledge– Information access is expensive...– Information retrieval is very expensive...

Page 8: The Semantic Web #1 - Overview

8Linked Data & Semantic Web Technology

Characteristics of Web

• Hyperlink and Multimedia

• Advantages:– No expert knowledge required– Simple information access– Information retrieval via search engines

webpage

webpage

webpage

hyperlink

hyperlink

hyperlink

Page 9: The Semantic Web #1 - Overview

9Linked Data & Semantic Web Technology

Web Architecture

Document

IdentifierProtocol

URIHTTP

HTML

an application protocol for distrib-uted, collaborative, hypermedia in-formation systems

a string of characters used to identify a name or a resource

the main markup language for dis-playing web pages and other informa-tion that can be displayed in an web browser

Page 10: The Semantic Web #1 - Overview

10Linked Data & Semantic Web Technology

Page 11: The Semantic Web #1 - Overview

11Linked Data & Semantic Web Technology

Problem of HTML

• HTML describes– how information is presented, displayed, and linked for human read-

ers

• There is no meaning of information.

Cars.comAutoTrader.com

sameinformation,

but …

Page 13: The Semantic Web #1 - Overview

13Linked Data & Semantic Web Technology

What we want on the Web?

• to process the meaning of information automatically• to relate and integrate heterogeneous data• to deduce implicit information from existing information in an

automated way

The Web was designed as an information space, with the goal that it should be use-ful not only for human-human communi-cation, but also that machines would be able to participate and help.

Page 15: The Semantic Web #1 - Overview

15Linked Data & Semantic Web Technology

What is the Semantic Web

• “The Semantic Web provides a common framework that allows data to be shared and reused across application, enterprise, and community boundaries.” – W3C

• “The first step is putting data on the Web in a form that machines can naturally understand, or converting it to that form. This cre-ates what I call a Semantic Web -- a web of data that can be processed directly or indirectly by machines.” – Tim Berners-Lee

• “The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation.” – Tim Berners-Lee

Page 16: The Semantic Web #1 - Overview

16Linked Data & Semantic Web Technology

Overview of the Semantic Web

• What is the Semantic Web?

• Semantic Web Technologies

• Semantic Web Case Studies

Page 17: The Semantic Web #1 - Overview

17Linked Data & Semantic Web Technology

Semantic Web Layer Cake

an elemental syntaxfor content structurewithin documents

a simple languagefor expressing data models,

which refer to objects ("resources")and their relationships

more vocabularyfor describing properties and classes

a vocabulary for describingproperties and classes

of RDF-based resources

a protocol and query languagefor semantic web data sources

to exchange rulesbetween many "rules languages"

a string of characters used to identify a name or a resource

Page 18: The Semantic Web #1 - Overview

18Linked Data & Semantic Web Technology

URI (Uniform Resource Identifier)

• a string of characters used to identify a name or a resource

URN(Uniform Resource Name)

URL(Uniform Resource Locator)+

URI

urn:isbn:0451450523urn:isan:0000-0000-9E59-0000-O-0000-0000-2urn:issn:0167-6423

ftp://[email protected]://en.example.org/wiki/url

Page 19: The Semantic Web #1 - Overview

19Linked Data & Semantic Web Technology

RDF (Resource Description Framework)

• to be used as a general method for conceptual description or modeling of information that is implemented in web resources, using a variety of syntax formats

http://www.cars.com/car#A6

http://www.cars.com/car#Car

http://www.cars.com/car#Gasoline

http://www.cars.com/car#GDI

http://www.cars.com/car#Auto_8-Speedhttp://www.cars.com/car#Sedan

4

http://www.cars.com/car#AWD

115”

http://www.w3.org/1999/02/22-rdf-syntax-ns#type

http://www.cars.com/car#transmission

http://www.cars.com/car#wheelbase

http://www.cars.com/car#engine

http://www.cars.com/car#fuel

http://www.cars.com/car#drivetrain

http://www.cars.com/car#doors

http://www.cars.com/car#body_style

Page 20: The Semantic Web #1 - Overview

20Linked Data & Semantic Web Technology

XML (Extensible Markup Language)

• a markup language that defines a set of rules for encoding doc-uments in a format that is both human-readable and machine-readable

<?xml version="1.0" encoding="utf-8"?><note>

<to>Tove</to><from>Jani</from><heading>Reminder</heading><body>Don't forget me this weekend!</body>

</note>

Page 21: The Semantic Web #1 - Overview

21Linked Data & Semantic Web Technology

RDFS (RDF Schema)

• a set of classes with certain properties using the RDF extensible knowledge representation language, providing basic elements for the description of ontologies, otherwise called RDF vocabu-laries, intended to structure RDF resources

car:Car

car:Vehicle

rdfs:subClassOf

rdf:Property

car:body_stylerdfs:domain

rdfs:range

rdfs:Class

rdf:type

rdf:type

car:Style

rdf:type

car:A6

rdf:type

car:Sedanrdf:typecar:body_style

TBox - terminological component

ABox - assertion component

Page 22: The Semantic Web #1 - Overview

22Linked Data & Semantic Web Technology

Ontology

• knowledge representation as a set of concepts within a domain, and the relationships between those concepts formal, explicit specification of a shared conceptualisation

"Ontologies are often equated with taxonomic hierarchies of classes, class definitions, and the subsumption relation, but ontolo-gies need not be limited to these forms. Ontologies are also not lim-ited to conservative definitions — that is, definitions in the tradi-tional logic sense that only introduce terminology and do not add any knowledge about the world. To specify a conceptualization, one needs to state axioms that do constrain the possible interpreta-tions for the defined terms."

Page 23: The Semantic Web #1 - Overview

23Linked Data & Semantic Web Technology

OWL (Web Ontology Language)

• a family of knowledge representation languages for authoring ontologies on the Semantic Web

Page 24: The Semantic Web #1 - Overview

24Linked Data & Semantic Web Technology

Semantics of RDF, RDFS, and OWL

• Each language for the Semantic Web provides a formal mean-ing based on a model-theoretic semantics in its abstract syn-tax.

<x, y> is in IEXT(I(rdfs:subClassOf))

if and only if x and y are in IC

and ICEXT(x) is a subset of ICEXT(y)

car:Car

car:Vehicle

rdfs:subClassOf

car:A6

rdf:type

rdf:type

Page 25: The Semantic Web #1 - Overview

25Linked Data & Semantic Web Technology

Language for the Rule Description

• SWRL (Semantic Web Rule Language) is a proposal for a Se-mantic Web rules-language, combining sublanguages of the OWL Web Ontology Language (OWL DL and Lite) with those of the Rule Markup Language (Unary/Binary Datalog).

<ruleml:imp> <ruleml:_rlab ruleml:href="#example1"/> <ruleml:_body> <swrlx:individualPropertyAtom swrlx:property="hasParent"> <ruleml:var>x1</ruleml:var> <ruleml:var>x2</ruleml:var> </swrlx:individualPropertyAtom> <swrlx:individualPropertyAtom swrlx:property="hasBrother"> <ruleml:var>x2</ruleml:var> <ruleml:var>x3</ruleml:var> </swrlx:individualPropertyAtom> </ruleml:_body> <ruleml:_head> <swrlx:individualPropertyAtom swrlx:property="hasUncle"> <ruleml:var>x1</ruleml:var> <ruleml:var>x3</ruleml:var> </swrlx:individualPropertyAtom> </ruleml:_head> </ruleml:imp>

hasParent(?x1,?x2) hasBrother(?x2,?x3) hasUncle(?x1,?x3)∧ ⇒

Page 26: The Semantic Web #1 - Overview

26Linked Data & Semantic Web Technology

Inference

• being able to derive new data from data that you already know

hasParenthasParent

hasWife

if hasParent(?x, ?y)

hasParent(?x, ?z)

Man(?y)

Woman(?z)

then hasWife(?y, ?z)

Page 27: The Semantic Web #1 - Overview

27Linked Data & Semantic Web Technology

SPARQL

• Why do we need a query language for RDF?– Why de we need a query language for RDB?– to get to the knowledge from RDF

• SPARQL Protocol and RDF Query Language– to retrieve and manipulate data stored in RDF format– to use SPARQL via HTTP

PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?name ?emailWHERE { ?person a foaf:Person. ?person foaf:name ?name. ?person foaf:mbox ?email.}

RDF Knowledge Base

?name ?email

Myungjin Lee [email protected]

Gildong Hong [email protected]

Grace Byun [email protected]

Page 28: The Semantic Web #1 - Overview

28Linked Data & Semantic Web Technology

Overview of the Semantic Web

• What is the Semantic Web?

• Semantic Web Technologies

• Semantic Web Case Studies

Page 30: The Semantic Web #1 - Overview

30Linked Data & Semantic Web Technology

Naver Semantic Movie Search

Page 31: The Semantic Web #1 - Overview

31Linked Data & Semantic Web Technology

Apple’s Siri

• an intelligent personal assistant and knowledge navigator which works as an application for Apple's iOS

• a natural language user interface to answer questions, make recommen-dations, and perform actions by delegating requests to a set of Web ser-vices

Siri’s knowledge is represented in a unified modeling system that com-bines ontologies, inference net-works, pattern matching agents, dic-tionaries, and dialog models. ... Siri isn’t a source of data, so it doesn’t expose data using Semantic Web standards.

Page 32: The Semantic Web #1 - Overview

32Linked Data & Semantic Web Technology

Google’s Knowledge Graph

• a knowledge base used by Google to enhance its search engine's search results with semantic-search information gathered from a wide variety of sources

• over 570 million objects and more than 18 billion facts about and relationships between these different objects

They decided to call it “Knowledge Graph”.

Page 33: The Semantic Web #1 - Overview

33Linked Data & Semantic Web Technology

Facebook’s Open Graph Protocol

• simple protocol for enabling any web page to become a rich object in a social graph

cook

http://example.com/cookie.html

Social Object

http://www.facebook.com/mjinlee

http://example.com/cookie.html

http://samples.ogp.me/Recipe

Stuffed Cookiesme:cook rdf:type

http://example.com/zhen/cookie.jpg

The Turducken of Cookies

http://example.com/zhen/cookie.html

og:title

og:image

og:descriptionog:url

Page 34: The Semantic Web #1 - Overview

34Linked Data & Semantic Web Technology

Twitter Annotations

• to add one or more annotations that represent structured meta-data about the tweet

http://r.github.com/annotationsformatter/

First element is a type.Every Annotations has a type.

Type maps to attribute and value pair.

Second element is one or more attribute names with values.

Page 35: The Semantic Web #1 - Overview

35Linked Data & Semantic Web Technology

Linking Open Data

• a method of publishing structured data to share information in a way that can be read automatically by computers based on standard Web technologies such as HTTP and URIs

Page 36: The Semantic Web #1 - Overview

36Linked Data & Semantic Web Technology

The Linking Open Data cloud diagram

Page 37: The Semantic Web #1 - Overview

37Linked Data & Semantic Web Technology

Media

User Generated Content

Publications

Government

Geographic

Cross-Domain

Life Sciences

Domain Number of datasets Triples (Out-)Links

Media 25 18,4185,2061 5044,0705

Geographic 31 61,4553,2484 3581,2328

Government 49 133,1500,9400 1934,3519

Publications 87 29,5072,0693 1,3992,5218

Cross-domain 41 41,8463,5715 6318,3065

Life Sciences 41 30,3633,6004 1,9184,4090

User-generated Content 20 1,3412,7413 344,9143

Total 295 316,3421,3770 5,0399,8829

Page 38: The Semantic Web #1 - Overview

38Linked Data & Semantic Web Technology

DBPedia

• a project aiming to extract structured content from the informa-tion created as part of the Wikipedia project using the Resource Description Framework (RDF) to represent the extracted infor-mation

• more than 3.64 million things, out of which 1.83 million are classified in a consistent ontology

• 2,724,000 links to images and 6,300,000 links to external web pages

• over 1 billion pieces of information (RDF triples)

Page 39: The Semantic Web #1 - Overview

39Linked Data & Semantic Web Technology

DBPedia

Page 40: The Semantic Web #1 - Overview

40Linked Data & Semantic Web Technology

Linked Data on BBC

Data from Wikipedia

Data from MusicBrainz

Page 41: The Semantic Web #1 - Overview

41Linked Data & Semantic Web Technology

Best Buy with GoodRalations

<div class="vcard" typeof="gr:LocationOfSalesOrServiceProvisioning" about="#store_1796"><div class="hours" rel="gr:hasOpeningHoursSpecification"><li class="day0" typeof="gr:OpeningHoursSpecification" about="#storehours_sun"><span rel="gr:hasOpeningHoursDayOfWeek" resource="http://purl.org/goodrelations/v1#Sunday" class="day"><span property="gr:opens" datatype="xsd:time" content="11:00:00" class="open">...

Page 42: The Semantic Web #1 - Overview

42Linked Data & Semantic Web Technology

• By “open”, “open” data is free for anyone to use, re-use and re-distribute.

• By “government data” we mean data and information produced or commissioned by government or govern-ment controlled entities.

Open Government Data

Open

GovData

OpenData

OpenGov

DataGov

OpenGovData

Page 43: The Semantic Web #1 - Overview

43Linked Data & Semantic Web Technology

Data.gov (the United States Govern-ment)

Page 44: The Semantic Web #1 - Overview

44Linked Data & Semantic Web Technology

Data.gov.uk (HM Government)

Page 45: The Semantic Web #1 - Overview

45Linked Data & Semantic Web Technology

Data-Gov Wiki

• a project for investigating open government datasets using se-mantic web technologies– 417 RDFlized datasets covering the content of 703 out of 5762 datasets

with 6.46 billion RDF triples.– additional RDF-ized datasets including 35 Non-Data.gov Datasets with

0.9 billion more RDF triples.

• http://data-gov.tw.rpi.edu/wiki/The_Data-gov_Wiki

Page 46: The Semantic Web #1 - Overview

46Linked Data & Semantic Web Technology

KDATA (Linked Data for Korea)Domain Triples

국가코드 3,899

엔터테인먼트 44,278

행정구역 2,969

초중고등학교 126,469

교육청 1,130

대학교 2,833

사회적 기업 5,539

서울시 개방 화장실 47,340

야구선수 및 팀 228,872

지하철역 4,450

역사 5,392

행정데이터표준용어 109,101

한옥마을 1,155

공공 WiFi 설치정보 1,671

KDATA 분류용어 808

전통시장 4,535

국립공원 10,605

문화재 80,156

공공체육시설 49,799

생물분류 3,256

문화시설 9,418

공원정보 및 프로그램 2,429

가격안정모범업소 16,212

가격안정모범업소 상품목록 14,300

공공시설물 인증제품 6,931

제설함 위치정보 39,218

야생동식물정보 115,099

야생동식물 출현정보 139,608

합계 1,077,472

Page 47: The Semantic Web #1 - Overview

47Linked Data & Semantic Web Technology

References• http://en.wikipedia.org/wiki/Internet• http://en.wikipedia.org/wiki/Computer_network• http://en.wikipedia.org/wiki/World_Wide_Web• http://www.slideshare.net/lysander07/openhpi-11• http://en.wikipedia.org/wiki/Html• http://www.google.com/insidesearch/howsearchworks/thestory/• http://www.go-gulf.com/blog/60-seconds/• http://www.slideshare.net/lysander07/openhpi-15• http://www.w3.org/DesignIssues/Semantic.html• http://en.wikipedia.org/wiki/Semantic_web• http://www.slideshare.net/lysander07/openhpi-13• http://www.w3.org/2001/sw/• Tim Berners-Lee, James Hendler, and Ora Lassila, "The Semantic Web", Scientific American Magazine, 2001.• http://www.w3.org/2007/Talks/0130-sb-W3CTechSemWeb/#(24)• http://www.slideshare.net/onlyjiny/semantic-web-13288556• http://www.slideshare.net/onlyjiny/linked-open-government-data-15708234• http://www.slideshare.net/onlyjiny/linkeddata• http://www.slideshare.net/sonagi/ss-16734202• http://www.slideshare.net/lysander07/13-semantic-web-technologies-linked-data-semantic-search• http://kdata.kr/index.jsp• http://linkeddata.org/• http://lod-cloud.net/

Page 48: The Semantic Web #1 - Overview

48Linked Data & Semantic Web Technology

Dr. Myungjin Lee

e-Mail : [email protected] : http://twitter.com/MyungjinLee

Facebook : http://www.facebook.com/mjinlee

SlideShare : http://www.slideshare.net/onlyjiny/

Thanks foryour attention.


Recommended