+ All Categories
Home > Software > KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in...

KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in...

Date post: 10-Feb-2017
Category:
Upload: nishita-jaykumar
View: 600 times
Download: 2 times
Share this document with a friend
26
KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science WWW - LDOW 2016, Canada Nishita Jaykumar, Pavankalyan Yallamelli, Vinh Nguyen, Sarasi Lalithsena, Krishnaprasad Thirunarayan, Amit Sheth Kno.e.sis, Wright State University Clare Paul *Air Force Research Laboratory, Wright-Patterson AFB
Transcript
Page 1: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in

Materials Science

WWW - LDOW 2016, Canada

Nishita Jaykumar, Pavankalyan Yallamelli, Vinh Nguyen,

Sarasi Lalithsena, Krishnaprasad Thirunarayan, Amit Sheth Kno.e.sis, Wright State University

Clare Paul*Air Force Research Laboratory, Wright-Patterson AFB

Page 2: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

2

• Collaboration with AFRL

Context for Research

ASM HNDBK

MILHNDBK-5

MILHNDBK-

17

(Standardized Vocabularies)

SKOSDublin Core

QUDTVAEM

Crowdsourcing from domain

experts

Consolidated

vocabulary

(MatVocab)

Page 3: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

3

Motivating ExampleFacts:

Name Definition Source

A-Basis The mechanical property value is the value above which …

ASM Handbook, Volume 21: Composites.

ABasis A statistically-based material property; a 95% lower…

Composite Materials Handbook - Volume 1. MIL-HDBK-17F-1F, 17 June 2002

A-Basis The lower of either a statistically calculated number…

Metallic Materials and Elements for Aerospace Vehicle Structures, MIL-HDBK-5J, 31 January 2003

Page 4: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

4

Facts:Name Definition Source

YoungsModulus The ratio of normal stress to corresponding …

ASM Handbook, Volume 21: Composites.

ModulusYoungs The ratio of change in stress to change …

MIL-HDBK-17

• Same term has multiple definitions that needs to be

represented with its provenance information, that

includes data such as, source, time etc.

Motivating Example

Page 5: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

5

Related Work

Auxiliary node approach

A-Basis Auxiliary node1

…A statistically-

based material …

P26v

P26s

P580q

P582q

• Properties represented in the wikidata model do not

correspond to RDF properties

• Lack of formal semantics

Page 6: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

6

• Extension to Mediawiki

• We use the Semantic Form extension of Semantic

Mediawiki for our task• Inability to represent metadata about the metadata

Semantic Mediawiki

http://www.slideshare.net/cool_uk/semantic-mediawiki-simple-tutorial

Representing entities and simple metadata

The '''United Kingdom''' is a

country located in

[[Located in::Europe]].

Page 7: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

7

• Adopted the Singleton Property method for capturing

triple metadata in SMW

• Importing legacy data with provenance in bulk using

the Singleton Property method

• Importing existing RDF datasets with provenance into

SMW for curation

Our Approach

Page 8: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

8

• Adopted the Singleton Property method for capturing

triple metadata in SMW

• Importing legacy data with provenance in bulk using

the Singleton Property method

• Importing existing RDF datasets with provenance into

SMW for curation

Our Approach

Page 9: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

Subject Predicate Object Source LicenseAutoclave hasDefinition “A closed vessel for

producing…”MIL-HDBK-17F-1F, 17

All rights reserved

Singleton PropertyFacts:

Subject Predicate Object

hasDefinition#1 rdf:sp hasDefinition

Autoclave hasDefinition#1 “A closed vessel for producing…”

hasDefinition#1 hasSource MIL-HDBK-17

hasDefinition#1 hasLicense All rights reserved

Singleton Property Translation

9"Don't like RDF reification?: making statements about statements using singleton property."Proceedings of the 23rd international conference on World wide web. ACM, 2014.

Page 10: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

10

• Formal semantics defined

• Scalable, e.g., to LOD

• Compatible with existing standards

– RDF, RDFS, SPARQL

• Can be used to capture multiple types of metadata

– Provenance, time, location

Why use Singleton Property?

Fu, Gang, et al. "Exposing Provenance Metadata Using Different RDF Models." arXiv preprint arXiv:1509.02822 (2015). Nguyen, Vinh, Olivier Bodenreider, and Amit Sheth. Hernández, Daniel, Aidan Hogan, and Markus Krötzsch. "Reifying RDF: What Works Well With Wikidata?." Proceedings of the 11th International Workshop on Scalable Semantic Web Knowledge Base Systems co-located with 14th International Semantic Web Conference (ISWC 2015), Bethlehem, PA, USA. 2015.

Page 11: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

Property

Category

TemplateDefinition

Field

Form DefinitionWiki Article

Has a default formAssign article to a category

Identifies

Edit with Form

Uses template

Has value

for

Data type Input type

Represents

Bundled in

Is a part of

Can use (by default)

Has a Has an

Belo

ngs t

o

Semantic Mediawiki Dataflow

Singleton TemplateDefinition

Regular TemplateDefinition

Page 12: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

12

Overall Architecture

Page 13: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

13

Singleton v/s Regular Template

Autoclave

Definition TextImage

Source

Rights

Autoclave

Definition Text

Image

SourceRights

SourceRights

Page 14: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

14

Regular Vs Singleton templates

Subject Predicate ObjectAutoclave hasDefinition#1 “A closed vessel…”

hasDefinition#1 singletonPropertyOf skos:definition

hasDefinition#1 source “ASM Handbook”hasDefinition#1 license “Reproduced by…”Autoclave hasImage#1 “Image.jpg”hasImage#1 singletonPropertyOf mv:image

Subject Predicate ObjectAutoclave hasDefinition “A closed vessel…”

Autoclave source “ASM Handbook”

Autoclave license “Reproduced by…”

Autoclave hasImage “Image.jpg”

Page 15: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

15

• Singleton Templates are our enhancement to SMW

• Implemented parser function to handle the Singleton

Template parsing using magicWord

• Registering the magicWord

• Parser function responsible to handle/process the

Singleton Templates and generate the RDF triples

Implementation

Page 16: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

16

• Properties of interest to domain experts: – Definition Text– Source– License– Creator– Abbreviation– Synonyms– Units– …..

Use Case in Materials Science

mv: is matvocab namespace

Page 17: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

17

Steps to create vocabularies

Page 18: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

Create the Form

Create the regular template and

singleton templates

Create relevant properties

Import existing vocabularies into SMW

Create Term pages

Material Manufacturing and Design From

11 Templates: 6 Singleton templates5 Regular templates

SKOS, QUDT, etc.

[[Property name:: property value]]

20

Page 19: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

MatVocab Form to create a term

21

Page 20: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

20

• Adopted the Singleton Property method for capturing

triple metadata in SMW

• Importing legacy data with provenance in bulk using

the Singleton Property method

• Importing existing RDF datasets with provenance into

SMW for curation

Our Approach

Page 21: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

21

• Data from Excel spreadsheet files – 3 vocabularies

• We map CSV data into the predefined 11 templates

• Some of the data mapped to regular templates

• Others mapped to singleton templates

Import legacy data with provenance

Structural Materials Vocabulary

Page 22: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

22

Statistics of the Use Case

Type SMW

1 Number of vocabularies imported 3

2 Total number of terms imported from ASM 1295

3 Total number of terms imported from MILHNDBK-5 19

4 Total number of terms imported from MILHNDBK-17 179

5 Total number of Singleton Templates created 6

6 Total number of Regular Templates created 5

7 Total number of pages created 1,685

Page 23: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

23

• Adopted the Singleton Property method for capturing

triple metadata in SMW

• Importing legacy data with provenance in bulk using

the Singleton Property method

• Importing existing RDF datasets with provenance into

SMW for curation

Our Approach

Page 24: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

24

• We developed an extension similar to the “CSV

Import”

• We experimented with the YAGO-SP2 dataset

• Ongoing work

• More details can be found in our paper

RDF import with provenance

Page 25: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

25

We enabled Semantic Mediawiki with the Singleton property model to facilitate creation

and curation of the Materials Science vocabulary with provenance metadata

To Conclude…

Page 26: KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in Materials Science

26

Further information, please visit http://wiki.knoesis.org/index.php/KnowledgeWiki


Recommended