Date post: | 10-Feb-2017 |
Category: |
Software |
Upload: | nishita-jaykumar |
View: | 600 times |
Download: | 2 times |
KnowledgeWiki: An OpenSource Tool for Creating Community-Curated Vocabulary, with a Use Case in
Materials Science
WWW - LDOW 2016, Canada
Nishita Jaykumar, Pavankalyan Yallamelli, Vinh Nguyen,
Sarasi Lalithsena, Krishnaprasad Thirunarayan, Amit Sheth Kno.e.sis, Wright State University
Clare Paul*Air Force Research Laboratory, Wright-Patterson AFB
2
• Collaboration with AFRL
Context for Research
ASM HNDBK
MILHNDBK-5
MILHNDBK-
17
(Standardized Vocabularies)
SKOSDublin Core
QUDTVAEM
…
Crowdsourcing from domain
experts
Consolidated
vocabulary
(MatVocab)
3
Motivating ExampleFacts:
Name Definition Source
A-Basis The mechanical property value is the value above which …
ASM Handbook, Volume 21: Composites.
ABasis A statistically-based material property; a 95% lower…
Composite Materials Handbook - Volume 1. MIL-HDBK-17F-1F, 17 June 2002
A-Basis The lower of either a statistically calculated number…
Metallic Materials and Elements for Aerospace Vehicle Structures, MIL-HDBK-5J, 31 January 2003
4
Facts:Name Definition Source
YoungsModulus The ratio of normal stress to corresponding …
ASM Handbook, Volume 21: Composites.
ModulusYoungs The ratio of change in stress to change …
MIL-HDBK-17
• Same term has multiple definitions that needs to be
represented with its provenance information, that
includes data such as, source, time etc.
Motivating Example
5
Related Work
Auxiliary node approach
A-Basis Auxiliary node1
…A statistically-
based material …
P26v
P26s
P580q
P582q
…
• Properties represented in the wikidata model do not
correspond to RDF properties
• Lack of formal semantics
6
• Extension to Mediawiki
• We use the Semantic Form extension of Semantic
Mediawiki for our task• Inability to represent metadata about the metadata
Semantic Mediawiki
http://www.slideshare.net/cool_uk/semantic-mediawiki-simple-tutorial
Representing entities and simple metadata
The '''United Kingdom''' is a
country located in
[[Located in::Europe]].
7
• Adopted the Singleton Property method for capturing
triple metadata in SMW
• Importing legacy data with provenance in bulk using
the Singleton Property method
• Importing existing RDF datasets with provenance into
SMW for curation
Our Approach
8
• Adopted the Singleton Property method for capturing
triple metadata in SMW
• Importing legacy data with provenance in bulk using
the Singleton Property method
• Importing existing RDF datasets with provenance into
SMW for curation
Our Approach
Subject Predicate Object Source LicenseAutoclave hasDefinition “A closed vessel for
producing…”MIL-HDBK-17F-1F, 17
All rights reserved
Singleton PropertyFacts:
Subject Predicate Object
hasDefinition#1 rdf:sp hasDefinition
Autoclave hasDefinition#1 “A closed vessel for producing…”
hasDefinition#1 hasSource MIL-HDBK-17
hasDefinition#1 hasLicense All rights reserved
Singleton Property Translation
9"Don't like RDF reification?: making statements about statements using singleton property."Proceedings of the 23rd international conference on World wide web. ACM, 2014.
10
• Formal semantics defined
• Scalable, e.g., to LOD
• Compatible with existing standards
– RDF, RDFS, SPARQL
• Can be used to capture multiple types of metadata
– Provenance, time, location
Why use Singleton Property?
Fu, Gang, et al. "Exposing Provenance Metadata Using Different RDF Models." arXiv preprint arXiv:1509.02822 (2015). Nguyen, Vinh, Olivier Bodenreider, and Amit Sheth. Hernández, Daniel, Aidan Hogan, and Markus Krötzsch. "Reifying RDF: What Works Well With Wikidata?." Proceedings of the 11th International Workshop on Scalable Semantic Web Knowledge Base Systems co-located with 14th International Semantic Web Conference (ISWC 2015), Bethlehem, PA, USA. 2015.
Property
Category
TemplateDefinition
Field
Form DefinitionWiki Article
Has a default formAssign article to a category
Identifies
Edit with Form
Uses template
Has value
for
Data type Input type
Represents
Bundled in
Is a part of
Can use (by default)
Has a Has an
Belo
ngs t
o
Semantic Mediawiki Dataflow
Singleton TemplateDefinition
Regular TemplateDefinition
12
Overall Architecture
13
Singleton v/s Regular Template
Autoclave
Definition TextImage
Source
Rights
Autoclave
Definition Text
Image
SourceRights
SourceRights
14
Regular Vs Singleton templates
Subject Predicate ObjectAutoclave hasDefinition#1 “A closed vessel…”
hasDefinition#1 singletonPropertyOf skos:definition
hasDefinition#1 source “ASM Handbook”hasDefinition#1 license “Reproduced by…”Autoclave hasImage#1 “Image.jpg”hasImage#1 singletonPropertyOf mv:image
Subject Predicate ObjectAutoclave hasDefinition “A closed vessel…”
Autoclave source “ASM Handbook”
Autoclave license “Reproduced by…”
Autoclave hasImage “Image.jpg”
15
• Singleton Templates are our enhancement to SMW
• Implemented parser function to handle the Singleton
Template parsing using magicWord
• Registering the magicWord
• Parser function responsible to handle/process the
Singleton Templates and generate the RDF triples
Implementation
16
• Properties of interest to domain experts: – Definition Text– Source– License– Creator– Abbreviation– Synonyms– Units– …..
Use Case in Materials Science
mv: is matvocab namespace
17
Steps to create vocabularies
Create the Form
Create the regular template and
singleton templates
Create relevant properties
Import existing vocabularies into SMW
Create Term pages
Material Manufacturing and Design From
11 Templates: 6 Singleton templates5 Regular templates
SKOS, QUDT, etc.
[[Property name:: property value]]
20
MatVocab Form to create a term
21
20
• Adopted the Singleton Property method for capturing
triple metadata in SMW
• Importing legacy data with provenance in bulk using
the Singleton Property method
• Importing existing RDF datasets with provenance into
SMW for curation
Our Approach
21
• Data from Excel spreadsheet files – 3 vocabularies
• We map CSV data into the predefined 11 templates
• Some of the data mapped to regular templates
• Others mapped to singleton templates
Import legacy data with provenance
Structural Materials Vocabulary
22
Statistics of the Use Case
Type SMW
1 Number of vocabularies imported 3
2 Total number of terms imported from ASM 1295
3 Total number of terms imported from MILHNDBK-5 19
4 Total number of terms imported from MILHNDBK-17 179
5 Total number of Singleton Templates created 6
6 Total number of Regular Templates created 5
7 Total number of pages created 1,685
23
• Adopted the Singleton Property method for capturing
triple metadata in SMW
• Importing legacy data with provenance in bulk using
the Singleton Property method
• Importing existing RDF datasets with provenance into
SMW for curation
Our Approach
24
• We developed an extension similar to the “CSV
Import”
• We experimented with the YAGO-SP2 dataset
• Ongoing work
• More details can be found in our paper
RDF import with provenance
25
We enabled Semantic Mediawiki with the Singleton property model to facilitate creation
and curation of the Materials Science vocabulary with provenance metadata
To Conclude…
26
Further information, please visit http://wiki.knoesis.org/index.php/KnowledgeWiki