Mashing Up Linked Open Government Data
Li Ding
Tetherless World ConstellationRensselaer Polytechnic Institute
Nov 8, 2010
2
LOGD Application UI
TWC LOGDdata.gov.uk
dbpedia
W W W S
PA
RQ
LQ
uery
SP
AR
QL
Results
Format Data
JSO
N
XM
L
CS
V
Visualize DataQuery Data Integrate Data
LOGD Consumption Workflow
3
Query Data: SPARQL-based Data Mashup
LOGD data dump file 1 LOGD data dump file 2
SPARQL Endpoint
RDF Dataset 1
<record_x> <p1> obj
RDF Dataset 2
<record_y> <p2> obj
same value
load load
SELECT ?x ?yWHERE { graph <RDF Dataset 1> { ?x <p1> ?obj }
graph <RDF Dataset 2> { ?y <p2> ?obj } }
query
4
Format Data
google.visualization.Query.setResponse({version:0.6,status:'ok',reqId:0,table:{cols:[{id:'g',label:'g',type:'string'},{id:'number_of_triples',label:'number_of_triples',type:'number'}],rows:[{c:[{v:'http://data-gov.tw.rpi.edu/vocab/Dataset_353'},{v:23721}]},{c:[{v:'http://data-gov.tw.rpi.edu/vocab/Dataset_1623'},{v:446}]},{c:[{v:'http://data-gov.tw.rpi.edu/vocab/Dataset_1580'},{v:778}]},….
<sparql xmlns="http://www.w3.org/2005/sparql-results#" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.w3.org/2001/sw/DataAccess/rf1/result2.xsd"> <head> <variable name="g"/> <variable name="number_of_triples"/> </head> <results distinct="false" ordered="true"> <result> <binding name="g"><uri>http://data-gov.tw.rpi.edu/vocab/Dataset_353</uri></binding> <binding name="number_of_triples"><literal>23721</literal></binding> </result>…..
gNumberof_triples
http://data-gov.tw.rpi.edu/vocab/Dataset_353 23721
http://data-gov.tw.rpi.edu/vocab/Dataset_1623 446
http://data-gov.tw.rpi.edu/vocab/Dataset_1580 778
http://data-gov.tw.rpi.edu/vocab/Dataset_1356 188358
http://data-gov.tw.rpi.edu/vocab/Dataset_1187 202
SPARQL/XML
JSON (Google Viz)
HTML Table
5
Integrate & Visualize Data
• Dynamic content generated by SPARQL query • Aggregate data from multiple sources• Visualize data using different technologies
Javascript
XSLT
Google AJAXAPI
Yahoo Pipes
PHP + Java
6
Types of Mashups
• Aspects– Data Mashup– Visualization Mashup– Application Mashup
• Subject– Temporal Mashup– Spatial Mashup– Entity Mashup– …
• Other topics– Semantic Search– Social mashup– Provenance
7US and UK Foreign AID FY2007:Integrating data from two countries
AID Major aids from US Major aids from UK
Brazil US >UK Development Assistance Gov & civil society, Economic
India UK > US Child Survival and Health Health, Economic
Created by James Michaelis, RPI, http://data-gov.tw.rpi.edu/demo/linked/aidviz-1554-10030.html
Data Sources:
[Spatial Mashup] Data.gov (USAID) + Data.gov.uk (DFID)
8Gov Data for Pop Science:Integrating different sources for discovery
Created by Sarah Magidson, U. Chicago. http://data-gov.tw.rpi.edu/demo/stable/tobacco-smoker/demo-state-10026-smoke-rate-statevarsapi.html
[Spatial Mashup] Data.gov (Population) + NIH (Tobacco Tax, Smoking rate)
Gov data provides knowledge for poplation science study
9
XHTML+RDFa
ARC2
http://data-gov.tw.rpi.edu/
Data-gov Semantic Search
HTML
RDFa
WebSearchResults
RDF
Annotation
Consume LOGD data in Semantic Search
10
Adding Social Factor to Mashups
RDFPublish*
Enhance*
UserRawData
consume*
feedback
• Import socially contributed data, e.g. DBpedia • Let users contribute
– links– feedbacks
OtherSocial Web
AppsImport/export
11Social Mashup - US Wildland Fire:Linking to Wikipedia (socially contributed)
Wildland fire(NIFC)
Budget on wildfire“DOI” and “USDA”(OMB)
Category:Wildfires In The United States
Created by Li Ding, RPI, http://data-gov.tw.rpi.edu/demo/stable/demo-1187-40x-wildfire-budget.html
[Temporal Mashup] Data.gov (statistics+ budget) + Wikipedia (famous fires)
12Social Mashup - White House Visitor Search:Linking social network data using semantic wiki
“POTUS”
dbpedia:Barack_Obama
Created by Dominic DiFranzo, Evan Patton, RPI, http://data-gov.tw.rpi.edu/demo/stable/white-house-visitor/top100-visitees.php
[Person Mashup] Data.gov (statistics) + Wikipedia (personal profiles)
The White House
Semantic Wiki
WikipediaNYTimes
13Social Mashup: USPS Spending and Newsgovernment data + User Feedbacks
Created by Sarah Magidson, http://data-gov.tw.rpi.edu/demo/linked/demo-401-usps-news.html
[Temporal Mashup] Data.gov (budget) + USPS + User Contributed News
14
Using Web Tools
Information networks can be integrated via the Semantic Wiki and visualized a number of different ways: social networks, human-language technology, workflows, …
(a) white house visitor search
(e) [Health] Tobacco Prevalence and Correlated Factors
(h) [Yahoo! Pipes] World Earthquake Map
(b) US-UK Foreign Aid Comparison
(c) Agency Budget and NYTimes
(d) Wildland fire and DBpedia
(f) [Policy] About Supreme Court Justices
(k) [RSS] data.gov updates(i) [IBM ManyEyes] white house visitor network
(g) [Financial] Stock price and Twitter events
(j) [RDFa] semantic search
(d) Wildland fire and DBpedia
More Mashups
16
Resources
• Online Tutorials– Mashing up LOGD data with SPARQL – Retrieving SPARQL Results – Building LOGD Visualizations
• Standards and Tools– SPARQL Endpoint– SparqlProxy
• Visualization APIs– Google Visualization API– MIT SIMILE Exhibit
• Language (name it): Javascript, PHP, Python,…