+ All Categories
Home > Documents > George thomas gtra2010

George thomas gtra2010

Date post: 07-Dec-2014
Category:
Upload: george-thomas
View: 1,193 times
Download: 1 times
Share this document with a friend
Description:
 
Popular Tags:
35
Data.gov: Semantic Web, Linked Data, and Social Media George Thomas, GTRA June 2010
Transcript
Page 1: George thomas gtra2010

Data.gov: Semantic Web, Linked Data,

and Social Media

George Thomas, GTRA June 2010

Page 2: George thomas gtra2010

2

This Presentation

• From the Web of Docs to a Web of Data

• Common and Domain Specific Vocabularies

• URI Schemes

• Federation and Integration via Interlinking

Vocabularies

• Social Metadata and Data

– On Freebase

– Using Semantic MediaWiki + Halo

Page 3: George thomas gtra2010

3

This Presentation

• From the Web of Docs to a Web of Data

• Common and Domain Specific Vocabularies

• URI Schemes

• Federation and Integration via Interlinking

Vocabularies

• Social Metadata and Data

– On Freebase

– Using Semantic MediaWiki + Halo

Page 4: George thomas gtra2010

• Using HTML, there is only 1 type of link– An important 20th century invention for info workers!

Page 1 hyperlink Page 2

Source hyperlink Target

• My page contains markup that links to Your page– http://myserver.tld/mypage.html

• <a href=‘http://yourserver.tld/yourpage.html>link text</a>

The Existing Web of Documents

4

Page 5: George thomas gtra2010

• Using RDF, we can create custom ‘tagged’ links– An innovation where things relate to other things

Entity 1 ‘relatesTo’ Entity 2

Subject predicate Object

• My data resource contains triple markup like– http://myserver.tld/Subject

• <somePredicate> <http://yourserver.tld/Object>

The Emerging Web of Data

5

Page 6: George thomas gtra2010

• The result is a ‘graph’ of triples– A ‘Giant Global

Graph’ – of interrelated

(open gov) data

• Making the Web a Database– With ubiquitous

access over HTTP

Linked (Open Gov) Data Cloud

6

Page 7: George thomas gtra2010

1. Use URI’s as names for thingsconneg: user agent preference (HTML, RDF/XML, etc.)

2. Use HTTP URI’s so that people can look up those names

Via HTTP GET, ‘dereferencing’

3. When someone looks up a URI, provide useful information, using the standards

(RDF and SPARQL)

4. Include links to other URI’s so that they can discover more things

‘traversing the data graph’

Linked Data Design Principles

7 source

Page 8: George thomas gtra2010

8

This Presentation

• From the Web of Docs to a Web of Data

• Common and Domain Specific Vocabularies– Triples

• URI Schemes

• Federation and Integration via Interlinking

Vocabularies

• Social Metadata and Data– On Freebase

– Using Semantic MediaWiki + Halo

Page 9: George thomas gtra2010

9

A Management Vocabulary

OMG’s Business Motivation Model

Page 10: George thomas gtra2010

• (1-Strategy) (2-channels effort towards) (3-Goal); (2-defines) (3-Offering).

• Objective quantifies Goal.

• Tactic implements Strategy; deploys, Offering.

• Offering requires Resource.

• Lets look at some BMM metadata triples

Triples: Management Metadata

10

Page 11: George thomas gtra2010

• Now lets look at data triples that use this Vocab

• EHR Exchange is a Goal.• Meaningful Use is an Objective; quantifies EHR Exchange.• NHIN is a Strategy.• Connect is a Tactic.• Connect implements NHIN; channels effort towards

Meaningful Use.

EHR Exchange

Triples: Management Data

11

Page 12: George thomas gtra2010

12

An IT Investment Vocabulary

• Creating Triples;

– Get some domain specific CSV data

– (From http://it.usaspending.gov)

• Worksheet name = Class name

• Worksheet Row = instance of Class

• Unique ID, Primary Key = Subject

• Column Name = predicate

• Cell data = Object

Page 13: George thomas gtra2010

13

This Presentation

• From the Web of Docs to a Web of Data

• Common and Domain Specific Vocabularies

• URI Schemes

• Federation and Integration via Interlinking

Vocabularies

• Social Metadata and Data

– On Freebase

– Using Semantic MediaWiki + Halo

Page 14: George thomas gtra2010

• Let’s take a look at the Linked Data info ecosystem on the Web

– OMB publishes a management vocabulary

schema.omb.gov/Objective<omb:quantifies> <omb:Goal>.

– HHS publishes data that (re)uses OMB’s vocabulary

data.hhs.gov/MeaningfulUse <is a type of> <omb:Objective>;<omb:quantifies> <data.hhs.gov/EHR_Exchange>.

data.hhs.gov/Connect<omb:implements> <hhs:NHIN>;<omb:channelsEffortToward> <hhs:MeaningfulUse>.

– Data.gov aggregates Linked Open Gov Data for mashers and more linking!

data.gov/sparql (a query point on the Web)– Select Agency Objectives and IT Expenditures and …

Example URI Scheme

14

Page 15: George thomas gtra2010

15

This Presentation

• From the Web of Docs to a Web of Data

• Common and Domain Specific Vocabularies

• URI Schemes

• Federation and Integration via Interlinking

Vocabularies

• Social Metadata and Data

– On Freebase

– Using Semantic MediaWiki + Halo

Page 16: George thomas gtra2010

• Common vocabularies come from standards orgs– Everyone uses these

• Catalogs, licensing, publishing, etc.• Business Management, IT Investments

• Authoritative domain specific vocabularies– Are associated with Agency mission

• And their Web site ‘namespaces’ (a base URI)

• Interlinking vocabularies connect cross agency data– Driven by various interested parties

• Inside and outside of Gov• Adoption leads to voluntary consensus standardization

• Integration is achieved at lower coordination cost!

Federation and Integration

16

Page 17: George thomas gtra2010

17

An Interlinking Vocabulary

• Here is a view inside an Ontology editor

– TopBraid Composer (Free Edition)

• Open in the editor is ‘bizmo.owl’

– Which ‘imports’ other vocabs from the Web

• The Management Vocabulary

• The IT Investment Vocabulary

– On the left is a classes tree browser

– On the right is a properties tree browser

• Bizmo is being used to interlink and

extend imported concepts and terms

– Through sub-classes and sub-properties

• a ‘Program’ is a type of ‘Means’

• A ‘FederalAgency’ is a type of ‘OrgUnit’

– And through interlinking properties

Page 18: George thomas gtra2010

18

Autonomous Owners of Integrated Metadata

• Combine classes and properties from multiple publisher namespaces

Page 19: George thomas gtra2010

19

– supportsABC properties link IT investments• To Management goals and objectives, etc.

– Extending metadata specs, without changing them!

– maintainsXYZ properties link an Organizational Unit• To IT investment information

Close Up: Interlinking Properties

Page 20: George thomas gtra2010

20

This Presentation

• From the Web of Docs to a Web of Data

• Common and Domain Specific Vocabularies

• URI Schemes

• Federation and Integration via Interlinking

Vocabularies

• Social Metadata and Data

– On Freebase

– Using Semantic MediaWiki + Halo

Page 21: George thomas gtra2010

21

Social Metadata on Freebase

• A Type specification HTML page, defining a vocabulary term (Goal)

– And can see the properties that link (relate) this Type to other Types

• Members of this ‘base’ get notified that George created/edited this Type

– And see instances of this Type (called Topics on Freebase)

Page 22: George thomas gtra2010

22

Social Metadata History on Freebase

• The activity-stream for this Type (Goal)

– The ‘base’ that these belong to can also be looked at this way

• Contributions filtered by User and Date range

– Individual User contributions can also be examined across any/all Types

Page 23: George thomas gtra2010

23

Social Data on Freebase

• A Type instance (Topic) HTML page, defining a specific Goal

– ‘Health Care Reform’

• Data about this topic/instance can be retrieved as JSON or RDF (N-triple)

– And it’s revision history and member contributions can be viewed

Page 24: George thomas gtra2010

24

Social Data History on Freebase

• The activity-stream for this ‘Topic’ (Health Care Reform)

– You can see when and by whom relationships with this data resource was created

• Contributions filtered by User and Date range

– Individual User contributions can also be examined across any/all ‘Topics’ (resources)

Page 25: George thomas gtra2010

25

This Presentation

• From the Web of Docs to a Web of Data

• Common and Domain Specific Vocabularies

• URI Schemes

• Federation and Integration via Interlinking

Vocabularies

• Social Metadata and Data

– On Freebase

– Using Semantic MediaWiki + Halo

Page 26: George thomas gtra2010

26

Browsing SMW KB via Classes

Page 27: George thomas gtra2010

27

Browsing SMW KB via Properties

Page 28: George thomas gtra2010

28

Ontology Classes/Properties = Tags

• Create semantic annotations (part of wysiwyg editor tools)– auto-completion suggests tags to reuse from ontologies that have

been imported into the wiki (note existing tags from BMM ontology)

Page 29: George thomas gtra2010

29

Ontology based SMW KB Query

Page 30: George thomas gtra2010

30

SMW KB Report Generation

Page 31: George thomas gtra2010

31

NHIN – Automagic Data Merging

This UPI keyed data is captured from Ex300/53 submissions…

But this data is ‘merged’ from other independently published sources on the Web of Data!!

Page 32: George thomas gtra2010

32

NHIN RDF/XML IEP: Export Excerpt

• Wiki instance data and metadata curation (SME edits) maintain class and property specs of ontologies imported off the Web when exported or accessed by other sites/users/apps

Page 33: George thomas gtra2010

33

Browsing Integrated Data on SMW

• Filtering through properties defined by the interlinking vocabulary• Finds IT topics (instances) that are linked to Management topics

Page 34: George thomas gtra2010

Conclusion

• Data.gov is today:– One of the largest sets of Open Government RDF

data on the Web

• Data.gov tomorrow will explore:– Focus on common, domain specific and interlinking

vocabularies, and corresponding URI schemes– And treat those vocabularies and their instance data

as Social media objects on the Social Web

• Questions?

Page 35: George thomas gtra2010

Thanks For Your Time!

• Contact me:

– http://xri.net/=george.thomas

• About me:

– HHS Enterprise Architect• CTO Council, member• Innovation Council, member

– Federal CIO Council • Architecture and Infrastructure Committee (AIC), Services Subcommittee co-chair• Cloud Computing Advisory Committee (CCAC), member

– W3C eGov IG, Invited Expert

– OMG GovDTF, Steering Committee member

– Graduate School, SOA Faculty Instructor


Recommended