+ All Categories
Home > Technology > RDFa From Theory to Practice

RDFa From Theory to Practice

Date post: 06-May-2015
Category:
Upload: adrian-stevenson
View: 2,021 times
Download: 5 times
Share this document with a friend
Description:
Talk given at Institutional Web Management Workshop 2010, University of Sheffield
66
A centre of expertise in digital information management www.ukoln.ac.u k www.bath.ac.u k UKOLN is supported by: RDFa From Theory to Practice - Part 1 - A gentle introduction to Linked Data and the Semantic Web? 12th July 2010 Institutional Web Management Workshop 2010 University of Sheffield, UK Adrian Stevenson
Transcript
Page 1: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

UKOLN is supported by:

RDFa From Theory to Practice - Part 1 - A gentle introduction to Linked Data and the Semantic Web?

12th July 2010

Institutional Web Management Workshop 2010University of Sheffield, UK

Adrian Stevenson

Page 2: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

semantics is … devoted to the study of meaning … on the syntactic levels of words, phrases, sentences

http://en.wikipedia.org/wiki/Semantic

Page 3: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

“The Semantic Web is a web of data, in some ways like a global database”1

“first step is putting data on the Web in a form that machines can naturally understand...  This creates what I call a Semantic Web - a web of data that can be processed directly or indirectly by machines”2

1. http://www.w3.org/DesignIssues/Semantic.html

2. Tim Berners-Lee, Weaving the Web. Harper, San Francisco. 1999.

Page 4: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

“The term Linked Data refers to a set of best practices for publishing and connecting structured data on the Web.”

“the Semantic Web is the goal or end result… Linked Data provides the means to reach that goal”

From ‘Linked Data: The Story So Far’ - Heath, Bizer and Berners-Lee 2009

Page 5: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

The Web We’re Used To

• Made by humans for humans

• Primarily documents

• Machines not very welcome

• Data silos

Page 6: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Web of Linked Data

• In 1998 the idea from Tim Berners-Lee of ‘linked data’ took shape

• Designed for machines first

• It primarily links data about ‘things’, not documents

• …but it is for humans in the end

Page 7: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

• But haven’t we been putting data on the web for years?– In CSV , relational databases, XML etc?

• Well yes, but these approaches are not so easy to integrate

• Web 2.0 mashups work against a fixed set of data sources

• Linked Data applications operate on top of an unbound, global data space.

Page 8: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

So what’s happening now?

Page 9: RDFa From Theory to Practice
Page 10: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

• “Sir Tim Berners-Lee, the inventor of the world wide web, will help the British government to make its data more easily available online … I have asked Sir Tim Berners-Lee … to help us drive the opening up of access to Government data in the web” Prime Minister Gordon Brown, 10th June 2009

• "What you find if you deal with people in government departments is that they hug their database, hold it really close”. Tim Berners-Lee, 10th June 2009

• An Institute of Web Science proposed• Why?

– Openness – MPs expenses, etc.– Saving money

Page 11: RDFa From Theory to Practice

http://www.guardian.co.uk/technology/2010/may/25/berners-lee-institute-web-science-statement

Page 12: RDFa From Theory to Practice

http://www.ecs.soton.ac.uk/about/news/3223

Page 13: RDFa From Theory to Practice

Data.gov.uk

Officially launched 21st January 2010

Page 14: RDFa From Theory to Practice

Data.gov.uk – search for ‘traffic’

Page 15: RDFa From Theory to Practice

Central Office of Information - http://coi.gov.uk/

Page 16: RDFa From Theory to Practice

BBC Music BETA

http://www.bbc.co.uk/music/developers

Page 17: RDFa From Theory to Practice

• Provides access to raw data (Excel spreadsheets, PDF files, and more)

• UK is adhering more closely to Berners- Lee’s Linked Data rules

Page 18: RDFa From Theory to Practice

http://www.readwriteweb.com/archives/cnet_partners_with_thomson_reuters_on_linked_data.php

Page 19: RDFa From Theory to Practice

http://open.blogs.nytimes.com/2009/06/26/nyt-to-release-thesaurus-and-enter-linked-data-cloud/

Page 20: RDFa From Theory to Practice

Graphs house prices over time - combines house price data with information from Yahoo! Placemaker, Nestoria and OpenStreetMap

Page 21: RDFa From Theory to Practice
Page 22: RDFa From Theory to Practice

Postcode Paper - bus timetables, doctors surgeries, allotmentshttp://blog.newspaperclub.co.uk/2009/10/16/data-gov-uk-newspaper/

Page 23: RDFa From Theory to Practice

Owls Near You - http://owlsnearyou.com/

Page 24: RDFa From Theory to Practice

12 month project funded by JISC 2/10 jiscExpo callhttp://blogs.ukoln.ac.uk/locah/ http://www.twitter.com/projectlocah tag: #locah

Page 25: RDFa From Theory to Practice

http://richard.cyganiak.de/2007/10/lod/

Page 26: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

A little bit of the techy stuff

Page 27: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Linked Data is …

• A way of publishing data on the web that:– Encourages reuse– Reduces redundancy– Maximises inter-connectedness– Enables network effects

• So how is this achieved?

Page 28: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Presentational tagging – HTML

• <h1>Agilitas Physiotherapy Centre</h1> <p>Welcome to the Agilitas Physiotherapy Centre home page. Do you feel pain? Have you had an injury? Let our staff Lisa Davenport, our secretary Kelly Townsend, and Steve Matthews take care of your body and soul.</p>

<h2>Consultation hours</h2> Mon 11am - 7pm<br/> Tue 11am - 7pm<br/> Wed 3pm - 7pm<br/> Thu 11am - 7pm<br/> Fri 11am - 3pm

• <p> But note that we do not offer consultation during the weeks of the <a href=". . .">State Of Origin</a> games.</p>

Page 29: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Semantic tagging<company>

<treatmentOffered>Physiotherapy</treatmentOffered>

<companyName>Agilitas Physiotherapy Centre</companyName>

<staff>

<therapist>Lisa Davenport</therapist><therapist>Steve Matthews</therapist>

<secretary>Kelly Townsend</secretary>

</staff>

</company>

Page 30: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Tim BL’s Linked Data Design Issues• Use URIs as names for things • Use HTTP URIs so that people can look up those

names. • When someone looks up a URI, provide useful

information, using the standards (RDF, SPARQL) • Include links to other URIs so that they can

discover more things.

• From http://www.w3.org/DesignIssues/LinkedData.html

Page 31: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

URIs and HTTP

• A “Uniform Resource Identifier (URI) provides a simple and extensible means for identifying a resource –RFC 3986

• A URL is a type of URI• HTTP URIs can be ‘de-referenced’

• HTTP URIs are used for “real world” things– http://adrianstevenson.com/id/me– http://dbpedia.org/page/Tim_Berners-Lee

Page 32: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

RDF• Resource Description Framework

– “a language for representing information about resources in the World Wide Web”

– “RDF can also be used to represent information about things that can be identified on the Web, even when they cannot be directly retrieved on the Web”

• Describes relations based on triples– Subject-object-predicate

• http://www.w3.org/TR/REC-rdf-syntax/

Page 33: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Heroes

has a

creator whose name is

David Bowie

Subject

Predicate

Object

Page 34: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

RDFa• ‘Resource Description Framework in

attributes’

• Adds attribute level extensions to XHTML

• Enables embedding RDF triples within XHTML

• Google and Yahoo process RDFa

Page 35: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

RDFa example<html xmlns:dc="http://purl.org/dc/terms/">

<head>

<title>RDFa: From Theory to Practice</title>

</head>

<body>

<h1>RDFa: From Theory to Practice</h1>

Author: <em property="dc:creator" content=”Adrian Stevenson">Adrian Stevenson</em>

Created: <em property="dc:created" content="2010-07-12"> July 12th, 2010</em>

License: <a rel="license" href="http://creativecommons.org/licenses/ » by-sa/3.0/">CC Attribution-ShareAlike</a>

</body>

</html>

Page 36: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

RDFa example<html xmlns:dc="http://purl.org/dc/terms/">

<head>

<title>RDFa: From Theory to Practice</title>

</head>

<body>

<h1>RDFa: From Theory to Practice</h1>

Author: <em property="dc:creator" content=”Adrian Stevenson">Adrian Stevenson</em>

Created: <em property="dc:created" content="2010-07-12"> July 12th, 2010</em>

License: <a rel="license" href="http://creativecommons.org/licenses/ » by-sa/3.0/">CC Attribution-ShareAlike</a>

</body>

</html>

Page 37: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Linked Data in Use

Page 38: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Publishing Linked Data• RDFizers – convert data formats into

RDF

• D2R Server – creates linked data from relational databases

• SparqPlug – Extracts linked data from HTML

• …. Many others

Page 39: RDFa From Theory to Practice
Page 40: RDFa From Theory to Practice
Page 41: RDFa From Theory to Practice

D2R server publishes Linked Data view of database and allows clients to query the database via SPARQL

Page 42: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Linked Data Applications

• Linked Data Browsers – navigate between data sources– Disco– Tabulator– Marbles

• Linked Data Search Engines– For humans – Falcons, SWSE– For apps – Swoogle, Sindice

Page 43: RDFa From Theory to Practice

• Tracks provenance of data• Merges data about the same thing from different sources

http://marbles.sourceforge.net/

Page 44: RDFa From Theory to Practice

• User can explore the underlying data structures

• Can search for objects, concepts or documents

http://iws.seu.edu.cn/services/falcons/

Page 45: RDFa From Theory to Practice

• Provides interface (API) that other linked data apps can use• Rationale: new linked data apps shouldn’t need to implement their own infrastructure for crawling and indexing web of data

http://sindice.com/

Page 46: RDFa From Theory to Practice

http://sindice.com/search?q=jazz&qt=term

Page 47: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Some issues

• To RDF or not to RDF• Usability• Sustainability• Provenance• Licensing• Reliability

Page 48: RDFa From Theory to Practice
Page 49: RDFa From Theory to Practice
Page 50: RDFa From Theory to Practice
Page 51: RDFa From Theory to Practice
Page 52: RDFa From Theory to Practice
Page 53: RDFa From Theory to Practice
Page 54: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Sustainability

• Ed Summers at the Library of Congress createdhttp://lcsh.info

• Linked Data interface for LOC subject headings

• People started using it

Page 55: RDFa From Theory to Practice

Library of Congress Subject Headings

Page 56: RDFa From Theory to Practice
Page 57: RDFa From Theory to Practice

Data Licensing

• Uses Amazon Web Services but contravenes their terms and conditions

http://www4.wiwiss.fu-berlin.de/bizer/bookmashup/

Page 58: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Provenance

• OK if data ‘watermarked’

• But can often be a problem

• VOID can help

Page 59: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Page 60: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

• Can we convince IT Managers, VC etc. it’s worth it?– Realistic expectations– “..the people sort of in charge of the kind

of data thing knew so little about their data structures”

– “I’ve had a whole bunch of meetings to get one dataset, been fobbed off, and literally just never get anywhere”Tom Steinberg, Director of MySociety (from Nodalities issue 8)

The Business Case

Page 61: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

• What’s the payoff for O’Reilly, BBC etc of using Linked Data?

• Why didn’t it work the first time?– What’s different now?• Need to work out what Linked Data does

that other things don’t• prove a simple tangible benefit

The Business Case

Page 62: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Universities and Colleges in the Giant Global Graph

• Session at CETIS Conference 2009

• Case for Linked Data / Semantic Web discussed

• Some cases:– Freedom of Information– Improves data quality– Joining the party

http://wiki.cetis.ac.uk/Universities_and_Colleges_in_the_Giant_Global_Graph

Page 63: RDFa From Theory to Practice

http://wiki.cetis.ac.uk/Image:Conf2009_GGG_Group1B.jpg

Page 64: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Conclusion• Interesting developments and sense of

momentum• Central Gov’t still seem committed• JISC is funding 10 Linked Data projects

starting around July 2010• … but still much to do if the semantic web

and linked data are to really take hold

Page 65: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

Questions?

• http://blogs.ukoln.ac.uk/adrianstevenson• http://www.twitter.com/adrianstevenson• [email protected]

Page 66: RDFa From Theory to Practice

                                                             

A centre of expertise in digital information management

www.ukoln.ac.uk www.bath.ac.uk

CC Attribution

• Some sections of this presentation adapted from:– An Introduction to Linked Data, by Tom Heath– The Semantic Web – An Introduction by Owen Stephens– Using Linked Data as a Learning Resource

Recommendation System by Chris Clarke

• This presentation available under creative commons Noncommercial-Share Alike


Recommended