+ All Categories
Home > Technology > rNews Embedded Data for the News Industry by Evan Sandhaus

rNews Embedded Data for the News Industry by Evan Sandhaus

Date post: 22-Nov-2014
Category:
Upload: search-marketing-expo-smx
View: 1,098 times
Download: 0 times
Share this document with a friend
Description:
From SMX East 2013 - Structured Data Superstars - rNews Embedded Data for the News Industry by Evan Sandhaus of the NY Times
40
rNews: EMBEDDED DATA FOR THE NEWS INDUSTRY # Evan Sandhaus NYTimes @kansandhaus #SMX #22C October 2, 2013 #
Transcript
Page 1: rNews   Embedded Data for the News Industry by Evan Sandhaus

rNews:

EMBEDDED DATA

FOR THE

NEWS

INDUSTRY

#

Evan Sandhaus

NYTimes

@kansandhaus

#SMX #22C

October 2, 2013

#

Page 2: rNews   Embedded Data for the News Industry by Evan Sandhaus

2

Agenda

Why we need rNews

Intro to rNews

Benefits of rNews

Road to rNews

Schema.org integration

Adoption

NYT Case Study

Discussion

Page 3: rNews   Embedded Data for the News Industry by Evan Sandhaus
Page 4: rNews   Embedded Data for the News Industry by Evan Sandhaus

4

...And 50

Others

Page 5: rNews   Embedded Data for the News Industry by Evan Sandhaus

5

Why we need

Semantic

Markup

5

The Burning Question

Page 6: rNews   Embedded Data for the News Industry by Evan Sandhaus
Page 7: rNews   Embedded Data for the News Industry by Evan Sandhaus

STORY

PHOTO

Story components which are obvious to a person…

Page 8: rNews   Embedded Data for the News Industry by Evan Sandhaus

STORY

PHOTO

...are not so obvious to a machine.

Page 9: rNews   Embedded Data for the News Industry by Evan Sandhaus

9

The Problem Of Structured Data: Continued

Label Type Value

id number 1248069162607

Headline text New Web Code Draws Concern...

Byline text By TANZINA VEGA

Date date 20101010

Body text In the next few years, a powerful...

Length number 1123

Tag text Privacy

Tag text Computers and the Internet

Tag text Web Browsers

<html> <head> <title> New Web Code

Draws Concern... </title> </head>

<body> <div>

New Web Code Draws Concern...

</div> <div>

By TANZINA VEGA

</div> <div>

October 10, 2010

</div> <div>

In the next few years, a powerful...

</div> </body></html>

Data Tier Display TierLogic Tier

Content very well structured on Data

Tier, but all of this structure is lost in

translation to presentation tier.

Page 10: rNews   Embedded Data for the News Industry by Evan Sandhaus

10

The Problem Of Structured Data: Continued

<html> <head> <title> New Web Code

Draws Concern... </title> </head>

<body> <div>

New Web Code Draws Concern...

</div> <div>

By TANZINA VEGA

</div> <div>

October 10, 2010

</div> <div>

In the next few years, a powerful...

</div> </body></html>

Display Tier

=

?

Search engines, social

networks, aggregators and

other sites only see the

Display Tier, and cannot

leverage the underlying

structure of the data.

Page 11: rNews   Embedded Data for the News Industry by Evan Sandhaus

11

The Problem Of Structured Data: Continued

Without structured data search engines, social

networks and other sites cannot attractively format

links back to our site, potentially decreasing

referral traffic.

With Structured

Data

No Structured

Data

Page 12: rNews   Embedded Data for the News Industry by Evan Sandhaus

12

The Case of the Missing Structured Data

Page 13: rNews   Embedded Data for the News Industry by Evan Sandhaus

13

Semantic Markup Standards

Microformats RDFa Microdata JSON

First

Simple

Rigid

Official

Complex

OpenGraph

Unofficial

Flexible

Schema.org

Official

Developers

External

Page 14: rNews   Embedded Data for the News Industry by Evan Sandhaus

14

Page 15: rNews   Embedded Data for the News Industry by Evan Sandhaus

15

Page 16: rNews   Embedded Data for the News Industry by Evan Sandhaus

1616

rNews

Page 17: rNews   Embedded Data for the News Industry by Evan Sandhaus

17

rNews Defined

rNews is a data model for embedding

machine-readable publishing metadata in

web documents and a set of suggested

implementations.

slightly shorter

Page 18: rNews   Embedded Data for the News Industry by Evan Sandhaus

18

rNews is a data model

Page 19: rNews   Embedded Data for the News Industry by Evan Sandhaus

19

for embedding machine-readable publishing

metadata in web documents

Headline

Byline

Tags

Creator

...

Page 20: rNews   Embedded Data for the News Industry by Evan Sandhaus

20

and a set of suggested implementations

RDFa Microdata JSON

Today Today Maybe?

Page 21: rNews   Embedded Data for the News Industry by Evan Sandhaus

21

rNews - Class Diagram

Page 22: rNews   Embedded Data for the News Industry by Evan Sandhaus

22

rNews - Working Example

Page 23: rNews   Embedded Data for the News Industry by Evan Sandhaus

23

Page 24: rNews   Embedded Data for the News Industry by Evan Sandhaus

24

HTML 5 Microdata<!DOCTYPE HTML>

<html itemscope itemtype="http://schema.org/NewsArticle" >

<head>

<style type="text/css">@import url(css/iptc_times2.css);</style>

<meta itemprop="dateCreated" content="2011-03-23"/>

<meta itemprop="description" content="The questions about the command..."/>

<meta itemprop="inLanguage" content="en-US"/>

<meta itemprop="thumbnailUrl" content="http://graphics8.nytimes.com/images/common/icons/t_wb_75.gif"/>

<meta itemprop="genre" content="Current"/>

<meta itemprop="id" content="1248069687395"/>

<meta itemprop="version" content="2"/>

<meta itemprop="publishingPrinciples" content="http://www.nytco.com/press/ethics.html"/>

<meta itemprop="wordCount" content="879"/>

</head>

<body>

<div style="height:900px" class="article">

<div class="a_column">

<div itemprop="headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div>

<div itemprop="alternativeHeadline" class="rider">NATO Takes Command</div>

<div itemprop="associatedMedia" itemscope itemtype="http://schema.org/ImageObject">

<img itemprop="URL" class="image" src="img/libya_sample_reuters.jpg"/>

<div class="image_credit">Credit:

<span itemprop="creator" itemscope itemtype="http://schema.org/Person">

<span itemprop="name">Goran Tomasevic</span>

</span>

/

<span itemprop="sourceOrganization" itemscope itemtype="http://schema.org/Organization">

<span itemprop="name">Reuters</span>

<meta itemprop="tickerSymbol" content="NYSE TRI"/>

</span>

</div>

Page 25: rNews   Embedded Data for the News Industry by Evan Sandhaus

rNews

BenefitsOr Why You Should Care

About rNews

Page 26: rNews   Embedded Data for the News Industry by Evan Sandhaus

26

Benefit #1: Superior Algorithmically-

Generated Links

Using structured data search engines, social

networks and other sites can attractively format

links back to our site, potentially increasing referral

traffic.

With Structured

Data

No Structured

Data

Page 27: rNews   Embedded Data for the News Industry by Evan Sandhaus

27

Benefit #2: Superior Tool Support

Vertical search

Commenting Platforms

Rights Management

Page 28: rNews   Embedded Data for the News Industry by Evan Sandhaus

28

Benefit #3: Better Analytics

Page 29: rNews   Embedded Data for the News Industry by Evan Sandhaus

29

The Way to rNews

Page 30: rNews   Embedded Data for the News Industry by Evan Sandhaus

30

rNews - Timeline

September 2010 - rNews proposed to IPTC at fall

meeting

March 2011 - rNews draft version 0.1 approved by

IPTC at summer meeting.

March - May 2011 - IPTC solicits feedback on draft

standard.

June 2011 - IPTC to vote on revised standard at

summer meeting

October 2011 - rNews 1.0 Approved by IPTC

November 2011 - Start of rNews Implementation

on nytimes.com

Page 31: rNews   Embedded Data for the News Industry by Evan Sandhaus

31

Engaging Our Community

Page 32: rNews   Embedded Data for the News Industry by Evan Sandhaus

32

Engaging Our Community

Page 33: rNews   Embedded Data for the News Industry by Evan Sandhaus

33

Engaging Our Community

Page 34: rNews   Embedded Data for the News Industry by Evan Sandhaus

3434

Engaging Our Community

Page 35: rNews   Embedded Data for the News Industry by Evan Sandhaus

And Then

Schema.org

Page 36: rNews   Embedded Data for the News Industry by Evan Sandhaus

36

Page 37: rNews   Embedded Data for the News Industry by Evan Sandhaus

37

Page 38: rNews   Embedded Data for the News Industry by Evan Sandhaus

38

This class contains derivatives of

IPTC rNews properties. rNews is a

data model of publishing metadata

with serializations currently

available for RDFa as well as

HTML5 Microdata. More

information about the IPTC and

rNews can be found at rnews.org.

Page 39: rNews   Embedded Data for the News Industry by Evan Sandhaus

3939

rNewsThank You!

Page 40: rNews   Embedded Data for the News Industry by Evan Sandhaus

http://www.slideshare.net/SearchMarketingExpo

#

see more presentations at:

#

4

0


Recommended