+ All Categories
Home > Documents > Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Date post: 24-Dec-2015
Category:
Upload: roger-ford
View: 217 times
Download: 0 times
Share this document with a friend
Popular Tags:
23
Open Content By Daniel Jacobson and Harold Neal National Public Radio
Transcript
Page 1: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Open ContentBy Daniel Jacobson and Harold Neal

National Public Radio

Page 2: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Overview

‣ Who is NPR?

‣ Landscape of Open Content

‣ RSS

‣ NPR’s Solution

‣ NPR’s Architecture

‣ NPR API Demo

‣ API Stats and Details

‣ The Future of NPR’s API

‣ Questions?

Page 3: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Who is NPR?

‣ NPR (National Public Radio)

‣ Leading producer and distributor of radio programming

‣ All Things Considered, Morning Edition, Fresh Air, Wait, Wait, Don’t Tell Me, etc.

‣ Broadcasted on over 800 local radio stations nationwide

‣ NPR Digital Media

‣ Website (NPR.org) with audio content from radio programs

‣ Web-Only content including blogs, slideshows, editorial columns

‣ About 250 produced podcasts, with over 600 in directory

‣ Mobile sites

‣ API and other syndication

Page 4: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Open Content Landscape

Content Providers

Amount of

Content Available in APIs

ContentAggregators

UGCAggregators

E-CommerceSites

Major MediaProducers

Page 5: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

What is Major Media Doing?

‣ Most offer RSS for very specific feeds

‣ Some offer extended RSS or comparable

‣ MediaRSS extensions

‣ Podcast enclosures

‣ Very few comprehensive APIs (although seems to be changing)

Really Successful Syndication

‣ Gets some content out there

‣ Drives traffic back to the site

‣ A lot of traction in the marketplace

Really Stingy Syndication

‣ There is meaty real content there

‣ Namespace extensions are limited

‣ Embraces content lock-down model

Page 6: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Full Content Must Be Where The Users Are

‣ RSS is not enough (anymore) to be where the users are!

‣ Users are looking for rich content, multi-media, full text, etc.

‣ There are infinite ways to get content

‣ Loyal patronage is limited to your audience, at best

‣ No guarantee users will come to you for content

‣ Google helps total page views, but page views per session are often low

‣ Facebook, Myspace, etc., is where people go

‣ More content is appearing in these forums

‣ If content is there, users don’t need to go elsewhere

‣ Platforms are constantly changing

‣ It is difficult, but necessary, to keep up

‣ Your site cannot do it alone!

Page 7: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

NPR’s Solution…Open API

‣ Distribute the full content

‣ Allows users to innovate and be creative with our content

‣ A few of us, millions of you

‣ Unlimited people thinking about what can be done

‣ Unlimited people building things

Page 8: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

So Easy, Our CEO Can Do It

Page 9: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

But enables more tech savvy users to do build complex apps

Page 10: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Philosophy of NPR Digital Media

‣ Build Content Management tools, not Web Publishing tools

‣ COPE (Create Once Publish Everywhere)

‣ Separate Content from Display

‣ Eliminate markup from content upon storage

‣ Understand the Atom

‣ Story is the Atom of NPR

‣ Story contains relationships to assets

‣ Stories are grouped into lists

‣ Know when to build and know when to integrate

‣ Tools for assets are always internally managed and centrally stored

‣ For everything else, depends on cost-benefit analysis

‣ When integrating, first option is open source tools

Page 11: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

High-Level System Architecture

Page 12: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Central Oracle 10g Database(planning to migrate to an open source database)

Page 13: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Custom Built CMS

Page 14: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

External Facing Templates(including all transforms and presentations)

Page 15: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Caching and Performance

Page 16: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Output Formats

‣ Currently Supported Formats

‣ NPRML

‣ RSS

‣ MediaRSS

‣ JSON

‣ Atom

‣ JavaScript Widget

‣ HTML Widget

‣ Possible Future Formats

‣ Full Story Widget

‣ NewsML

‣ PBCore

Page 17: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

What is NPRML?

‣ Custom XML structure

‣ most closely represents NPR’s data model

‣ NPR’s “native” model

‣ Foundation of NPR.org

‣ The basis of all other API transformations

‣ Libraries to retrieve and manipulate data from layered data storage

‣ Retrieved via SimpleXML and DOM

‣ NPRML is not meant to be a new standard

Page 18: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Details on the Content

‣ Content available in the NPR API:

‣ 13 years worth of NPR content

‣ About 250,000 unique stories

‣ About 400,000 unique audio files available

‣ Over 5700 unique types of lists, with infinite combination possibilities

‣ Over 90 topics

‣ Twelve programs

‣ Nearly 4000 musical artists

‣ Almost 400 NPR personalities

‣ Over 700 editorial columns and series

Page 19: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Current Statistics on Usage

‣ Since launch on Wednesday, July 16th

‣ Over 300 registrants for the API

‣ Over 235,000 requests to the API

‣ Nearly 10,000 requests based on search terms

‣ Nearly 15,000 requests based on date ranges

‣ Over 23,000 page views of the NPR Tech Center

Page 20: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Distribution of Requested Output Formats

55%

29%

7%

6%

2%< 1%

1%

NPRML

RSS

MediaRSS

HTML Widget

JavaScript Widget

JSON

Atom

Page 21: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Current Rights and Exclusions

‣ Everything that NPR has the rights to is in the API

‣ Includes Morning Edition and All Things Considered

‣ Some NPR programming is excluded due to rights

‣ Car Talk, Fresh Air and This I Believe

‣ Other popular Public Radio Programs are excluded due to rights

‣ * This American Life, Marketplace and A Prairie Home Companion

‣ Some text, images and audio is not available due to rights

‣ Video and blogs are not offered… yet

‣ * These programs are not produced or distributed by NPR.

Page 22: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Future Enhancements for API

‣ Short Term

‣ Full Story HTML Widget

‣ geo information for stories

‣ station finder API

‣ video

‣ Possible Mid to Long Term

‣ more station content from more stations

‣ posting to the API

‣ create your own podcasts

‣ blogs

‣ other formats, including NewsML and PBCore

Page 23: Open Content By Daniel Jacobson and Harold Neal National Public Radio.

Questions?

‣ Feel free to contact us directly:

Daniel Jacobson

[email protected]

Harold Neal

[email protected]

To see the API:http://www.npr.org/api

To follow the API development:http://www.npr.org/blogs/inside


Recommended