+ All Categories
Home > Technology > Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Date post: 15-May-2015
Category:
Upload: bradley-allen
View: 537 times
Download: 1 times
Share this document with a friend
Popular Tags:
32
Navigation for the Digital Universe Multimedia Data Navigation and the Semantic Web Valery A. Petrushin and Bradley P. Allen
Transcript
Page 1: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Navigation for the Digital Universe

Multimedia Data Navigation and the Semantic Web

Valery A. Petrushin and Bradley P. Allen

Page 2: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 2

Outline• About the authors• Faceted Navigation• Semantic Web Techniques

– RDF(S)

– Dublin Core

– SKOS

– TGM

– LSCOM, SMIL & MPEG-7• Case Study: BBC Rushes• Implementation

– BBC Rushes Navigator• Metadata representation

• Architecture

• User interface

• Future work• Contact Information• Demo

Page 3: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 3

About the Authors• Valery A. Petrushin, Ph.D.

– Sr. Researcher, Accenture Technology Labs

– Semantics of programming languages

– Multimedia data mining, analysis, annotation and retrieval

– Georgia Tech, Glushkov Institute for Cybernetics

• Bradley P. Allen– Founder and CTO Siderean Software, Inc.

– Semantic-based navigation, Web personalization services, case-based reasoning

– Former founder and CTO of Limbex Corp. and TriVida Corp.

– Carnegie-Mellon University

Page 4: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 4

Faceted navigation• Facets are metadata properties whose ranges

form a near-orthogonal set of controlled vocabularies

• Creator: “Dickens, Charles”• Subject: Arsenic, Antimony• Location: World > U.S. > California > Venice

• Facets form a frame of reference for information overview, access and discovery

• Other properties serve as landmarks and cues

• Faceted navigation uses facets to provide end user access and discovery in the context of large collections of semi-structured information

Page 5: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 5

Faceted Navigation BuiltUsing Semantic Web Standards

• Define/reuse ontologies expressed in RDF(S)/OWL

• Classes for defining instances and controlled vocabularies• Properties for facets and additional asset metadata attributes

• Import/transform aggregated instance metadata into an RDF representation

• Resources referred to via URIs• Content and controlled vocabularies

• Write application profiles in terms of RDF

Page 6: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 6

Building Faceted Navigation Applications

… then represented as instances of concepts in

ontologies and tagged using controlled vocabularies…

… then application profilesare created…

… that define navigation services for user applications

Metadata is aggregated…

Term

Event

Person

PlaceText

Application Profiles

Page 7: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 7

Semantic Web Technology

• RDF(S) – Resource Description Framework (Schema) • Dublin Core• SKOS – Simple Knowledge Organization System• TGM-I & II – Thesaurus for Graphic Materials • LSCOM – Large Scale Concept Ontology for Multimedia• SMIL – Synchronized Multimedia Integration Language• MPEG-7 – Multimedia Content Description Interface

Page 8: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 8

RDF (S)• RDF (S) - Resource Description Framework (Schema)

• http://www.w3.org/RDF/• http://www.w3.org/TR/rdf-schema/• language for representing metadata about Web resources• Triple : subject – predicate -- > object • Example:

<?xml version="1.0"?>

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:contact="http://www.w3.org/2000/10/swap/pim/contact#">

<contact:Person rdf:about="http://www.accenture.com/techlabs/VAP/contact#me">

<contact:fullName>Valery A. Petrushin</contact:fullName>

<contact:mailbox rdf:resource="mailto:[email protected]"/>

</contact:Person>

</rdf:RDF>

Page 9: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 9

Dublin Core (DC)• Dublin Core

• http://dublincore.org/documents/ • vocabulary for describing documents (title, creator, subject,

description, publisher, contributor, date, type, format, identifier, source, language, relation, coverage, rights)

• Example:

<?xml version="1.0"?>

<!DOCTYPE rdf:RDF PUBLIC "-//DUBLIN CORE//DCMES DTD 2002/07/31//EN"

"http://dublincore.org/documents/2002/07/31/dcmes-xml/dcmes-xml-dtd.dtd">

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:dc="http://purl.org/dc/elements/1.1/">

<rdf:Description rdf:about="http://www.accenture/techlabs/Petrushin">

<dc:title> Multimedia Data Mining and Knowledge Discovery</dc:title>

<dc:creator> Valery A. Petrushin </dc:creator >

<dc:publisher>Springer Verlag</dc:publisher>

</rdf:Description>

</rdf:RDF>

Page 10: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 10

SKOS• SKOS – Simple Knowledge Organization System

• http://www.w3.org/2004/02/skos/ • model for expressing structure and content of concept schemes

(thesauri, taxonomies, etc.)• Specifies concepts, collections of concepts and relations between

concepts (broader, narrower, related)• Example:

<rdf:RDF

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:skos="http://www.w3.org/2004/02/skos/core#">

<rdf:Description rdf:about="http://www.example.com/concepts#people">

<skos:broader rdf:resource="http://www.example.com/concepts#mammals"/>

<skos:narrower rdf:resource="http://www.example.com/concepts#children"/>

<skos:narrower rdf:resource="http://www.example.com/concepts#adults"/>

</rdf:Description>

</rdf:RDF>

Page 11: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 11

TGM – I & II• TGM – Thesaurus for Graphic Materials (The

Library of Congress)• TGM-I – Subject Terms (6,300)

– http://www.loc.gov/rr/print/tgm1/toc.html

• TGM-II – Genre and Physical Characteristic Headings (600)– http://www.loc.gov/rr/print/tgm2/

• Example:

TGM-I:

Term: Sand

Narrower Term: Quicksand

Related Term: Dunes, Sand sculpture, Sandpaintings

TGM-II:

Term: Aerial views

Public Note: Views from a high vantage point.

Used For: Air views, Balloon views, Views, Aerial

Broader Term: Views

Narrower Term: Aerial photographs

Related Term: Bird's-eye views, Panoramic views

Page 12: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 12

LSCOM, SMIL & MPEG-7• LSCOM – Large Scale Concept Ontology for

Multimedia• http://www.acemedia.org/aceMedia/files/multimedia_ontology/

presentations_1st_meeting/arda.pdf

• SMIL – Synchronized Multimedia Integration Language

• http://www.w3.org/TR/REC-smil/ • Simple language for representing multiple synchronized media

streams

• MPEG-7 – Multimedia Content Description Interface

• http://www.chiariglione.org/mpeg/standards/mpeg-7/mpeg-7.htm • Advanced language for representing multimedia content• ISO Standard

Page 13: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 13

Case Study: BBC Rushes• Rushes are raw footage …

with a promise to turn into golden nuggets of stockshots

• TRECVID 2005• Video Retrieval Competition at NIST• http://www-nlpir.nist.gov/projects/trecvid/

• Problem:• create a system that helps a TV program maker

compose a video using current clips and rushes

• Data Statistics:– Duration: 49.3 hours– Content:

– Clips about vacation and travel– 4 issues of “Summer Holiday” (~ 2 hours)– BBC One News (30’) + fragment (~3’)

Page 14: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 14

BBC Rushes: Data Statistics - 1• Statistics: clip level

• 615 clips (308 development + 307 test sets)

• Duration (mm:ss) :– Minimal / Maximal - 00:03.48 / 47:11

– Mean / Median – 04:49 / 02:25

– Std - 06:02.73

• Keywords:– Different keywords / Occurrences –

1036 / 4908

– Mean / Median – 7.98 / 7

– Minimal / Maximal – 0 / 34

Page 15: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 15

BBC Rushes: Data Statistics - 2• Statistics: shot level

• Number of shots 10,064• Shot duration (mm:ss)

» Minimal - 0:00.04» Maximal –

22:45.16» Mean – 0:17.51» Median – 0:09.74» Std -

0:33.97

• Number of key frames» Total: 39,132» Median per shot: 2» Mean per shot: 3.8» Maximal: 377» Minimal: 1

Page 16: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 16

BBC Rushes: representation• Ontologies

– RDFS, Dublin Core, SKOS• Controlled vocabularies

– TGM-1 (reflecting Light Scale Concept Ontology for Multimedia), ISO8601 (temporal hierarchy of dates), MPEG-7 (visual features)

• Instances– trecvid:Shot, trecvid:Clip

• Application profile– Retrieve instances of type trecvid:Clip

• Textual facets: dc:title (clip title), dc:subject (keywords), dc:creator (director), dcterms:created (production date), dcterms:issued (show date), dc:extent (duration)

– Retrieve instances of type trecvid:Shot• Visual facets: dc:subject with values skos:narrower than trecvid:color,

trecvid:texture and trecvid:colorplustexture

• Textual facets through reference to containing clip

Page 17: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 17

Ontology Schema

Clip

Shot

KeyFrame

Color

Texture

Color+Texture

Title

Creator

Subject

Date

dcterms: partOf

dc: title

dc: creator

dc: subject

dc: subject

dc: created

skos: broader

skos: broader

skos: broader

skos: broaderskos: broader

skos: broader

skos: broader

skos: broader

VISUAL

TEXTUAL

Page 18: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 18

BBC Rushes: visual facets• Facets: color, texture, [shape] +

combinations• Color, texture, color+texture

• To build facets• Extract features (MPEG-7):

– Color: dominantColor(24), colorStructure (256), colorLayout (12)

– Texture: edgeHistogram (80), homogenousTexture (60)

• SOM Clustering of keyframes– Select as a visual “word” the closest

keyframe to node centroid• Represent keyframes as SKOS concepts,

centroids as skos:broader of cluster members

• Example: – SOM for color 35x28 (=980 nodes)

Page 19: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 19

Self-organizing Maps• SOM = Kohonen NN = Topology-preserving map• Unsupervised learning (Clustering + Visualization)• X = {xi} , xi Rd - input data• M = {mk} , mk Rd - prototype vectors (codebook) =

neurons on 1D or 2D grid• Training:

• 1. Start with random mk

• 2. For xi find best-matching unit (BMU) mc

• 3. Update prototype vectors in neighborhood

where is the neighborhood kernel is radius at time t

• Two phases: rough and fine tuning

kk

c mxmx min

)()()()()()1( tmtxthttmtm ickkk

tkccktckck rrddth ,),2/exp()( 22 )(thck

Page 20: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 20

BBC Rushes: RDF subgraph

Chilli_peppers

v159_001.wmv v159.mpg

“michelle jones”

2000-03-01

dc:subject

dc:creator

dcterms:partOf

dc:created

dc:subject

color#26547

f000000000.jpg

skos:broader

skos:broader

2000

2000-03

Hot_peppers

PeppersYear

skos:broader

skos:broader

skos:broader

skos:broader

“thailand, chiang mai/chillis”

dc:title

Color

skos:broader

Page 21: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 21

BBC Rushes: RDF/XML serialization <trecvid:Clip rdf:about="http://swvideo.techlabs.accenture.com/v159.mpg"> <rdf:type rdf:resource="&dctype;MovingImage" /> <dc:title>thailand, chiang mai/chillis</dc:title> <dcterms:extent>202200</dcterms:extent> <dc:creator>michelle jones</dc:creator> <dc:identifier>mrs320354</dc:identifier> <dcterms:created rdf:resource="tag:siderean.com,1752-09-14:2000-03-01" /> <dcterms:issued rdf:resource="tag:siderean.com,1752-09-14:2000-07-18" /> <dc:subject rdf:resource="&trecvid;thailand" /> <dc:subject rdf:resource="&trecvid;chiang_mai" /> <dc:subject rdf:resource="&trecvid;chillis" /> <dc:subject rdf:resource="&trecvid;peppers" /> <dc:subject rdf:resource="&trecvid;chilli_peppers" /> <dc:subject rdf:resource="&trecvid;vegetables" /> <dc:subject rdf:resource="&trecvid;markets" /> <dc:subject rdf:resource="&trecvid;street_markets" /> <dc:subject rdf:resource="&trecvid;food_markets" /> <dc:subject rdf:resource="&trecvid;food" /> <dc:subject rdf:resource="&trecvid;herbs" /> <dc:relation>http://swvideo.techlabs.accenture.com/v159.fset/f000000000.jpg </dc:relation> </trecvid:Clip>

<skos:Concept rdf:about="&trecvid;chilli_peppers"> <skos:broader rdf:resource="&tgm1;Hot_peppers"/> <skos:prefLabel>chilli peppers</skos:prefLabel> </skos:Concept>

<skos:Concept rdf:about='tag:siderean.com,1752-09-14:2000-03-01'> <skos:prefLabel>2000-03-01</skos:prefLabel> <skos:broader rdf:resource='tag:siderean.com,1752-09-14:2000-03'/> </skos:Concept>

<trecvid:Shot rdf:about="http://swvideo.techlabs.accenture.com/shotsWMV/v159_001.wmv"> <rdf:type rdf:resource="&dctype;MovingImage" /> <dcterms:isPartOf rdf:resource="http://swvideo.techlabs.accenture.com/v159.mpg" /> <dcterms:extent>21000</dcterms:extent> <dc:relation>http://swvideo.techlabs.accenture.com/v159.fset/f000000000.jpg</dc:relation> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000000000.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000000240.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000000280.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000001440.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000003120.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000005440.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000009680.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000011520.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000012040.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000013800.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000014800.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000015120.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000016760.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000018280.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000019360.jpg"/> <dc:subject rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000021000.jpg"/> </trecvid:Shot>

<skos:Concept rdf:about="http://swvideo.techlabs.accenture.com/v159.fset/f000000000.jpg"> <skos:broader rdf:resource="http://swvideo.techlabs.accenture.com/color#26547" /> <skos:prefSymbol rdf:resource="http://swvideo.techlabs.accenture.com/v159.fset/f000000000.jpg" /> </skos:Concept>

<skos:Concept rdf:about="http://swvideo.techlabs.accenture.com/color#26547"> <skos:broader rdf:resource="&trecvid;color" /> <skos:prefSymbol rdf:resource="http://swvideo.techlabs.accenture.com/v289.fset/f000048880.jpg" /> </skos:Concept>

Page 22: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 22

BBC Rushes Navigator: Architecture

AJAX client in FirefoxMetadataAggregator

MetadataStore

NavigationWeb Services

XRBRquery

XRBRresponse

BBC Rushes RDF

http://www.siderean.com/bbcrush/bbcrush.jsp (with Firefox 1.5)

Page 23: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 23

Lessons Learned• Data preparation

• Robust shot boundary detection• Careful selection of keyframes

– Motion based– Salient object based– Filtering redundant keyframes

• Using group-of-frames (GOF) features

• Concept recognition/propagation• Propagate keywords from clip to shots• Recognize concepts from visual data• Probabilistic reasoning• Derive concepts from data (data mining) + labeling

Page 24: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 24

Summary• Methodology of Multimedia Data

Representation– Semantic Web Technology– Multimedia Data Mining

• Prototype of Multimedia Retrieval System– BBC Rushes– Web-based Interface using AJAX

Page 25: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 25

Future work• More facets

• Shape + combinations• Geographical location

• More Interfaces• Map of the world for browsing

places• Hierarchy of SOM for browsing clips

and shots

• More Tools• Tagging tool for creating and

managing metadata• Tools for creating video databases

(shot extraction, feature extraction, clustering, classification of events, etc.)

• Tools for creating audio-video compositions (TV programs, commercials, etc.)

Page 26: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 26

BBC Rushes Navigator:Navigation with LSCOM

Page 27: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 27

BBC Rushes Navigator:Hierarchical Drill-down on People Facet

Page 28: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 28

BBC Rushes Navigator:Faceted View of All Shots

Page 29: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 29

BBC Rushes Navigator:Searching by Subject

Page 30: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 30

BBC Rushes Navigator: Searching by Color, Playlist composition

Page 31: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 31

BBC Rushes Navigator: Drill-down using Subject and Color

Page 32: Multimedia Data Navigation and the Semantic Web (SemTech 2006)

Copyright © 2005 Accenture, LLP / Siderean Software, Inc. All rights reserved. 32

Contact Information

• Valery A. [email protected]

• Bradley P. [email protected]


Recommended