+ All Categories
Home > Documents > Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State...

Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State...

Date post: 24-Dec-2015
Category:
Upload: elaine-wilkinson
View: 218 times
Download: 1 times
Share this document with a friend
Popular Tags:
49
Multimedia Databases Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University
Transcript
Page 1: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Multimedia DatabasesMultimedia Databases

Prepared by Pradeep Konduri

Instructor: Dr. Yingshu LiGeorgia State University

Page 2: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Plan of AttackPlan of AttackIntroductionArchitectureImage Content AnalysisModeling ConstructsLogical ImplementationReal-World ApplicationsConclusion

Page 3: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Types of multimedia dataTypes of multimedia dataText: using a standard language

(SGML, HTML)Graphics: encoded in CGM,

postscriptImages: bitmap, JPEG, MPEGVideo: sequenced image data at

specified ratesAudio: aural recordings in a

string of bits in digitized form

Page 4: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Nature of Multimedia Nature of Multimedia ApplicationsApplicationsRepositories: central location for

data maintained by DBMS, organized in storage levels

Presentations: delivery of audio and video data, temporarily stored.

Collaborative: complex design, analyzing data

Page 5: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Management IssuesManagement IssuesModeling: complex objects, wide

range of typesDesign: still in researchStorage: representation,

compression, buffering during I/O, mapping

Queries: techniques need to be modified

Performance: physical limitations, parallel processing

Page 6: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Research ProblemsResearch ProblemsInformation Retrieval in Queries:

Modeling the content of documents

Multimedia/Hypermedia Data Modeling and Retrieval: Hyperlinks, Used in WWW

Text Retrieval: Use of a thesaurus

Page 7: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Multimedia Database Multimedia Database ApplicationsApplicationsDocumentation and keeping

RecordsKnowledge distributionEducation and TrainingMarketing, Advertisement,

Entertainment, TravelReal-time Control, Monitoring

Page 8: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

A Generic Architecture of MMDBMS A Generic Architecture of MMDBMS

Media Object

Feature Extraction

Compression

Indexing

MMDBMS

Query feature construction

Search Engine

Feedback Query construction

Media Object

query

feedback

result

Media organization: organize the features for retrieval(i.e., indexing the features with effective structures)

Media query processing: accommodated with indexing structure, efficient search algorithm with similarity function should be designed

Page 9: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Multimedia Database Multimedia Database ArchitectureArchitecture

Multimedia Data Preprocessing System

Database Processing

MM DataPre-

processor

AdditionalInformation

<!ELEMENT ..>.....<!ATTLIST...>

Multimedia DBMS

Users

Que

ry I

nter

face

MM DataInstance

<article>.....</article>

Recognized components

MM DataInstance

MM Data

Meta-Data

Page 10: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Document Database Document Database ArchitectureArchitecture

Document Processing System

Database Processing

DTD filesD

TD

Par

ser<!ELEMENT ..>

.....<!ATTLIST...>

DTDManager

TypeGenerator

Multimedia DBMS

Users

Que

ry I

nter

faceDTD

Document content

SGML/XMLDocuments

XML or SGMLDocumentInstance

ParseTree

<article>.....</article>

<!ELEMENT ..>.....<!ATTLIST...> DTD

C++ Types

C++ Objects

SGML/XMLParser

InstanceGenerator

Page 11: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Image Database Architecture Image Database Architecture

Image Processing System

Database Processing

Image Analysis and Pattern Recognition

ImageAnnotation

Multimedia DBMS

Users

Que

ry I

nter

face

Image ContentDescription

Image

Image

SyntacticObjects

SemanticObjects

<article>.....</article>

Meta-Data

Page 12: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Image Content AnalysisImage Content AnalysisImage content analysis can be

categorized in 2 groups:◦Low-level features: vectors in a

multi-dimensional space Color Texture Shape

◦Mid- to high-level features: Try to infer semantics

◦Semantic Gap

Page 13: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Image Content Analysis: Image Content Analysis: ColorColorColor space:

◦ Multidimensional space◦ A dimension is a color component◦ Examples of color space: RGB, HSV◦ RGB space: A color is a linear combination of 3

primary colors (Red, Green and Blue)Color Quantization

◦ Used to reduce the color resolution of an imageThree widely used color features

◦ Global color histogram◦ Local color histogram◦ Dominant color

Page 14: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Color HistogramsColor Histograms Color histograms indicate color distribution

without spatial information◦ Color histogram distance metrics

0

10

20

30

40

50

Red Orange Yellow Green Blue Indigo Violet

Page 15: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Image Content Analysis: Image Content Analysis: TextureTextureRefers to visual patterns with properties of

homogeneity that do not result from the presence of only a single color

Examples of texture: Tree barks, clouds, water, bricks and fabrics

Texture features: Contrast, uniformity, coarseness, roughness, frequency, density and directionality

Two types of texture descriptors◦ Statistical model-based

Explores the gray level spatial dependence of texture and extracts meaningful statistics as texture representation

◦ Transform-based DCT transform, Fourier-Mellin transform, Polar Fourier

transform, Gabor and wavelet transform

Page 16: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Image Content Analysis: Image Content Analysis: ShapeShapeObject segmentation

◦Approaches: Global threshold-based approach Region growing, Split and merge approach, Edge detection app

◦Still a difficult problem in computer vision. Generally speaking it is difficult to achieve perfect segmentation

Page 17: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Salient Objects vs. Salient Salient Objects vs. Salient PointsPoints

Original images

Segmented images with region boundaries

Extracted salient points

Generic low-level description of images into salient objects and salient points

Page 18: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Modeling Images – Modeling Images – PrinciplesPrinciples

Support for multiple representations of an image

Support for user-defined categorization of images

Well-defined set of operations on imagesAn image can have (semantic, functional,

spatial) relationships with other images (or documents) which should be represented in the DBMS

An image is composed of salient objects (meaningful image components)

Page 19: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Salient Object ModelingSalient Object Modeling

Multiple representations of a salient object (grid, vector) are allowed

A salient object O is of a particular type which belongs to a user defined salient object types hierarchy

An image component may have some (semantic, functional, spatial) relationships with other salient objects

Page 20: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

““Semantic GapSemantic Gap””

semantics-intensive multimedia systems & applications

non-semanticmultimedia data models

require

model

semantic meaning of the

data

raw data,primitive

properties (size, format,

etc)

Semantic Gap

Page 21: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Semantic modeling of Semantic modeling of multimedia multimedia -- Why hard?-- Why hard?

Context-dependency◦Semantics is not a static and intrinsic

property◦The semantics of an object often depends on:

the application/user who manipulate the object the role that the object plays other objects in the same “context”

Van Gogh’s

paintings

flower

Example:

Page 22: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Why hard? (cont.)Why hard? (cont.)Modality-independency

◦Media objects of different modalities may suggest the similar/related semantic meanings.

◦Example:Harry Potter has never been the star of a Quidditch team, scoring points while riding a broom far above the ground. He knows no spells, has never helped to hatch a dragon, and has never worn a cloak of invisibility.

Query:

Results:

image video text

Page 23: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

MediaViewMediaView –– A A ““Semantic Semantic BridgeBridge””

An object-oriented view mechanism that bridges the semantic gap between multimedia systems and databases

Core concept – media view (MV)◦a customized context for semantic

interpretation of media objects (text docs, images, video, etc)

◦collectively constitute the conceptual infrastructure of a multimedia system & application

Page 24: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

ArchitectureArchitecture

External Schema

mediaview 1

Internal Schema

mediaview 2

mediaview n

. . .

Object-oriented Database

Multimedia Systems

Conceptual Schema

. . .

MediaView Mechanism

Page 25: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Basic ConceptsBasic Concepts

An example…

Image

MultimediaObject

TextDocument

AudioClip

VideoClip

Image

BitmapImage

JPEGImage

keyframe

audiotrack

ImpressionisticArtworks

Name

Artist

Type

Style

Wavelet-Texture

Dominant-Shape

Color-Histogram

Artworks

RealisticArtworks

ImpressionisticPaintings

ImpressionisticArtworks

Post-modernArtworks

(a) Base Class (B) Media View

(d) View Schema(c) Base Schema

SongSpeechImpressionistic

Sculptures

subclasssubclass

subviewsubview

Page 26: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Basic ConceptsBasic Concepts

Semantics-based data reorganization via media views

text

audio

video

image

media view

Page 27: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

View OperatorsView OperatorsA set of operators that take

media views and view instances as operands.

Focus on the operators that are indispensable in supporting queries and navigation over multimedia objects.

Page 28: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

View OperatorsView Operators

type-level

V-overlapsyntax<boolean>:= v-overlap (<media view1, media view2 >)semantics true, if and only if ( o O)(oextent(<media view1>) and oextent(<media view2>))

Crosssyntax{<object>}:= cross (<media view1, media view2 >)semantics{<object>} := {o O | o extent(<media view1>) and oextent(<media view2>)}

Sumsyntax{<object>}:= sum (<media view1, meida-view2 >)semantics{<object>} := {o O | o extent(<media view1>) or oextent(<media view2>)}

Subtractsyntax{<object>}:= subtract (<media view1, media view2>)semantics{<object>}:= {o O | o extent(<media view1>) and oextent(<media view2>)}

Page 29: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

View OperatorsView Operators

instance-level

Classsyntax<base class> := class(<view instance>)semantics<view instance> is a instance of <base class>

componentssyntax{<object>} := components (<view instance>) semantics {<object>} := { oO | o is a component (direct or indirect) of <view instance>}

i-overlapsyntax<boolean> := i-overlap (<view instnace1>, <view instance2>)semantics true, if and only if ( o O) (o components (<view instance1>) and o components(<view instance2>))

Page 30: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

View AlgebraView Algebra Functions

-- derivation of new MVs from existing MVs

Heuristic Enumeration1. Blind enumeration 2. Content-based enumeration 3. Semantics-based enumeration

Page 31: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

View AlgebraView AlgebraAlgebra Operators

◦select from src-MV where <predicate>

◦project <property-list> from src-MV◦intersect (src-MV1, src-MV2)◦union (src-MV1, src-MV2)◦difference (src-MV1, src-MV2)

Page 32: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Comparison (vs. class)Comparison (vs. class)

media view object classmembershi

pheterogeneous objects uniform objects

member acquisition

dynamic inclusion/exclusion of existing objects of other classes

creating new objects

mapping one object can belong to multiple media views

one object has exactly one class

relationship inter-member semantic relationship

N/A

Page 33: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Comparison (vs. traditional Comparison (vs. traditional object view)object view)

media view object viewmembershi

pheterogeneous objects uniform objects

relationship inter-member semantic relationship

N/A

member properties

instance-level properties (user-defined)

inherited or derived properties (for view

instances)global

propertiesMV-level properties (user-

defined)N/A

Page 34: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Logical ImplementationLogical ImplementationMediaView ConstructionMediaView CustomizationMediaView Evolution

Page 35: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

MediaViews ConstructionMediaViews ConstructionWork with CBIR systems to

acquire the knowledge from queries◦Learn from previously performed

queries ◦A multi-system approach to support

multi-modality of media objects

Organize the semantics by following WordNet

Page 36: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Why WordNet?Why WordNet?

Different queries may greatly vary with the liberty of choosing query keywords

We need an approach to organize those knowledge into a logic structure◦A simple “context”: a concept in WordNet◦Common media views: corresponds to

simple contexts ◦We provide all common media views, based

on which users can build complex ones.

Page 37: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Navigating the Multimedia Navigating the Multimedia DatabaseDatabase

Navigating via semantic relationships of WordNetSemantic Relationship ExamplesSynonymy (similar) pipe, tubeAntonymy (opposite) fast, slowHyponymy (subordinate) tree, plantMeronymy (part) chimney, houseTroponomy (manner) march, walkEntailment drive, ride

Page 38: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Navigating the Multimedia Navigating the Multimedia DatabaseDatabase

Multimedia Database

MediaView 1

MediaView 2

MediaView 3

MediaView 4

Semantic Relationship in WordNet

User

browse

Page 39: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

MediaViews Construction MediaViews Construction

CBIRSystem(Video)

CBIRSystem(Image)

CBIR System(Text)

Query

...

Multimedia Database

MediaView Engine

System Feedback

Users

User Feedback

Results

Issue

Page 40: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

MediaView Customization MediaView Customization

Two level MediaView Framework

Basic MediaView

Customized MediaView

Simple Context Advanced Context

Page 41: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

MediaView Customization MediaView Customization Dynamically construct complex-

context-based media views based on simple ones ◦An example complex context: “the

Grand Hall in City University” Several user-level operators are

devised to support more complex/advanced contexts, besides the basic operators

Page 42: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

User-level OperatorsUser-level OperatorsINHERIT_MV(N: mv-name, NS:

set-of-mv-refs, VP: set-of-property-ref, MP: set-of-property-ref): mv-ref

UNION_MV(N: mv-name, NS: set-of-mv-refs): mv-ref

INTERSECTION_MV(N: mv-name, NS: set-of-mv-refs): mv-ref

DIFFERENCE_MV(N1: mv-ref, N2: mv-ref): mv-ref

Page 43: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Build a MediaView in Run-Build a MediaView in Run-timetime

Example: find out info about "Van Gogh"◦ Who is "Van Gogh"?◦ What is his work?◦ Know more about his

whole life.◦ Know more about his

country.◦ See his famous

painting "sunflower"

Legend

Multimedia Document

Media View 1

Text

Sound

Image

Video

Topic 1

Topic 2

Topic 3

Build MediaView

Page 44: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Build a MediaView in Run-Build a MediaView in Run-timetimeWho is “Van Gogh”?

◦ INHERIT_MV(“V. Gogh“, {<painter>},name=”Van Gogh” ,);

What is his work?◦ INTERSECTION_MV(“work”, {<painting>, vg});

Know more about his whole life.◦ INTERSECTION_MV(“life”, {<biography>, vg});

Know more about his country.◦ INTERSECTION_MV(“country”, {<country>, vg});

See his famous painting “sunflower”◦ Set sunflower = INTERSECTION_MV(“sunflower”,

{<sunflower>, <painting>});Set vg_sunflower = INTERSECTION_MV(“vg_sunflower”, {vg_work, sunflower});

Page 45: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Authoring Scenario Authoring Scenario Creates a new media view named after the

subject◦ All multimedia materials used in the document would

be put into this MediaView for further reference. To collect the most relevant materials for

authoring, the user performs the MediaView building process. ◦ Import suitable media objects by browsing media

views Reference the manner and style of authoring,

to find other media views with similar topics. ◦ Drag & Drop◦ “learning-from-references”

Page 46: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

SummarySummaryTypes of multimedia data: Text, Audio, Video,

Images.Management issues: Design, Storage,

Modeling, QueriesImage Content Analysis: Color, Texture,

ShapeMediaView – a semantic multimedia database

modeling mechanism ◦ to bridge the semantic gap between

conventional database and semantics-intensive multimedia applications

A set of user-level operators to accommodate the specialization/generalization relationships among the media views

Page 47: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Summary (contd..)Summary (contd..)MediaView promises more effective access

to the content of media databases◦Users could get the right stuff and tailor it

to the context of their application easily. Providing the most relevant content from

pre-learnt semantic links between media and contexthigh performance database browsing and

multimedia authoring tools can enable more comprehensive applications to the user.

Users could customize specific media view according to their tasks, by using user-level operators

Page 48: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Further IssuesFurther IssuesThe development and transition

of MediaView to a fully-fledged multimedia database system supporting “declarative” queries

Intensive and extensive performance studies

Advanced semantic relations (eg. temporal and spatial ones) can also be incorporated in combining individual media views

Page 49: Multimedia Databases Prepared by Pradeep Konduri Instructor: Dr. Yingshu Li Georgia State University.

Thank you!

Q & A

Email: [email protected]


Recommended