+ All Categories
Home > Documents > Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences...

Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences...

Date post: 28-Dec-2015
Category:
Upload: juniper-sharp
View: 214 times
Download: 1 times
Share this document with a friend
Popular Tags:
61
Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint work: Jia Li Department of Statistics The Pennsylvania State University
Transcript
Page 1: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Research on Intelligent Image Indexing and Retrieval

James Z. WangSchool of Information Sciences and Technology

The Pennsylvania State University

Joint work: Jia Li

Department of StatisticsThe Pennsylvania State University

Page 2: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Photographer annotation: Greece (Athens, Crete, May/June 2003), taken by James Z. Wang

Page 3: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

“Building, sky, lake, landscape, Europe, tree”

Can a computer do this?

Page 4: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Can a computer do this?

EMPERORdatabase byC.-c. Chen

Page 5: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Outline

Introduction Our related SIMPLIcity work ALIP: Automatic modeling and

learning of concepts Conclusions and future work

Page 6: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

The field: Image Retrieval The retrieval of relevant images from an

image database on the basis of automatically-derived image features

Applications: biomedicine, homeland security, law enforcement, NASA, defense, commercial, cultural, education, entertainment, Web, ……

Our approach: Wavelets Statistical modeling Supervised and unsupervised learning… Address the problem in a generic way for

different applications

Page 7: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Chicana Art Project, 1995

1000+ high quality paintings of Stanford Art Library Goal: help students and researchers to find visually

related paintings Used wavelet-based features [Wang+,1997]

Page 8: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Feature-based Approach

+ Handles low-level semantic queries

+ Many features can be extracted

-- Cannot handle higher-level queries (e.g.,objects)

Signature

feature 1feature 2……

feature n

Page 9: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Region-based Approach Extract objects from images first

+ Handles object-based queriese.g., find images with objects that are similar to some given objects

+ Reduce feature storage adaptively

-- Object segmentation is very difficult-- User interface: region marking, feature

combination

Page 10: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

UCB Blobworld[Carson+, 1999]

Page 11: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Outline

Introduction Our related SIMPLIcity work ALIP: Automatic modeling and

learning of concepts Conclusions and future work

Page 12: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Motivations

Observations: Human object segmentation relies on knowledge Precise computer image segmentation is a very difficult open

problem

Hypothesis: It is possible to build robust computer matching algorithms without first segmenting the images accurately

Page 13: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Our SIMPLIcity Work [PAMI, 2001(1)] [PAMI, 2001(9)][PAMI, 2002(9)]

Semantics-sensitive Integrated Matching for Picture LIbraries

Major features Sensitive to semantics: combine

statistical semantic classification with image retrieval

Efficient processing: wavelet-based feature extraction

Reduced sensitivity to inaccurate segmentation and simple user interface: Integrated Region Matching (IRM)

Page 14: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Wavelets

Page 15: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Fast Image Segmentation

Partition an image into 4×4 blocks Extract wavelet-based features from each block Use k-means algorithm to cluster feature vectors into

‘regions’ Compute the shape feature by normalized inertia

Page 16: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

K-means Statistical Clustering

Some segmentation algorithms: 8 minute CPU time per image

Our approach: use unsupervised statistical learning method to analyze the feature space

Goal: minimize the mean squared error between the training samples and their representative prototypes

Learning VQ [Hastie+, Elements of Statistical Learning, 2001]

Page 17: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

IRM: Integrated Region Matching

IRM defines an image-to-image distance as a weighted sum of region-to-region distances

Weighting matrix is determined based on significance constrains and a ‘MSHP’ greedy algorithm

Page 18: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

A 3-D Example for IRM

Page 19: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

IRM: Major Advantages

1. Reduces the influence of inaccurate segmentation

2. Helps to clarify the semantics of a particular region given its neighbors

3. Provides the user with a simple interface

Page 20: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Experiments and Results

Speed 800 MHz Pentium PC with LINUX OS Databases: 200,000 general-purpose image DB

(60,000 photographs + 140,000 hand-drawn arts)70,000 pathology image segments

Image indexing time: one second per image Image retrieval time:

Without the scalable IRM, 1.5 seconds/query CPU time With the scalable IRM, 0.15 second/query CPU time

External query: one extra second CPU time

Page 21: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

RANDOM SELECTION DB: 200,000 COREL

Page 22: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Current SIMPLIcity System

Query Results

Page 23: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

External Query

Page 24: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

EMPEROR Project

C.-C. ChenSimmons College

Page 25: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

(1) Random Browsing

Page 26: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

(2) Similarity Search

Page 27: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

(2) Similarity Search

Page 28: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

(3) External Image Query

Page 29: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Outline

Introduction Our related SIMPLIcity work ALIP: Automatic modeling and

learning of concepts Conclusions and future work

Page 30: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Why ALIP?

Size 1 million images

Understandability &Vision “meaning” depend

on the point-of-view Can we translate

contents and structure into linguistic terms

dogs

Kyoto

Page 31: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

(cont.)

Query formulation SIMILARITY: look similar to a given

picture OBJECT: contains a historical building OBJECT RELATIONSHIP: contains a cat

and a person MOOD: a happy picture TIME/PLACE: sunset of Athens

Page 32: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Automatic Linguistic Indexing of Pictures (ALIP)

An exciting research direction Differences from computer vision

ALIP: deal with a large number of concepts ALIP: rarely find enough number of “good”

(diversified/3D?) training images ALIP: build knowledge bases

automatically for real-time linguistic indexing (generic method)

ALIP: highly interdisciplinary (AI, statistics, mining, imaging, applied math, domain knowledge, ……)

Page 33: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Automatic Modeling and Learning of Concepts for Image Indexing

Observations: Human beings are able to build models about

objects or concepts by mining visual scenes The learned models are stored in the brain

and used in the recognition process Hypothesis: It is achievable for computers to

mine and learn a large collection of concepts by 2D or 3D image-based training

2000-now: [ACM MM, 2002][PAMI 2003(10)]

Page 34: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Concepts to be Trained Concepts: Basic building blocks in

determining the semantic meanings of images

Training concepts can be categorized as: Basic Object: flower, beach Object composition:

building+grass+sky+tree Location: Asia, Venice Time: night sky, winter frost Abstract: sports, sadness High-level

Low-level

Page 35: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Database: most significant Asian paintings (Copyright: museums/governments/…)

Question: can we build a “dictionary” of different painting styles?

Page 36: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Database: terracotta soldiers of the First Emperor of ChinaQuestion: can we train a computer to annotate these?

EMPEROR images of C.-c. Chen

Page 37: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

System Design Train statistical models of a dictionary of

concepts using sets of training images 2D images are currently used 3D-image training can be much better

Compare images based on model comparison

Select the most statistical significant concept(s) to index images linguistically

Initial experiment: 600 concepts, each trained with 40 images 15 minutes Pentium CPU time per concept,

train only once highly parallelizable algorithm

Page 38: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Training Process

Page 39: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Automatic Annotation Process

Page 40: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Training

Training images used to train the concept “male” with description “man, male, people, cloth, face”

Page 41: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Initial Model: 2-D Wavelet MHMM [Li+, 1999]

Model: Inter-scale and intra-scale dependence States: hierarchical Markov mesh, unobservable Features in SIMPLIcity: multivariate Gaussian distributed

given states A model is a knowledge base for a concept

Page 42: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

2D MHMM

Start from the conventional 1-D HMM Extend to 2D transitions Conditional Gaussian distributed feature vectors Then add Markovian statistical dependence across resolutions Use EM algorithm to estimate parameters

Page 43: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Annotation Process

Statistical significances are computed to annotate images

Favor the selection of rare words

When n, m >> k, we have

Page 44: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Preliminary Results

Computer Prediction: people, Europe, man-made, water

Building, sky, lake, landscape,

Europe, tree People, Europe, female

Food, indoor, cuisine, dessert

Snow, animal, wildlife, sky,

cloth, ice, people

Page 45: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

More Results

Page 46: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Results: using our own photographs

P: Photographer annotation Underlined words: words predicted by

computer (Parenthesis): words not in the learned

“dictionary” of the computer

Page 47: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

10 classes:

Africa,beach,buildings,buses,dinosaurs,elephants,flowers,horses,mountains,food.

Systematic Evaluation

Page 48: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

600-class Classification Task: classify a given image to one of the 600

semantic classes Gold standard: the photographer/publisher

classification This procedure provides lower-bounds of the

accuracy measures because: There can be overlaps of semantics among classes (e.g.,

“Europe” vs. “France” vs. “Paris”, or, “tigers I” vs. “tigers II”) Training images in the same class may not be visually

similar (e.g., the class of “sport events” include different sports and different shooting angles)

Result: with 11,200 test images, 15% of the time ALIP selected the exact class as the best choice I.e., ALIP is about 90 times more intelligent than a

system with random-drawing system

Page 49: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Can ALIP learn to be a domain specialist?

For each concept, train the ALIP system with a couple to several EMPEROR images and their metadata

Use the trained models about these concepts to annotate images

Page 50: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

ExperimentsEight (8) images areused to train theconcept “Terracotta Warriors and Horses”

Four (4) images areused to train theconcept “Roof Tile-end”

Page 51: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Classification Tests

Terracotta warriors and horses: 98%

The Great Wall: 98% Roof Tile-end: 82% Terracotta warriors – head: 96% Afang Palace: 82%

Page 52: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

The only wrongly annotated images and their true annotations

Page 53: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Preliminary Results on Painting Images

Page 54: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Classification of Painters

Five painters: SHEN Zhou (Ming Dynasty), DONG Qichang (Ming), GAO Fenghan (Qing), WU Changshuo (late Qing), ZHANG Daqian (modern China)

Page 55: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Outline

Introduction Our related SIMPLIcity work ALIP: Automatic modeling and

learning of concepts Conclusions and future work

Page 56: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Conclusions Automatic Linguistic Indexing of Pictures

Highly challenging but crucially important Interdisciplinary collaboration is critical Approaches: Penn State, UC Berkeley,…

Our ALIP System: Automatic modeling and learning of semantic concepts 600 concepts can be learned automatically

Application of ALIP to historical materials

Page 57: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Advantages of Our Approach

Accumulative learning Highly scalable (unlike CART, SVM,

ANN) Flexible: Amount of training depends

on the complexity of the concept Context-dependent: Spatial relations

among pixels taken into consideration Universal image similarity: statistical

likelihood rather than relying on segmentation

Page 58: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Limitations

Training with 2D images which have no information about the actual object sizes

Training with not enough number of images for complex concepts

Low diversity of available training images Iterative training with negative examples? Training with conflicting opinions?

(learning the same subject from two teachers with different viewpoints)

……

Page 59: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Future Work Explore new methods for better accuracy

refine statistical modeling of images learning from 3D (may use 2D-to-3D

technologies if cannot acquire directly in 3D) refine matching schemes

Apply these methods to special image databases

(e.g., art, biomedicine) very large databases (e.g., Web images)

Integration with large-scale information systems

……

Page 60: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

Acknowledgments NSF ITR (since 08/2002) Endowed professorship from the PNC

Foundation Equipment grants from SUN and NSF Penn State Univ. Earlier funding (1995-2000): IBM

QBIC, NEC AMORA, SRI AI, Stanford Lib/Math/Biomedical Informatics/CS, Lockheed Martin, NSF DL2

Page 61: Research on Intelligent Image Indexing and Retrieval James Z. Wang School of Information Sciences and Technology The Pennsylvania State University Joint.

More Information

http://wang.ist.psu.edu

Papers in PDF, demos, etc

Monographs:1. J. Z. Wang, Integrated Region-

based Image Retrieval, Kluwer, 2001.

2. J. Li, R. M. Gray, Image Segmentation and Compression using Hidden Markov Models, Kluwer, 2000.


Recommended