+ All Categories
Home > Education > ESSIR LivingKnowledge Image Analysis

ESSIR LivingKnowledge Image Analysis

Date post: 02-Dec-2014
Category:
Upload: jonathon-hare
View: 755 times
Download: 1 times
Share this document with a friend
Description:
Slides from ESSIR2011.
Popular Tags:
37
A BRIEF INTRODUCTION TO EXTRACTING INFORMATION FROM IMAGES Jonathon Hare University of Southampton
Transcript
Page 1: ESSIR LivingKnowledge Image Analysis

A BRIEF INTRODUCTION TO EXTRACTING INFORMATION

FROM IMAGES

Jonathon HareUniversity of Southampton

Page 2: ESSIR LivingKnowledge Image Analysis

• What can images tell us?• How are images represented in digital

computers• How do we extract information from

images– Examples of some different extraction

techniques– Analogies with text– Free software!

CONTENTS

Page 3: ESSIR LivingKnowledge Image Analysis

IMAGES CAN…

the main roles of images in the communications process

Page 4: ESSIR LivingKnowledge Image Analysis

ATTRACT ATTENTION AND MAKE DOCUMENTS MORE

APPEALING

Page 5: ESSIR LivingKnowledge Image Analysis
Page 6: ESSIR LivingKnowledge Image Analysis

CONVEY OPINIONS AND EMOTIONAL MESSAGES

Page 7: ESSIR LivingKnowledge Image Analysis
Page 8: ESSIR LivingKnowledge Image Analysis

CONVEY INFORMATION FOR DOCUMENTING A CLAIM

Page 9: ESSIR LivingKnowledge Image Analysis
Page 10: ESSIR LivingKnowledge Image Analysis

REPRESENTATION AND UNDERSTANDINGhow a computer “sees”

Page 11: ESSIR LivingKnowledge Image Analysis

DIGITAL IMAGE REPRESENTATION

87 91 85 ... 86 86 81 ... 88 85 84 ...... ... ... ...137 145 144 ...153 150 137 ...148 139 123 ...... ... ... ...

89 91 89 ... 84 88 90 ... 88 87 90 ...... ... ... ...

Page 12: ESSIR LivingKnowledge Image Analysis

UNDERSTANDING AN IMAGE

Page 13: ESSIR LivingKnowledge Image Analysis

FEATURE EXTRACTION

f(x)

Feature extraction is the process of extracting “descriptors” from an image. Descriptors describe some aspect of the image content.Typically, a descriptor is a numerical vector called a “feature vector”, however other forms of descriptor are possible.

Page 14: ESSIR LivingKnowledge Image Analysis

• Higher-level features– Directly interpretable by humans

• i.e. the number of faces in the image

– Either hand-crafted or trained with machine learning techniques

• Lower-level features– Much more abstract; convey a notion of the

image content• i.e. the colour distribution of the image

IMAGE FEATURE MORPHOLOGY

Page 15: ESSIR LivingKnowledge Image Analysis

EXAMPLE HIGH-LEVEL FEATURES

faces, composition & photoshop disasters

Page 16: ESSIR LivingKnowledge Image Analysis

• The detection of faces in an image is a very useful feature for inferring information about an image– Face detection is the first step of face

recognition• The most popular face detection

algorithm is the “Viola-Jones” detector– Conceptually simple– Uses machine learning; Requires training

(slow).– Very fast detection

HIGH-LEVEL FEATURES: FACE DETECTION

Page 17: ESSIR LivingKnowledge Image Analysis

VIOLA-JONES FACE DETECTION

P. Viola, M. Jones, Robust Real-Time Face Detection, IJCV, Vol. 57(2), 2004. (first version appeared at CVPR 2001)

Bank of filters. Consider all possible position, scale and type parameters(very large numbers of features)

For each feature create a simple (weak) binary classifier (a stump)

Use ADABOOST to select the informative features

Page 18: ESSIR LivingKnowledge Image Analysis

VIOLA-JONES FACE DETECTION

P. Viola, M. Jones, Robust Real-Time Face Detection, IJCV, Vol. 57(2), 2004. (first version appeared at CVPR 2001)

Page 19: ESSIR LivingKnowledge Image Analysis

• Photographers use the “rule-of-thirds” to improve the composition of their photos.– The basic idea is to place main subjects at

roughly one-third of the horizontal or vertical dimension of the photograph.

HIGH-LEVEL FEATURES: COMPOSITION

Page 20: ESSIR LivingKnowledge Image Analysis

It is possible to design features that look for the presence of composition using the rule-of thirds

HIGH-LEVEL FEATURES: COMPOSITION

image saliency map segments + saliency map

distance to closest power-point

area of segment * saliency of segment

Che-Hua Yeh, Yuan-Chen Ho, Brian A. Barsky, and Ming Ouhyoung. "Personalized Photograph Ranking and Selection System". In ACM Multimedia 2010, pages 211–220, October 2010.

Page 21: ESSIR LivingKnowledge Image Analysis

HIGH-LEVEL FEATURES: TAMPERING

Page 22: ESSIR LivingKnowledge Image Analysis

HIGH-LEVEL FEATURES: TAMPERING

Page 23: ESSIR LivingKnowledge Image Analysis

HIGH-LEVEL FEATURES: TAMPERING

A Political Advertisement for George W. BushAutomatic cloning detection (“copy-move” forgery)

Page 24: ESSIR LivingKnowledge Image Analysis

EXAMPLE LOW-LEVEL FEATURES

colour histograms, segments and sift

Page 25: ESSIR LivingKnowledge Image Analysis

• Global features describe the content of an entire image– One of the simplest global

features is the “Global RGB Colour Histogram”

• Quantise each pixel into a discrete number of colours and then build a histogram.

LOW-LEVEL FEATURES: GLOBAL

Page 26: ESSIR LivingKnowledge Image Analysis

• Global features are useful for some tasks, but in many cases are not powerful enough

• Local features attempt to overcome this by breaking the image into smaller parts from which to extract features– Three primary techniques for splitting up the image

LOW-LEVEL FEATURES: LOCAL

segmentation salient regions &interest points

grids & blocks

Page 27: ESSIR LivingKnowledge Image Analysis

• Salient interest regions and their associated features are currently the most popular way of describing an image content.

• Extracting image features using interest regions is a two-part process:– Find regions– Extract feature to describe region properties

• Typically, the resultant image feature will have a variable length, dependent on the number of regions

SALIENT INTEREST REGIONS

Page 28: ESSIR LivingKnowledge Image Analysis

• Important regions portray:– Repeatability– Saliency

• Corners and blobs have these qualities

• Detectable using various techniques– Difference of Gaussian - corners– Harris corner detector - corners– MSER - blobs

SALIENT INTEREST REGION LOCATION

corners

blobs

Page 29: ESSIR LivingKnowledge Image Analysis

• Good region descriptors portray:– Resilience to image transforms– Compactness

• Emphasise different image characteristics:– Pixel intensities, colour, texture, edges etc.

• Common descriptors include:– SIFT: histogram of edge orientation– Shape context: histogram of edge location

SALIENT INTEREST REGION DESCRIPTORS

Page 30: ESSIR LivingKnowledge Image Analysis

SIFT: SCALE INVARIANT FEATURE TRANSFORM

Page 31: ESSIR LivingKnowledge Image Analysis

ANALOGIES WITH TEXT

introducing the visual bag-of-words

Page 32: ESSIR LivingKnowledge Image Analysis

In the computer vision community over recent years it has become popular to model the content of an image in a similar way to a “bag-of-terms” in textual document analysis.

BAGS OF VISUAL WORDS

Page 33: ESSIR LivingKnowledge Image Analysis

• Features localised by a robust region detector and described by a local descriptor such as SIFT.

• A vocabulary of exemplar feature-vectors is learnt.– Traditionally through k-

means clustering.

• Local descriptors can then be quantised to discrete visual terms by finding the closest exemplar in the vocabulary.

BOVW USING LOCAL FEATURES

Page 34: ESSIR LivingKnowledge Image Analysis

• BOVW models have many applications– Auto-annotation and object recognition

– Concept classification

– Large-scale indexing

APPLICATIONS OF BOVW

Page 35: ESSIR LivingKnowledge Image Analysis

OPEN-SOURCE TOOLS FOR IMAGE ANALYSIS AND

INDEXINGintroducing openimaj & imageterrier

Page 36: ESSIR LivingKnowledge Image Analysis

• Open-source (BSD Licence) libraries and tools for multimedia (image, video, sound) analysis and information extraction

• Implemented in Java; use with any JVM language– Implementations of all the techniques mentioned in this

tutorial– Scalability of extraction using Hadoop with the included tools

http://www.openimaj.org

Page 37: ESSIR LivingKnowledge Image Analysis

• Extension to the Terrier retrieval system to allow indexing of images– Collections and documents that read data produced from

image feature extractors.– New indexers and supporting classes to make compressed

augmented inverted indices for visual term data.– New distance measures implemented as WeightingModels.– Geometric re-ranking implemented as

DocumentScoreModifiers.– Command-line tools for indexing and searching.

• Freely available under the Mozilla Licence

http://www.imageterrier.org


Recommended