Zhenbao Liu 1, Shaoguang Cheng 1, Shuhui Bu 1, Ke Li 2 1 Northwest Polytechnical University,...

transcript

Zhenbao Liu1, Shaoguang Cheng1, Shuhui Bu1, Ke Li21 Northwest Polytechnical University, Xi’an, China.2 Information Engineering University, Zhengzhou, China.

ICMEICME 2014 – 2014 – ChengduChengdu, , ChinaChina(1(14-184-18 JulyJuly, 2014), 2014)

High-Level Semantic Feature for 3High-Level Semantic Feature for 3D Shape Based on Deep Belief NetD Shape Based on Deep Belief Net

workwork

Outline

Backgrounds

WhyIdea

WhatMethod

howExperiments

Conclusion

Backgrounds

Feature Representation

LearningAlgorithm

The Key step

BackgroundsQ: how do we extract features in practice?

A: specified manually . Such as SIFT, HoG ...

Backgrounds

NLP Speech Recgnitio

Computer Vision

Backgrounds Why deep learning is difficult for 3D shape (graph data)?

Idea – 3D feature learning framework

DeepLearning

High-level feature

3D shape

Idea – 3D feature learning framework

Off-line

On-line

low-level

feature middle-level

featurehigh-level

feature

Method – Low Level Feature

view images generation

Attention:•Rotation angle must be set carefully to ensure that all cameras are distributed uniformly on a sphere.•A 3D object is represented by 10× 20 images from different views.

SIFT feature extraction

... ...

Robust to noise and illumination and stableto various changes of 3D viewpoints.

20 to 40 SIFT features per image. About 5000 to 7000 SIFT features for a 3D shape.

Method – Low Level Feature

Bag-of-Visual-Feature

Method – Middle Level Feature

SIFT feature from all shapes

K-means

SIFT feature from single

shapeNN

Encode

Visual Words

Method –Deep Belief Network

restricted Bolztman Manchine

joint distribution

Energy function

Math model ：

Method –Deep Belief Network

Stacking a number of the RBMs and learning layer by layer from bottom to top gives rise to a DBN.

The bottom layer RBM is trained with the input data of BoVF.

High-level feature

Classification

Experiments - parameters setting

Experiments - classification

Classification results on SHREC 2007 (left) and McGill (right)

SHREC 2007 McGill

BOVF 83% 78%

Proposed method 93% 89%

Experiments - retrieval

experiment on SHREC 2007

Experiments - retrieval

experiment on McGill

Conclusion

The experiment results demonstrate that the learned high-level features are more discriminative and can achieve better performance both on classification and retrieval tasks.

The number of view images is large. Currently only investigate SIFT as the low-level descriptors.

Thank you for your attention!

Zhenbao Liu 1, Shaoguang Cheng 1, Shuhui Bu 1, Ke Li 2 1 Northwest Polytechnical University,...

Documents