+ All Categories
Home > Documents > Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to...

Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to...

Date post: 12-Jan-2016
Category:
Upload: roberta-fox
View: 212 times
Download: 0 times
Share this document with a friend
Popular Tags:
55
Utility = f(Vision) - A Review
Transcript
Page 1: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Utility = f(Vision)- A Review

Page 2: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Perception

“To perceive is also about how to approach and what to do with an object …”

“Perception/cognition is determined by aspects and form of the agent (Embodiment) …”

Page 3: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Affordances

“An affordance is an intrinsic property of an object, allowing an action to be performed with the object. It also depends on the embodiment of the agent performing the action …”

“Objects which are cars for residents of Lilliput, are merely toys for Gulliver… ”

Page 4: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

A Condition for Survival

“One of the most basic function of all organisms is the cutting up of environment into classifications by which non-identical stimuli can be treated as equivalent …”

Page 5: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Clustering Visual Input• Click to edit Master text styles

• Second level• Third level

• Fourth level• Fifth level

Tremendous variation in shape !(Hard for state of art algorithms based on appearance to recognize them)

BUT

All are sittable surfaces!(for humans)

Or, dimensionality = 1 in affordance space.

Page 6: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

So, the question to ask is:

What are the affordances an object can support given its visual features such as shape, texture and color ?

Page 7: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Why to answer this question ?

• Obtaining semantic clustering of objects Generalization !

• Building vision perception for robotic platforms.

• Generating scene descriptions in a utilitarian framework Visual aid devices for blind !

• For the sake of science !

Page 8: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Points to Note

“The proposition is to use appearance cues as a supplement to affordance learning and not to totally ignore them…”

Shape AffordancesIs not one to one

Page 9: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Continued… Implicit and Explicit Knowledge

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level“Shapes can only represent explicit knowledge ..”

“Knowledge about hooks/fixture is implicit in (b)…”

Page 10: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

“If I have seen further it is by standing on ye sholders of Giants”-Isaac Newton

A Survey of efforts in the past

Page 11: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Affordance Learning

From Activity

From Shape From Simulation

Hand Activity

Local Features

Body Activity

Global Features

Interactive Robot

Page 12: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Freeman & Newell [1971]

• Structure is a unit that provides a set of functions.

• Laid down a formalism for• When• How

Can structures be combined to provide required functions.

Page 13: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

The first efforts ! (Winston, Binford et al

[1983])

• Functional description of an object cup• ako: A kind of• hq: Has quality

Page 14: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Input to System:

Page 15: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

First Vision System using functional information (Connel & Brady [1987])

• Describe functional concepts geometrically.• Generalize !

Page 16: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Page 17: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Understanding Functional Reasoning ([Di Manzo, Ricci et al 1989])

• Knowledge representation Semantic Networks

• Objects 3D octree models (Synthetic).• Try to account for real-world noise

• Functional Elements: Support, Grasp, Hang, Cut, Equilibrium, Enter, Contain, Pierce, Stop

Page 18: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Understanding Functional Reasoning ([Di Manzo, Ricci et al 1989])

Page 19: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

More Attempts ([Stark et al. 1991-

1994])

• Concept of Knowledge Primitives• Dimensions (length or area of surface)• Relative orientation: between surfaces• Proximity: between surfaces/faces• Clearance : Lack of obstacles in a defined area• Stability: being in rest in certain orientation.

• Pre-define Categories and Sub-Categories

• CAD and Range-sensor data.

Page 20: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

([Stark et al. 1991-1994])

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Page 21: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

([Stark et al. 1991-1994])

Page 22: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

([Stark et al. 1991-1994])

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

• Categories Considered• Chairs, Tables, Bench, Bookshelf, Bed, Not Known

Page 23: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

A Part based approach ([Rivlin et al

1995])

• Extract 4 parts Reason about their relative configuration• Sticks, Blobs, Plate, Strip

Page 24: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Criticism• Highlight the importance of Knowledge representation

• Hard-Coded definitions

• Almost no testing on real world data

• Instead of trying to recognize surfaces for sitting, sleeping, keeping objects ended up recognizing chairs, beds and tables !• Pseudo-functional Space

Page 25: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Using Affordance Cues for Object Detection

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Page 26: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Continued..

Page 27: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

• 2 Oranges, 1 Apple, Can, Tissue Packs, Beaker, Bottle• Coarse features generalize the most.• Active Stereo, Multiple Viewpoints

Use of Coarse Features ([Dillman et al.

ICRA 2011])

Page 28: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Learning by ActionsAffordance Learning

Page 29: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Human Actions and Object Context (Moore et al. [ICCV 1999])

• Jointly Model actions and Image features

• Pre-defined object model

• Shape: Pixel area, size of bounding box, L2-distance from known classes

• Action: HMM based hand pose estimation

Page 30: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Results• Click to edit Master text styles

• Second level• Third level

• Fourth level• Fifth level

Page 31: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Interaction Signatures ([Venkatesh et al

ICCV-05])

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Page 32: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Interaction Signatures ([Venkatesh et al

ICCV-05])

Consider only printer, chair, keyboard and paper !

Page 33: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Observing Humans ([Veloso et al. ICRA

2005])

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Page 34: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Objects in Action ([Gupta et al CVPR

2007])

• HOG Initial guess on probability of object in a window.• Reach (Mr)• Manipulation (Mm)• Reaction (Or)

Page 35: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Objects in Action ([Gupta et al CVPR

2007])

Page 36: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Interactive Learning (Leonardis et al

2009])

• Object Shape Ellipses (Curvature, area, etc)

• Action features Color and Edge histograms

• SVM object features to clustered action features.

Page 37: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Object-Action Recognition ([Kragic

2011])

• Consider• Book, Magazine, Hammer, Pitcher, Box,Cup• Hammering, opening, pouring

• Video Data

• Object Recognition HOG

• Hand pose (velocity, angle b/w joints, orientation) SVM

• Learn a joint model using Factorial CRF.

Page 38: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Learning by Simulation Affordance Learning

Page 39: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Learning Spatial Relations Using Functional Simulation (Sjoo et al [IROS 2011])

• Learn relation between 2 objects• Support• Protection• Constraint• Move Together

• Features• Pose, closest seperation,

area, distance, contact patch area etc.

• Predict Relation given feature.

Page 40: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

What makes a chair chair ?• Discussed !

Page 41: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Indoor Scenes• Highly Structured !

• Surface Orientations: Mainly Vertical and Horizontal

• Components• Boundaries

• Walls, Floors, Doors• Furniture

• Tables, Chairs, Beds, Shelves, Cabinets• Actions

• Cups, Bottles, Glasses, Books, Pens, Kitchen Appliances etc.

• Current Proposition Discover the first 2 categories of scene components

Page 42: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Scene Interpretation

Page 43: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Most Relevant Work (Rusu et al [2010])

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Page 44: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Framework

• Kitchen Environment• Co-Register 16 scans Laser and TOF Cameras.

• Bottom and Topmost regions Floor and Roof

• Determine X and Y axes

• Use heuristics on remaining vertical surfaces to get walls.

• Label other vertical surfaces as furniture.

Page 45: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Segmentation

• Click to edit Master text styles• Second level

• Third level• Fourth level

• Fifth level

Page 46: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Furniture Labelling

Page 47: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Moving Ahead (Replacing Heuristics)• Horizontal L-1 features

• Z-Coordinate, Length and Width

• Vertical L-1 features • Height, Floor Distance, Roof Distance, Width

• L2 features • Height, Width• Num Handles, Knobs

• Learn using CRF.

Page 48: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Some Results• Click to edit Master text styles

• Second level• Third level

• Fourth level• Fifth level

• Legend• Horizontal Planes: Floors, Tables, Ceilings• Vertical Planes: Walls, Furniture Candidates• Furniture: Cupboards, Drawers, Kitchen

Appliances

Page 49: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Leftover Objects• Like cups, bottles etc.• Application: Grasping, Manipulation

Page 50: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Geometrical Primitives• Planes, Spheres, Cylinders, Cones, Tori, Edges and Corners

• Use local point features for primitive labeling using CRF.

• Further using point labels, an SVM modeling capturing shape is used for identifying class of object. (4 object classes).

Page 51: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Proposition

Page 52: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Pipeline

Point Cloud

Identify Horizontal and Vertical Surfaces

Surface Normal

Identify floor, roof, Z axis Compute

Features

Walls, X and Y Axes

Segmentation Normal

Edges

Observe Clusters

Clustering

Page 53: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Features• Defined for each horizontal/vertical surface

• Orientation of Surface• Area of Surface• Volume of Object• Distance from floor• Distance from walls

• At a 2nd level• Relation with other surfaces in the object

• Metrics• Human Height• Width• Arm Span• Area

Page 54: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Hopeful Objects• Identification

• Walls, Floors, Navigable Spaces

• Emerge by un-supervised clustering (pure geometrical features)• Tables/Desks • Chairs• Beds• Shelves• Almirahs• Doors• Cabinets• Windows• Dustbins

Page 55: Utility = f(Vision) - A Review. Perception “To perceive is also about how to approach and what to do with an object …” “Perception/cognition is determined.

Further Extensions• Poselet driven affordance learning:

• Human moving around in an environment.• Vision system Tracks humans, associates poses and objects.• Supplement object detection by using poses.

• Eg: Recognizing bean bags for sitting.• Predict the affordance pose given the object.

• fMRI Study:• Learn a model by showing common tools.• Use say a screwdriver for hammering would be interesting to

see if it is predicted as hammer or screw-driver.


Recommended