+ All Categories
Home > Documents > lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by...

lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by...

Date post: 04-Oct-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
68
CS4670/5670: Intro to Computer Vision Instructor: Kavita Bala
Transcript
Page 1: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

CS4670/5670:  Intro  to  Computer  Vision  Instructor:  Kavita  Bala  

Page 2: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Instructor  

•  Kavita  Bala  ([email protected])    

•  Office  hours:    

     F:  10-­‐11,  or  by  appointment  

 

•  Research  interests:  – Computer  graphics  and  vision    

Page 3: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Today  

1. What  is  computer  vision?  

2.  Course  overview  

 

Page 4: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Today  

•  Readings  – Szeliski,  Chapter  1  (IntroducTon)  

Page 5: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Every  image  tells  a  story  

•  Goal  of  computer  vision:  perceive  the  “story”  behind  the  picture  

•  Compute  properTes  of  the  world  – 3D  shape  –   Names  of  people  or  objects  

– What  happened?  

Page 6: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

The  goal  of  computer  vision  

Page 7: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Can  the  computer  match  human  percepTon?  

•  Yes  and  no  (mainly  no)  –  computers  can  be  be]er  at  “easy”  things  

–  humans  are  much  be]er  at  “hard”  things  

•  But  huge  progress  has  been  made  –  Especially  in  the  last  10  years  

– What  is  considered  “hard”  keeps  changing  

Page 8: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Human  percepTon  has  its  shortcomings  

Sinha and Poggio, Nature, 1996

Page 9: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

But  humans  can  tell  a  lot  about  a  scene  from  a  li]le  informaTon…  

Source: “80 million tiny images” by Torralba, et al.

Page 10: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching
Page 11: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

The  goal  of  computer  vision  

Page 12: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

The  goal  of  computer  vision  •  CompuTng  the  3D  shape  of  the  world  

Page 13: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

The  goal  of  computer  vision  •  Recognizing  objects  and  people  

Page 14: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

slide credit: Fei-Fei, Fergus & Torralba

Page 15: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

sky

building

flag

wall banner

bus

cars

bus

face

street lamp

slide credit: Fei-Fei, Fergus & Torralba

Page 16: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

The  goal  of  computer  vision  •  “Enhancing”  images  (c.f.  ComputaTonal  Photography)  

Texture  synthesis  /  increased  field  of  view  (uncropping)  (image  credit:  Efros  and  Leung)  

InpainTng  /  image  compleTon  (image  credit:  Hays  and  Efros)  

Super-­‐resoluTon  /  denoising  (source:  2d3)  

Page 17: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

The  goal  of  computer  vision  

•  Forensics  

Source: Nayar and Nishino, “Eyes for Relighting”

Page 18: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Source: Nayar and Nishino, “Eyes for Relighting”

Page 19: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Source: Nayar and Nishino, “Eyes for Relighting”

Page 20: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Why  study  computer  vision?  

•  Millions  of  images  being  captured  all  the  Tme  

 

•  Loads  of  useful  applicaTons  •  The  next  slides  show  the  current  state  of  the  art  

Page 21: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

OpTcal  character  recogniTon  (OCR)  

Digit  recogniTon,  AT&T  labs  h]p://www.research.a].com/~yann/  

•  If  you  have  a  scanner,  it  probably  came  with  OCR  soiware  

 

License  plate  readers  h]p://en.wikipedia.org/wiki/AutomaTc_number_plate_recogniTon  

 

Source: S. Seitz AutomaTc  check  processing  

Sudoku  grabber  h]p://sudokugrab.blogspot.com/  

Page 22: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Face  detecTon  

•  Many  new  digital  cameras  now  detect  faces  – Canon,  Sony,  Fuji,  …    

Source: S. Seitz

Page 23: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Face  RecogniTon  

http://developers.face.com/tools/

Page 24: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Face  recogniTon  

Who is she? Source: S. Seitz

Page 25: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Vision-­‐based  biometrics  

“How the Afghan Girl was Identified by Her Iris Patterns” Read the story

Source: S. Seitz

Page 26: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Login  without  a  password…  

Fingerprint scanners on many new laptops,

other devices

Face recognition systems now beginning to appear more widely

http://www.sensiblevision.com/

Source: S. Seitz

Page 27: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Object  recogniTon  (in  supermarkets)  

LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “

Source: S. Seitz

Page 28: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Google  Goggles  

Page 29: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching
Page 30: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Google  Search  by  Image  

Page 31: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Leaf  RecogniTon  

Page 32: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

The  Matrix  movies,  ESC  Entertainment,  XYZRGB,  NRC  

Special  effects:    shape  capture  

Source: S. Seitz

Page 33: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Pirates of the Carribean, Industrial Light and Magic

Special  effects:    moTon  capture  

Source: S. Seitz

Page 34: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Sports  

Sportvision  first  down  line  Nice  explanaTon  on  www.howstuffworks.com  

Source: S. Seitz

Page 35: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Vision-­‐based  interac/on  (and  games)  

Kinect  

Page 36: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Smart  cars  

•  Mobileye  – Vision  systems  currently  in  high-­‐end  cars  

Sources: A. Shashua, S. Seitz

Page 37: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Smart  cars  

Page 38: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Vision  in  space  

Vision systems (JPL) uses for several tasks •  Panorama stitching •  3D terrain modeling •  Obstacle detection, position tracking •  For more, read “Computer Vision on Mars” by Matthies et al.

The Heights of Mount Sharp http://www.nasa.gov/mission_pages/msl/multimedia/pia16077.html Panorama captured by Curiosity Rover, August 18, 2012 (Sol 12)

Page 39: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

RoboTcs  

NASA’s  Mars  Curiosity  Rover  (Mars  Science  Laboratory)  h]p://en.wikipedia.org/wiki/Spirit_rover  

Autonomous  RC  Car  h]p://www.cs.cornell.edu/~asaxena/rccar/  

Page 40: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Medical  imaging  

Image  guided  surgery  Grimson  et  al.,  MIT  

3D  imaging  MRI,  CT  

Source: S. Seitz

Page 41: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Large-­‐scale  3D  reconstrucTon  •  AutomaTc  3D  reconstrucTon  from  Internet  photo  collecTons  

“Statue  of  Liberty”  

3D  model  

Flickr  photos  

“Half  Dome,  Yosemite”   “Colosseum,  Rome”  

Page 42: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Photosynth  

Page 43: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Current  state  of  the  art  •  You  just  saw  examples  of  current  systems.  

– Many  of  these  are  less  than  5  years  old  

•  This  is  a  very  acTve  research  area,  and  rapidly  changing  – Many  new  apps  in  the  next  5  years  

•  To  learn  more  about  vision  applicaTons  and  companies  –  David  Lowe  maintains  an  excellent  overview  of  vision  companies  

•  h]p://www.cs.ubc.ca/spider/lowe/vision.html  

Page 44: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Why  is  computer  vision  difficult?  

Viewpoint  variaTon  

IlluminaTon  Scale  

Page 45: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Why  is  computer  vision  difficult?  

Intra-­‐class  variaTon  

Background  clu]er  

MoTon  (Source:  S.  Lazebnik)  

Occlusion  

Page 46: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Challenges:    local  ambiguity  

slide credit: Fei-Fei, Fergus & Torralba

Page 47: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

But  there  are  lots  of  cues  we  can  exploit…  

Source: S. Lazebnik

Page 48: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Bo]om  line  •  PercepTon  is  an  inherently  ambiguous  problem  

– Many  different  3D  scenes  could  have  given  rise  to  a  parTcular  2D  picture  

 – We  oien  need  to  use  prior  knowledge  about  the  structure  of  the  world  

Image source: F. Durand

Page 49: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching
Page 50: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Instructor  

•  Kavita  Bala  ([email protected])    

•  Office  hours:    

     F:  10-­‐11,  or  by  appointment  

 

•  Research  interests:  – Computer  graphics  and  vision    – Rendering,  PercepTon,  Material  modeling  and  recogniTon  

Page 51: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Autodesk  360  Cloud  Render  

Page 52: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching
Page 53: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Current  Interests  

•  Material  PercepTon  •  RecogniTon  

– Visual  Search  •  Scene  ReconstrucTon  

Page 54: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Important  personnel  •  TAs:    

– Senior  TAs  •  Sco]  Wehrwein,  Sean  Bell  

– PhD  TAs  •  Balazs  Kovacs,  Andreas  Veit  

– Meng  TA  •  Alec  Regulinski  

– Undergrad  TAs    •  Akhila  Ananthram,  Daniel  Carpenter,  Sheroze  Sheriffdeen,  Dhruv  Singhal,  Raghav  Subramaniam  

•  Office  hours  TBA  

Page 55: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Other  administraTve  details  

•  Textbook:  Rick  Szeliski,  Computer  Vision:  Algorithms  and  Applica;ons  

online  at:  h]p://szeliski.org/Book/  

•  Course  webpage  (lectures,  assignments):      h]p://www.cs.cornell.edu/courses/cs4670/2015sp/  

•  Announcements/grades  via  Piazza/CMS  Sign  up  on  piazza  h]ps://cms.csuglab.cornell.edu/  

Page 56: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Course  requirements  

•  Prerequisites—these  are  essen;al!  – Data  structures  – A  good  working  knowledge  of  C/C++  programming  – Linear  algebra  – Vector  calculus  

•  Course  does  not  assume  prior  imaging  experience  – computer  vision,  image  processing,  graphics,  etc.  

 

Page 57: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Course  overview  (tentaTve)  1.  Low-­‐level  vision  

–  image  processing,  edge  detecTon,  feature  detecTon,  cameras,  image  formaTon  

2.  Geometry  and  algorithms  –  projecTve  geometry,  stereo,  

structure  from  moTon,  Markov  random  fields  

3.  RecogniTon  –  detecTon  /  recogniTon,  category  

recogniTon,  segmentaTon  

4.  Light,  color,  and  reflectance  

Page 58: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

1.  Low-­‐level  vision  

•  Basic  image  processing  and  image  formaTon  

Filtering,  edge  detecTon  

* =

Feature  extracTon   Image  formaTon  

Page 59: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Project:  Image  Scissors  

Page 60: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Project:  Feature  detecTon  and  matching    

Page 61: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

2.  Geometry  

ProjecTve  geometry  

Stereo  

MulT-­‐view  stereo   Structure  from  moTon  

Page 62: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Project:  CreaTng  panoramas  

Page 63: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

3.  RecogniTon  

Sources: D. Lowe, L. Fei-Fei

Face  detecTon  and  recogniTon  Single  instance  recogniTon  

Category  recogniTon  

Single  instance  recogniTon  

Page 64: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

4.  Light,  color,  and  reflectance  

Light  &  Color   Reflectance  

Page 65: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Grading  

•  Occasional  quizzes  (at  the  beginning  of  class)  •  One  prelim,  one  final  exam  

•  Rough  grade  breakdown:  – Quizzes:  2-­‐5%  – Midterm:  15-­‐20%  

– Homeworks:  10-­‐20%  – Programming  projects:  40-­‐50%  – Final  exam:  15-­‐20%  

Page 66: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Late  policy  

•  Three  “late  days”  will  be  available  for  the  semester  

•  Late  projects  will  be  penalized  by  25%  for  each  day  it  is  late,  and  no  extra  credit  will  be  awarded.  

Page 67: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

Academic  Integrity  

Page 68: lec00 intro web - Cornell University€¦ · Objectrecognion(insupermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching

QuesTons?  


Recommended