+ All Categories
Home > Documents > Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... ·...

Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... ·...

Date post: 13-Oct-2019
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
18
July 27, 2016 Sam Siewert Computer and Machine Vision Lecture Week 15 Part-1 Wrap-Up Take-away
Transcript
Page 1: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

July 27, 2016 Sam Siewert

Computer and Machine Vision

Lecture Week 15 Part-1

Wrap-Up Take-away

Page 2: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Outline of Week 15 What Next? – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive and Prosthetic Vision Systems – CV Turing Test

Final Review – Blackboard Online Final Quiz – Takeaway

Sam Siewert 2

Page 3: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Computer Vision

To Go from Machine to Computer Vision, We Need a

New Computer and New Algorithms

Sam Siewert

3

Page 4: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

MV vs CV vs Video Analytics Machine Vision – Photometers Used in Process Control – Successful History – Industrial Automation and Robotics – Controlled Environments – Inspection, Optical Navigation, Medical

Computer Vision – Emulate Human Vision System – Early Underestimation – Marvin Minsky Summer Project – Challenge of Un-controlled Environments – 50 Years Later, Challenges Better Understood – Vision Prosthetics, General Automation – Recent Breakthroughs – USC, DARPA Artificial Retina,

Google Car – Efficiency and Generalization?

Video Analytics – CV from RT or Stored/Networked Video

Sam Siewert 4

Spitzer – JPL/Caltech

CU-Boulder ECEN 5623

Page 5: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

If Possible, CV => MV Conversion – Cheat!

Practical Solution – Convert CV to MV Problem – Loss of Generalization (Solves One Problem Rather

than Class) – Controlling Environment May Be Difficult – Use Non-Visible Spectrum to Advantage (e.g. FLIR) – Sensor Fusion (Visible + FLIR, RADAR, GPS, …) – Models and Prior-Knowledge of Problem Exploited

Overhead Camera Dark Background Overhead Lighting Game State / Grid Known Shape Database

Sam Siewert 5

CU-Boulder ECEN 5623

Page 6: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Why is Human Vision > Computer?

Sam Siewert 6

Approximately 100+ Mega-Pixel

(Rod/Cone Count)

Cortex=10 Billion Neurons (High fan-out)

Total=100 Billion Neurons

Neuroscience. 2nd edition. Purves D, Augustine GJ, Fitzpatrick D, et al., editors. Sunderland (MA): Sinauer Associates; 2001. http://www.ncbi.nlm.nih.gov/books/NBK10848/

Red Epic 645 63 Mega-Pixel

I/O Bus (x16 5Gbps = 8GB/sec)

Camera Link

Interface Card

Local Bus

CPU CPU

Memory Controller

5 To 7 billion transistors 1. Neuron > Transistor 2. Better Programming? ROM? 3. More Richly Interconnected 4. Storage + Processing

> 1 Trillion Synapses

Page 7: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Biological Vision vs. Machine Vision (Why A Honey Bee is Better than HPC for CV)

Humans - 100 million Photoreceptors

– 10 billion Neurons (Cerebral Cortex) – Brain with 100 billion Neurons – Millisecond Transfer – Massively Parallel Analog + Digital Computation

Synapse Match is a Challenge

– 7000 Connections from 10 Billion Neurons – 3 Year Olds Have 1015 Synapses

CPU to Digital Camera/HDD

– Connects 10’s of millions of pixels – to Several Billion transistors – Through Sequential Logic and I/O Bus

Sam Siewert 7

960K Neurons in flight: Learns locations, complex odors, colors, and shapes; with high efficiency (500 Watt/Kg), 0.218g

Brain plasticity for learning, connectedness, concurrency, integrated sensing, power efficiency, and resiliency

2016 – 16 billion?

NVIDIA GK110 28nm, (7.1 billion)

Pascal – 15 billion

Intel MICA 22nm (5 billion)

http://en.wikipedia.org/wiki/List_of_animals_by_number_of_neurons

http://en.wikipedia.org/wiki/File:Transistor_Count_and_Moore%27s_Law_-_2011.svg

Page 8: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

From Machine to Computer Vision Problem of: – Scaling? Efficiency?

Latency? – Architectural Bottle-

neck? – Algorithm Challenge?

Storage / Networking Bottleneck – Access Time in

Human Scale is 1 month

– Compared to 1 sec for Memory Access

– Network / Bus is still on Order of an Hour

Sam Siewert 8

Network Latency

(microsec)

Network Controller

I/O Bus

Head Seak, Rotate 1 msec – 100’s

Storage Controller

Camera Link

(or HD-SDI)

Local Bus

CPU CPU

Memory Controller

I/O Fabric

Page 9: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Inverse Rendering - Computer Vision Turing Test

Sam Siewert 9

HPC Solutions for Digital Cinema (Slower than RT) Augmented Reality (RT Interactive)

Scene Description

(E.g. RenderMan, OpenGL)

OpenCV and HPC Scaled Processing (ARSC, JANUS)

GPU

CVPU

Scene Analytics

Intelligent Systems

http://en.wikipedia.org/wiki/File:Glasses_800_edit.png

Rendered Scene

Image

Page 10: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Progress from MV to CV Numerous Practical Machine Vision Automation Examples – From Optical Navigation to Sorting Recycling Materials,

Fish Tagging, Inspection, Agriculture

Supervisory Control – Robotic Satellite Servicing Augmented Reality – Heads Up Information Entertainment Systems – Gesture Recognition Neural Prosthetics for Computer Vision

Cameras Designed For Vision (Beyond NTSC, Web and Digital Video)

Sam Siewert 10

Page 11: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Neural Prosthetics for Computer Vision

Carver Meade – Neuromorphics – (VLSI) systems containing electronic analog circuits to mimic

neuro-biological architectures – Generalized to http://ine-web.org/about-ine/about-ine/index.html

U.S. DoE Artificial Retna Project - http://artificialretina.energy.gov/howartificialretinaworks.shtml Argus II – Vision Prosthetic – http://www.youtube.com/watch?v=ZyVjK7sktvw – http://www.doheny.org/PDF/Bringing_Sight_to_the_Blind.pdf

Numerous Additional Projects in Progress

Sam Siewert 11

Page 12: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Photometers Designed For Vision (Beyond NTSC, Web and Digital Video)

Eye Camera – http://www.scientificamerican.com/article.cfm?id=dvs-the-

eye-camera&WT.mc_id=SA_printmag_2012-10

Light Field Cameras – http://www.raytrix.de/index.php/Cameras.html

3D Scanners – http://www.faro.com/focus/us,

http://www.faro.com/site/resources/share/947

Camera Arrays – https://graphics.stanford.edu/papers/highspeedarray/

Sam Siewert 12

Page 13: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Open Source Inference Much of the Future of Computer Vision May Depend on Scene Inference – Statistical Matching of Key Points and Features – Incorporation of Context to Recognition – Chapter 14 in CMV – Section II, Chapter 6 in “Computer Vision: Models, Learning and

Inference”, by Simon Prince

Inference Engines – Prolog Rule-Based

http://www.gprolog.org/ http://www.probp.com/

– Bayesian Inference http://people.csail.mit.edu/milch/blog/index.html

Sam Siewert 13

Page 14: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

CV and Image Processing Libraries

MATLAB – http://www.mathworks.com/products/imaq/ – http://www.mathworks.com/products/image/ – http://www.mathworks.com/products/computer-

vision/ Mathematica – Image Processing – http://reference.wolfram.com/mathematica/guide/I

mageProcessing.html OpenCV - http://docs.opencv.org/ IPP - http://software.intel.com/en-us/intel-ipp

Sam Siewert 14

Page 15: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

How Do We Advance? Fundamental Research – Biological Systems, Physiology, Connectionist Theory and Algorithms (UA-A Program in Complex Systems) Highly Concurrent Perception Processing with Tight Coupling to Photometers Designed for Vision

– Low latency, Motion triggered cameras, multi-sensor low resolution and high resolution channels

– Direct Interface to Concurrent Programmable Parallel Logic and Mixed Analog/Digital Circuitry for Sensory Fusion

Research on a Parallel Perception Interface and Range of Concurrency – Analog and Digital Cameras Ranging from Simple Photometers to CCD/CMOS

Detectors – CPLD, FPGA Concurrent Digital Transformation (Verilog/VHDL -> OpenCL) – GPU SIMD Processing for Parallel (CUDA and StreamProc -> OpenCL) – Multi-Core Processing with OpenCL

Address Challenging Problems that are Cooperative Supervisory Control Between Biology and Machine Vision

– Supervisory Control – What Humans Do Best and What Computing Does Best in Collaboration to solve useful problems of Automation – e.g., Robotic Satellite Servicing

– Human Prosthetics and Hyper-sensory Systems -

Sam Siewert 15

Page 16: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Computational Photometer Project Cyber-Sensory Perception Interface – Computational Photometer

Experimental Hybrid Reconfigurable Logic: and Multi-channel Photometer Interfaces with SIMD/Multi-Core Processing and Scalable SSD See Paper - http://www.cse.uaa.alaska.edu/~ssiewert/papers/Paper-9121-16-CP-SPIE-Tech+Applications.pdf Goal to Build (or Use) Designed for Vision Cameras – Both Analog and Digital Integrated Sensor Fusion – E.g. IMU sensors and Cameras

Leverages and Extends Research at CU-Boulder Funded by Intel – CU-Cam (Analog cameras, CPLD, USB Interface to ARM Coretex)

Sam Siewert 16

Page 17: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

Human Perception More than 30 Human Senses Used for Interaction with the World and Self-Monitoring – [5] Basic 5 Exteroception: Coarse/Fine Vision, Olfactory,

Auditory, Tactile, Taste (Gustatory) – [1] Proprioception: Muscle Memory and motion tracking – [2+] Equilibrioception: IMU Features (vestibular – rotational,

otolithic – accelaration) – [11+] Interoception: Internal Health and Status (homeostatic

thermoception, respiration, suffocation, nausea, thirst, cutaneous, GI, esophigal, gagging, fullness, headache …)

– [1] Chronoception: Sense of Time – [1+] Nociception: Pain receptors – [1+] Thermoception

Human Sensor Fusion – E.g., Flavor combines taste and smell

Sam Siewert 17

Page 18: Computer and Machine Visionecee.colorado.edu/~siewerts/extra/ecen5763/ecen5763_doc/Lectures/... · – Future of Computer and Machine Vision – Merging of CV with Graphics – Interactive

CV Related Open Source Feature Vector Analysis, Pattern Recognition, Classification

– SIFT (Scale Invariant Feature Transform) http://www.cs.ubc.ca/~lowe/keypoints/ http://areshmatlab.blogspot.com/2010/04/video-stabilization.html

– PCA Analysis (related/unrelated dimensions of observed data) - http://en.wikipedia.org/wiki/GNU_Octave, http://www.mathworks.com/products/statistics/

– ICA Analysis (solves cocktail party problem – combined filtering and selective signal enhancement) - http://mialab.mrn.org/software/gift/

– Bayesian Logic - http://people.csail.mit.edu/milch/blog/index.html (Monty Hall Problem - Conditional Probability and Baye’s Rule)

P(A|B) = [P(B|A) * P(A)] / P(B) Basket-1=5R, 5B balls, Basket-2=3R, 7B; If Basket selected randomly and ball drawn is R, what is Probability ball came from Basket-1?

– P(R|B1)=1/2, P(R|B2)=3/10, P(B1)=1/2, P(B2)=1/2 – P(R) = [P(R|B1)*P(B1)] + [P(R|B2)*P(B2)] = [(1/2) * (1/2)] + [(3/10) * (1/2)] = 2/5 – P(B1|R) = [P(R|B1) * P(B1)] / P(R) = (0.5 * 0.5) / (2/5) = 5/8

Prolog – http://www.probp.com/ – http://www.gprolog.org/

MV/CV Libraries

– http://opencv.org/ – http://software.intel.com/en-us/intel-ipp – http://www.mathworks.com/products/computer-vision/index.html

Sam Siewert 18


Recommended