Date post: | 16-Apr-2017 |
Category: |
Technology |
Upload: | mariachapo |
View: | 69 times |
Download: | 1 times |
PowerPoint Presentation
In the past few years we have been witnessing incredible progress in the field of computer vision, mainly due to deep learning.
Tackling challenges incomputer visionAugustin Marty CEO Deepomatic
In the past few years we have been witnessing incredible progress in the field of computer vision, mainly due to deep learning.
Deep learning changed so much in solving image related questions. You need to feed your model with examples. Give your model images, thousands of images so that it learns to differentiate.
Imagenet: Iconic challenge in the world of computer vision with thousands of categories: all the images must be placed by algorithms into the correct category. Democratisation of image recognition error rate dropped from 26% to 3% today. 5% is the error rate of a human.
DEEP LEARNINGIS THE NEW PARADIGM
Deep learning changed so much in solving image related questions. You need to feed your model with examples. Give your model images, thousands of images so that it learns to differentiate.
Imagenet: Iconic challenge in the world of computer vision with thousands of categories: all the images must be placed by algorithms into the correct category. Democratisation of image recognition error rate dropped from 26% to 3% today. 5% is the error rate of a human.
The deep learning progress was also made possible because the tech giants started to massively investing in it, developing bigger models (with more layers and millions of parameters) . This is a heavy process
GoogleNet (2014)22 LAYERS ResNet (2015)152 LAYERS
The deep learning progress was also made possible because the tech giants started to massively investing in it, developing bigger models (with more layers and millions of parameters) . This is a heavy process
With bigger and more complex models and algorithms there is a need for great computing power.
Here NVIDIAs CEO, Jen-Hsun Huang, is presenting and Open AI supercomputer to Elon Musk.This is a relatively small box, aligned with GPU processing calculations amazingly fast. This type of supercomputer is now affordable and accessible, especially since it is in the Cloud.
This progress and computing power has led to interesting applications in image recognition:
OPEN-AI SUPERCOMPUTER
With bigger and more complex models and algorithms there is a need for great computing power.
Here NVIDIAs CEO, Jen-Hsun Huang, is presenting and Open AI supercomputer to Elon Musk.This is a relatively small box, aligned with GPU processing calculations amazingly fast. This type of supercomputer is now affordable and accessible, especially since it is in the Cloud.
This progress and computing power has led to interesting applications in image recognition:
google show and tell : Googles image captioning model. The model is able to describe the scene with a collection of verbs, adjectives like a human does.
Googles Show and Tell
google show and tell : Googles image captioning model. The model is able to describe the scene with a collection of verbs, adjectives like a human does.
Style transfer is another example: Deep learning algorithms can understand style of a painting and reproduce it. Here it understands Van Goghs expressionist painting and - coupled with a picture of houses along a river - reproduces the style, creating a new picture.
Style Transfer
Style transfer is another example: Deep learning algorithms can understand style of a painting and reproduce it. Here it understands Van Goghs expressionist painting and - coupled with a picture of houses along a river - reproduces the style, creating a new picture.
Video Coloration: Artificial intelligence colours the video turning the black and white video into a coloured one. How was this achieved?: 1.Engineers took thousands of coloured videos and made them black and white. 2. They then trained the algorithm to understand the correlation between the B&W videos and the respective coloured ones. 3. Then the algorithm was able to colour new, initially B&W videos.
Video Colouration
Video Coloration: Artificial intelligence colours the video turning the black and white video into a coloured one. How was this achieved?: 1.Engineers took thousands of coloured videos and made them black and white. 2. They then trained the algorithm to understand the correlation between the B&W videos and the respective coloured ones. 3. Then the algorithm was able to colour new, initially B&W videos.
In specific industry problems those models dont necessarily work. Here the following 3 images are taken from Microsofts image recognition platform online. Here an automotive part is mistaken for a close up of a plane - not quite!
Close up of a plane
In specific industry problems those models dont necessarily work. Here the following 3 images are taken from Microsofts image recognition platform online. Here an automotive part is mistaken for a close up of a plane - not quite!
A terrorist is labeled as man looking at the ocean' . Of course there's an ocean and a man but first of all hes clearly looking away from the ocean. The machine doesnt recognise the dark knife in his hands, and cant properly identify the face because of the mask.
Man looking at the ocean
A terrorist is labeled as man looking at the ocean' . Of course there's an ocean and a man but first of all hes clearly looking away from the ocean. The machine doesnt recognise the dark knife in his hands, and cant properly identify the face because of the mask.
a group of men standing on a dirt field : its not wrong: you have a group of men and yes its a dirt road.But we humans, understand the context of this picture better , unlike the algorithm: these group of men are fighters they have weapons, they are fighters, and are probably engaging in an act of war
All this goes to show that progress is undeniable but there is much to still do. When you want to apply these technologies to your industry or company specific challenges it might not work. You may think that therefore its not for you, that AI and computer vision doesnt solve your need
But
Group of men standing on top of a dirt field
a group of men standing on a dirt field : its not wrong: you have a group of men and yes its a dirt road.But we humans, understand the context of this picture better , unlike the algorithm: these group of men are fighters they have weapons, they are fighters, and are probably engaging in an act of war
All this goes to show that progress is undeniable but there is much to still do. When you want to apply these technologies to your industry or company specific challenges it might not work. You may think that therefore its not for you, that AI and computer vision doesnt solve your need
But
Artificial Intelligence is for everyone, for every company. The following examples show industry specific problems that were solved thanks to computer visions
AI IS FOR EVERY COMPANY
Artificial Intelligence is for everyone, for every company. The following examples show industry specific problems that were solved thanks to computer visions
SADAKO TECHNOLOGIES- a Spanish firm- has developed a waste sorting device by combining robotics and computer vision. They are therefore able to automatically distinguish plastic from other waste on a conveyer belt. This can have incredibly promising applications for the future waste sorting systems and management,, and other cleaning applications
CASE STUDY: SADAKOWaste Sorting
SADAKO TECHNOLOGIES- a Spanish firm- has developed a waste sorting device by combining robotics and computer vision. They are therefore able to automatically distinguish plastic from other waste on a conveyer belt. This can have incredibly promising applications for the future waste sorting systems and management,, and other cleaning applications
Regaind is a startup that is able to qualify the image aesthetics. Selecting best pictures (amongst thousands of pictures taken during a vacation) it creates photo albums automatically by analysing the quality of the picture.CASE STUDY: REGAIND Image Aesthetics
Regaind is a startup that is able to qualify the image aesthetics. Selecting best pictures (amongst thousands of pictures taken during a vacation) it creates photo albums automatically by analysing the quality of the picture.
Coming back to the image of the fighters. Its possible, with todays technology, to develop weapon detection for images and in videos. This has great use and is of great importance to military and intelligence.
CASE STUDY: SECURITY Weapon Detection
Coming back to the image of the fighters. Its possible, with todays technology, to develop weapon detection for images and in videos. This has great use and is of great importance to military and intelligence.
The three previous application have been developed by small companies. If they were able to do this than so can you, if you follow the right methodology There is a secret sauce to tackle image recognition challenges specific to your industry:
THE SECRET AI SAUCE TO SOLVEYOUR PROBLEMS
The three previous application have been developed by small companies. If they were able to do this than so can you, if you follow the right methodology There is a secret sauce to tackle image recognition challenges specific to your industry:
First, you need a deep learning framework. These are available as they are open source. You just need an engineer to use them.A framework
First, you need a deep learning framework. These are available as they are open source. You just need an engineer to use them.
Second ingredient: annotated images. These must be relevant to the task you are tackling The images differ for each use-case and problem need to develop a dataset.
ANNOTATED DATA
Second ingredient: annotated images. These must be relevant to the task you are tackling The images differ for each use-case and problem need to develop a dataset.
Annotators human in the loop
HUMANS-IN-THE-LOOP
Annotators human in the loop
Assemble the 3 ingredients; You trained an algorithm thanks to your dataset and framework.Your first algorithm is applied to never-before-seen images (it is never perfect at first). Your algorithm wont give an answer on all new images provides: you need humans-in-the-loop to keep annotating those that werent (when the machine lacked confidence), completing the task. AI + annotators who complete the job and also keeps building the dataset, creating a better algorithm
DatasetNeural network modelsHumans in the loopTrainingAnnotationCalling humans when the model is not sureTHE LOOP
Assemble the 3 ingredients; You trained an algorithm thanks to your dataset and framework.Your first algorithm is applied to never-before-seen images (it is never perfect at first). Your algorithm wont give an answer on all new images provides: you need humans-in-the-loop to keep annotating those that werent (when the machine lacked confidence), completing the task. AI + annotators who complete the job and also keeps building the dataset, creating a better algorithm
This may all seem relatively simple but theres a catch: you need to have a very good dataset for it to work well: a huge amount of perfectly annotated images Creating these datasets takes lots of time.
BUTTHERES A CATCH
This may all seem relatively simple but theres a catch: you need to have a very good dataset for it to work well: a huge amount of perfectly annotated images Creating these datasets takes lots of time.
Heres an example of furniture detectionTo develop and algorithm that detects furniture in images you need a dataset with boxes around every single item in the image.Consequently, you need to do this manually at first, making sure the boxes are perfectly around the item, and that no object is missed. This takes 10 minutes for 1 image! Sadako Technology, mentioned earlier, needed to do this to train their technology: they put millions of boxes around plastic bottles to create their dataset.
FURNITURE DETECTION(10 min)
Heres an example of furniture detectionTo develop and algorithm that detects furniture in images you need a dataset with boxes around every single item in the image.Consequently, you need to do this manually at first, making sure the boxes are perfectly around the item, and that no object is missed. This takes 10 minutes for 1 image! Sadako Technology, mentioned earlier, needed to do this to train their technology: they put millions of boxes around plastic bottles to create their dataset.
Some tasks are even more time consuming. If you want to develop algorithms for robotics (automated cars, robots, drones etc.) they need to understand their entire environment. So in this case, to train algorithms you need to determine what each pixel represents in the image.
This segmentation task takes over an hour
URBAN SEGMENTATION(70min)
Some tasks are even more time consuming. If you want to develop algorithms for robotics (automated cars, robots, drones etc.) they need to understand their entire environment. So in this case, to train algorithms you need to determine what each pixel represents in the image.
This segmentation task takes over an hour
The real bottleneck is now the dataset creation
DATA IS AI BOTTLENECK
The real bottleneck is now the dataset creation
Good dataset creation is crucial to speed up the pace of AI progress
LACK OF DATA IS SLOWING DOWN AI EXPANSION
Good dataset creation is crucial to speed up the pace of AI progress
To make datasets today there are 2 ways of doing it for now: 1) done internally by data scientists2) Use crowdsourcing, such s Amazons Mechanical Turk this isnt too bad but is time consuming and you need to do many quality reviews and check to ensure satisfactory results
Every data scientist has, at least once, thrown out a dataset due to its poor quality.
Amazon mechanical turkTime consuming, poor qualityDo it internallyMake your data scientists want to quitorSolutions
To make datasets today there are 2 ways of doing it for now: 1) done internally by data scientists2) Use crowdsourcing, such s Amazons Mechanical Turk this isnt too bad but is time consuming and you need to do many quality reviews and check to ensure satisfactory results
Every data scientist has, at least once, thrown out a dataset due to its poor quality.
Real need for Industrialising the dataset creation process is the true solution to move forward to solve image related challenges for each company. There are a few elements that may help the industrialisation and democratisation of the dataset creation:
INDUSTRIALISING THE ANNOTATION PROCESS
Real need for Industrialising the dataset creation process is the true solution to move forward to solve image related challenges for each company. There are a few elements that may help the industrialisation and democratisation of the dataset creation:
1.Improve the UXof annotation toolsWe need to have a dedicated software: today there is no software to produce datasets. Its crazy to think that each company develops their own small software. A big leap in productivity can be achieved by simply improving the design and the annotation experience
We need to have a dedicated software: today there is no software to produce datasets. Its crazy to think that each company develops their own small software. A big leap in productivity can be achieved by simply improving the design and the annotation experience
Second element to increase pace fo AI production is to work on active learning. You dont want to annotate millions of images. Active learning is science that helps select the most informative image to build AI with as few images as possible
2.Active Learning & HITL
Second element to increase pace of AI production is to work on active learning. You dont want to annotate millions of images. Active learning is science that helps select the most informative image to build AI with as few images as possible
Improve software with machine learning: if software knows what youre doing it can really improve the ease of the task.
3.Improve tools with AI
Improve software with machine learning: if software knows what youre doing it can really improve the ease of the task.
We intend to reduce the time from 70 to 5 minutes increasing the productivity by 10x
We intend to reduce the time from 70 to 5 minutes in 10 months
We intend to reduce the time from 70 to 5 minutes increasing the productivity by 10x
Here machine learning helps software to annotate images. This video shows that if you are looking for a face the box will automatically adjust around the head. The same goes for when annotating objects pixel-wise speeding up the completion of the task.
Here machine learning helps software to annotate images. This video shows that if you are looking for a face the box will automatically adjust around the head. The same goes for when annotating objects pixel-wise speeding up the completion of the task.
AI really is for everyone and can solve any companies challenges. Algorithms are becoming commodities and datasets are the bottle neck of AI.Democratising and industrialising the process of dataset creation will allow for all of us, all companies to move forward with their AI. Applications and goals.
THANK YOU
AI really is for everyone and can solve any companies challenges. Algorithms are becoming commodities and datasets are the bottle neck of AI.Democratising and industrialising the process of dataset creation will allow for all of us, all companies to move forward with their AI. Applications and goals.
THANK YOU