+ All Categories
Home > Documents > ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer...

ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer...

Date post: 14-Jul-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
46
ML & AI: A SWOT Analysis Volkan Cevher, Associate Professor EPFL ML & AI | Volkan Cevher | hps://lions.epfl.ch
Transcript
Page 1: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

ML & AI: A SWOT AnalysisVolkan Cevher, Associate Professor EPFL

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 2: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

ML & AI | Volkan Cevher | https://lions.epfl.ch

Preface

My research:Machine Learning (ML) OptimizationSignal ProcessingInformation TheoryStatistics

My courses (2019-20):Mathematics of DataReinforcement Learning Advanced Topics in ML

2

This talk

Page 3: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Group: 14 PhDs & 3 postdocs (14 nationalities)

Faculty@Rice, NUS, Umea, Zhejiang, UNC, Linkoping, AIMS, UoCB, VNU, NTU, TechnionPostdoc@ETHZ (3), MIT (2), McGill, TuringOthers@Kandou bus, SwissRE, TUM

ML & AI | Volkan Cevher | https://lions.epfl.ch

Preface

My research:Machine Learning (ML) OptimizationSignal ProcessingInformation TheoryStatistics

My courses (2019-20):Mathematics of DataReinforcement Learning Advanced Topics in ML

3

Page 4: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Strengths

A SWOT Analysis

4

Machine Learning (ML)

Neural Networks

(NN)

Multilayer NNs

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 5: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Machine Learning (ML)

5

● ML is an interdisciplinary study of algorithms, statistical models, and error functions jointly with computer systems to perform specific tasks

“Only a fool learns from his own mistakes. The wise man learns from the mistakes of others” - Otto von Bismarck

● ML makes you wiser

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 6: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

● ML is an interdisciplinary study of algorithms, statistical models, and error functions jointly with computer systems to perform specific tasks

The ingredients via a simplified supervised learning example

6

Task: Learn a mapping from image to disease

ML & AI | Volkan Cevher | https://lions.epfl.ch

Task: Learn a mapping from control inputs to walking

Page 7: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

● ML is an interdisciplinary study of algorithms, statistical models, and error functions jointly with computer systems to perform specific tasks

The ingredients via a simplified supervised learning example

7ML & AI | Volkan Cevher | https://lions.epfl.ch

Supervised ML: Use algorithms to learn “model”Gradient Descent Algorithm

Con

vex

optim

izat

ion

Page 8: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Academic theory vs industrial practice

8ML & AI | Volkan Cevher | https://lions.epfl.ch

Conventional wisdom in ML until 2010:

Simple models + simple errors

optimization landscapes

Page 9: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Enter neural networks: Universal approximation

9ML & AI | Volkan Cevher | https://lions.epfl.ch

Challenges: 1. too big to optimize! 2. did not have enough data 3. could not find the optimum via algorithms

[Cybenko 1989] NN is a universal approximant of any continuous function on a hypercubereal function optimization landscape

Page 10: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Multilayer neural networks: Tractable & nearly universal

10ML & AI | Volkan Cevher | https://lions.epfl.ch

real function optimization landscape

: First-order method(Gradient descent...)

: Appropriate initializationFor weight matrices W

: Sufficiently large network(Overparameterization)

Neural NetworkAnalysis

NetworkStructure

Algorithm Initialization

Subquatratic overparametrization for shallow neural networksSong et al.

Page 11: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Weaknesses

A SWOT Analysis

11

Robustness

Bias

Interpretability

Reproducibility

Malicious data

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 12: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Robustness

12ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 13: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Robustness is an active research area

13

● He, K., Zhang, X., Ren, S., and Sun, J. (2015). Deep Residual Learning for Image Recognition. arXiv e-prints, page arXiv:1512.03385.

● Huang, G., Liu, Z., van der Maaten, L., and Weinberger, K. Q. (2016). Densely Connected Convolutional Networks. arXiv e-prints, page arXiv:1608.06993.

● Miyato, T., Kataoka, T., Koyama, M., and Yoshida, Y. (2018). Spectral normalization for generative adversarial networks. In International Conference on Learning Representations.

● Raghunathan, A., Steinhardt, J., and Liang, P. S. (2018). Semidefinite relaxations for certifying robustness to adversarial examples. Neurips.

● Wong, E. and Kolter, Z. (2018). Provable defenses against adversarial examples via the convex outer adversarial polytope. ICML.

● Madry, Aleksander and Makelov, Aleksandar and Schmidt, Ludwig and Tsipras, Dimitris and Vladu, Adrian. Towards Deep Learning Models Resistant to Adversarial Attacks. ICLR.

● Huang, X., Kwiatkowska, M., Wang, S., and Wu, M. (2017). Safety verification of deep neural networks. Computer Aided Verification.

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 14: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

14

Adversarial examples are inevitable!

● Concentration-of-measure phenomenon

[Shafahi et al. ICLR 2019]

● Lipschitz constant is important

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 15: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Progress towards robustness

15

Lipschitz Constant Estimation of Neural Network via Sparse Polynomial Optimization. Latorre, Fabian and Rolland, Paul and Cevher, Volkan. ICLR 2020.

NP-hard for NNs [Scaman et al. NeurIPS 2018]

< Lasserre hierarchy

< Krivine hierarchy

Generalization:

[Bartlett et al. NeurIPS 2017]

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 16: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Interpretability

16ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 17: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Interpretability in ML is an active research field

17

● Baehrens, David and Schroeter, Timon and Harmeling, Stefan and Kawanabe, Motoaki and Hansen, Katja and Mueller, Klaus-Robert. Simonyan, Karen and Vedaldi, Andrea and Zisserman, Andrew. How to Explain Individual Classification Decisions. JMLR 2010.

● Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps. arXiv e-prints. arXiv:1312.6034. 2013.

● Ribeiro, Marco and Singh, Sameer and Guestrin, Carlos. “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. KDD 2016.

● Sundararajan, Mukund and Taly, Ankur and Yan, Qiqi. Axiomatic Attribution for Deep Networks. ICML'17.

● Shrikumar, Avanti and Greenside, Peyton and Kundaje, Anshul. Learning Important Features Through Propagating Activation Differences. ICML'17.

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 18: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

A robustness & interpretability result

18

On Certifying Non-Uniform Bounds against Adversarial Attacks. Liu, Chen and Tomioka, Ryota and Cevher, Volkan. ICML’19.

non-robust robustinput

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 19: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Further evidence: Robust training <> interpretability

19ML & AI | Volkan Cevher | https://lions.epfl.ch

Robust fundus classification & dataset bootstrapping via interpretable featuresKrawczuk et al.

Page 20: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Reproducibility

20ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 21: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Reproducibility challenge: Non-convexity

21ML & AI | Volkan Cevher | https://lions.epfl.ch

Lagrangian perspective: New theory for nonlinear optimization with nonlinear constraints

iAL Sahin M. F. et. al. [NeurIPS 2019]

AL^2 Eftekhari A. et. al. [Under review]

ADMM Latorre F. et. al. [NeurIPS 2019]

Page 22: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Extending reproducibility via universality in convex optimization

22

Smooth ✓ ✓

Stochastic ✓

Nonsmooth ✓ ✓ ✓

Strongly convex ✓

✓ Universal primal-dual, Yurtsever et al.✓ UniXGrad, Kavis et al. Accelegrad, Levy et al.✓ Random extrapolation, Alacaoglu et al.

ML & AI | Volkan Cevher | https://lions.epfl.ch

is the iteration counter.

One algorithm to rule them all!

Page 23: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Many other weaknesses

23

1. Bias

2. Malicious data

3. Privacy

4. ...

I am Tay

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 24: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

A geometric perspective on bias

24ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 25: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Opportunities

A SWOT Analysis

25

Generative Adversarial Networks

Automation

Medical

Financial

Design

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 26: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Generative Adversarial Networks

26ML & AI | Volkan Cevher | https://lions.epfl.ch

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Karras et al. [ICLR 2018]

High-Fidelity Image Generation With Fewer LabelsLucic M*, Tschannen M*, Ritter M*, Zhai X, Bachem O,

Sylvain S [2019]

Page 27: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Challenge: Limit cycles (minimax)

27

The limits of min-max optimization algorithms: Convergence to spurious non-critical sets, Hsieh, Mertikopoulos, and Cevher 2020.

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 28: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Training GANs via mixed Nash equilibria (minimax)

28

Finding Mixed Nash Equilibria of Generative Adversarial Networks. Hsieh et al. ICML 2019

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 29: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Minimax formulations and robust RL

29

Robust Reinforcement Learning with Langevin Dynamics. Kamalaruban et al.ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 30: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Minimax formulations and robust BO

30ML & AI | Volkan Cevher | https://lions.epfl.ch

Adversarially robust Gaussian Process Optimization.Bogunovic et al. NeurIPS 2018

Page 31: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

New opportunities via GANs

31ML & AI | Volkan Cevher | https://lions.epfl.ch

Closed loop deep Bayesian inversion: Uncertainty driven acquisition for fast MRI.

Sanchez et al.

Page 32: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

New opportunities in RL

32

Interactive Teaching Algorithms for Inverse Reinforcement Learning. Kamalaruban et al. IJCAI 2019

ML & AI | Volkan Cevher | https://lions.epfl.ch

Interaction-limited Inverse Reinforcement Learning. Troussard et al.

Page 33: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

New opportunities in deep learning

33ML & AI | Volkan Cevher | https://lions.epfl.ch

: First-order methodStochastic gradient

: Appropriate initializationFor weight matrices W

: Sufficiently large network(Overparameterization)

Neural NetworkAnalysis

NetworkStructure

Algorithm Initialization

Generalization <> Robustness

Convergence of SGD for neural networks without heavy overparameterization. Song & Cevher

Efficient proximal mapping of the 1-path-norm of shallow networks. Latorre et al.

Lipschitz controlled polynomial

Page 34: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

New opportunities in scalable optimization

34ML & AI | Volkan Cevher | https://lions.epfl.ch

Randomization <> Scalability

Towards stochastic SDP & LP’s with stochastic constraints

Conditional gradient methods for stochastically constrained convex minimization. Vladarean et al.

Ex: scalable solutions (sparsest) cut problems and their variants

Ex: robustness certifications for NNs

Page 35: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

New opportunities in engineering applications

35ML & AI | Volkan Cevher | https://lions.epfl.ch

Chemical machine learning with kernels: The impact of loss functions. Van Nguyen et al.

[Quantum Chemistry 2019]

EDA Gym. Krawczuk et al.

Page 36: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Threats

A SWOT Analysis

36

Hype

Talent pool

Interpretability

Sustainability

ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 37: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

The AI hype vs the ML revolution

37ML & AI | Volkan Cevher | https://lions.epfl.ch

neural networks < multi-index models

< multilayer NN

Page 38: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Talent pool: Missing the top talent vs the needed talent

38ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 39: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Sustainability:

39ML & AI | Volkan Cevher | https://lions.epfl.ch

Page 40: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Sustainability:

Dennard scaling & Moore’s law vs Growth of data

40ML & AI | Volkan Cevher | https://lions.epfl.ch

Andy Burg

Tim Dettmers

DART Consulting

Page 41: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Sustainability:

Energy constraints / Time constraints

41ML & AI | Volkan Cevher | https://lions.epfl.ch

Learning-based compressive sensing + hardware design. Baldassarre et al., Gozcu et al., Aprile et al. [IEEE TMI, IEEE TSP, IEEE CnS, IEEE TCAS]

IBM Thesis Award 2019

Page 42: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Sustainability:

Energy constraints of recording neural data

> 30 dB quality Stream out Full comp. LBCSAFE + ADC 10 μW 10 μW 10 μWDSP 0 80 μW ~2.5 μWTX 50 μW ~2.5 μW ~3 μW

MethodCompression rate

4 8 16 32

LBCS 38.90 35.77 33.09 30.28

SHS 33.67 31.75 29.21 27.73

BERN 33.57 29.59 26.62 24.03

MCS 34.22 30.82 27.03 23.00

ML & AI | Volkan Cevher | https://lions.epfl.ch 42

Page 43: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

● Accelerate the MRI scan 5 times.

● Pick the most relevant data only for your method.

Theo

ryO

urs

(Sin

gle

Coi

l)

Prac

tice

(Mul

ticoi

l) O

urs

(Mul

ticoi

l)

Learning-based compressive MRI. Gözcü B., et al [IEEE TMI 2018]Rethinking Sampling in Parallel MRI: A Data-Driven Approach. Gözcü B. et al. [EUSIPCO 2019]

x

y

Sustainability:

Time constraints of MRI

ML & AI | Volkan Cevher | https://lions.epfl.ch 43

Page 44: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

● Time drastically increases the dimensionality of data

● Reduce computations by a factor 200: from a month to 4 hours without losing performance.

Scalable learning-based sampling optimization for compressive dynamic MRI. Sanchez T., et al. [IEEE ICASSP 2020]

ML & AI | Volkan Cevher | https://lions.epfl.ch

Sustainability:

Time constraints of MRI

44

Page 45: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

Sustainability: Resource constrained optimization

45ML & AI | Volkan Cevher | https://lions.epfl.ch

Truncated variance reduction: A unified approach to Bayesian optimization and level-set estimation.

Bogunovic et al. NIPS 2017

Page 46: ML & AI: A SWOT Analysis...A SWOT Analysis 4 Machine Learning (ML) Neural Networks (NN) Multilayer NNs ML & AI | Volkan Cevher |  . Machine Learning (ML) 5

ML & AI | Volkan Cevher | https://lions.epfl.ch

Conclusions

● Are you wiser?○ time-data-power and other trade offs

● Existential threats = “Opportunities” ○ talk to me offline

● ML - AI: Mathematical understanding○ Hype protection

[email protected] https://lions.epfl.ch Twitter: @CevherLIONS

46


Recommended