+ All Categories
Home > Documents > Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint...

Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint...

Date post: 08-Jun-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
107
Randomized iterative methods for linear systems Robert Mansel Gower IMA Leslie Fox Prize Meeting, Strathclyde, June 2017
Transcript
Page 1: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized iterative methods for linear systems

Robert Mansel Gower

IMA Leslie Fox Prize Meeting, Strathclyde, June 2017

Page 2: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Motivation

Page 3: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Kernel Ridge Regression

Problem: a9a

Origin: LIBSVM

Page 4: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Kernel Ridge Regression

Problem: a9a

Origin: LIBSVM

Conjugate Gradient

Page 5: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Kernel Ridge Regression

Problem: a9a

Origin: LIBSVM

Conjugate Gradient

Block Coordinate

Descent

Page 6: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Kernel Ridge Regression

Problem: a9a

Origin: LIBSVM

Conjugate Gradient

Rademacher Sketch?

Block Coordinate

Descent

Page 7: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Kernel Ridge Regression

Problem: a9a

Origin: LIBSVM

Conjugate Gradient

Rademacher Sketch?

Block Coordinate

Descent

Good enough

Page 8: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Kernel Ridge Regression

Problem: a9a

Origin: LIBSVM

Conjugate Gradient

Cheikh S. Toure

Rademacher Sketch?

Block Coordinate

Descent

Good enough

Page 9: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Linear Systems

Page 10: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The Problem

Assumption: The system is consistent (i.e., has a solution)

Page 11: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The Problem

Page 12: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The Problem

B: Symmetric and positive definite

Page 13: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The Problem

B: Symmetric and positive definite

As there are possibly multiple solutions, we compute the solution with the least B-norm.

Page 14: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Methods

Page 15: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Often fits in parallel/distributed architecture

Easy to analyse, good complexity

Easy to implement

Suitable for large scale problems: short recurrence, low iteration cost and low memory

The return of old methodsOld methods (Kaczmarz 1937, Gauss-Seidel

1823) make a randomized return, why?

Stochasticity inherent in problem

Page 16: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Old Methods

Page 17: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

Karczmarz, M. S. (1937). Angenaherte Auflosung von Systemen linearer Gleichungen. Bulletin International de l’Académie Polonaise Des Sciences et Des Lettres, 35, 355–357.

Page 18: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

Karczmarz, M. S. (1937). Angenaherte Auflosung von Systemen linearer Gleichungen. Bulletin International de l’Académie Polonaise Des Sciences et Des Lettres, 35, 355–357.

Page 19: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

Karczmarz, M. S. (1937). Angenaherte Auflosung von Systemen linearer Gleichungen. Bulletin International de l’Académie Polonaise Des Sciences et Des Lettres, 35, 355–357.

Page 20: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

Karczmarz, M. S. (1937). Angenaherte Auflosung von Systemen linearer Gleichungen. Bulletin International de l’Académie Polonaise Des Sciences et Des Lettres, 35, 355–357.

Page 21: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

G.N. Hounsfield. Computerized transverse axial scanning (tomography): Part I. description of the system. British Journal Radiology. 1973

Karczmarz, M. S. (1937). Angenaherte Auflosung von Systemen linearer Gleichungen. Bulletin International de l’Académie Polonaise Des Sciences et Des Lettres, 35, 355–357.

Page 22: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Page 23: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Page 24: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Page 25: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Page 26: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Observation:

Page 27: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Observation:

Page 28: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Observation:

Page 29: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate DescentLeventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Observation:

Block Coord. Descent

Page 30: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Modern Sketching

Page 31: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Sketching

The Sketching Matrix

David P. Woodruff (2014), Foundations and Trends® in Theoretical Computer, Sketching as a Tool for Numerical Linear Algebra.

W. B. Johnson and J. Lindenstrauss (1984). Contemporary Mathematics, 26, Extensions of Lipschitz mappings into a Hilbert space.

Page 32: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Sketching and Projecting

Page 33: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1. Relaxation Viewpoint“Sketch and Project”

Page 34: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1. Relaxation Viewpoint“Sketch and Project”

Page 35: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1. Relaxation Viewpoint“Sketch and Project”

Page 36: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

2. Optimization Viewpoint “Constrain and Approximate”

Page 37: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

2. Optimization Viewpoint “Constrain and Approximate”

Page 38: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

2. Optimization Viewpoint “Constrain and Approximate”

Page 39: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

Page 40: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

Page 41: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

Page 42: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

Page 43: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

Page 44: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

Page 45: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

(1)

Page 46: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

3. Geometric Viewpoint “Random Intersect”

(2)

(1)

Page 47: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

4. Algebraic Viewpoint“Random Update”

Random Update Vector

Page 48: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

4. Algebraic Viewpoint“Random Update”

Moore-Penrose pseudo inverse

Random Update Vector

Fact:

Page 49: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

4. Algebraic Viewpoint“Random Update”

Moore-Penrose pseudo inverse

Random Update Vector

Fact:

Small matrix

Page 50: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

5. Analytic Viewpoint“Random Fixed Point”

Page 51: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

5. Analytic Viewpoint“Random Fixed Point”

Page 52: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

5. Analytic Viewpoint“Random Fixed Point”

Random Iteration Matrix

Page 53: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Theory

Page 54: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Complexity / Convergence

Theorem [GR‘15]

Page 55: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Complexity / Convergence

Theorem [GR‘15]

Smallest nonzero eigenvalue

Page 56: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Case study of

Page 57: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Case study of

Special Choice of Parameters

Page 58: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Case study of

Special Choice of Parameters

Page 59: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Case study of

Special Choice of Parameters

Page 60: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Case study of

Special Choice of Parameters

Page 61: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Case study of

Special Choice of Parameters

No zero rows in A is positive definite

Page 62: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The rate: lower and upper bounds

Theorem [RG‘15]

Page 63: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The rate: lower and upper bounds

Theorem [RG‘15]

Insight: The method is a contraction (without any assumptions on S whatsoever). That is, things can not get worse.

Page 64: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The rate: lower and upper bounds

Theorem [RG‘15]

Insight: The method is a contraction (without any assumptions on S whatsoever). That is, things can not get worse.

Page 65: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

The rate: lower and upper bounds

Theorem [RG‘15]

Insight: The method is a contraction (without any assumptions on S whatsoever). That is, things can not get worse.

Insight: lower rank of A and great rank of STA gives better lower bound. In other words, when the dimension of the search space in the “constrain and approximate” viewpoint grows.

Page 66: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Special Case: Randomized

Kaczmarz Method

T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

T. Strohmer and R. J. Vershynin, (2009). A Randomized Kaczmarz Algorithm with Exponential Convergence Journal of Fourier Analysis and Applications, 15:262

Page 67: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz: derivation and rate

General Method

Page 68: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz: derivation and rate

General Method

Special Choice of Parameters

Page 69: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz: derivation and rate

General Method

Special Choice of Parameters

Page 70: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz: derivation and rate

General Method

Special Choice of Parameters

Page 71: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz: derivation and rate

General Method

Special Choice of Parameters

Page 72: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz: derivation and rate

General Method

Special Choice of Parameters

Page 73: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Kaczmarz: derivation and rate

General Method

Special Choice of Parameters

Complexity Rate.

Page 74: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Special Case: Randomized

Coordinate Descent

T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

Leventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Page 75: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate Descent: derivation and rateGeneral Method

Page 76: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate Descent: derivation and rateGeneral Method

Special Choice of Parameters

positive definite

Page 77: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate Descent: derivation and rateGeneral Method

Special Choice of Parameters

positive definite

Page 78: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate Descent: derivation and rateGeneral Method

Special Choice of Parameters

positive definite

Page 79: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate Descent: derivation and rateGeneral Method

Special Choice of Parameters

positive definite

Page 80: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate Descent: derivation and rateGeneral Method

Special Choice of Parameters

positive definite

Page 81: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Randomized Coordinate Descent: derivation and rateGeneral Method

Special Choice of Parameters

Complexity Rate

positive definite

Page 82: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Theory recovers known and new convergence results

Method Convergence Rate

Randomized CDLeast square

B S

T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

*Leventhal, D., & Lewis, A. S. (2010). Randomized Methods for Linear Constraints: Convergence Rates and Conditioning. Mathematics of Operations Research, 35(3), 641–654.

Gaussian psd

Gaussian Kaczmarz

Page 83: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Designing New Methods

Page 84: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Optimal methodsOptimal choice

Page 85: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Optimal methodsOptimal choice

B

Page 86: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Optimal methodsOptimal choice

B

Optimal S

Page 87: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Optimal methodsOptimal choice

B

Optimal S

S with fixed range

Page 88: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Optimal methodsOptimal choice

B

Optimal S

S with fixed range Optimal pi's

Page 89: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Optimal methodsOptimal choice

B

Optimal S

S with fixed range Optimal pi's Difficult SDP

Page 90: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Practical New MethodsOne Shot Sketches

T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

N. Ailon and B. Chazelle (2006). Approximate nearest neighbors and the fast Johnson-Lindenstrauss transform. Mathematics of Operations Research, 35(3), 641–654.

Page 91: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Practical New MethodsOne Shot Sketches

S

Gaussian Matrix

T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

N. Ailon and B. Chazelle (2006). Approximate nearest neighbors and the fast Johnson-Lindenstrauss transform. Mathematics of Operations Research, 35(3), 641–654.

Subsampled Hadamard-Welsh

Countmin Sketch

Computing STA

Page 92: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Practical New MethodsOne Shot Sketches

S

Gaussian Matrix

T. Strohmer and R. Vershynin. A Randomized Kaczmarz Algorithm with Exponential Convergence. Journal of Fourier Analysis and Applications 15(2), pp. 262–278, 2009

N. Ailon and B. Chazelle (2006). Approximate nearest neighbors and the fast Johnson-Lindenstrauss transform. Mathematics of Operations Research, 35(3), 641–654.

Subsampled Hadamard-Welsh

Countmin Sketch

Computing STA

Rademacher Sketch

Page 93: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Sub-Rademacher Sketching

Page 94: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1

1

Sub-Rademacher Sketching

Page 95: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1

1

2

2

Sub-Rademacher Sketching

Page 96: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1

1

2

23

3

1

3

2

Sub-Rademacher Sketching

Page 97: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1

1

2

23

3

1

3

2

Sub-Rademacher Sketching Flip the sign with 50% probability

Page 98: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

1

1

2

23

3

1

3

2

Sub-Rademacher Sketching Flip the sign with 50% probability

Page 99: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Experiments

Page 100: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Ridge Regression

Problem: w8a

Conjugate Gradient

Block Coordinate

Descent

Rademacher Sketch?

Origin: LIBSVM

Page 101: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Ridge Regression

Problem: rcv1

Conjugate Gradient

Block Coordinate

Descent

Rademacher Sketch?

Origin: LIBSVM

Page 102: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Ridge Regression

Problem: mnist

Conjugate Gradient

Origin: LIBSVM

Page 103: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Conclusions

Unites many randomized methods under a single framework

Improved convergence New lower bounds, less assumptions, tightest results.

Design new methods S = Guassian, count-sketch, Walsh-Hadamard ...etc

Optimal Sampling We can choose a sampling that optimizes the convergence rate.

Page 104: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Ridge Regression

Problem: a9a

Origin: LIBSVM

Conjugate Gradient

Rademacher Sketch

Block Coordinate

Descent

Page 105: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Large scale Ridge Regression

Problem: a9a

Origin: LIBSVM

Conjugate Gradient

Rademacher Sketch

Block Coordinate

Descent

Fast initial sublinear convergence

Page 106: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

RMG and Peter RichtárikStochastic Dual Ascent for Solving Linear SystemsPreprint arXiv:1512.06890, 2015

RMG and Peter RichtárikRandomized quasi-Newton updates are linearly convergent matrix inversion algorithmsPreprint arXiv:1602.01768, 2016

RMG and Peter RichtárikRandomized Iterative Methods for Linear Systems. SIAM. J. Matrix Anal. & Appl., 36(4), 1660–1690, 2015. Most Downloaded SIMAX Paper!

Page 107: Randomized iterative methods for linear systems · 2020-05-13 · Linear Systems Preprint arXiv:1512.06890, 2015 RMG and Peter Richtárik Randomized quasi-Newton updates are linearly

Thank you,Questions?


Recommended