+ All Categories
Home > Documents > Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic...

Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic...

Date post: 23-Jun-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
32
Stochastic Reformulations of Linear Systems: Algorithms and Convergence Theory Peter Richtárik Modern Convex Optimization and Applications Workshop in honour of Arkadi Nemirovski’s 70 th birthday Fields Institute, Toronto July 4-7, 2017
Transcript
Page 1: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

StochasticReformulationsofLinearSystems:

AlgorithmsandConvergenceTheory

PeterRichtárik

ModernConvexOptimizationandApplicationsWorkshopinhonourofArkadi Nemirovski’s 70th birthday

FieldsInstitute,TorontoJuly4-7,2017

Page 2: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear
Page 3: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

HPOPT 2008Tilburg UniversityThe Netherlands

Page 4: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

ASystemofLinearEquations

Ax = b

m

n A 2 Rm⇥n, x 2 Rn

, b 2 Rm

m equations with n unknowns

Assumption:Thesystemisconsistent(i.e.,asolutionexists)

Page 5: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

PartIStochasticReformulations

P.R.andMartinTakáčStochasticReformulationsofLinearSystems:AlgorithmsandConvergenceTheoryarXiv:1706.01108, 2017

Page 6: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

StochasticReformulationsofLinearSystems

Ax = b

1. StochasticOptimization2. StochasticLinearSystem3. StochasticFixedPoint4. ProbabilisticIntersection

Theorema) These4problemshavethesamesolutionsetsb) Necessary&sufficientconditionsforthesolutionset

tobeequalto{x : Ax = b}

B,D

n⇥ n pos def

distribution over m⇥ q matrices

B = identityD = uniform over e1, . . . , em (unit basis vectors in Rm

)

Example:

Page 7: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Reformulation1:StochasticOptimization

Minimize f(x)def= ES⇠D[fS(x)]

H = S(S>AB�1A>S)†S>LS = {x : S

>Ax = S

>b}

fS(x) =12kx�⇧B

LS(x)k2B = 1

2 (Ax� b)>H(Ax� b)

Page 8: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Reformulation2:StochasticLinearSystem

preconditioner

H = S(S>AB�1A>S)†S>InsteadofAx = b

thepreconditionedsystem:

wesolve

Instead of B�1A>E[H]A we have access to B�1A>HA

Unbiasedestimateofthepreconditioner

Solve B�1A>ES⇠D[H]Ax = B�1A>ES⇠D[H]b

Page 9: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Reformulation3:StochasticFixedPointProblem

Projection in B-norm onto LS = {x : S

>Ax = S

>b}

Solve x = ES⇠D⇥⇧

BLS

(x)

Page 10: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Reformulation4:ProbabilisticIntersectionProblem

LS = {x : S

>Ax = S

>b}

Sketchedsystem

Sdiscrete

Find x 2 Rn such that P(x 2 LS) = 1

{x : P(x 2 LS) = 1} =T

S LS

Page 11: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

PartIIRandomizedAlgorithms

Page 12: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Viewpoint1:StochasticOptimization

Page 13: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

StochasticGradientDescent

S ⇠ Dconstantstepsize

x

t+1 = x

t � !rfS(xt)

Akeymethodinmachinelearning

“stochasticgradient”

Page 14: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Stochastic“Newton”Descent

S ⇠ D

B- pseudoinverse ofthe“stochasticHessian”

x

t+1 = x

t � !(r2fS)†BrfS(xt)

“stochasticgradient”Constantstepsize

Page 15: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

StochasticProximalPointMethod

S ⇠ D

x

t+1 = arg minx2Rn

⇢f

S

(x) +! � 1

2!kx� x

tk2B

Page 16: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Viewpoint3:StochasticFixedPoint

Method

Page 17: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

StochasticFixedPointMethod

S ⇠ D

Relaxationparameter

Stochasticfixedpointmapping

x

t+1 = !⇧BLS

(xt) + (1� !)xt

Page 18: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

PartIIIComplexity

Page 19: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

BasicMethod

Page 20: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

BasicMethod:Complexity

H = S(S>AB�1A>S)†S>

E[U>B

1/2(xt � x

⇤)] = (I � !⇤)tU>B

1/2(x0 � x

⇤)

W = B�1/2A>ES⇠D[H]AB�1/2 = U⇤U>

stepsize /relaxationparameter

Page 21: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

BasicMethod:Complexity

t � �max

�+

min

log

✓1

◆kE[xt � x

⇤]k2B ✏

! = 1/�max

! = 1

t � 1

�+min

log

✓1

◆kE[xt � x

⇤]k2B ✏

! = 1

t � 1

�+min

log

✓1

◆E⇥kxt � x

⇤k2B⇤ ✏

ConvergenceofExpectedIterates

L2Convergence

Page 22: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

ParallelMethod

Page 23: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

ParallelMethod

x

t+1 = 1⌧

P⌧i=1 �!(xt

, S

ti )

i.i.d.

“Run 1 step of the basic method from x

t

several times independently,

and average the results.”

One step of the basic method from x

t

Page 24: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

ParallelMethod:Complexity

t � 1

�+min

log

✓1

E⇥kxt � x

⇤k2B⇤ ✏

L2Convergence

⌧ = 1

or

⌧ = +1

t � �max

�+

min

log

✓1

Page 25: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

AcceleratedMethod

Page 26: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

AcceleratedMethod

Accelerationparameter(between1and2)

x

t+1 = ��!(xt, S

t) + (1� �)�!(xt�1, S

t�1)

St, St�1 ⇠ D (independent)

One step of the basic method from x

t�1

One step of the basic method from x

t

Page 27: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

AcceleratedMethod:Complexity

ConvergenceofIterates

kE[xt � x

⇤]k2B ✏

BasicMethoddependson!

t �

s�max

�+

min

log

✓1

�max

�+

min

Page 28: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

DetailedComplexityResults

Page 29: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

PartIVConclusion

Page 30: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

Contributions• 4Equivalentstochasticreformulationsofalinearsystem

– Stochasticoptimization– Stochasticfixedpointproblem– Stochasticlinearsystem– Probabilisticintersection

• 3Algorithms– Basic(SGD,stochasticNewtonmethod,stochasticfixedpointmethod,

stochasticproximalpointmethod,stochasticprojectionmethod,…)– Parallel– Accelerated

• Iterationcomplexityguaranteesforvariousmeasuresofsuccess– Expectediterates(closedform)– L1/L2convergence– Convergenceoff;ergodic…

Page 31: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

RelatedWorkRobertMansel GowerandP.R.RandomizedIterativeMethodsforLinearSystemsSIAMJ.MatrixAnalysis&Applications36(4):1660-1690, 2015

RobertMansel GowerandP.R.StochasticDualAscentforSolvingLinearSystemsarXiv:1512.06890, 2015

• 2017IMAFoxPrize(2nd Prize)inNumericalAnalysis

• MostdownloadedSIMAXpaper

RobertMansel GowerandP.R.RandomizedQuasi-NewtonMethodsareLinearlyConvergentMatrixInversionAlgorithmsarXiv:1602.01768, 2016

RobertMansel Gower,DonaldGoldfarbandP.R.StochasticBlockBFGS:SqueezingMoreCurvatureoutofDataICML2016

Basicmethodwithunitstepsize andfullrankA:

Removaloffullrankassumption+duality:

Invertingmatrices&connectiontoQuasi-Newtonupdates:

Computingthepseudoinverse:

RobertMansel GowerandP.R.LinearlyConvergentRandomizedIterativeMethodsforComputingthePseudoinversearXiv:1612.06255,2016

Applicationinmachinelearning:

Page 32: Stochastic Reformulations of Linear Systems: Algorithms ...prichtar/talks/TALK... · Stochastic Reformulations of Linear Systems Ax = b 1. Stochastic Optimization 2. Stochastic Linear

THEEND


Recommended