+ All Categories
Home > Documents > High Dimensional Learning rather than Computing in Quantum...

High Dimensional Learning rather than Computing in Quantum...

Date post: 22-Jul-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
26
High Dimensional Learning rather than Computing in Quantum Chemistry Matthew Hirn*, Stéphane Mallat*, Nicolas Poilvert** *École normale supérieure **Pennsylvania State University
Transcript
Page 1: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

High Dimensional Learning rather than Computing in

Quantum ChemistryMatthew Hirn*, Stéphane Mallat*, Nicolas Poilvert**

!*École normale supérieure

**Pennsylvania State University

Page 2: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Motivation• Chemists want to build

“Google of molecules”

• Pharmaceutical industry Materials science

• Need to compute energy of each molecule

• Billions of molecules

• Complex, time consuming computation

Page 3: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Energy Computation• Exact:

Schrödinger’s Equation Extremely high dimensional eigenvalue problem Example: Alcohol is ~ dimensional

• Approximate: Coupled cluster methodsDensity functional theory Scales as where

C2H6O 2300

O(Na) 4 a 7

Number of electrons

Page 4: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Regression• High dimensional

• Approximate a functional given sample values

• Many body problems: Energy of a state

x 2 Rd

f(x)

nf(x)

Astronomy Classical Electrostatics

Quantum Chemistry

PositionCharge, mass, etc…

{xi, f(xi)}ni=1

x = {(pk, qk)}k

Page 5: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

• can be approximatedfrom samples by local interpolation if is regular and there are close examples

!

• Need points to cover at a Euclidean distance is always large

00.1

0.20.3

0.40.5

0.60.7

0.80.9

0

0.2

0.4

0.6

0.8

10

20

40

60

80

100

120

Curse of Dimensionalityf(x)

f

[0, 1]dn = ✏�d

✏ =) kx� xik

{xi, f(xi)}ni=1

Page 6: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Sparse Linear Regression• Representation of

• Regression of linear in :

• Interpolates:

• Few samples can only learn small number of coefficients must have a sparse expansion of in to obtain good regression

• Sparsity inherits the properties of must possess the properties of

x : �(x) = {�p(x)}p

f̃(x) f(x) �(x)

{�p}p

=) f̃

{xi, f(xi)}ni=1=) {↵p}p=) f

{�p}p=) {�p}p f

f̃(x) = h↵,�(x)i =X

p

↵p�p(x)

f̃(xi) = f(xi)

Page 7: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

• State:

• Energy:

1. Invariant to actions of the isometry group:

2. Multiscale potential

3. Lipschitz continuous to the action of diffeomorphisms

• Want a representation with these three properties

Energy Properties

f(x)

x = {(pk, qk)}k

E(d) = R

doO(d)

Page 8: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

• Energy of interacting bodies (Coulomb, gravitation) Invariant to isometries Multiscale potential

• Point charges/masses:

• Potential:

• Diagonalized by Fourier modulus:

Classical PhysicsN

x 7! ⇢(u) =NX

k=1

qk�(u� pk)

f(⇢) =

ZbV (!)| {z }↵!

|⇢̂(!)|2| {z }�!(⇢)

d!O(N)coefficients

V (u) = |u|�� =) f(x) = f(⇢) =X

k 6=`

qkq`

|pk � p`|�

Page 9: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Wavelets• Complex wavelet:

• Dilated and rotated:

(u) = g(u)ei⇠·u, u 2 Rd, d = 2, 3

Real parts Imaginary parts

j,r(u) = 2�dj (2�jru), (j, r) 2 Z⇥O(d)

| b j,r(!)|2

Page 10: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Potential Diagonalized by Wavelet Energy Coefficients

• Properly defined wavelet energy coefficients define a representation that is invariant over isometries and gives a multiscale decomposition of the potential.Theorem (H., Mallat, Poilvert; 2014): For any there exists wavelets with

L2

✏ > 0

f(⇢) = (1 + ✏)X

j

↵j

Z

O(3)k⇢ ⇤ j,rk2 dr

| {z }�j(⇢)

O(logN) coefficients

Page 11: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Quantum Chemistry Density Functional Theory

• State: Positions and total protonic charges of the atoms Energy of the molecule

• Invariant to isometriesMultiscale potentialStability to diffeomorphisms

• Electronic density:

• Hohenberg-Kohn, 1964:

!

• Will have to learn representation as where is an approximate electronic density that can be efficiently derived from

f(x) =

x 7! ⇢(u)

f(x) = f(⇢) = min⇢̃

E(⇢̃), ⇢ = argmin⇢̃

E(⇢̃)

x = {(pk, qk)}k =

Complexvariationalproblem

x 7! �(⇢̃) ⇢̃x

Page 12: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Electronic DensitySolution to variational problem:

⇢ = argmin⇢̃

E(⇢̃)

E(⇢) =

T (⇢)

+

Z⇢(u)V (u)

+1

2

ZZ⇢(u)⇢(v)

|u� v|

+Exc

(⇢)

Kinetic energy

Electron-nucleiattraction

Electron-electronCoulomb repulsion

Exchange correlation energy

Page 13: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Electronic Density

Locally Kato Cusp Condition:

⇢(u) ⇠ e�2qk|u�pk|

Page 14: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Approximate Density

⇢̃(u) =X

k

q4ke�2qk|u�pk|

Page 15: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Stability to Diffeomorphisms

• Diffeomorphism

• Amplitude of diffeomorphism:

• Want Lipschitz stability to diffeomorphisms:

kr⌧k1

⇢̃ = D⌧⇢ =) k�(⇢)� �(D⌧⇢)k Ckr⌧k1k⇢k

1� ⌧ :

D⌧⇢(u) = ⇢(u� ⌧(u))

Page 16: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Fourier Unstable to Diffeomorphisms

• Fourier modulus representation:

• Fourier: Unstable to small diffeomorphisms is large at high frequencies

|⇢̂(!)|

|⇢̂(!)| |⇢̂⌧ (!)|

|⇢̂⌧ (!)|| |

�(⇢) = {�!(⇢)}! = {|⇢̂(!)|}!

⇢⌧ (u) = ⇢(u� ⌧(u))�

=) k�(⇢)� �(⇢⌧ )k � kr⌧k1k⇢k

Page 17: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Scattering RepresentationLayer 0p = ;

⇢̃

�;(⇢̃) =

Z

Rd

⇢̃(u) du

Page 18: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Scattering RepresentationLayer 0 Layer 1p = ;

⇢̃

p = j

⇢̃j(u, r) = |⇢̃ ⇤ j,r(u)|

⇢̃j0(u, r)

⇢̃j00(u, r)

�;(⇢̃) =

Z

Rd

⇢̃(u) du �j(⇢̃) =

Z

E(d)⇢̃j(u, r) du dr

Page 19: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Scattering RepresentationLayer 0 Layer 1 Layer 2p = ;

⇢̃

p = j

⇢̃j(u, r) = |⇢̃ ⇤ j,r(u)|

⇢̃j0(u, r)

⇢̃j00(u, r)

p = (j,�2)

|⇢̃j ~ �002(u, r)|

|⇢̃j ~ �02(u, r)|

|⇢̃j ~ �2(u, r)|

�;(⇢̃) =

Z

Rd

⇢̃(u) du �j(⇢̃) =

Z

E(d)⇢̃j(u, r) du dr �j,�2(⇢̃) =

Z

E(d)|⇢̃j ~ �2(u, r)| du dr

Page 20: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Scattering RepresentationLayer 0 Layer 1 Layer 2

�(⇢̃) = {�p(⇢̃)}p

p = ;

⇢̃

p = j

⇢̃j(u, r) = |⇢̃ ⇤ j,r(u)|

⇢̃j0(u, r)

⇢̃j00(u, r)

p = (j,�2)

|⇢̃j ~ �002(u, r)|

|⇢̃j ~ �02(u, r)|

|⇢̃j ~ �2(u, r)|

�;(⇢̃) =

Z

Rd

⇢̃(u) du �j(⇢̃) =

Z

E(d)⇢̃j(u, r) du dr �j,�2(⇢̃) =

Z

E(d)|⇢̃j ~ �2(u, r)| du dr

Page 21: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Scattering Properties!

!

• Invariant to isometries… yes

• Multiscale family of functions… yes

• Lipschitz stable to diffeomorphisms… yes Mallat, 2012

�(⇢̃) =

0

@k⇢̃kL1(Rd)

k⇢̃ ⇤ j,·kL1(E(d))

k|⇢̃ ⇤ j,·|~ �2kL1(E(d))

1

A

j,�2

Page 22: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Augment Scattering• From classical physics, we know energy

coefficients are needed to expand the Coulomb potential, which is also present in the quantum chemistry molecular energy.

L2

�(⇢̃) =

0

B@k⇢̃kpLp(Rd)

k⇢̃ ⇤ j,·kpLp(E(d))

k|⇢̃ ⇤ j,·|~ �2kpLp(E(d))

1

CA

j,�2; p=1,2

Page 23: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Quantum Chemistry Regression

• Two data bases of planar, organic molecules with up to 20 atoms

• Regression on Fourier and scattering coefficients:

!

• M-term sparse regression with greedy Orthogonal Least Squares computed on a training set:

{xi, f(xi)}i

{�p}p =

8<

:

L1/L2Fourier modulus coe�cients

or

L1/L2Scattering coe�cients

fM (x) =MX

k=1

↵k�pk(x)

Page 24: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

M-term Error

log2 M

log2 E(|f(X)� fM (X)|)

FourierScattering 1 Layer (Wavelets)

Scattering 2 Layers

Coulomb

Page 25: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Numerical Results• Mean absolute error in kcal/mol:

• Scattering expansion terms:

• First term:

• Selected scales: Important geometric scales

E(|f(X)� fM (X)|)

�n1(⇢̃) =

Z⇢̃(u) =

X

k

qk = total charge

Fourier Coulomb Scattering

400 molecules 21.40 13.09 6.61

4000 molecules 18.61 4.16 2.05

Page 26: High Dimensional Learning rather than Computing in Quantum …mate.dm.uba.ar/~hafg/FoCM14/talks/Hirn_FoCM2014.pdf · • Chemists want to build “Google of molecules” • Pharmaceutical

Conclusion• The scattering transform defines a representation

that captures the fundamental properties of molecular energy.

• One can learn the energy through data and compute it fast.

• Can we learn other physical functionals?

http://www.di.ens.fr/⇠hirn/


Recommended