Molecular Dynamics Valerie Daggett Bioengineering Department University of Washington...everything...

Post on 30-Dec-2015

213 views 0 download

Tags:

transcript

Molecular Dynamics

Valerie DaggettBioengineering DepartmentUniversity of Washington

...everything that living things do can be understood in terms of the jigglings and wigglings of atoms.

Richard Feynman

Protein Dynamics

• Proteins are not static• Motion is an incontrovertible consequence of existing @

room temperature (or any T > 0 K)• Kinetic energy per atom is ~ 1 kcal/mole @ 298K (25°C)

several Å/ps• Motion recognized to be important early on. Kendrew

(1950s) solved crystal structure of myoglobin (Perutz, phasing)

Myoglobin

No pathway for

O2 heme!

Protein DynamicsKendrew: “Perhaps the most remarkable features

of the molecule are its complexity and its lack of

symmetry. The arrangement seems to be almost

totally lacking in the kind of regularities which one

instinctively anticipates, and it is more complicated

than has been predicted by any theory of protein structure.”

Situation gets worse when you consider dynamics.

But proteins are dynamic and dynamic behavior critical for function. So, static, average structures are only part of the story.

Function from static structure?

Dynamics necessary for function

Snapshots

Static and/or average structures may not be representative of

conformations critical to function

Theory

• Experiment clearly demonstrates that proteins are mobile, but no single experiment or combination of experiments can provide an all-inclusive view of the dynamic behavior of all atoms in a protein.

• Computer simulations can however– ea. atom as a function of time

Why Molecular Dynamics?

• Most realistic simulation method available

• Can provide structural and dynamic information

unobtainable by experiment, but is experimentally

testable

• Native and nonnative interactions apparent

But• Sampling is limited, the goal is to sample

experimentally relevant regions of conformational

space, not all of conformational space

Molecular Dynamics

• Potential function for MD1,2 [sum of following terms]

U = Bond + Angle + Dihedral + van der Waals + Electrostatic

1. Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) 91: 215-231

2. Levitt M. et al. J. Phys. Chem. B (1997) 101: 5051-5061

Molecular Dynamics

• Potential function for MD

U = Bond + Angle + Dihedral + van der Waals + Electrostatic

Molecular Dynamics

• Potential function for MD

U = Bond + Angle + Dihedral + van der Waals + Electrostatic

b0

Molecular Dynamics

• Potential function for MD

U = Bond + Angle + Dihedral + van der Waals + Electrostatic

θ0

Molecular Dynamics

• Potential function for MD

U = Bond + Angle + Dihedral + van der Waals + Electrostatic

Φ0

Molecular Dynamics

• Potential function for MD

U = Bond + Angle + Dihedral + van der Waals + Electrostatic

Molecular Dynamics

• Non-bonded components of potential function

Unb = van der Waals + Electrostatic

• To a large degree, protein structure is dependent on non-bonded atomic interactions

Molecular Dynamics• Non-bonded components of potential function

Unb = van der Waals + Electrostatic

Molecular Dynamics• Non-bonded components of potential function

Unb = van der Waals + Electrostatic

Molecular Dynamics• Non-bonded components of potential

function

+ -

Molecular Dynamics• Non-bonded components of potential

function

+ +

Molecular Dynamics

• Non-bonded components of potential function

NOTE:Sum over all pairs of N atoms, or

pairs

2

1NN

N is often between 5x105 to 5x106

For 5x105 that is 1.25x1011 pairs

THAT IS A LOT OF POSSIBLE PAIRS!

What can you do with a force field?

Generation of ‘experimental’ structuresRefinement of ‘experimental’ structuresMonte CarloScoring functionsEnergy minimizationAnalysisPerform MD simulationsetc.

Molecular Dynamics• Time dependent integration of classical

equations of motion

Molecular Dynamics

• Time dependent integration

Molecular Dynamics

• Time dependent integration

Molecular Dynamics

• Time dependent integration

Molecular Dynamics

• Time dependent integration

Molecular Dynamics

• Time dependent integration

Molecular Dynamics

• Time dependent integration

Molecular Dynamics• Time dependent integration

Evaluate forces and perform integration for every atom

Each picosecond of simulation time requires 500 iterations of cycle

E.g. w/ 50,000 atoms, each ps (10-12 s) involves 25,000,000 evaluations

8)]()(6)(3[)()(

8)]()(5[)()()(

2

tttatattatvttv

tttatattvtxttx

iiiii

iiiii

Molecular Dynamics

Actual integration the equations of motion

Conserves energySmooth, robust

b

ii

Nk

vmT

3

2

Molecular Dynamics

Determination of temperature

Methods• Molecular dynamics (MD)

– Brooks-Beeman integration algorithm– Microcanonical ensemble (NVE)

• Number of atoms, box volume, & energy are conserved

• Energy conservation is naturally satisfied with classical equations of motion

• Energy conservation is an inherent check on the implementation

• Free from coupling the microscopic system to macroscopic variables as do NVT and NPT

Molecular dynamics

Microcanonical ensemble, all atoms, solvent,

fully flexible molecules, continuous trajectories,

no restraints/biases

predictive MD---expt to check

No Ewald --- artificial periodicity, altered conformational and dynamical properties

No fictitious bonds between H atoms of water

No Shake

Correct masses

Good simple, flexible water correct D and RDF

Methods• Molecular dynamics (MD)

– Temperature in NVE• Mean T over hundreds of steps• Energy drift in ilmm is primarily kinetic

resulting from numerical round-off– Over thousands of steps mean T can be monitored

for energy conservation– Velocity rescales once per 10ns

Implementation• Written in C

– Ubiquitous, standardized, optimized language• 64 bit math• Software design

– Kernel• Compiles user’s molecular mechanics programs• Schedules execution across processor and machines

– Modules, e.g.• Energy minimization• Molecular Dynamics• Monte Carlo• Analysis• REMD• RDCs • + others

Implementation• Dual mode parallelization

– Standardized tools available on modern platforms• POSIX threads

– Distribute computations across multiple CPUs in a single computer

• Message Passing Interface (MPI)– Distribute computations across multiple

computers on a high speed network

– Benefit is scalability

State of the Art MD

What can be done with PCs?

Environment:• Possible to characterize solvent-dependent conformational behavior • Proteins in membranes

Size: •≤ 500 residues (more possible if willing to dedicate resources to it, our record is 2519 residues in solvated membrane)

Timescale:• Multiple 20-100 ns simulations fairly routine for proteins• s possible if willing to dedicate resources to it

Molecular Dynamics• MD provides atomic resolution of native

dynamics

PDB ID: 3chy, E. coli CheY 1.66 Å X-ray crystallography

Molecular Dynamics• MD provides atomic resolution of native

dynamics

PDB ID: 3chy, E. coli CheY 1.66 Å X-ray crystallography

Molecular Dynamics• MD provides atomic resolution of native

dynamics

3chy, hydrogens added

Molecular Dynamics• MD provides atomic resolution of native

dynamics

3chy, waters added (i.e. solvated)

Molecular Dynamics• MD provides atomic resolution of native

dynamics

3chy, waters and hydrogens hidden

Molecular Dynamics• MD provides atomic resolution of native

dynamics

native state simulation of 3chy at 298 Kelvin, waters and hydrogens hidden

Molecular Dynamics• MD provides atomic resolution of native

dynamics

native state simulation of 3chy at 298 Kelvin, waters and hydrogens hidden

Average may not be representive

Storch et al., Biochem, 1995, 1999a,b, 2000

Dynamic cleft discovered through MD

Cytochrome b5

Storch et al., Biochem,1999

Bill Atkins, PatriciaCampbell

S18 R47

Construction of mutants to testwhether cleftforms

Construction of cyt c – cyt b5 complexes

Changes in cyt b5 upon binding cyt c

= Predicted binding surface

= Change in chemical shift

Hom et al., Biochem, 2000

Cleft allows for electron transfer through the protein in channel lined with aromatics

= Nonpolar

= Polar

Validation

• Validation, how do you know if a simulation is correct?

• How do you know it is done?

Starting a Molecular Dynamics Simulation

Crystal orNMR Structure

Solvate with water or other solvent 8 - 14 Åfrom protein

Heat to desiredtemperature andallow motion to evolve over time

T = 298 K r = 0.997 gm/ml

T = 498 Kr = 0.829 gm/ml

All atoms presentFully flexible waterNVE

Native Dynamics at 25 ºC

All C atoms

C R

MS

D (

Å)

Time (ns)

Active site loop and N-terminus removed

3

0.5

2

0

2.5

1.5

1

5 15 4035302520 5045100

<RMSD> = 1.7 Å

<RMSD> = 0.7 Å

Crystal structure

NOEsreproduced

Crystal Structure After 50 ns of MD

Structural Changes to Native State During MD

= Xtal Structure= 50 ns MD

Active Site Loop

N-terminus

Turn

Chymotrypsin Inhibitor 2

• Expt = Shaw et al (1995) Biochem 34:2225

• MD = Li & Daggett (1995) Prot. Eng. 8(11)

N15–H

no mobility

high

Temperature

(K)

Cutoff range(Å)

Timea

(ns)NOEs satisfiedb

(% of 603)

Xtal 91.00

298 8 100 98.18

298 8 50 98.51

298 8 50 98.00

298 8 20 97.18

298 8 20 97.84

298 8 20 98.18

298 10 100 98.84

298 10 50 97.68

298 10 50 97.51

298 10 20 97.84

298 10 20 97.54

298 10 20 97.35

310 8 100 98.51

310 8 50 98.01

310 8 50 98.51

310 10 100 98.68

310 10 50 98.01

310 10 50 98.51

323 8 100 98.34

323 8 50 97.35

323 10 100 98.34

323 10 50 98.01

No restraints22 simulations>1.2 s

NOEs courtesy of Stefan Freund& Trevor Rutherford, ARF

Pushing to high temperature

• Taking excursions farther from the native state, will the force fields and methods hold?

• Thermal unfolding of proteins

Thermal Denaturation of CI2 at 498 K

Ca

RM

SD

)

Time (ns)

0

2

4

6

8

10

12

14

16

18

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8 3

N298

MD1

MD2

MD3

MD4

Li & Daggett, 1994, PNAS; JMB, 1996

N

TS?

D

•TS not localized to single bond, distributed and ensemble•From MD cannot calculate DG along reaction coordinate•Structure-based definition of TS:

•Kinetically, protein will not succeed in every attempt to cross TS but will change rapidly afterwards•A process with a large change in energy but small change in entropy large change in free energy

NTS

D

Reaction Coordinate

H

G

-TS

Identifying Transition States in MD Trajectories

TS

D

Projection of Trajectory in RMSD Space

Calculate the RMSD between all structures---15,000 x 15,000 dimensional spaceReduce to 3-dimensions, distance between points RMSD between structures

Clusters indicate similar conformations, conformational states

Li & Daggett, 1994, 1996

N

Main-chain Fold Preserved in Transition State

a

Crystal Structure Average TS1 Structure~4 Å, 43 % native H-Bonds

b1

b2

b3

Packing is Disrupted in Transition State

Crystal StructureWT TS132 % SASA

Structure of TS from Experiment

= 1 site of mutation native-like in TS = 0 site of mutation unfolded in TSFractional values partial structure in TS

= GTS-D / GN-D = 1

N

TS

D

GN-D

GTS-D

GN-D

= GTS-D / GN-D = 0

N

TS

D

GN-D

GTS-D

GN-D

N TS D

Matouschek, Kellis, Serrano, Fersht, 1989, Nature, Fersht, Leatherbarrow, Wells, 1986, Nature; 1987 Biochem, +++

Calculation of S Values for Comparison with Experimental F Values

S = structure index = (S2º ) (S3º )

% native secondary structure ()

% tertiarystructure,contacts

For each residue calculate:

Daggett & Li, 1994, PNAS; Daggett, Li, Itzhaki, Otzen & Fersht, 1996, JMB

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1.1

1.2

1.3

1.4

0 5 10 15 20 25 30 35 40 45 50 55 60 65

Phi ValueS Value

Residue Number

Ph

i or

S

Valu

e

S V

alu

ePhi Value

Comparison of Calculated S Values and Experimental Values

Otzen et al., 1994, PNAS; Itzhaki et al., 1995, JMB; Li & Daggett, 1994,1996, JMB; Daggett et al., 1996, JMB

R = 0.90

0

0.2

0.4

0.6

0.8

1

1.2

1.4

0 0.2 0.4 0.6 0.8 1 1.2 1.4

Overall TS Structure and Unfolding Pathway are Independent of Temperature

373 K

398 K

448 K

473 K

498 K

(21 ns)

(0.3 ns)

(0.57 ns)

(1.44 ns)

(8.26 ns)

TS1

TS2

TS3

TS4

498 K

(0.225 ns)

(0.335 ns)

(0.1 ns)

(0.07 ns)

T

Rotate 90

to right

Conformational Heterogeneity of TS

= Crystal Structure= TS1-4, 498 K= TS5-9, T

<RMSD>XTAL<RMSD>MD 4.5 Å

N TS

D

Free Energy Calculations for Direct Determination of

WT

Mutant

N TS D

N' TS' D'

GNTS GTSD

G'TSDG'NTS

GN GTS GD

GND

G'ND

GN-TS = GN-TS - G'N-TS = GN - GTS

GTS-D = GTS-D - G'TS-D = GTS - GD

GN-D = GN-D - G’N-D = GN - GD

F = GTS-D / GN-D

GN-D GN-TS GTS-D

F Mut GN GTS

GD Calc Expt Calc Expt Calc Expt Calc Expt A16G 2.28 2.17 1.10 1.18 1.09 0.11 -0.20 1.07 1.15 0.91 1.06 V19A 1.68 1.03 1.09 0.59 0.49 0.77 0.86 -0.06 -0.13 -0.10 -0.26 I20V 2.64 1.71 1.42 1.22 1.30 0.93 0.57 0.29 0.52 0.24 0.40 I29V 2.45 1.75 1.04 1.41 1.11 0.70 0.69 0.71 0.19 0.50 0.37 V47A 2.60 0.87 0.74 1.86 4.93 0.73 3.81 1.13 1.02 0.61 0.21 L49A 2.64 0.78 0.63 2.01 3.80 0.86 1.45 1.15 2.11 0.57 0.53 V51A 4.24 2.77 1.92 2.32 1.98 1.47 1.00 0.85 0.49 0.37 0.25 I57A 5.42 2.60 1.80 3.62 4.29 2.82 3.46 0.80 0.36 0.22 0.08

R = 0.85 R = 0.91 (no V47A)

FEP Calculations for Hydrophobic Core Mutants

(kcal/mole)

Pan & Daggett, Biochem, 2001

Extended peptide NOT a good model of D

DA 23 TS

A23

K2

E7

WT TS RF48 TS

K2

E7

D23

R48

R62

F50F48

R62

F50

Ladurner, Itzhaki, Daggett, Fersht, 1998, PNAS

Designing Faster Folding Forms of CI2Based on MD-Generated TS Models

Protein kf

H2O (s-1) GTS-D (kcal/mole)

ku

H2O (s-1) GTS-N (kcal/mole)

Wild type

56.5

1.2

DA23 83.9 -0.23 3.8 -0.69

RF48 2300 -2.18 nd -0.75

Removal of Unfavorable Interactions Identified in TS Models Accelerates Folding

Removal of charge repulsion and improvement of packingin the TS yields fastest-folding form of CI2.

Thermal Denaturation of CI2 at 498 K

C R

MS

D (

Å)

Time (ns)

0

2

4

6

8

10

12

14

16

18

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8 3

N298

MD1

MD2

MD3

MD4

D40,000structures

The Denatured State of CI2

Distances in N:W5-V14 15 ÅI30-Y42 13 ÅP33-I37 11 Å

V14

Y42

I37

I57

I30

L49

P33

Kazmirski et al., PNAS, 2001

Experimental Results

<3JNH-CH>expt = 7.2 Hz

<3JNH-CH>MD = 7.0 Hz

V19-L21, I30-T36

Nearly random coil

(res. 17-21)

hydrophobic clustering

W5

Summary of CI2 Simulations• N is well behaved and in good agreement with experiment.

•TS is an expanded version of N with disrupted core and loops and frayed secondary structure.

Validity of MD-generated TS models tested through indirect comparison with experimental values, direct comparison of Gs, behavior when T is quenched, and design of faster folding mutants.

•WT D is very disrupted with only minor amounts of hydrophobic clustering and fluctuating helical structure. Nearly random coil.

25 50 75 100

10-9

10-8

10-7

10-6

10-5

10-4

103

104

105

106

107

108

Temperature ( oC)t 1

/2

(s)

k ob

s (s

-1)

kunf

kf

En-HD unfolds at 348 & 373 K on the same timescale by simulation and experiment

47,000 s-1

Time to reachTS in MD simulation

But, it is notenough to get the timescaleright, must get pathway too!

10 °C

Mayor et al., Nature 2003

Development of information-rich property space

Low information content property:Main-chain non-polar SASANo discrimination between native & non-native states

Native

Nonnative

Development of information-rich property spaceHigh information content property:

CONGENEAL structural dissimilarity score1

Excellent discrimination between native & non-native states

1. Yee and Dill, Protein Science, 1993.

Native

Nonnative

• Mean distance in PS (32 -> 10 properties) for a given conformation to the folded or native state ‘cluster’ is acceptable reaction coordinate– Value increases with distance from native cluster– Native cluster is bounded

Development of a reaction coordinate from property space

PSr

• Foldedness ↔ location along folding reaction coordinate