+ All Categories
Home > Documents > FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain...

FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain...

Date post: 13-Aug-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
14
Pula April 21, 2008 FlexCryst: A tool for molecular simulations FlexCryst: A tool for molecular simulations in biochemistry and crystallography in biochemistry and crystallography
Transcript
Page 1: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

Pula

April 21, 2008

FlexCryst: A tool for molecular simulations FlexCryst: A tool for molecular simulations

in biochemistry and crystallographyin biochemistry and crystallography

Page 2: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

FlexCryst: A tool for molecular simulations in biochemistry and FlexCryst: A tool for molecular simulations in biochemistry and crystallography (D.W.M. Hofmann, 21 April 2008, 10:30)crystallography (D.W.M. Hofmann, 21 April 2008, 10:30)

Abstract: In this talk we present FlexCryst: Flexcryst is a program suite with a graphical user interface (GUI). Presently it contains 6 modules, ligand receptor docking, crystal structure prediction, crystal structure determination, sublimation energy of crystal structures, and comparison of powder diagrams. Main feature of the modules is the high velocity. This is connected with the selected approach: all modules are based on classical mechanics and take only short range interaction into account. The energy function has been optimized by data on the largest existing database of crystal structures, the Cambridge Structure Database (CSD). The mean error of the energy function has been tested by prediction of the sublimation energy of more than 100 crystal structures and has been found to be around 7 KJ/mole. The energy function was applied to the problem of ligand-receptor docking in collaboration with BiosolveIt. It shows better scoring for small shifts of the ligands, however it is less tolerant to large shifts. A further advantage of the trained energy function is that it gives real bonding energies rather than arbitrary scores.

Page 3: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

Main features of FlexCryst

consists of 6 independent modulesbases on classical mechanicsscoring function evaluates very quicklymolecules are treated as rigidscoring function predicts well sublimation energiesa graphical user interface is available common input formats are supported

OverviewOverview

Page 4: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

Converting of Data FormatsConverting of Data Formats

Supported input formats:

• CIF (.cif) Crystallographic Information File (CIF) format -- International Union of Crystallography format for crystal structure data.

• CSSR (.cssr) SERC Daresbury Laboratory's Cambridge Structure Search and Retrieval (CSSR) file format.

• PDB (.pdb) Protein Data Bank format for 3D molecules.

• Mol2 (.mol2) Tripos Mol2 File Format

Page 5: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.comEnergy functionEnergy function

Pula, 21 April 2008

EIJ [KJ /mole]=∑i=1

nI

∑j=1

n J

type i ,type j , r ij=∑

i=1

nI

∑j=1

n J

mnr ij

The energy function:

fast to evaluate potentials are tabled at auxiliary

points cut off at 5.77 Å

pair potentials are steadypotentials have optimum shape

parameters are optimized by data mining

short term potentials contain long term interactionshydrogen is differentiated in various types

energy function:

auxiliary points (descriptors): K= {kmnl :kmnl=100/ l2 with 3≤l≤60}

pair potential (weights):

mnr ij=f −l mnl l1−f mnl1 with l=⌊f ⌋ and f=100/r ij

2

cutoff: r cutoff=k3=100/32≈5.77 A

example: the H(N)...O potential

Page 6: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Parametrization by data miningParametrization by data mining

Each experimental structure should be in local minimum of energy (if the structure is correct). Therefore any slightly distorted structure (decoy) must be higher in energy.

2

1

minimum condition: En≤Enme

The error function J (perceptron) measures the distorted structures, which violate the minimum condition.

error function: J = ∑n=1

400000

∑m=1

10

max [0,En−Enme2 ]

Aim of the minimization with neuronal networks or simplex algorithm is to separate as good as possible the decoys and the experimental structures by an hyperplane. The perpendicular vector to this plane contains the parameters of the force field (weights).

frequency of contact type 1(descriptor 1)

freq

uency

of

con

tact

type 2

(desc

rip

tor

2)

The figure shows one experimental structure with ten decoys. The structures are described by two selected types of contacts. Two of the decoys are outliers and can not be separated by the hyperplane (in the two dimensional case it reduces to a line).

Page 7: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

Prediction of sublimation energiesPrediction of sublimation energies

A wide variety of solid substances occurs in crystalline or microcrystalline state (e.g. drugs). One of the most important properties are the sublimation energy. Further properties can be derived from this energy.

The sublimation energy can be estimated with an accuracy of 8.9 KJ/mole, which is correlated to the experimental accuracy. In the database some structures (same color) occur manifoldly with slightly different coordinates. The resulting energy difference determines the mean error.

mean error:

8.9 KJ/mole

Page 8: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

Crystal Structure PredictionCrystal Structure Prediction

experimental structure

simulated structure (rank 2)

Thermodynamical properties can be derived very well, because all high ranking crystal structures have similar energy. The conformation has to be highly accurate All atoms must have ordinary environment The experimental crystal structure can be determined by help of additional information (e.g. powder diagram), because it is found among the first hundred predicted crystal structures.

example: Blind test at Cambridge 2007, Molecule XIV

One unique substance (drug, pigment) can occur in several crystal structures (polymorphs). These different polymorphs have different kinetic properties, as the rate of solubility, and can be patented separately

Page 9: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

Comparing of Crystal StructuresComparing of Crystal Structures

similarity

0.005

Crystal structures and powder diagrams can be compared automatically. The results are visualized in dendrograms. Main applications of it are:

Screening off manifold predictions Search for similar crystal structures Systematizing of powder diagrams Cleaning of data bases

Page 10: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Crystal Structure DeterminationCrystal Structure Determination

Pula, 21 April 2008

conformation generation • rigid molecules e.g. drugs and pigments• programs• intuitive

crystal structures generation

conformation optimization DMOL3, Gaussian, and other quantum chemical programs

crystal structure minimization

remove double predicted strutures

scoring and sorting

automatic refinement

Page 11: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

FlexibilityFlexibility

Drugs are very often rigid. Flexible drugs loose during docking degrees of freedom, which comes along with a decrease of the entropy. In result the binding affinity of flexible molecules is lower than for rigid molecules

caffeine (30mg/Espresso)

nicotine (1mg/Cigarette)

progesterone contraception

Clopidogrel antiplatelet

(annual sales 6,057,000 USD)

HproteinHdrug−TSdrug⇔Hcomplex

However, flexible molecules show much more often biological activity and are more practical to find leading structures for drug design. Due to the flexibility they can easier conform themselves to a given pocket of a protein.

Page 12: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

cis-4-octeneC. Schauerte, C. Buchsbaum,L. Fink, D. W. M. Hofmann,M.U. Schmidt, J. Knipping, and R. Boese Acta Cryst. (2005). A61, C290

Pigment Yellow 111

Examples of determined crystal structures from X-ray powder diagrams

Pigment Red 181

D.W.M.Hofmann and L.N.Kuleshova, J. Appl. Cryst. (2005) 38, 861-866.

Crystal Structure DeterminationCrystal Structure Determination

Page 13: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

DockingDocking

The presented scoring function gives the energy for a given ligand-protein complex. This gives access to thermodynamical properties of the complex

In the figure we show a complex generated by FlexX. The receptor is repositioned and the energy of the complex is calculated

Page 14: FlexCryst: A tool for molecular simulations in ... · data mining short term potentials contain long term interactions hydrogen is differentiated in various types energy function:

FlexC

ryst

– h

ttp:/

/ww

w.f

lexc

ryst

.com

Pula, 21 April 2008

Docking: present stateDocking: present state

The trained force field scores more accurate as common scoring functions and gives the free energy for ligand protein complex rather than an arbitrary score.

In this figure the trained force field is compared to the scoring function implemented in FlexX. For small rmsd the number of correctly predicted complexes is always higher (blue chart). For large rmsd FlexX scores better (violet chart)

nFl

exX-n

FlexC

ryst

number of ranks taken into account

The high accuracy becomes even more obvious, if the experimental structure is added to the poses. In hits example around 90-times the experimental structure obtains the lowest energy and the rmsd is zero.

rmsd [Å]

num

ber

of

ligand-p

rote

in c

om

ple

x

•trained force field, experimental structure included

•FlexX, experimental structure included•trained force field without experimental structure

•FlexX, without experimental structure•best structure generated

Home


Recommended