+ All Categories
Home > Documents > Chemoinformatics

Chemoinformatics

Date post: 06-Jan-2016
Category:
Upload: shima
View: 46 times
Download: 0 times
Share this document with a friend
Description:
Chemoinformatics. P. Baldi, J. Chen, and S. J. Swamidass School of Information and Computer Sciences Institute for Genomics and Bioinformatics University of California, Irvine. Overall Outline. Introduction Molecular Representations Chemical Data and Databases Molecular Similarity - PowerPoint PPT Presentation
Popular Tags:
30
Chemoinformatics P. Baldi, J. Chen, and S. J. Swamidass School of Information and Computer Sciences Institute for Genomics and Bioinformatics University of California, Irvine
Transcript
Page 1: Chemoinformatics

Chemoinformatics

P. Baldi, J. Chen, and S. J. SwamidassSchool of Information and Computer Sciences

Institute for Genomics and BioinformaticsUniversity of California, Irvine

Page 2: Chemoinformatics

2

Overall Outline

1. Introduction2. Molecular Representations 3. Chemical Data and Databases4. Molecular Similarity5. Chemical Reactions6. Machine Learning and Other Predictive

Methods7. Molecular Docking and Drug Discovery

Page 3: Chemoinformatics

3

1. Introduction

• What is Chemoinformatics

• Resources

• Brief Historical Perspective

• Chemical Space: Small Molecules

• Overview of Problems and Methods

Page 4: Chemoinformatics

4

What is Chemoinformatics?

• chemoinformatics encompasses the design, creation, organisation, management, retrieval, analysis, dissemination, visualization and use of chemical information

Page 5: Chemoinformatics

5

What is Chemoinformatics?

• "the mixing of information resources to transform data into information and information into knowledge, for the intended purpose of making better decisions faster in the arena of drug lead identification and optimizaton"

Page 6: Chemoinformatics

6

What is Chemoinformatics?

• “the set of computer algorithms and tools to store and analyse chemical data in the context of drug discovery and design projects”

• However: drug design/discovery is to chemoinformatics like DNA/RNA/ protein sequencing is to bioinformatics

Page 7: Chemoinformatics

7

Resources

Books:

J. Gasteiger, T. E. and Engel, T. (Editors) (2003). Chemoinformatics: A Textbook. Wiley.

A.R. Leach and V. J. Gillet (2005). An Introduction to Chemoinformatics. Springer.

Journal:

Journal of Chemical Information and Modeling

Web:

http://cdb.ics.uci.edu

and many more………

Page 8: Chemoinformatics

8

Brief Historical Perspective

• Historical perspective: physics, chemistry and biology

• Theorem:

computers/biology or computers/physics>> computers/chemistry

• Proof:

Genbank, Swissprot, PDB, Web (CERN), etc..

Page 9: Chemoinformatics

9

Caveat: Long Tradition

• Quantum Mechanics• Docking• Beilstein• ACS• Etc…

Gasteiger, J. (2006). "Chemoinformatics: a new field with a long tradition." Anal Bioanal Chem(384): 57-64.

Page 10: Chemoinformatics

10

Possible Causes

• Alchemy• Industrial age and early commercial

applications of chemistry• Concurrent development of modern

computers and modern biology• Scientific differences (theory/process)• Psychological perceptions (life/inert)• ACM

Page 11: Chemoinformatics

11

Chemical Space: Small Molecules in Organic Chemistry

• Understanding chemical space• Small molecules:

– chemical synthesis– drug design – chemical genomics,– systems biology– nanotechnology– etc

Page 12: Chemoinformatics

12

“A mathematician is a machine that converts coffee into theorems” P. Erdos

Page 13: Chemoinformatics

13

Cholesterol

Page 14: Chemoinformatics

14

Aspirin

Page 15: Chemoinformatics

15

“A chemoinformatician is a machine …..…”

Page 16: Chemoinformatics

16

Chemical Space

Stars Small Mol.

Existing 1022 107

Virtual 0 1060 (?)

Mode Real Virtual

Access Difficult “Easy”

Page 17: Chemoinformatics

17

Chemoinformatics

• Historical perspective: physics, chemistry and biology• Understanding chemical space• Small molecules (chemical synthesis, drug design,

chemical genomics, systems biology, nanotechnology)• Predict physical, chemical, biological properties

(classification/regression)• Build filters/tools to efficiently navigate chemical space to

discover new drugs, new reactions, new “galaxies”, etc.

Page 18: Chemoinformatics

18

Chemo/Bio Informatics

Two Key Ingredients

1. Data

2. Similarity Measures

Bioinformatics analogy and differences:– Data (GenBank, Swissprot, PDB)– Similarity (BLAST)

Page 19: Chemoinformatics

19

Computational/Predictive Methods

• Spetrum of methods:

– Quantum Mechanics– ….– Molecular Mechanics– ….– Machine Learning

Page 20: Chemoinformatics

20

Quantum Mechanics

Schrodinger’s Equation (time independent)

Hψ=EψH=(-h2/8π2m)∂2+V = Hamiltonian Operator

E= Energy

V =external potential (time independent)ψ= ψ(x,t) =(complex) wave function = ψ(x)T(t)

(time independent case) Ψ2 = Ψ* Ψ =probability density function (particle at

position x)

Page 21: Chemoinformatics

21

Schrodinger Equation

• Partial differential eigenvalue equation• Where are the electrons and nuclei of a molecule in

space?• Uncer a given set of conditions, what are their energies?• Difficult to solve exactly as number of particle grows

(electron-electron interactions, etc)• Approximate methods

– Ab initio– Semi empirical

• 3D structures• Reaction mechanisms, rates

Page 22: Chemoinformatics

22

Ab Initio

• Limited to tens of atoms and best performed using a cluster or supercomputer

• Can be applied to organics, organo-metallics, and molecular fragments (e.g. catalytic components of an enzyme)

• Vacuum or implicit solvent environment• Can be used to study ground, transition, and

excited states (certain methods)• Specific implementations include: GAMESS,

GAUSSIAN, etc.

Page 23: Chemoinformatics

23

Semiempirical Methods• Semiempirical methods use parameters that

compensate for neglecting some of the time consuming mathematical terms in Schrodinger's equation, whereas ab initio methods include all such terms.

• The parameters used by semiempirical methods can be derived from experimental measurements or by performing ab initio calculations on model systems.Limited to hundreds of atoms

• Can be applied to organics, organo-metallics, and small oligomers (peptide, nucleotide, saccharide)

• Can be used to study ground, transition, and excited states (certain methods).

• Specific implementations include: AMPAC, MOPAC, and ZINDO.

Page 24: Chemoinformatics

24

Molecular Mechanics

• Force field approximation

• Ignore electrons

• Calculate energy of a system as a function of nuclear positions

Page 25: Chemoinformatics

25

Molecular Mechanics

Energy = Stretching Energy + Bending Energy + Torsion Energy + Non-Bonded Interactions Energy

Page 26: Chemoinformatics

26

Stretching Energy

Page 27: Chemoinformatics

27

Bending Energy

Page 28: Chemoinformatics

28

Torsion Energy

Page 29: Chemoinformatics

29

Non-Bonded Energy

Page 30: Chemoinformatics

30

Statistical/Machine Learning Methods

NNs and recursive NNsGASGsGraphical ModelsKernels………Representations are essential. Must either (1) deal

with non-standard data structures of variable size; or (2) represent the data in a standard vector format.


Recommended