Lecture Series Special Topics in Protein Chemistry (equivalent to a 2- credict course) Lecturer:...

Lecture SeriesSpecial Topics in Protein Chemistry(equivalent to a 2-

credict course)

Lecturer: Alpay Taralp, Materials Science & Engineering Program, Sabancı University, Istanbul 34956; [email protected]; http://people.sabanciuniv.edu/~taralp/

Day 1 (3h)Introduction to protein chemistryStrategies used by enzymes to accelerate reaction ratesDay 2 (3h)Protein stability elucidated and enhanced via protein engineeringProtein folding & unfolding probed via protein engineeringDay 3 (2.5h)Protein folding and unfolding probed via protein engineering (continued)

Day 4 (3h)The combined power of in vitro chemical modification and paper-supported chromatography as a probe of structure and functionDay 5 (3h)The combined power of in vitro chemical modification and paper-supported chromatography as a probe of structure and function (continued)In vitro manipulation of protein monomers or their environment to enhance performanceDay 6 (2.5h)In vitro manipulation of protein monomers or their environment to enhance performance (continued)A closer look at optimizing protein function in non-aqueous environmentsProtein purification and related analytical methodsShort examination scheduling

Lecture Topics by Day

© 2006, Alpay Taralp, Sabanci University

Introduction to Protein Chemistry


References Relevant to this Material1. Lundblad, R.L., Techniques in protein modification, CRC Press, 1995, 0-8493-2606-02. Wong, S.S., Chemistry of protein conjugation and crosslinking, CRC Press, 1991, 0-8493-5886-83. Nagradova, N.K., Lavrik, O.I., Kurganov, B.I., Chemical Modification of Enzymes, Nova Science, Inc., 1995, 1-5607-2238-X4. Brown, W.E., Howard, G.C., Practical Methods in Advanced Protein Chemistry, CRC Press, 2000, 0-8493-9453-85. J.M.Walker, Ed., Protein protocols on CD-ROM, Humana Press, 1998, 0-89603-514-X6. Darbre, A., Practical Protein Chemistry: A Handbook, John Wiley and Sons, 1986. 7. Eyzaguirre, J., Chemical Modification of Enzymes: Active Site Studies, Prentice Hall, 1987, 0-47020-763-98. Methods in Enzymology Series, Academic Press,Vols.11, 25-27, 47-49, 61, 91, 117, 130, 131, 135-137.9. Glazer, A.N, Delange, R.J., Sigman, D.S., Chemical Modification of Proteins, Elsevier Science, 1975, 0-44410-811-410. Feeney, R.E., Whitaker, J.R., American Chemical Society Advances in Chemistry Ser. (No. 160) - Food Proteins: Improvement Through Chemical & Enzymatic Modification, Books on Demand, 0-31710-649-X11. Feeney, R.E., Whitaker, J.R., Modification of Proteins: Food, Nutritional & Pharmacological Aspects, 1982, 0841206104 Advances in Chemistry Ser. (No. 198) American Chemical Society12. Feeney, R.E., Whitaker, J.R., Protein Tailoring & Reagents for Food & Medical Uses, Marcel Dekker Incorporated, 1986, 0-82477-616-X.13. Bailey, J. L., Techniques in Protein Chemistry, Elsevier Publishing Company, 1962, Lib. Congress 62-19691.14. Lundblad, R. L., Chemical Reagents for Protein Modification, 2nd Ed., CRC Press, 1991, 0-8493-5097-2.15. Walsh, G., Headon, D.R., Protein Biotechnology, John Wiley and Sons, 1994, 0-471-94393-2.16. Mean, G., Feeney, R.E., Chemical Modification of Proteins, Holden Day, Inc., 1971, Lib. Congress 74-140785.17. McGrath, K., Kaplan, D., Protein-Based Materials, Birkhauser, 1996, 0-8176-3848.18. Koskinen, A.M.P. and Klibanov, A.M., Enzymatic Reactions in Organic Media, Blackie Academic and Professional, 1996, 0-7514-0259-1.19. Suckling, C.J., Gibson, C.L., Pitt, A.R., Enzyme Chemistry: Impact and Applications, Blackie Academic and Professional, 1998, 0-7514-0362-8.20. Magdassi, S., Surface Activity of Proteins: Chemical and Physicochemical Modifications, Marcel Dekker, Inc., 1996, 0-8247-9532-6. 21. Rawn, J. D., Proteins, Energy and Metabolism, Neil Patterson Publishers, 1989, 0-89278-404-0.22. Fersht, A.R., Structure and Mechanism in Protein Science, W.H. Freeman and Company, 1999, 0-7167-3268-8.23. Oxender, D.L., Fox, F.C., Protein Engineering, Alan R. Liss, Inc., 1987, 0-8451-4300-X.24. Crieghton, T.E., Protein Structure: A Practical Approach, 2nd Edition, Oxford University Press, 1997, 0-19-963618-4.25. Crieghton, T.E., Proteins: Structures and Molecular Properties, 2nd Edition, W.H. Freeman and Company, 1993, 0-7167-2317-4.26. Wüthrich, K., NMR of Proteins and Nucleic Acids, John Wiley and Sons, 1986, 0-471-82893-9.27. Journals focused on the subject of protein chemistry: Journal of Protein Chemistry; Protein Science; Biochemistry; Journal of Biological Chemistry; Biomacromolecules28. Catalogues! Promega Protein Guide: Tips and Techniques; Pierce Products; Biorad Life Science Research Products


The way to operate -True or false? A student of protein chemistry…

1. buys the best possible instrument and then tries to force the problems of protein chemistry to suit the use of the machine.

2. works in a problem-oriented manner in which experience and knowledge are adopted to accommodate available machines.

3. relies first on imagination, then knowledge, then machines (Consider the contrast between H. Noyrath vrs. B. Hartley). What was one of Einstein's quotations?

4. believes that protein investigation is as simple and amusing as watching Indiana Jones running away from a band of sword-wielding bandits (The Okum's Razor argument).

5. should use all his/her time reading primary references and never use his/her own ideas, intuitions or beliefs

6. gives more credit to the ideas of a supervisor than to their own ideas© 2006, Alpay Taralp, Sabanci University

To study proteins is to study diversity! i.e., diversity of structure, function, chemistry, analysis, etc.To emphasize the scope of diversity, let us focus on structural diversity...

• Structure is a shape, sequence, order, orientation, configuration, etc. of an atom or molecule.

• Eg. The electronic structure of carbon is 1s22s22p2.

• Eg. CCl4 has a tetrahedral shape.

• Eg. The primary structure of insulin begins with:

• Eg. The tertiary structure of cytochrome c is globular.

Cl

CCl

ClCl

Diversity of structure: Static vs. dynamic• Structure (an other traits) may be static (fixed)

or dynamic (changing) over time.

• The time frame of structural change may be very long (the half life of 238U is 4.5x109 years) or very brief (a 10 fs chemical interaction)

Ra226U238I-CH3-OH

( shorter) Life-time of event (longer )

http://www.quietlywild.com/qwpix/recentpix/turtle.jpg

http://images.google.com.tr/imgres?imgurl=www.charismac.com/images/rabbit.jpg&imgrefurl=http://www.charismac.com/&h=378&w=405&sz=44&tbnid=f4egH-qP6GIJ:&tbnh=112&tbnw=120&prev=/images%3Fq%3Drabbit%26hl%3Dtr%26lr%3D%26ie%3DUTF-8%26inlang%3Dtr

We must characterize structural diversity to understand proteins

• Question 1: Can you see the 3-D shape of myoglobin with your eyes?

• Question 2: Can you live 4.5 billion years to see ½ of the 238U decay?

• Question 3: Can you react quickly enough to measure a chemical interaction?

The answer to each is NO! >> So we must use machines...Why? Reason one: We are limited by resolving power…If our information carrier is visible light, we are limited to an approximate resolution of 0.2m. Details smaller than 0.2m are lost.

Look Wilma, whata nice smoothsurface!! Light

you see

actual surface

500nm

Reason two: The event is often faster than the speed of the measurement…You obtain a blurred average.

• Eg. Try photographing chicks in a bowl

a.

b.

c.

Nature features an invisible world of details & diversity. Instruments

allow us to see these details…

Balloons pierced with a bullet

Dynamic changes along an aqueous surface: Droplets captured in motion

A Look at Diverse Protein Structures1. Protein structure is not rigid!2. Protein structure bears many aspects

Proteins are generally made from20 types of amino acids, which are:

>> Linked by amide bonds (rarely: ester bonds, Ser/Thr; thioester bonds, Cys)

>> Bridged via –S-S- groups or the desmosine group, a 4-lys crosslinker

>> Enzymatically processed: hydroxylated, formylated, phosphorylated, glycosylated,amidated, sulfonated, acetylated, methylated, hydrolyzed, etc.

>> Associated to non-proteins, e.g., WATER, heme groups, etc.

S S


H2N

O

OH

R HI am dynamic!

The peptide bond (like formamide, below & above right) is stabilized by resonance: 60% amide and 40% hydroxyimine character

What else do we observe?All 6 atoms lie in the same plane, i.e., the peptide bond is planar.-electrons are distributed over the C-O and C-N bonds.The C-N bond is 10% shorter than a normal C-N bond.The peptide bond is trans.

The peptide bond has a permanent dipole ( = 3.7D)

H2N

O

OH

R H

O

CH N

H

H

O

CH N

H

H

O

CH N

H

H

O

CH N

H

H

-

+

O

CH N

H

H

Linking the building blocks: Stereoelectronic properties of the peptide bond

H2N

O

OH

R H

CN C

C

O

H

CN

+C

C

O-

H


Protein diversity is enabled by linking diverse building

blocks! Stereoelectronic differences of common amino acid residues:


Amino Acid Let. Codes MW Surface Ǻ2 Volume Ǻ3 pKa, Side,25°C pI, 25°C Sol., g/100g Crys. g/ml

Alanine ALA A 71.09 115 88.6 - 6.107 16.65 1.401

Arginine ARG R 156.19 225 173.4 ~12 10.76 15 1.1

AsparticAcid ASP D 115.09 150 111.1 4.5 2.98 0.778 1.66

Asparagine ASN N 114.11 160 114.1 - - 3.53 1.54

Cysteine CYS C 103.15 135 108.5 9.1-9.5 5.02 v. high -

GlutamicAcid GLU E 129.12 190 138.4 4.6 3.08 0.864 1.460

Glutamine GLN Q 128.14 180 143.8 - - 2.5 -

Glycine GLY G 57.05 75 60.1 - 6.064 24.99 1.607

Histidine HIS H 137.14 195 153.2 6.2 7.64 4.19 -

Isoleucine ILE I 113.16 175 166.7 - 6.038 4.117 -

Leucine LEU L 113.16 170 166.7 - 6.036 2.426 1.191

Lysine LYS K 128.17 200 168.6 10.4 9.47 v. high -

Methionine MET M 131.19 185 162.9 - 5.74 3.381 1.340

Phenylalanine PHE F 147.18 210 189.9 - 5.91 2.965 -

Proline PRO P 97.12 145 112.7 - 6.3 162.3 -

Serine SER S 87.08 115 89.0 - 5.68 5.023 1.537

Threonine THR T 101.11 140 116.1 - - v. high -

Tryptophan TRP W 186.12 255 227.8 - 5.88 1.136 -

Tyrosine TYR Y 163.18 230 193.6 9.7 5.63 0.0453 1.456

Valine VAL V 99.14 155 140.0 - 6.002 8.85 1.230

Different building blocks have stereoelectronic differences: Some are more similar than others.


Residues joined by solid lines may be replaced with 95% confidence

The “20” amino Acids Non-polar amino acids

Charged acidic amino acids

Charged basic amino acids

Polar uncharged amino acids

Selenocysteine Pyrrolysine Selenomethionine 4-Hydroxyproline -Carboxyglutamic acid

5-Hydroxylysine

Stop codon + special tRNA

Postsynthetic

#21 #22

EE

EEE

E

E E*

E

E*

E*


http://en.wikipedia.org/wiki/Image:Selenomethionine.jpg

http://en.wikipedia.org/wiki/Image:Carboxyglutamic_acid.png

http://en.wikipedia.org/wiki/Image:L-selenocysteine-2D-skeletal.png

http://en.wikipedia.org/wiki/Image:Pyl.gif

http://en.wikipedia.org/wiki/Image:Hydroxyproline.png

Amino Acidsin 55 Proteins

SEA >30 Å2

30 > SEA >10 Å2

SEA <10 Å2

Abs. & % nonpolar surface of residues vs. total Å2

estimatedEffecthydrophobic following residue (L) or side-chain burial (R) [kcal/mol]

Glutamic acid 0.93 0.03 0.04 69 (36%) vs. 190 1.73 0.5

Lysine 0.93 0.05 0.02 122 (61%) vs. 200 3.05 1.9

Arginine 0.84 0.11 0.05 89 (40%) vs. of 225 2.23 1.1

Asparagine 0.82 0.08 0.10 42 (26%) vs. 160 1.05 -0.1

Aspartic acid 0.81 0.10 0.09 45 (30%) vs. 150 1.13 -0.1

Glutamine 0.81 0.09 0.10 66 (37%) vs. 180 1.65 0.5

Proline 0.78 0.09 0.13 124 (86%) vs. 145 3.10 1.9

Threonine 0.71 0.13 0.16 90 (64%) vs. 140 2.25 1.1

Serine 0.70 0.10 0.20 56 (49%) vs. 115 1.40 0.2

Tyrosine 0.67 0.13 0.20 38+116 (67%) vs. 230 2.81 1.6

Histidine 0.66 0.15 0.19 43+86 (66%) vs. 195 2.45 1.3

Glycine 0.51 0.13 0.36 47 (63%) vs. 75 1.18 0.0

Tryptophan 0.49 0.07 0.44 37+199 (93%) vs. 255 4.11 2.9

Alanine 0.48 0.17 0.35 86 (75%) vs. 115 2.15 1.0

Methionine 0.44 0.36 0.20 137 (74%) vs. 185 3.43 2.3

Phenylalanine 0.42 0.16 0.42 39+155 (92%) vs. 210 3.46 2.3

Leucine 0.41 0.10 0.49 164 (96%) vs. 170 4.10 2.9

Valine 0.40 0.10 0.50 135 (87%) vs. 155 3.38 2.2

Isoleucine 0.39 0.14 0.47 155 (89%) vs. 175 3.88 2.7

Cysteine 0.32 0.14 0.54 48 (36%) vs. 135 1.20 0.0

Posttranslational modifications increase protein structural diversity

General:Proteolysis | Racemization | N-O acyl shift | N-S acyl shift | Other enzymatic processing:

N-terminus: Acetylation | Formylation | Myristoylation | Pyroglutamate

C-terminus: Amidation | Glycosyl phosphatidylinositol (GPI)

Lysine:Methylation | Acetylation | Hydroxylation | Ubiquitination | SUMOylation | Desmosine

Cysteine: Disulfide bond | Prenylation | Palmitoylation

Serine/Threonine: Phosphorylation | Glycosylation

Tyrosine: Phosphorylation | Sulfonation

Asparagine: Deamidation | Glycosylation

Aspartate: Succinimide formation

Glutamate: Carboxylation

Arginine: Citrullination | Methylation

Proline: Hydroxylation © 2006, Alpay Taralp, Sabanci University

Bonding Diversity: Factors Determining Protein Structure & Stability

Physico-chemical properties of the amino acid side chains determine the folded conformation

Evidence shows that the amino acid sequence of most proteins contains all the information to arrive at the folded conformation.

Assume each amino acid adopts 2 conformations in a 250-unit chain – We obtain 2250 ≈ 1075 conformations.

Steric constraints reduce the number, however, a very large number of conformations is still possible.

The main factors, which cause a long polypeptide chain to fold into stable conformation are:

Hydrophobic interactions among amino acid side-chains

Hydrogen bonding

Ionic interactions

Dipolar-dipolar interactions and hydrophilic interactions, dipolar interactions, quadrupolar interactions


Diversity of Protein Structural Elements: Basic Structural Hiearchy

1. Primary structure: The exact specification of atomic composition and the chemical bonds connecting those atoms, including stereochemistry. (i.e., L-amino acid sequence, disulphide bridges, other postsynthetic modifications, e.g., insulin A & B chains; chymotrypsin A, B & C chains)

2. Secondary structure: Regular arrangment of the backbone polypeptide without reference to side-chain types or conformation. The secondary structure is usually held by H-bonds (e.g., helix, sheets, random coils)

3. Tertiary structure: 3-D arrangement of polypeptide backbone and amino acid side-chains (e.g., lysozyme). Domain structure: compactly folded units

4. Quaternary structure: Noncovalent association of folded protein subunits (e.g., haemoglobin)

>> Most enzymes: Globular shape, with hydrophobic interior & hydrophilic exterior


So are protein physical traits diverse?Compare keratin versus collagen versus albumin (all from the same 20 amino acid types)

How do we draw protein 3-D structure?Space filling, stick/skeletal (backbone only, sometimes labeled) and ribbon/ schematic models:Show helices (coils), strands (arrows) & random structure

Note: Proteins are made not only using amino acid components – you must also consider water, metal ions, carbohydrates, lipids, porphorin rings, cofactors, etc. © 2006, Alpay Taralp, Sabanci University

http://www.dl.ac.uk/srs/PX/special/daff3.tif



Diversity of protein function

Q: What is protein function?

A: Function describes a signal transduction

a. chemical-mechanical; muscles;

b. chemical-chemical; metabolism;

c. chemical-electrical; nerve transmissions;

d. photochemical; vision & photosynthesis;

e. transport; active & passive transport;

f. defense - antibodies & blood clotting


http://images.google.com.tr/imgres?imgurl=www.j-walk.com/digitalimages/animals/zebra.jpg&imgrefurl=http://www.j-walk.com/digitalimages/animals/left.htm&h=593&w=441&sz=79&tbnid=iucF3sHbhCYJ:&tbnh=131&tbnw=98&start=7&prev=/images%3Fq%3Dzebra%26hl%3Dtr%26lr%3D%26ie%3DUTF-8%26oe%3DISO-8859-9%26inlang%3Dtr

1. Enzymatic proteins: Proteinases, lipases, epimerases, kinases, polymerases...Proteins, which transduce chemical to chemical signals

Note - Proteins are not just enzymes – antibodies, connective tissue (collagen), fluid media, transportation vehicles (Haemoglobin, serum albumin), buffers (serum albumin), signal transducers (rhodopsin), etc.

2. Cytoskeleton – Actin (muscle), Tubulin (cell motility), Intermediate filaments (mechanical protection near membranes and cells subjected to stresses), Spectrin (cytoskeletal protein, particularly found in erythrocytes)

3. Human Plasma – Albumin (osmotic regulation, buffering, transport), -Globulins (transport),‑Globulins (iron transport {transferrin}, histocompatibility antigen {-Microglobulin}), ‑Globulins Antibodies, Fibrinogen (proteolised by thrombin to form fibrin clot), Complement A (11 different protein types working to complement the immune system)

4. Extracellular Matrix – Glycosaminoglycans (hydrated gels), Proteoglycans (long glycosaminoglycans linked to a core protein), Collagen (extracellular matrix; Type I-III tissue supporting fibrils, Type IV laminar network), Elastin (random coil protein gives elasticity to tissues), Fibronectin (cell adhesion), Integrin (integral membrane proteins, also adhesion of cells to extracellular matrix)


Classes of Protein According to Function

5. Digestive Enzymes of Digestive Tract – Amylase (starch to disaccharides), Pepsin, Trypsin, Chymotrypsin (proteins to large peptides), Peptidases (large peptides to small peptides; small peptides to amino acids), Lipases (lipids to fatty acids and glycerol), Ribonuclease (RNA into oligonucleotides), Disaccharidases (disaccharides to monosaccharides)

6. Cytosol Proteins (300-1000 types) – Synthesis of most small molecules, proteins, carbohydrates & lipids of cell

7. Nuclear Proteins – Histones (5, complex to DNA to make chromosomes), Nucleic Acid polymerising enzymes (5-10, used in DNA and RNA synthesis)

8. Mitochondrial & Chloroplast Proteins (300-1000) – Energy production from metabolites or light

9. Endoplasmic Reticulum & Golgi Apparatus Proteins (50-200) - Protein modification, oligosaccharide and lipid synthesis

10. Lysosome & Peroxisome Proteins (300-1000) – Degradation processes of undesirable compounds

11. Plasma Membrane Proteins (100-500) – Transport across membranes, transmission of important metabolic signals across plasma membrane


Diversity of protein physico-chemical traits:>> Diversity among proteins is high but not “random”>> Structure/construction and function are related>> Some 1˚, 2˚ & 3˚ features are retained among proteins of similar function

• Global shape and morphology: Round, tight, loose, fibrous, skinny, crystalline

• Local function-related structures: Active site, receptor site, allosteric regions, catalytic residues

• Solubility: Highly variable• pI: Highly variable• pH stability: Highly variable• Tolerance to other environmental factors: Highly variable

Understanding protein structure, protein function, and their relationships are the central problems of protein science. The rules that govern structure-function relationships are simple

Nature is presumed to provide simple solutions.The challenge is to ask the right questions.


What is protein chemistry?Area of science related to:1. Obtaining/purifying protein,2. Investigating protein structure & function, and3. Controlling and engineering proteins


Classic emphasis

Current emphasis

Protein chemistry contributes to the following subject areas:1. Biochemistry, Biotechniques & Bioengineering2. Analytical Chemistry and Spectroscopy 3. Surface and Colloid Science4. Clinical Chemistry5. Polymer Science6. Medicinal and Pharmaceutical Chemistry7. Organic Chemistry

Why is protein chemistry highly interdisciplinary? Protein chemistry has developed together with analytical methods such as sequencing, X–ray, NMR structure determination and site–directed mutagenesis.

Protein chemistry is useful to whom?Researchers, professionals and students in various areas of specialization:

Protein chemists, molecular biologists, materials scientists, enzymologists, clinicians, analytical chemists, biophysicists and industrial scientists


E.g.: Protein chemists help X-ray crystallographers & genetic engineers:

Protein chemist X-ray crystallographer•Purifies 1g protein •Attempts crystallization•Chemically modifies to aid crystallization •Obtains diffraction patterns

or to form heavy atom derivatives, •Uses heavy atomwhich aid the phase problem derivatives to solve

structures at 3-4Ǻ

Protein chemist Genetic engineer•Purifies 1mg protein •Synthesizes oligonucleotides•Sequences peptides •Screens the gene-bank•Compares peptides & sequence codes •Sequences DNA of insects•Probes posttrans processing by FabMS •Constructs expression vector•Prepares antibodies •Screens using western blots•Develops protocols to purify•Compares properties of wild-type & mutant


One of the most common and often ambitious experiments in protein chemistry is the structure-function study

I.e., How does structure perturb function? How does function define structure?e.g. Consider the pKa of active-site thiols in cysteine proteases

Structure-function experiments: probe the interdependence of structure & function in proteins; generally reflect elements of both structure & function:

Examples along this continuum:1. One end – X-ray; Emphasizes analysis of structure2. Middle ground - pH titration of protein groups, showing hysteresis; Reflects

elements of structure and function substantially3. 2nd end – Bioassay; Weighted toward functional assessment


a S-F study ?

continuumPure S study Pure F study

How do S-F studies work? How would you learn about a system that you cannot see? You interact with the system & note the consequences.

If you walk into an icicle, your “initial state” becomes altered. Your “final state” indicates something sharp. Thus, any change in you during the interaction can probe structure.

stationaryglacier

movingglacier

happyfurry animal+ achorn

panickedfurry animal +achornREGION OF INTERACTION

Interaction 1

Interaction 2

Structure-function studies use:physical measurements (usually spectroscopic) and/or chemical protocols (usually covalent modification)

Physical methods:Generally nonintrusive, require more protein, performed in water or water-free state.

Chemical methods: Generally intrusive, may be destructive. Potentially very sensitive, performed in water, organic solvent or dry state.

Some physical methods to assess structure & functionDiffraction: X-ray, neutron diffraction Spectroscopies: Infrared, ultraviolet, Raman, optical rotary dispersion, circular dichroism, NMR, esr (principle is to infer structure by perturbing light)Thermal analysis: MicrocalorimetrySpectrometry: Mass analysisIn silicio: Computer modelingOther: Electrophoresis, hydrodynamic techniques, chromatographies

Typical outputs:Composition and secondary structure, quantification, folding energies (spectroscopies)Identifying/purifying biological materials by exploiting adsorption, isoelectric point, size/mass, affinity, etc. (chromatography & electrophoresis)Unfolding enthalpies of protein (microcalorimetry)3-D "Static" structure (X-ray, neutron diffraction) 3-D dynamic structure, kinetic folding, association constants, etc. (NMR)Local environment of coordinated metal ions (Mossbauer spec.)


Some chemical methods to assess structure & function

Titration studies: nature & number of ionizable groups.

Proteolysis in vitro: Limited proteolysis to elucidate the structural motifs of protein.

Kinetic studies: Applied to any protein, but mainly enzymes.

Classic chemical modification: Used to identify important residues. E.g., acetylation of chymotrypsinogen vs. chymotrypsin showed the role of Ile16.

Competitive labeling: Very sensitive and powerful. Reports on individual residue pKa values, structural information such as accessibility of groups, and stereoelectronic perturbations of a group. E.g., the surface reactivities and pKa values of the 12S subunit of a native 50-protein ribosome complex was characterized.

Site-directed mutagenesis: Reports on the role of specific groups. All groups can be investigated. SDM is complementary to chemical modification. Using SDM, the role of active site groups of barnase on stability and catalysis were quantified.


In a typical study of a poorly characterized protein...1. Physical & chemical methods to purify protein and to

analyze protein structure (some early examples): Dialysis and gel filtration, column chromatography of proteins Zone electrophoresis of proteins Estimation of protein and amino acid content Paper chromatography of amino acids and peptides High-V paper electrophoresis of amino acids and peptides Ion-exchange chromatography of amino acids and peptides Disulphide bond mapping Urea unfolding and stability tests Selective cleavage of peptide chains N-terminal sequence determination C-terminal sequence determination X-ray and later CD and NMR structures (with/without incipients)

2. Physical & chemical methods to analyze protein function Bioassays (enzyme kinetics, receptor-hormone, protein adsorption, cell

adhesion to protein layers) Comparative studies give insight to the S-F relationship!

continuumPure S study Pure F study


A Review of Protein Structure-Function at Play: Enzyme

Strategies to Accelerate Rates


An enzyme will not “reduce” the activation energy of a pathway! Like all catalysts, an enzyme will permit the reactants to follow an alternate, low–energy pathway.

CO → CO2

The alternative pathway reflects a new mechanism. Here, it proceeds via 2 or more intermediate steps.

2CO + O2

2CO2

Enzymes use similar tricks as non-enzymatic catalysts: E.g., bases, acids, metal surfaces, etc., PLUS some extra tricks, which are unique to its structure

overly simplified more correct

Enzymes in the Protein Family: Properties1. Monomeric or oligomeric or exist as part of a multienzyme complex2. Often require non-protein components (co-factors) for catalytic activity – activator eg. metal ion, co-enzyme, prosthetic group3. Efficient catalysts4. High Specificity5. High Stereospecificity6. Very sensitive to pH, temperature, dielectric (salts, solvent)

Industrial Uses of EnzymesTextile Industry – Cellulase for cottonDetergent Industry – Lipases and Carbohydrases for stainsFood Industry – Isomerase of glucose to fructose; lactase for lactose intolerant peopleOrganic Synthesis – Penicillin acylase; amino acid synthesis

1a. Oxidoreductases (all redox reactions) eg. Alcohol to aldehyde – catalysed by NAD oxidoreductase, aka alcohol dehydrogenase (plus NAD+ cofactor NADH)

1b. Transferases (transfer of methyl groups, glycosyl groups, phosphate groups, etc.) eg. creatine to phosphocreatine – catalysed by creatine phosphotransferase aka creatine kinase (plus ATP ADP)

1c. Hydrolases (hydrolytic cleavage of ester, amide and glycoside bonds by insertion of water) eg. glucose-6-phosphate to glucose plus phosphate – catalysed by glucose-6-phosphate phosphohydrolase, aka glucose-6-phosphatase

1d. Lyases (cleavage of bonds by mechanisms other than hydrolysis or oxidation; carbon-carbon lyases, carbon-oxygen lyases, carbon-sulfur lyases) eg. L-histidine to histamine plus carbon dioxide – catalysed by histidine decarboxylase

1e. Isomerases (racemizations, epimerizations, cis-trans isomerization) eg. D-ribulose-5-phosphate to D-xylulose-5-phosphate – catalysed by D-ribulose-5-phosphate 3-epimerase aka phosphoribuloepimerase

1f. Ligases (condensation of two different molecules at a new C-O or C-S bond, but coupled to the breaking of ATP) eg. L-tyrosine plus tRNA plus ATP to give L-tyrosyl-tRNA plus pyrophosphate – catalysed by L-tyrosyl-tRNA ligase aka lyrosyl-tRNA synthetase


Mechanism & Strategies of Rate Acceleration

in Enzymes General questions 1) Why are enzymes such efficient catalysts? 2) Which factors typically affect enzyme performance?

Binding: Unproductive binding, competing substrates, competing products, competitive inhibition, uncompetitive inhibition, and noncompetitive inhibition

Temperature Ionic strength, pH value and other environmental factorsLocal diffusion and convection

3) Why have proteins been selected as catalysts in biological systems?

4) How large do enzymes have to be?© 2006, Alpay Taralp, Sabanci University

Quantifying enzyme rates

Q: Why do we study enzyme activity?A: Enzyme kinetics probes protein structure and function in

general.Enzymes are proteins evolved with a natural marker of structure &

function.

Q: What are some parameters to characterize enzymes?A: Enzyme Units (historically)

EU/mg protein (specific activity)Ks (Binding constant)

KM (Michaelis constant)

kcat (turnover number/catalytic constant)

kcat/KM (specificity constant, or pH activity for kcat/KM versus pH)

Ki (inhibition constant: competative, uncompetative, noncompetative)© 2006, Alpay Taralp, Sabanci University

Means of Quantification: Measure a change of S→P over time, many techniques

Rate measurements: Rate of formation of product or removal of reactant as amount/time e.g., M/s, mole/s, vol/s, g/ml/s, etc.

Try to measure these slopes!

Q: What do we call these measurements?A: Initial rates! Acquire data within a few minutes & within 1-5 mole% S conversion.

Q: Why measure initial rates?

Forward rate, S → P, has no interference:

1. No product inhibition is possible;

2. No reverse reaction is possible;

3. Enzyme instability is less of a concern; and

4. Be safe - Enzyme reaction models are more complex than ordinary kinetics: Invite errors when extrapolating non-initial rate data

Let us examine how the above theory has originated...

Historically Early studies (1895-1913) on the rates of the enzyme-catalyzed

reactions gave the following observations: 1. At constant substrate concentration, the rate of reaction was directly

proportional to the enzyme concentration. 2. At constant enzyme concentration:

a. The reaction rate was independent of substrate concentration. b. The reaction rate was directly proportional to the substrate concentration. c. The reaction rate was fractional with respect to substrate concentration,

with a value between zero and one.

In 1913, Michaelis & Menten proposed a scheme to account for the above observations: Enzyme only acts upon bound substrate, i.e., E & S must initially form a

complex, held together by physical forces. Assumptions:

E and S are equilibrated with ES, i.e., kcat << k-1

Breakdown of ES is 1st order so rate [ES] i.e. rate = kcat[ES]

Rate of reverse reaction is zero

E + S ES E + PKS

kcat

So rate = (kcat[E]o[S]o)/(Ks+[S]o) © 2006, Alpay Taralp, Sabanci University

Briggs and Haldane revised the mechanismThey assumed that k2 was significant in comparison to k-1

(not an equilibrium, rather a steady-state).They set d[ES]/dt = zero to obtain a rate formula. The “new” M-M equation has the same form as the

original! Why? Equilibrium is a special case of the steady state treatment, k2 << k-1.

How does KM vary amongst the two models? KM is either (k-1+k2)/k1 or KM ≈ KS = k-1/k1 (in the original M-M model).

E + S ES E + Pk1

k-1

k2

E + S ES E + PKS

kcat

rate = (kcat[E]o[S]o)/(Ks+[S]o)

rate = (k2[E]o[S]o)/(KM+[S]o)

as k2 << k-1


Q: What are enzyme assays & how are they performed? SP

The Assay Any method that detects a change of physical property versus time:

Manometry, polarimetry, viscometry, NMR, MS, spectrophotometry, spectrofluoromethry and pH-stat. What is one pre-condition? The physical property should vary in proportion to S or P.

Direct assays Alcohol dehydrogenase can be monitored as a function of NADH

formation. NADH is strongly absorbent at 340nm. Is a buffer used? Hydrolases can be monitored as a function of proton formation (standard

ester cleavage). Is a buffer used here? Coupled Assays

If S & P are similar they cannot be directly used to assay. To get around this problem, a more distinguishable end product is made.

Target: With alanine aminotransferase; alanine + -ketoglutarate → pyruvate + glutamate. Using pyruvate dehydrogenase; pyruvate + NADH → lactate + NAD+ (NAD+ absorbs at 260nm). The coupled reaction should be faster than the principle reaction. WHY??

rate = (kcat[E]o[S]o)/(KM+[S]o)


Sampling AssaysS or P is withdrawn at specific time intervals & quantified, e.g., by colorimetry or radioisotopy.

Experimental Target of a M-M assayTo measure 3 parameters: KM, kcat & kcat/KM. Do these carry a physical meaning?

Q: How do we carry out a typical M-M experiment? A: Measure the initial rates as follows:

With substrate concentration at least 200-500x greater than total enzyme concentration , measure KM& kcat directly. Carry out these measurements at 3-4 different pH values. Measure the specificity (kcat/KM) directly at many pH values, using 0.1pH unit intervals (construct a pH activity curve); In choosing your parameters, S must be at least 20x less than KM. Why? What is the significance of a pH activity curve?Repeat any of the above experiments in the presence of inhibitors, different S, activators, different environments, etc.

Q: How does your experimental scenario compare to the true situation in biological systems? Is there a biological relevance? Why do we conduct experiments in this way? rate = (kcat[E]o[S]o)/(KM+[S]o)

Hanes-Wolf plot

Michaelis-Menten kinetics

Conc of S (mM)

Initial rate (mM/s)

= KM/(kcat[E]o)

A closer look at kinetic scenarios: Probing ionizable groups, which are important for binding and/or catalysis?

(6)

Sample math treatment for 3 (apparent) ionizable groups that are important for binding and/or catalysis

= 0

0

20000

40000

60000

80000

100000

120000

140000

3 4 5 6 7 8 9

kcat/Km

pH

The pH activity profiles of cathepsin B. The substrates are acetyl-Arg-Arg-Arg-AMC (+), acetyl-Val-Arg-Arg-AMC (◊) and benzyloxycarbonyl-Arg-Arg-AMC (―).

Real example!

Thermokinetic background related to protein analysisThermodynamics: G, H, S, equilibrium constant Keq

Kinetics:G≠, H≠, S≠ , kinetic rate constant k, kinetic rate theories

Origin: Position of G, H & S changes as system proceeds along reaction coordinate

Plan: To discuss the interrelation of these parameters and to focus on G≠ and G


Please delinate the relative importance of thermodynamic and kinetic contributions in the following scenarios

1. The right reaction releases energy faster than the left reaction. Q: Which videoclip shows the more exothermic reaction? A: Inconclusive! We cannot compare the molar enthalpy change from the videos.

2. True or false? All exothermic reactions are thermodynamically spontaneous and all endothermic reactions are thermodynamically non-spontaneous.A: False!

3. True or false? All thermodynamically spontaneous reactions yield a reaction & all thermodynamically non-spontaneous reactions fail. A: False!

Put away your weapons of mass destruction...

4. The thermite reaction is highly exothermic, H <<<0, the entropy change, S, is relatively unimportant, and the Gibbs energy change is highly negative, G <<<0. The reaction is thermodynamically spontaneous.Q: Why must you add a fuse to start the reaction?

5. The process H2O(s)→H2O(l) is highly endothermic (H>>0) Below is the evidence. Explain.

Time = 0min

Time = 60min

6. The reactant, nitrogen triiodide-NH3, sits at a high Gibbs energy level. Its products rest at a much lower energy state.

You must apply a physical shock before Nitrogen triiodide-NH3 explodes. Why?

NI3.NH3(crystal) → NH3(g) + ½ N2(g) + 3/2I2(g)

7. Liquid nitrogen evaporates. The process is thermodynamically spontaneous, endothermic & proceeds quickly. Q: How might you explain these comments?

N2(l) → N2(g)

8. The dissolution of ammonium sulfate in water is endothermic and readily proceeds under ambient conditions. Explain.

(NH4)2SO4 + bulk H2Os → 2NH4+

(aq) + SO42-

(aq) + a few less bulk H2Os

Why all the confusion?? Reason 1: Many terms and reactivity modelsReason 2: Misleading termsReason 3: Separate GS & TS concepts in chemical processes

At equilibrium, Ssys + Ssur is max.

Gsys Asys

UH

U PV

TSTS

kinetic Emic

potential Emic

rot vibrtransla

intermol. inter.

intramol. inter.

Let us progress until wearrive at the commonmodel to understandproteins...

Early measurements of U examined the link between enthalpy changes (H ≈ differences of bond energies) and reactivity

Why shouldn’t you predict reactivity using H?

A: H reports on the initial & final Ground States (GS) but not on the pathway (mechanism)

(There are other reasons too)

A

B

H

B

A

Reaction coordinate Reaction coordinate

H < 0 H > 0

Hfinal

HinitialExothermic Endothermic

What are some disadvantages of: the collision model? using Ea to predict reactivity?

Collision model: A kinetic view. Consider a potential barrier, Ea, between A & B. Rate const is kA→B = Ae-Ea/RT. (Later, A = Z)

A

B

U

Reactants collide with speed & good orientation.

In non-gases: PotE ≈ U, as (PotEf - PotEi) ≈ Uf - Ui

U ≈ H, as H = U + (PV)←very small

preexponential steric

Gibbs energy change: A way to explicitly incorporate entropy, S, to account for solvent effects, etc.

H-TSsys = Gsys

bomb calorimeter

most reactions


Gsys Asys

UH

U PV

TSTS

What is misleading by the term spontaneous?

G > 0G < 0

Spontaneity says nothing about energy barriers or chemical rates

Reaction CoordinateReactants Products

GibbsEnergy

Both processes arethermodynamicallyspontaneous

One is kinetically permitted,giving an observable rate,

and one is kinetically prohibitedby a high energy barrier

Reaction CoordinateReactants Products

Both processes arethermodynamicallynon-spontaneous

One is kinetically permitted,giving an observable rate,

and one is kinetically prohibitedby a high energy barrier, so we have 100% reactants

G≠

G

G≠

G

Transition State Theory: A kinetic element completes the Gibbs view.

State B

State A

GState A

State B

G‡

G‡

Reaction Coordinate

G

Changes of any state function are independent of path

State function

Not a state function

Problems with TS theory?

Reactant (A) proceeds through a high-energy transition state or activated complex to become product (B).

What is the weakness of predicting reactivity using only quantum tunneling?

Reaction Coordinate

Reactants Products

GibbsEnergy

Classic kineticbehavior

Tunneling (a 5A wavelength decays exponentially as it penetrates the barrier. If the barrier is not too long, R can reach the product sideof the hill without completely decaying away (i.e.,emerges on the product side with a non-zeroprobability density)

R,eg. +H P

O O-

HThe rate of proton transferoften has a significant tunneling component

5A

H

5A

Quantum tunneling kinetic view: The e- probability distribution of every particle is derived from a wave function

The above terms are related to large populationsCannot use TS theory to calculate the activity of “one” molecule or small groups of molecules, such as membrane proteins

Note: Reactions do tunnel, collision theory could apply

Let us examine a typical enzyme reaction...

G≠, H≠, S≠

G, H, SGA→B = HA→B – TSA→B

kA→B = (kBT/h) x e-G‡/RT

Keq = [B]b/[A]a = e-G/RT

To summarize: In protein systems, we assess thermodynamic & kinetic behavior in terms of G & TS theory (less use of Z or tunneling arguments)

G G

Uncatalyzed reaction Enzyme reaction

Reaction coordinate Reaction coordinate

S X+ P

E + S ES ES E + P=

ES≠

Enzymes lower G‡ (i.e., G‡ - GGS) in comparison to the uncatalyzed reaction

We shall simplify the notation even more...

Overall rxn is diffusion-controlled or rxn-controlled

Microscopic steps may be grouped intoPhysical binding (1st step; E + S → ES), and Chemical catalysis (2nd step; ES → ES‡ → E + P)


G

Reaction Coordinate

Two models:TS lowering & GS elevation

E + P ESE + Sk1

k-1

k2

http://images.google.com.tr/imgres?imgurl=www2.arnes.si/~lnosan/postal/boring.gif&imgrefurl=http://www2.arnes.si/~lnosan/postal/&h=313&w=200&sz=11&tbnid=xFJJI39ol9YJ:&tbnh=112&tbnw=72&start=3&prev=/images%3Fq%3Dboring%26hl%3Dtr%26lr%3D%26ie%3DUTF-8

Dissect G‡ into enthalpic (H‡) and entropic (S‡) components:

k = (kBT/h) x e-G‡/RT can be written as k = (kBT/h) x eS‡/R x e-H‡/RT

where S‡ is the entropy of activation, Stransition state – Sground state and

H‡ is the enthalpy of activation, Htransition state – Hground state.

activationenergy offorwardprocess

GibbsEnergy

ProductsReactants

Reaction Coordinate

Ggs

GQ: How might you predict the free energy of activation, G‡?

Answer: Assess the enthalpic & entropic differences between:

1. reactant (ES at ground state) &

2: the activated complex (ES≠, at the transition state position).

A closer look at changing the position of G

E + P ESE + Sk1

k-1

k2

H

H

H

Reaction Coordinate

activationenthalpy oftwo forwardprocesses

HA

B

The enthalpy of activation , H‡, is always positive because bonds are being broken.

The entropy of activation, S‡, may or may not be favorable. Can you think of some examples?

Both parameters contribute to rate according to k = (kBT/h) x eS‡/R x e-H‡/RT

S

S

S

Reaction Coordinate

activationentropy oftwo forwardprocesses

S

A

Q: How might substrate-surrounding interactions affect the position of H?

(Hint: In solutions & solids, H ≈ U)

Reaction Coordinate

Polar solvent Nonpolar solvent

Enthalpy, H

Oil

Water

Q: If you ignore any entropic contribution, how might a change of H affect the Gibbs free energy position in solutions & solids?

Water

Oil

Gibbs Energy, G

Nonpolar solventPolar solvent

Reaction Coordinate

Reaction Coordinate

Reactants Products

GibbsEnergy

Equilibriumposition, before & after

Increasing theconcentration ofreactant

Q: What happens if you increase the chemical potential (i.e., the potential to do work) of a reactant?

Answer: Reaction is more spontaneous; equilibrium is even closer to the product side; transition state is reached earlier; activation energy is smaller; forward rate is greater

Q: Which principles do enzymes exploit to lower the position of the TS (& how)?

1. General acid catalysis, general base catalysis, electrostatic catalysis and electrophilic catalysis. All modes could stabilise charge accumulation in proceeding

from ground state, GS, to transition state, TS.


Hydroxide

Enzyme

2. Covalent or nucleophilic catalysis. A covalent activated intermediate is formed, e.g., a ping-pong mechanism. The high-energy mechanism is broken into energetically less-demanding steps.

3. Neighboring charges, dipole moments & hydrophobic/dielectric considerations.

The enzyme environment enhances the reactivity of nucleophiles such as serine hydroxyl groups, cysteine thiol groups, etc.

N

O

H

OH

O

HN

H

H2O

N

O

H

O-Enz

HN

H

O

O

Enz

H2O

Enz O-

OH

O

G

Cys

S-

HNNH

His

+

Cys

SH

(aq)

pKa = 9pKa = 3

4. Pre-reduction of ground-state entropies.The basis of this strategy is to minimize the ground state freedom of the ES complex during the chemical transformation phase of a reaction. In this way, the ascent to the TS will not require a major loss of freedom.

Strategy 1. Orbital steering.

Strategy 2. Decrease the number of reaction participants in the chemical transformation phase.

N

O

HAA

N

O

H

N

O

H

A

versus

(aq)

LSS≠ << RSS≠

+ versus +

EnzEnz

≠≠

CH3CH2O- ICH2CH3 versus+

O-

I

Enz

5. Formation of low-barrier H-bonds.A normal H-bond in the GS may become a low-barier H-bond in the TS if the pKa value of the enzyme group & the activated complex (as ES≠) are matched.

1 normal H-bond ≈1-5 kcal/mole1 low-barrier H-bond ≈ 25-40kcal/mole

ONH

H3C

H Enz-O

S

GSTS

S

pKa = 10

pKa = 16

O EnzH

H

H O

+N

H3C

pKa = 10

6. Binding energy considerations.Enzyme binding groups interact non-covalently with substrate at all points along the reaction coordinate. The energy term H is variable along the reaction coordinate!

E.g. 1, GS shape of S perfectly matches enzyme site

E.g. 2, TS shape of S is much more complementary to enzyme than GS shape of S (for enzymes that behave according to the TS stabilization model of catalysis).

10 goodcontacts

+

TS??

ES

E + SG

+

2 goodcontacts

10 goodcontacts

ESE + S

G

ES=

E + P

Conclusion: Ground-state binding shouldn’t be “extremely specific”, as is often assumed

Shape of S transforms along the reaction coordinate

GS TS

EnzS

SEnz

GS

E + P

=ES

G

E + SES

+

2 goodcontacts

3 goodcontacts

5 goodcontacts

8 goodcontacts

10 goodcontacts

6 goodcontacts

TS reached

In a well-evolved enzyme-substrate interaction, we see an increase of binding energy stabilization in proceeding to the TS

SEnz

Every “good” interaction lowers the position of H, etc.

10 good contacts

2

3

6

8

Q: Can you see evidence of binding energy participation in the TS of amide bond hydrolysis?Hint: Look at the changes of KM & kcat



Gsys Asys

UH

U PV

TSTS

kinetic Emic

potential Emic

rot vibrtransla

intermol. inter.

intramol. inter.

= E + PESESE + S EPphysicalbinding

chemicaltransformations

physicaldissociation

rate-determining transition

ES

E + S

G

=

E + PEP

ES (Hypothetical enzyme-catalyzed energy profile when binding energy is not considered, i.e., profile is analogous to non-enzymatic catalysis)

=

S (Uncatalyzed energy profile)=

Reaction Coordinate

2

10 good contacts

8

4

ES (Enzyme-catalyzed energy profile including binding energy contribution)

Summary slide

Increasing [Substrate]

Increasingreactionrate (v)observed

A

B

C

[S]o >> Km[S]o << Km

a ; va = (kcat/Km)[S]o[E]

b

c

Vmax = kcat[E]ova =

va = Vmax/2

[S] = Kma

a

a a

Reaction Coordinate Axes

IncreasingGibbs energyof freesubstrate& substratein enzymecomplexes

Substrate Product

Large [S]

S

ES

ES

forward rate const = kcat

C,D

A,B

A,C

forward rate const = kcat/Km

Small [S]

ProductSubstrate

S

ESa,c

ESb,dP

d

D

c,d

PB,D

kuncat

stabilization before binding energy consideration

(a)

(c)

(b)

ES++

a,bES++

ES++

ES++

kcatKm

a,b( ) kcatB

kcatKm

A,B( )

KmB,D

S->P

ES binding energies can be grouped into three cases:

Case 1 (not shown here): ES has very strong GS binding (shape complementarity is exceptionally good in the ground state). Examples: HormonesCase 2: ES has poor GS binding and strong TS binding. Examples include carbonic anhydrase, acetylcholine esterase and catalase. Case 3: Modest GS binding and modest TS binding. Examples – Metabolic enzymes


Protein engineering to elucidate and improve stability


Protein engineering has been used to investigate structure-activity & molecular recognition relationships to to make better protein productsQ: Why does Mankind wish to use proteins? Proteins accelerate chemical reactions Proteins form commercial products & improve other product properties Proteins enable novel processes

Some typical industrial applications: Bioreactors Textile treatment Medicinal and organic syntheses Protein drugs & drug delivery Biosensors Bioremediation Food preparation industries

Problems? Industrial constraints are often too demanding for native proteins. Consequences: Poor biological activity, short lifespan, limited reaction parameters, etc.

Q: What can we accomplish by using protein engineering? improve existing pharmaceutical proteins create superior high-value proteins with improved half-life create new proteins and pioneer new therapies improve desirable biological activities alter receptor specificity and binding activity reduce harmful side-effects and toxicities.

Improved Proteins

Redesigned Antibodies

Molecular Recognition

'synthesis'

'analysis'Structure-ActivityRelationships

Improved Enzymes



The current focus of protein engineering: Formulating broad-scope protein preparations, which are:

Cheaper More stable More catalytic Longer-lived More easily stored & transported More active at pH & temperature extremes

Locating/purifying thermophiles, etc.

Geneticmanipulation

Low-techchemical strategies

Native

PROTEIN ENGINEERING

focus


The current focus is genetic manipulation

Barnase: Superimposed Xray/NMR, schematic and ribbon sketch

Interacting residues are observed

Q: Can mutations probe the stability of the folded state?

A: All residue interactions contribute to protein stability. By mutating 1 residue of an interacting pair, the Gibbs contribution of that pair to protein stability is assessed.

Eu, E'u

Ef

E'f

wildtype

mutant E'i

Ei

E'

E

G

Reaction Coordinate

=

=

How might you measure the thermodynamic unfolding/folding energy change of barnase?

Trp'sinside Trp fluorescence

quenched

Folded Unfolded

Concentration of denaturant

Fluorescenceof Trp residues

Concentrationcorrespondingto 50% fluorescencequenching

x xx

x

x

xx

x x xwt

mutEu, E'u

Ef

E'f

wildtype

mutant E'i

Ei

E'

E

G

Reaction Coordinate

=

=

In principle, all interactions contribute to protein stability. H2OGUnfolding is the free energy change (calculated), which accompanies barnaseFolded → barnaseUnfolded in water. Here are some examples:


Deleting one H-bonding partner where there is no access of water

Mutant [urea]1/2H2OGU GU

(in M) (kcal/mole)

wt 4.57 8.82 ----

TyrPhe78 3.88 7.68 1.35

SerAla913.58 6.41 1.93

Deleting one H-bonding partner where there is free access of water

Mutant GU

(kcal/mole)

SerAla31-0.14

TyrPhe103 0.00

Introducing a H-bonding residue in a place that contains no partner residue

Mutant Solvent access- GU

ible area (in Ǻ2) (kcal/mole)

ValThr10 0 2.48

ValThr89 0 2.55

ValThr45 43 2.44

ValThr36 70 1.15

ValThr55 93 0.60

Summary: Relative importance of types of interactions towards stability

H-bonds, no access to water → moderate

H-bonds, free access to water→ very small

Hydrophobic-hydrophobic interactions → very important, very abundant

(64 mutations) → >60kcal/mole destabilization energy© 2006, Alpay Taralp, Sabanci University

Destroying parts of buried or solvent-accessible hydrophobic residues

Mutant # methyl(ene) GU

groups < 6Ǻ (kcal/mole)

IleVal55 5 0.30

IleAla55 16 1.15

ValAla10 37 3.39

IleAla88 55 4.01

LeuAla14 62 4.32

Destroying a solvent-exposed ionic interaction between Asp8, Asp12 & Arg110

Mutant GU

(kcal/mole)

AspAla8 0.89

AspAla12 0.31

AspAla8 & AspAla12 0.80

Mutation studies have validated the importance of some interactions. Can we use site-directed mutagenesis to engineer proteins with enhanced stability?

Yes! Bridged mutants show resistance to denaturants & thermal stability!


Mutant [Urea] GU

to unfold 50% (kcal/mole)

wt 8.8 0.0

AlaCys43 (–SH) 7.7 1.1SerCys80

AlaCys43 (–S-S-) 10.0 -1.2SerCys80

SerCys85 (–SH) 8.4 0.4HisCys102

SerCys85 (–S-S-) 12.9 -4.1HisCys102

Stability of Barnase double mutants

SDM may be aided by evolutionary clues left by Nature

With respect to Barnase, Binase has lost one amino acid (Gln2 → ) and has 17 different residues. The structure of Binase is slightly more stable than Barnase.

Hypothesis: Evolution may have selected some of the 17 amino acids because they promoted stability. If these amino acids are mutated into Barnase, the engineered Barnase may have higher stability...


Strategy to improve the stability of Barnase: One by one, re-engineer the primary structure of

Barnase using each of the 17 residues of BinaseMeasure the conformational stability of the mutantDesign a “super” stable mutant using the information.

Mutation 50%Unfold Gu

(For comparison, wt barnase unfolds %50 in 8.8M urea)

Grand Results:


Take-home message Effects of these individual mutations are remarkably

additive Normally cooperativity is observed

Explanation Binase and Barnase are slightly divergent on the

evolutionary tree Mutations are very conservative

Implications for any industrial enzyme such as Xylose Isomerase

Find a closely related thermophile Make individual mutations between them and determine

Gu

Choose the stabilizing mutations and create a multiple mutant, stable at high temperatures


Directed evolution is a technique, which accelerates evolution. Evolution normally takes millions of years to produce an

improvement; accelerating the mutation process yields improvements in weeks.

One type of directed evolution is called molecular breedingDesired genetic trait obtained from a two-step process:

1. Genes are subjected to DNAShuffling, generating a diverse library of novel sequences (one or more genes are fragmented and recombined).

2. “Good” gene products are selected by screening. The good genes are subjected to more “shuffling” & screening until

the desired property is obtained

Other examples protein engineered via genetic manipulation have relied on the principle of directed evolution

Left image: Wild type green fluorescent protein gene in plants

Right image: Maxygen’s DNA shuffled green fluorescent protein gene in plants


Protein Folding


Folding of Barnase - Overview Elucidating rules, which govern the folded conformation of

proteins, is of theoretical interest and practical importance particularly since advances in recombinant DNA methods have enabled the design and synthesis of novel proteins.

Although many physico-chemical approaches have been employed, the mechanism of protein folding remains unclear.

An approach, which combines the technique of site-directed mutagenesis with the more classical physico-chemical techniques, has been employed to address this problem.

By altering specific side chains in a folded protein, it is possible to correlate the contributions of their interactions towards the overall stability of the protein. Thermodynamic relationships, specifically Bronsted relations, are employed in this treatment.

Barnase, a relatively small 110 amino acid, monomeric extracellular ribonuclease of Bacillus amyloquefaciens serves as a model protein for this study.


Protein FoldingProtein folding is a large-scale

continuation of the conformational analysis problem

3 mutual gauche interactions 2 mutual gauche interactions

Less stable More stable

The New Challenge!!


Protein structure is not rigid

1. Some native structures are more flexible and dynamic; some are tight, less dynamic and well protected

2. We note a correlation between protein flexibility and crystallizability

Article “What does it mean to be natively unfolded?”

Implications for NMR and Xray analysis?

Q: Why does a protein fold? A: Balance of enthalpic terms (non-covalent interactions) and

entropic terms (freedom decreases as conformation organizes)

Gibbs energy

Reaction coordinate

Eu = unfolded enzyme

Eı = intermediate

EF = folded enzyme

Typically,

GU→F = -5 to -15kcal/mole


EU

EI

EF

RateDeterminingStep

G = H - TS

Q: Please estimate the conformational possibilities while a protein folds

A: In a 100 amino acid chain8 conformations eachup to 8100 conformations possible

Q: Is protein folding random? If 1011-1013 conformations are randomly adopted per

second → requiring years to fold! In fact, a protein folds while associated with

ChaperonRibosomeAlone

in msec-to-sec time scale A: Folding is clearly a directed process!


Q: How might you define the mystery of protein folding A: How does the amino acid sequence

Direct folding? Determine the final conformation?

Q: How might you address the problem? A:

Obtain amino acid sequence (protein chemistry, DNA, molecular biology) Obtain 3-D structure (X-ray, NMR) Perturb the physico-chemical traits (Chemical modifications and site-directed

mutagenesis)

Q: Why the interest to understand protein folding? A:

Predict 3-D structure of any amino acid chain Novel enzyme design Improved industrial processes, e.g., a better xylose isomerase → € Treatment of protein related diseases, e.g., prion diseases (BSE, fatal familiar

insomnia, etc.)


In the prion class of diseases, why does a misfolded protein lead to disaster?

Globular proteinnon-associatingsolubledegradable

Fibrous proteinaggregatingcrystallizinginsolubleaccumulating

In prion disease?

Crystal nucleation?

NormalState

PathologicalCondition

G

Reaction coordinate

Today’s focus is related to Prof. Alan Fersht’s work on the folding pathway of Barnase Barnase: RNA → ribonucleotides

References:Serrano, Day and Fersht (1993) J. Mol. Biol. 233, 305-312Fersht, Matouschek and Serrano (1992) J. Mol. Biol. 224, 771-782

Serrano, Matouschek and Fersht (1992) J. Mol. Biol. 224, 805-818 © 2006, Alpay Taralp, Sabanci University

O

RO

G

O HO

P

O

-O

O

R'

BH+

B - R'OH+HB

B

O

RO

G

OO

P

O-O

H

OH

B

BH+

O

RO

G

O HO

P

O

-O

O

H

+H2O

HN

N N

N

O

H2N P 0

N -1

P +1

N +1

P +2

N +2

H

Bond cleavage

Base 0 (guanine)

Sugar 0

Q: Why is Fersht’s work interesting?Approach uses powerful protein engineering

Novel proteins can be made, e.g., M. Smith, UBC

Data interpreted via thermodynamic treatmentLinear free energy relationships

Bronsted plots (Bronsted catalysis eqtn)

Q: Advantage of Fersht’s approach?Relates measureable data to specific non-

covalent interactions, which govern protein structure & function

Improved Enzymes

Redesigned Antibodies

Molecular Recognition

'synthesis'

'analysis'Structure-ActivityRelationships

Improved Enzymes


Q: Why barnase as a model?Advantages:

Small monomeric proteinNo disulphide bridges or cis-prolines

No post-translational modificationsExcellent expression systems (wt → express;

mutants → express)© 2006, Alpay Taralp, Sabanci University

Ribbon structure of Barnase


Schematic of Barnase


Q: What was the experimental strategy? Choose a significant non-covalent interaction Make a subtle change to the interaction (e.g., Ser80 → Thr80; Ser85 → Thr85) Perform equilibrium/kinetic un/folding experiments and

compare the wt & mutant Rationale: All interactions contribute to protein stability - Some

form/break before the rate determining step of folding, whereas others form/break afterwards


Superimposed X-ray and NMR backbone positions of Barnase

X-ray is used to identify interacting residues

X-ray, NMR, CD and bioassays are used to check correct folding of mutants

Eu, E'u

Ef

E'f

wildtype

mutant E'i

Ei

E'

E

G

Reaction Coordinate

=

=

To recap, how might you measure the thermodynamic unfolding/folding of barnase?

Concentration of denaturant


Concentrationcorrespondingto 50% fluorescencequenching

x xx

x

x

xx

x x x

Trp'sinside Trp fluorescence

quenched

Folded Unfolded


Time

Rapidly mixedBarase solution isspiked with acid

Shape analysisof curve yieldskinetic unfoldingconstants

Now, how might you measure the kinetic unfolding of barnase?

...and how might you measure the kinetic folding of barnase? Shape analysis

of curve yieldskinetic foldingconstants

Rapidly mixed Barasein 8M urea is diluted10x with aqueous buffer

Time


A closer look at the consequence of a mutation Any measured energy change within the protein is attributed to the mutation

You may compare (one at a time) the interactions of many neigboring groups within the protein

You can map interactions that do/don’t contribute to protein stability/folding

Q: What principle allows you to identify conformational energy changes from measurable mutation studies?

A: GU, GI, GF, G, etc. are changes of free energy upon mutation (where GA

is stateEAwt – stateE’Amutant).

Effect of a mutation varies along the reaction coordinate. E.g. if you compare

stateEAwt – stateE’Amutant, GU is very close to

zero, whereas GI, G & GF are larger.

The change of free energy upon mutation between 2 x-positions, e.g., from unfolded → → → → folded, is GF-GU.


=

=

Reaction Coordinate

G

E

E'

Ei

E'imutantwildtype

E'f

Ef

Eu, E'u

GF

G=

GI

GU ≈ 0

Q: Is there a fundamental problem with the calculation of GF-GU? (GA is stateEA

wt – stateE’Amutant)

A: Yes! All vertical

equilibria are virtual!

Solution: Calculate instead the difference of free energy upon mutation, e.g., for unfolded → folded, we want GF-U, so measure GF-U

wt-GF-Umutant (= GF-GU!).


EU

E'U

EI

E'I E'F

EF

=E' I

E I=

GFGIGU G I=

Q: What is the meaning of GF-U?

A: If X & Y interact and we mutate X → Z, then:

GF-U = GF(X...Y) + GF(X...E) + GF(Y...E) + GF(X...H2O) +

GF(Y...H2O) – G’F(Z...Y) – G’F(Z...E) – G’F(Y...E) – G’F(Z...H2O) –

G’F(Y...H2O) – G’F(E...H2O) – GU(X...H2O) – GU(Y...H2O) –

GU(E...H2O) + G’U(Z...H2O) + G’U(Y...H2O) + G’U(E...H2O) +

GU(reorg) – GF(reorg)

Once all the terms are considered (many cancel), we can say that GF-U ≈ stabilization energy!

Q: Why is this statement significant?

A: Stabilization energies probe transition states & intermediates © 2006, Alpay Taralp, Sabanci University

Q: What is the advantage of GF-U? A: All horizontal equilibria are measureable via a

thermodynamic equilibrium experiment, i.e., G = -RTlnK & a kinetic experiment, i.e., k = (kBT/h)e(-G/RT)


GF-U

GI-U

G I -U=

GU GI GF

E I=

=E' I

EF

E'FE'I

EI

E'U

EU

G I=

E’I

To compare thethermodynamics offolding: GF-U = -RTln(50%urea’/50%urea)Convention: G < 0 if E’F is more stable

Bronsted catalysis equation Please recall logk = logK + C

0 < < 1: So what does describe?

For a “series” of 2 related compounds (e.g., wt & mutant) you may write logk = logK

Another way to write logk = logK + C isG = Geq + D. If we blend this logic, we

obtain: = G/Geq for wt & mutant We may define for each wt/mut pair the following:

Aunfol = GA-F/GU-F NOTE – approximates is not

Afol = GA-U/GF-U but is equated to when = 0 or 1

logk

logK

x

xx x x

xx


We begin by probing the ratedetermining TS of unfolding (easier)

Aunfol = GA-F/GU-F

Each point (x,y) describes an energy change (GU-F, G-F) due to a mutation of barnase. E.g., Val→Ala gives Ala:Phe

Is there a patterned change of stabilization energies

upon mutation of interacting pairs?

G-F; GU-F


Q: Let us quantify the mutant pair interactions as barnase unfolds.

unfol = G-F/GU-F

unfol = G-F/GU-F

= 2.60/2.32 = 1.06 ≈ 1

Therefore, the Val:Phe interaction

in the folded state was broken in the

TS of unfolding!


Equilibrium measurements GU-F = 2.32 kcal/mole

Kinetic measurements

G≠-F = 2.60 kcal/mole

Q: Why must this kinetic data reflect the rate determining transition state of unfolding?A: All subsequent steps are kinetically unimportant

High [Urea]

Unfolding

For each mutation:find G-F using a kineticexperiment (urea/acid pulse) &find GU-F using anequilibrium unfoldingexperiment (urea to denature50%)The rest is easy!unfol = G-F/GU-F© 2006, Alpay Taralp, Sabanci University

The state of interactions in the transition state of Barnase with respect to the folded state


Q: What is the general picture of the TS? Native-like, compact Some tertiary interactions lost Majority of 2˚ structure preserved Core1: weakened, esp. at Nterm 1

Core2: completely disrupted

Core3: fully intact

If we were to evaluate the unfolding pathway: 1st events: 3 of 5 loops unfold, Nterm of helices melt,

core1 weakens and core2 is destroyed; the remaining structure is disrupted later


Low [Urea]

Refolding

Refolding experiments


Rationale:Under refolding conditions:

EU → EI is faster than EI → EF;Thus, your measurements can examine EI → EF

Probing the rate determining TS of folding and an intermediate

For each mutation:Use the appropriate kinetic experiment (dilution of urea) &equilibrium foldingexperiment (urea) to probe the I state and TS of folding!

The state of interactions in the intermediate & transition state of Barnase with respect to the unfolded state; the last column shows the unfolding pathway for comparison


What can be said about the folding process?




Q: What can be said about folding? Correlation between hydrophobic burial and early events All early regions inteact with sheet Early burial is hydrophobic, extensive and nucleated Late processes only interact slightly with sheet; no

core nucleation; some hydrophillic burial is noted

General statements: The refolding pathway is at least partially sequential 2˚ structure formation leads to local hydrophobic burial

and precedes most 3˚ interactions Consolidation of structure is gradual and earliest for 2˚

structure elements


Concluding Remarks. When generalized to small globular proteins, folding proceeds by nucleation, i.e., local hydro-phobic collapse of core elements, followed by consolidation of hydrophilic interactions & 3º structural domains

A Closer Look at Chemical Modification of Protein Monomers in Aqueous (and

now also Organic or Dry-state) Environments and the Use of Paper-

supported Chromatography

HN

N

NHNHC NH2

H2+N

+NH3

SCH3SS

OH

SH

CO

-ONH2Native protein at slightly

alkaline pH Values


FOCUS To contrast chemical modification against enzyme

kinetics & SDM Discuss the applications of chemical modification Discuss non-destructive & destructive chemical

methods to learn about structure & function Look at classic derivatizations Discuss the advantages of competitive labeling


History of chemical modification (not SDM)

• Approximately the same time as enzyme kinetics

• Glutaraldehyde tanning of leathers

• Refinement of foods, e.g., milk proteins

• Protein structure, function and S-F studies

• Insulin primary structure determination, partial and full acid hydrolysis, dansyl method, cyanogen bromide sequence alignments, Edman Degradation, Carboxypeptide-MS methods

Structure perturbs function, function defines structure

SH

Gee, my thiol must be ionized in orderto attack the peptide bond

There you go Mr. Thiolate! I have placedyou near a positive His residue and the +base of a large alpha-helix dipole moment!You may attack at will!

S-

+His

H+

Chemical modification and enzyme kinetics are complementary methods.

Advantage of enzyme kinetics: Enzyme bears an intrinsic probe, which allows you to examine

structure and function pH activity profile shows the pKa of groups that are important for

binding and/or catalysis… Disadvantage of enzyme kinetics:

Not every protein is an enzyme! Other proteins which can be bioassayed fairly conveniently are antibodies and hormone-receptor interactions.

Advantage of chemical modification in comparison to enzyme kinetics? The method can be used on any protein to determine

information related to structure and function! Disadvantage of chemical modification in comparison to

enzyme kinetics: You must insert the probe of structure and function, and you

must do it without defeating the purpose of your experiment


Chemical modification and site-directed mutagenesis are complementary methods.

Rationale: Information obtained from chemical modification forms a base to design appropriate mutants. E.g. Chemical modification shows that 1 of 4 Met residues is

super reactive. To learn more abot the environment, use SDM to replace neigboring groups & observe the results…

Advantage of chemical modification in comparison to SDM? Fast, inexpensive, doesn’t require elaborate setup.

Disadvantage of chemical modification in comparison to SDM? Not very specific, and imposes a chemically reactive

environment. Can lead to drastic modifications. Modifications typically are limited to amino groups, carboxyl

groups, activated aromatic groups, sulfhydryl groups, guanidino groups, & imidazole groups, i.e., N & C termini, glu, asp, tyr, his, cys, met, lys, arg, trp.


Paper methods applied to protein investigation

Historical applications – chromatography and electrophoresis

Amino acid analysis

Peptide mapping

Disulfide bridge analysis

Analysis of other post-synthetic modifications, e.g., phosphorylation

Work-up of chemical modification experiments!

Protein/peptide/amino acid resolution and purification

Current applications

Same! Apparatus has been revised somewhat – e.g., HV tlc is a strong alternative to HV paper methods

Advantages of paper methods over HPLC and MS methods to identify and purify

1.Many samples can be run simultaneously

2.Multidimensional runs are conveniently setup; sample processing is possible in between runs!

3.Resolving power is great

4.Tolerance of potential interferents is high

5.Detection methods are potentially very sensitive and selective

6.Cost efficiency of equipment and experiments

7.Less need of skilled labor

8.MS and HPLC methods can be coupled if desired

The apparatus of paper chromatography

for biological samples

for simple analytes

You may discard the drying accessories; instead clip the paper horizontally at the bottom with zig-zag scissors - allows solvent to drop evenly to the bottom

suspended papers

trough with solvent

The apparatus of high-voltage paper electrophoresis

1. paper; 2. dielectric; 3. glass tank; 4. trough with buffer (top) and base with same buffer (bottom); 5. cathode (-); and 6. anode (+)

Types of volatile solvent systems

•BAWP

•Ammonia – organic

•Pyridine acetate buffer

•Formic acid – acetic acid buffer

Types of papers

•Whatmann 3MM for high loading, large peptides

•Whatmann 1MM for high resolution

Tlc alternatives as stationary phase

•Cellulose tlc plates

•Silica tlc plates

Protein sample preparation prior to spotting on paper

Amino acid analysisHydrolysis (acid typically; base hydrolysis if Trp is needed)Dry sampleReconstitute in a minimum amount of running bufferSpot the sample on paper

Identifying the number of arginine residues in BSAChemically block all lysine residues in 8M ureaDialyze away the ureaDigest all arginyl peptide bonds using trypsinDry sampleSpot the sample on paper

Separating the and chains of insulinIncubate protein in 95:5 formic acid/30%hydrogen peroxideDry sampleSpot the sample on paper

Origin X X X X XX X X X X

(-)

Submergedin buffer at base, pH 2.1

Submergedin bufferedtrough, pH 2.1

Direction of migration40V/cm

e.g. for HVPE

oo

o

o

oo

o

o

o

o

o

o

o

o

o

o

after migration

Identifying and quantifying bands after migration and drying of paper chromatograms

Strategies to identify•Intrinsic fluorescence•Radiolabeling and exposure of X-ray film•Fluorescent derivatization or colorimetric derivatization, e.g., ninhydrin

Strategies to quantify•Densometry of chromatographic images•Liberation from paper and subsequent analysis

How shall we see these?

n-Dimensional runs & the advantage of multi-dimensions

1-D

2-D

multi-D

X X X X X X

Xoo

o

o

rotate90degrees

o o o o

run 2nddimension

o

oo

oo

o o

oo

o

o o

oo

oo

run 1stdimension

2nd & subsequent dimensions may/may not be performed using the initial conditions

oooo

run 2nddimension

o

oo

after 1stdimension

o o o o

o o o o o

you are interestedin these seriessamples

cut out,stitch ontonew paper!

ooooo

o

o

o

o

o

oo

oo

o

now you are interestedin this vertical series cut out,

stitch ontonew paper,run anotherdimension

repeat asnecessary

You may chemically process the sample in between dimensions!

•Spray or Dip

http://images.google.com.tr/imgres?imgurl=http://ngfl.northumberland.gov.uk/dt/dtclip/textiles_alb/images/sewing%2520machine_JPG.jpg&imgrefurl=http://ngfl.northumberland.gov.uk/dt/dtclip/textiles_alb/pages/sewing%2520machine_JPG.htm&h=571&w=705&sz=43&hl=tr&start=9&tbnid=mymLdC-BhCm49M:&tbnh=113&tbnw=140&prev=/images%3Fq%3Dsewing%2Bmachine%26svnum%3D10%26hl%3Dtr%26lr%3D%26sa%3DN

Reaction between protein functional groups & reagents Many protein groups interact with reagents as a function

of their ionization state Groups can be modified most specifically in an optimum

pH range At very high pH values, reaction with hydroxide ion

typically competes


Met & neutral Cys feature substantial nucleophilicity Carboxylic acids react with diazo compounds principally

while in the protonated form. Ring carbon positions on Trp, His & Tyr can be modified,

usually via electrophilic attack. Phe is not sufficiently e--rich to promote electrophilic attack by typical protein reagents.

Typical reactivity of protic moieties as a function of pH


Q: Are there other factors, which affect apparent (macroscopic) reactivities? A: YES! Accessibility and steric considerations refer to the size &

amount of reagent used, whereabouts of reactive group, permeation time of reaction, and the conditions of reaction.

Nucleophilicity considers the base strength, solvation shell, and lone electron density, polarisability & conjugation of centers on protein groups.

Electrophilicity considers the electron density of protein groups and electron deficiency of reagents.

The “hard likes hard, soft likes soft” empirical relation does apply to some degree. Basically, the interaction of reacting orbitals and centers is in part determined by their “harness” or “softness.”


Some typical reagents of protein modification


Some typical reagents of protein modification (contin.)


Structure-function studies generally follow this procedure: Choose to address a particular issue; Envisage a strategy to perturb the protein in order to investigate the

issue; Anticipate the best groups to modify in order to create the desired

perturbation; Choose the best reagent and conditions to specifically modify the

functional groups in question; Compare the change of properties of the modified protein to that of

the control protein; Form conclusions using the Okum’s Razor argument.

Local changes and Global changes: Chemical modifications of protein groups affect pI & pKa values, solubility, surface hydrophilicity & hydrophobicity, bioactivity, folding stability & global organization.


A: One application is to carry out structure-function studies Methods can be benign or destructive A typical protocol:

Label the native protein Determine a change of property, and Extract useful information on the basis of the results.

Data collected may be related to structure: E.g., a tyrosine specific reagent cannot react with a tyrosine residue; the

tyrosine may be buried. Data collected may be analytical.

E.g., Protein X does not react with a tyrosine-specific reagent in 10M urea; the primary sequence does not contain tyrosine.

Data collected may be related to function and/or structure and function. E.g. 1., 10 Tyr residues are solvent-accessible, but 1 predominantly reacts

with trace reagent; this Tyr is unusually reactive in the protein environment. E.g. 2., After Lys-93 is modified chemically, substrate X shows a reduced

Km value; Lys-93 plays a role in binding. If the role is direct, this Lys is likely near the binding site of the active center.

Q: What are the principle applications of protein chemical modification?


Other investigative & analytical uses of protein chemical modifiers: Changing the net charge of the protein

WSC and ethylenediamine Iodoacetic acid, succinic anhydride Cyclohexanedione

Retaining the net charge but modifying the pKa Reductive formylation, amidination, guanidination

Altering groups and testing their importance Destabilizing a protein towards denaturants or reversibly protecting a protein

Citraconic anhydride, maleic anhydride Modifying protein hydrophobicity, hydrophilicity & surface activity

Adducts with different compounds such as PEG2000 Quantifying amino acids, functional groups, disulfide bridges, phosphate and other post-translational

modifications Various chemical reagents and chromatographic methods

Chemical modification is used to re-engineer proteins for improved performance in industry. The protein is characterized as much as possible The improved trait is defined A modification is envisaged to promote the property The protein is derivatized accordingly.

S-F problem?

InappropriateModification/ Conditions

Appropriate Modification/ Conditions

NOTE - Chemical treatment of biologicals need not necessarily be destructive and damaging. When conducting a structure-function study, your choice of reagent and protocol should not defeat the purpose of your study


Judging if the purpose of an investigation has been defeated• The meaning of defeating your experimental

purpose• Cases that are indifferent to over-reaction of

protein groups• Cases that require post-reaction validation

Distinguishing between apparent/macroscopic values and true/microscopic/theoretical values

• Kinetic measurements versus thermodynamic measurements

• Measuring a value via independent methods may give a true value

A time to chemically label and a time to chemically work-up: Distinguishing between the two steps

In a protein labeling experiment and its workup, we typically note:

1. Reactions conducted on native proteins are for labeling

2. Reactions conducted on protein derivatives are for labeling or work-up

3. Reactions conducted on unfolded native proteins are for labeling of controls

4. Reactions conducted on unfolded protein derivatives are for work-up

Acetylation using Acetic Anhydride or Acetyl Imidazole at pH 7.5-9 Acetylation blocks free amino termini, lysine side-chains and tyrosine side-

chains. Acetyl tyrosine is hydrolyzed at high pH values or transesterified above pH 6 by a strong nucleophile.

Acetylation of amino groups is usually quantitative only if the reaction is carried out in urea solution. Without urea, usually 60% of the lysine residues are modified.

This modification could have the following effects Decrease of protein solubility (Acetyl BSA is only soluble at pH < 5); Changes in biological activity (by either global structural changes or specific

changes in the active site); Dissociation of multimeric complexes (if the surface charges are required for

association to other biomolecules).

HN

N

NHNHC NH2

H2+N

N

O

HCH3

SCH3SS

OO

H3C

SH

CO

-O

Example modifications


Succinylation using Succinic Anhydride at pH > 7 Succinylation blocks amino groups without modifying other functional

groups. Succinyl tyrosine is hydrolyzed above pH 5 via an intramolecular

cyclization. The effect of succinylation parallels some of those described above for

acetylation. Succinylation is typically used to improve the solubility of poorly

soluble proteins, particularly at pH values above 5. Succinylation induces separation of protein aggregates (eg.

Hemerythrin dissociates into eight subunits). Succinyl proteins usually unfold more easily and demonstrate a shift in

their pH optima (assuming that activity is retained). Succinylation is often used as a linker molecule through which the

protein be attached to a foreign surface.

H N

N

N H N H C N H 2

H 2 + N

N

O

H CH 2 CH 2 CO O -

S CH 3 S S

O H

S H

C O

- O


Maleiylation (left) using maleic anhydride & citraconylation (right) using citraconic anhydride at pH > 8Maleic anhydride and citraconic anhydride block amino groups but

can be removed by incubating the protein at low pH.Maleic anhydride is more difficult to remove & sometimes leads to

a minor side-reaction, in which Cys adds across the double bond.Citraconic anhydride is used to reversibly block amino groups and

to protect the amino groups from other chemical reactions. E.g., citraconylation will protect amino groups from subsequent oxidation by

hydrogen peroxide. Once Cys’s are oxidized, amino groups are regenerated. Citraconylation is the method of choice to temporarily solubilize

poorly soluble proteins.

sometimes alittle bit hereC

O-O

S

O

O

O-

O-

H

OH

SS SCH3

N

O

HCH CHCOO-

NHC NH2

H2+N

N

NH

HNHN

N

NHNHC NH2

H2+N

N

O

HCH CCOO-

CH3

SCH3SS

OH

SH

CO

-O


Polypeptidylation using carboxyanhydrides at pH 7Carboxyanhydrides add onto amino groups, releasing carbon

dioxide and producing new amino groups. In presence of excess reagent, the reaction repeats, building a long

polypeptide chain that extends into solution.Technique can improve solubility of insoluble proteins (R = H) &

conversely, reduce the solubility of soluble proteins (R = i‑propyl). Polyvalylribonuclease, for example, aggregates in solution above 30C

Used to study hydrophobic interactions between proteins.

H N

N

N H N H C N H 2

H 2 + N

N

O

H CH R

N H

O

CH R

N H 2

S CH 3 S S

O H

S H

C O

- O

n

N H O

R O

O


Trifluoroacetylation using Ethyl Thiotrifluoroacetate at pH 10 Ethyl thiotrifluoroacetate is used to reversibly block amino groups as in the case of

maleic anhydride and citraconic anhydride. But the derivative has a net zero charge. Furthermore, the group is removed under basic

conditions using sodium carbonate at pH 10.7 or 1M piperidine. E.g. 1., pancreatic ribonuclease was deactivated completely following

trifluoroacetylation yet there were no measurable structural changes. In carbonate buffer, activity was gradually restored.

E.g. 2., trifluoroacetylation caused structural changes in cytochrome c. Incubation in carbonate buffer restored full electron transfer ability.

Trifluoroacetylation can simplify protein sequencing if trypsin is used to cleave the protein

Trypsin normally cleaves after every free arginine and lysine residue, generating a complicated mixture. If the protein analyte is first trifluoroacetylated, trypsin can only cleave at the arginine bonds.

The fragments are isolated, deblocked and subjected to a 2nd tryptic digestion at the lysine sites

HN

N

NHNHC NH2

H2+N

N

O

HCF3

SCH3SS

OH

SH

CO

-O


Amidination using methylacetimidate (left) or guanidination using o-methylisourea (right) at pH 7-10.Methylacetimidate forms stable derivatives with lysine residues

and amino termini. O-methylisourea reacts with lysine groups.Modification of amino groups increases steric bulk, while retaining the

positive charge. The pKa of derivatives shifts to values well above 11! Most modifications do not give significant structural changes.

Q: Suggest a test to postulate if the active-center lysine is the catalytic nucleophile or a binding group.

Q: Suggest another test to determine if a lysine and a neighboring aspartic acid maintain a crucial salt bridge. Hint: You will analyze the pH activity after reaction.

HN

N

NHNHC NH2

H2+N

N

N+H2

HCH3

SCH3SS

OH

SH

CO

-O

HN

N

NHNHC NH2

H2+N

N

N+H2

HNH2

SCH3SS

OH

SH

CO

-O


Ethoxyformylation using diethylpyrocarbonate (ethoxyformic anhydride) at pH 4 Ethoxyformic anhydride only reacts with imidazole groups at pH 4; it modifies

amino groups at basic pH values. The acyl-imidazole adduct is stable in water, particularly at pH 7, unlike other

acyl His derivatives. The group is removed by the action of H2N-OH at pH 7. EFA is used in molecular biology to battle against RNAse. EFA can rapidly inactivate many enzymes including trypsin and is used by

industry to cold-sterilize food. EFA can aid structure-function studies if histidine is important for bioactivity.

Q: You must decide if a Lys or a His is essential for catalysis. What experiment could you design using the reagents we have discussed thusfar?

HN

N

N

O

H3CH2CONHC NH2

H2+N

N

O

HOCH2CH3

SCH3SS

OH

SH

CO

-O


Reductive dimethylation using H2C=O & NaBH4 (left) at pH 9; Variable methylation using ICH3 (right) at pH 2-10 Reductive methylation mono/dimethylates amino groups, retaining the positive

charge of the amine. Generally, structural changes are not observed following reductive methylation.

Reaction with iodomethane quaternizes amino groups with retention of positive charge. Met and Cys are converted to their sulfonium iodides. Tyr is methylated and His is converted to the dimethylimidazolium iodide.

Reaction selectivity is tuned by appropriate choice of pH & reaction medium. Unlike reductive methylation, iodomethane:

Puts a permanent positive charge on amino, imidazole and sulfide groups Removes the hydrogen bonding ability of tyrosine

HN

N

NHNHC NH2

H2+N

+NHCH3

CH3

SCH3SS

OH

SH

CO

-O

HN

N+

N

H3C

CH3

NHC NH2

H2+N

+NCH3

CH3

CH3

S+(CH3)2SS

OCH3

S+(CH3)2

CO

-O


Carboxyalkylation using iodoacetate (left) & carbamino-alkylation using iodoacetamide (right) at pH 2-10. Iodoacetate & iodoacetamide react similarly to iodomethane

except tyrosines are generally not modified.Derivatives are larger than iodomethaneDerivatives may be considered bulkier.

In the case of iodoacetic acid, a negative moiety is introduced. Iodoacetamide, iodoacetic acid and iodomethane are commercially

available in NMR-active and radioactive forms.

HN

N+

N

-OOCH2C

CH2COO-

NHC NH2

H2+N

+NHCH2COO-

CH2COO-

S+CH2COO-H3C

SS

OH

S+(CH2COO-)2

CO

-O

HN

N+

N

H2NOCH2C

CH2CONH2

NHC NH2

H2+N

+NHCH2CONH2

CH2CONH2

S+CH2CONH2

H3CSS

OH

S+(CH2CONH2)2

CO

-O


Esterification using diazoglycinamide at pH 5 (left) or 0.1M methanolic HCl (right).Reactions are acid catalyzed

With diazoglycinamide, the strongest proton donors (C-terminal carboxylic acids) react first

With methanolic HCl, esterification hastens as [HCl] increases.Esterifıcation time course experiments often correlate to a loss of

biological activity

HN

N

NHNHC NH2

H2+N

NH2

SCH3

OH

SH

CO

H2NCCH2OO

S S HN

N

NHNHC NH2

H2+N

NH2

SCH3

OH

SH

CO

H3CO

S S


Amidation using amine, hydroxysuccinimide, and ethanolamine at pH 4.75Amidation is a general method of converting protein into

many useful forms.By appropriate choice of amine, a positive, zero or negative

charge can be introduced at the carboxylic acid sites.E.g. 1., immobilization of enzyme has been carried out using

carboxyl functions. E.g. 2., chemical modification can be used to probe the active

site carboxylates of pepsin.

HN

N

NHNHC NH2

H2+N

NH2

SCH3

OH

SH

CO

HOCH2CH2NH

S S


Reduction using mercaptoethanol or dithiothreitol (reduced form) at pH 8.Cystine bridges can be cleaved to afford two cysteines by the

action of mercaptoethanol or dithiothreitol. In the case of DTT, a stoichiometric amount of reagent is sufficient to bring

about the modification. The driving force for the reaction is formation of a stable 6-membered ring.

Reduction has been used to study bioactivity & structure. In many cases, reduced proteins exhibit changes of solubility & activity.Reoxidation of reduced cysteines sometimes reforms the correct disulfide

bridges & restores structure and function.

HN

N

NHNHC NH2

H2+N

NH2

SCH3SH

OH

SH

CO

HO

SH


In competitive labeling, a modification ALWAYS reflects the state of the native protein. Why? Reagent is used in trace amounts

At most 1 group/protein is modified Thus, only the reactivity of the native protein is probed

Above: A typical experiment quantifies the statistical distribution of the kinetic reactivity of protein groups the reactivity of every reactable group is probed at different pH values. While the results describe apparent data, they are used to interpolate a group’s

local environment.

Molarradiolabelincorporation(apparentrelativereactivity) with respectto Phe-NH2

pH value of reaction

1-

0.5-

apparent pKaof reactive group

Apparent relative reactivity and pKa values of 3 proteingroups with respect to an external Phe-NH2 standardCompetitive Labeling

Many methods used to modify different functional groups

Generally the modifier is used in excess → proteins modified extensively.Onus is put on the investigator to prove that the results reflect the properties of the native protein.


Steps: 1.Tracer. Incubate a solution containing protein & standard (e.g., Phe-NH2) with a trace amount of 3H-reagent. Tritium incorporation is your probe. Carry out this operation at many pH values.

2.Normalization. Unfold the protein in urea and react the remaining groups completely with 14C labeled reagent. This protocol ensures that all protein and standard derivatives become chemically homogeneous.

3.Quantification. Extract the standard quantitatively into an organic phase and quantify the 3H/14C ratio for each pH value. Digest the protein using enzymes so that only one modified group is present per peptide. Migrate all peptides in an electric field. Identify peptide positions using autoradiography. Collect each band & quantify the 3H/14C ratio. Sequence the peptide in order to identify the derivative.

4. Interpretation. The 3H/14C ratio of protein & standard are compared. The reactivity of the standard reflects its aqueous environment & should parallel its titration curve. The reaction profile reflects the groups’ local environment so you should observe a “titration curve”, which illustrates the effect of the protein environment. This titration curve may deviate significantly from what would be expected in an aqueous environment. The data can thus be used to build a picture of the local environment.


Two data can be extracted for every group: The kinetic pKa of the group & the relative reactivity of the group can be assessed in comparison to Phe-NH2, which has a well-characterized reactivity and pKa that is devoid of any environmental influences. The two data help to assess if any potential steric considerations or stereoelectronic factors are out of the ordinary.

Below are some example results, which allow you to appreciate the power of this method:

e.g. Titration curve of Cys gives a pKa of 3. Perturbed pKa. e.g. Titration of Lys-29 gives poor reaction when reacted from pH 5 to

11. Buried. e.g. Titration of Lys-29 gives a continuous titration curve with an

interpolated pKa of 11 when reacted from pH 5 to 11 (A normal lysine is 10.5). Lysine is buried part of the time or otherwise perturbed.

e.g. Titration of Lys-29 gives a discontinuous curve when reacted from pH 5 to 11. First there is no reaction, then after pH 10.5 the reaction is very high. Conformational change and accessibility.

e.g. The N-terminal Histidine imidazole ring displays a pKa value that is equal to the pKa of its N-terminal amino group. Inductive effects and coupled reactivities.

e.g. Topography of E.coli Ribosomal Protein L12 in situ Eur. J. Biochem 80, 35-41 (1977). Next slide (please).© 2006, Alpay Taralp, Sabanci University


The power of competitive labeling and paper methods

RECAP - Steps of competitive labeling: 1. Label each protein at most once, using a trace of tritiated reagent

His

NH2

LysTyr

SH3C

1000

1000 AcHisNH2

+100 *3-Me-I

95 *3-Me-OH

(aq)(aq)

SH3C

Me-3H*

Tyr Lys

NH2

His

(aq)

SH3C

Tyr Lys

NH2

His

Me-3H*

(aq)

SH3C

Tyr Lys

NH

Me-3H*

His

(aq)

SH3C

Tyr Lys

Me-3H*

NH2

His

AcHisNH2

Me-3H*999

997

111

in a pH "X" solution

AcHisNH21

Standard, with known pKa value & reactivity!

+

+ +

+

++

+

+excess *14-Me-I/12-MeI mixture

99% *14-MeOH/12-MeOH

in 8M urea pH 10 solution

+SH3C

Me

Tyr

Me

+Lys

Me

Me

Me

+N

Me

Me

Me

+His

MeMe

(aq)

+SH3C

Me

Tyr

Me

+Lys

Me

MeMe

+N

Me

Me

Me

+His

Me-3H*Me

(aq)

+SH3C

Me

+Lys

Me Me

Me

+N

Me-3H*

Me

Me

+His

MeMe

(aq)

+SH3C

Me

Tyr

Me

+LysMe

-3H*

Me

Me

+N

Me

Me

Me

+HisMe

Me

Ac+HisNH2

Me-3H*Me

999

997 1

11

Ac+HisNH2

Me Me1

+

++

+(aq)

MeTyr

(aq)

SH3C

Tyr Lys

NH2

His

(aq)

SH3C

Tyr Lys

NH2

His

Me-3H*

(aq)

SH3C

Tyr Lys

NH

Me-3H*

His

(aq)

SH3C

Tyr Lys

Me-3H*

NH2

His

AcHisNH2

Me-3H*

999

997 1

11

AcHisNH21

+

++

+

Step 2: Label all protein completely using excess low specific actitivity 14C-MeI to yield a chemically homoge-neous mixture

Step 3. Repeat steps 1 & 2 using more native protein; label many samples at different pH values.

Step 4. Prepare a positional marker protein; label native protein with high-specific activity 14C-MeI, then excess 12C-MeI in 8M urea

Step 4. Digest all methyl-proteins separately. Use 2-4 proteases.

Step 5. Separate the different peptides in an electric field along tlc or paper; use 1-4 dimensions; spot the marker peptides at the ends of the chromatogram.

Step 6. Expose X-ray film to the chromatogram (RT for 14C, -80˚C for 3H) to find the position of all substantially radioactive peptides.

Step 7. Find the 3H/14C ratio of each peptide band; sequence each to identify the reactive group (e.g., AlaTyr*Gln can only be Tyr18)

Step 8. Plot the reactivity (i.e., 3H/14C ratio) of each reactive group as a function of pH.

Subsequent Steps (for proteins of known 1˚ sequence):

cathode (-)

anode (+)

3kV,30min,pH 2.1

Peptide solutions spottedalong paper at origin

14C-marker peptides

x x x x x xx xx x x x x x x x x7.1 7.2 7.3 7.4 7.5 7.6 7.7 7.8

pH of labeling7.9 8.0 8.1 8.2 8.3 8.4 8.5

Paper chromatogram beneath;X-ray film superimposed on top

Recover & measure 3H cpm/14C cpm

e.g.1; peptide 8 @ pH 7.8 = 512/1890

e.g.2; peptide 7 @ pH 7.6 = 450/5680

e.g.3; peptide 6 @ pH 8.1 = 890/1543

. .

..

.

.

= radioactive ink spotted on Whatmann 3MM paper1

23

4

5

6

7

8

9

Peptide 6 was sequenced: AlaTyr*Gln indicates a Tyr reacted. There are 4 tyrosyl residues in this protein but the sequence data indicates Tyr18

Molar 3H/14Cradiolabelincorporation(apparentrelativereactivity) with respectto AcHis-NH2

pH value of reaction(referenced against known pKa of AcHis-NH2)

1-

0.5-

apparent pKaof reactive group

Apparent relative reactivity and pKa values of 5 proteingroups with respect to an external AcHis-NH2 standard

His102

Tyr18

Met1

Lys87

NH2

AcHis-NH2

In vitro manipulatin of protein monomers or their environments to

enhance performance


Goals: To formulate broad-scope protein preparations, which are: Cheaper More stable More catalytic Longer-lived More easily stored & transported More active at pH & temperature extremes


Geneticmanipulation


Native

search for robustness:

Approach 1: use proteins in water-free media

Approach 2: crosslink Approach 2: crosslink proteins togetherproteins together

Approach 3: chemically Approach 3: chemically modifymodify proteinprotein monomersmonomers


As scientists, you will occasionally prepare proteins, which are not in water. Please note the following pharmaceutical example:

Insulin Lysozymeintraperitoneal injections creams & gelstime-released matrix inhalable microcapsulesİnhalable microcapsules drop formulations

Question: What are some biases related to proteins in unusual environments?

Fact 1: Protein structure & function is sensitive to pH, temperature, ionic strength, etc.

Fact 2: Eminent scientists have said “Enzymes need aqueous environments to function because this is how Nature has intended them to function.”

Question: In your profession, you may prepare proteins in unusual environments. Should you worry?


Answer: NO! Recent evidence shows that protein structure and protein function may be very tolerant to strange environments!

Today, we will discuss the structure & function of proteins in reduced-water & water-free environments

Why does mankind wish to use biologically active proteins?Proteins accelerate chemical reactionsProteins improve product propertiesProteins permit novel syntheses

Here are some typical industrial applications:BioreactorsTextile treatmentMedicinal and organic synthesesProtein drugs & drug deliveryBiosensorsBioremediationFood preparation industries

Problem? Industrial conditions are often too harsh for proteins in the native state.Consequences: Poor biological activity, short lifespan, limited reaction parameters, etc.



Example I: Rate comparisons in octaneEnzyme kcat/Km (M-1s‑1) Rate (kENZ) Rate(kNONENZ) Enhancement

Chymotrypsin 0.7 1.1 x 10-11 6.4 x 1010

Subtilisin 1.8 1.1 x 10‑11 1.6 x 1010

N‑Ac‑L‑Phe‑OEt + amyl alcohol → N‑Ac‑L‑Phe‑OAmyl + EtOH in octane

NOTE – Reactions in water are much faster!

The new view: Proteins can maintain their structure & function without an aqueous environment.

Implication? Your protein drug formulations may beperfectly happy in a gel, or a cream or in a sugar-coatedmatrix

Typically kcat/Km water / kcat/Km

octane = 104-107

Q: Can we improve the speed of an enzyme reaction in octane? A: YES! But first we must understand more...


Q: -Momorcharin was crystallized in water (thin lines), then crosslinked, & solvent-exchanged with CH3CN (thick lines). Compare the structures.

Similar results were obtained with: chymotrypsin in hexane subtilisn carlsberg in acetonitrile

Example II: Protein Structure & Integrity

H2O

H2O

H2OH2OH2O

H2O H2O

H2O H2O

H2O H2O

H2OH2O

H2O

o oo

o oo

o oo o oo

o oo

o ooorganics organics organics

organicsorganicsorganics

organics organics

organicsorganics

organicsorganicsorganics

organicsorganics

organics


Example III: Solvent Effect on Catalytic RatesLyophilized (pH 7.8) chymotrypsin + dry organic solvent CT suspensionLyophilized (pH 7.8) subtilisin + dry organic solvent SBL suspension

Solvent Vmax/Km (min-1 x 10-6)Subtilisin Chymotrypsin

Hexadecane 3900 4300Octane 2000 1700Carbon tetrachloride 340 96Toluene 150 120Ethyl Ether 97 48Acetone 810 0.6Acetonitrile 150 0.4Dimethylformamide 19 <0.1Dimethylsulfoxide <0.1 <0.1

N-Ac-L-Phe-OEt + n-propanol N-Ac-L-Phe-n-OPr + EtOH

Q: Is the above data incorrect?A: NO! Products were NEVER formed unless the enzyme was added!


Example IV: Active Site Integrity

SO2 OSer in active site

Q: Chymotrypsin(dispersed in octane) + phenylmethylsulfonyl fluoride yielded inactive enzyme. What is implied?

Question: How do we explain this trend?

Answer: Hydrophilic organic solvents strip essential water from the enzyme.

What is the evidence? Enzyme activity correlates directly with the amount of water retained by

the “dry” enzyme Enzyme activity improves greatly by adding small amounts of water

(1.5%) to the hydrophilic organic solvents.


Example V: Rate Measurements in octane

Q: Rate measurements in organic solvents behaved as if in aqueous solution! What is implied?

Start FinishE EF

A P B Q

E + S ES E + Pk1

k-1

k2

B1

B2B31/v

1/[A]o

v

oE][= o +

oA

A

][

+

oB

B

][


Example VI: pH activity

Q: Chymotrypsin activity in octane depends on the pH of solution from which the enzyme was dried. Why?

A: pH “memory”

pH

kcat/Km

kcat/Km

LpH

Lyophilize

Choice 1: set pH here andlyophilize

Choice 2: lyophilize and thenoptimize pH with organicsoluble buffers

o

o

o

o

o

o

o

o

o

organics

organics

organics

organics

organics

organics

organics

organics

organics

organics

organics


Q: Enzyme dried in the presence of acetyl-L-phenylalanine was more active. Why?

A: Positional imprinting of active site groups!

o

o

o

o

o

o

o

o

o

Example VII: Imprinting protein in the dry state


Example VIII: Competitive inhibition of chymotrypsin in water & in octane

Inhibitor Inhibition Constant KI(mM)In water In octane

Benzene 21 1000Benzoic Acid 140 40Toluene 12 1200Phenylacetic acid 160 25Naphthalene 0.4 11001‑Naphthoic acid 7.2 3

Q: Good competitive inhibitors in water are poor competitive inhibitors in octane! Poor competitive inhibitors in water are excellent competitive inhibitors in octane! WHY?


Example IX: Substrate specificity (kcat/Km) of chymotrypsin & subtilisin (in water or octane)

Chymotrypsin Subtilisin

Substrate hydrolysis transesterification hydrolysis transesterification

N‑Ac‑L‑Phe‑OEt 4.00 x 104 0.72 1.3 x 104 1.7N‑Ac‑L‑His‑OMe 2.00 x 102 1.5 5.5 x 102 3.1N‑Ac‑L‑Ser‑OMe 0.87 x 102 2.5 1.6 x 102 4.5

Q: Hydrophobic groups yield better substrates in water! Hydrophilic groups yield better substrates in octane! WHY?


Initial Rate (mmol h-1)/mg proteinSolvent L-enantiomer D-enantiomer enantioselectivity

(vL/vD)acetonitrile 0.85 0.12 7.1pyridine 0.645 0.15 4.3acetone 0.54 0.41 1.3dichloromethane 0.29 0.33 0.88methyl tert‑butyl 2.2 6.4 0.34octane 2.9 12 0.24tetrachloromethane 1.7 8.9 0.19

Q: Acetyl-L-Phe-OEt is a better substrate in polar organic solvents, whereas acetyl-D-Phe-OEt is a better substrate in very hydrophobic solvents. WHY?

Example X: Rates and enantioselectivities of propanol/N‑Ac‑Phe‑OEt transesterification using Aspergillus oryzae protease in anhydrous solvent


Q: Protein structure & function may suffer in proceeding from water to dry environments. How can we help?

o

o

o

o

o

o

o

o

o


the fraction (f) of the active subtilisin in the CLCs

A linear profile (black) indicates that diffusion is not rate-limiting. A convex profile (red) would be expected if diffusion was an important factor.

o

o

o

o

o

o

o

o

o


In summary, when proteins are put into unusual environments they can survive!

Final Question: What are some advantages of using enzymes in water-free environments?

Novel reactions, which are not feasible in water, become posssible Use of “non-water” nucleophiles, e.g. transesterification and ester ammonolysis; Increased solubility of apolar substrates; andshifting thermodynamic equilibria to favor synthesis over hydrolysis, e.g., esterification and peptide formation.

Suppression of water‑mediated side-reactions. Alteration of substrate specificity. Enhanced thermostability of enzymes. Easy recovery of enzyme from low boiling solvents.


Crosslinking Protein Solids to Enhance Utility


You may prepare protein drug formulations, which are not in water:

We saw some examples just before:Insulin Lysozymeintraperitoneal injections creams & gelstime-released matrix inhalable microcapsulesİnhalable microcapsules drop formulations

New Facts: Protein structure & function can be stable in unusual environments if you use the correct procedure! What Are The Results???? Cheaper More stable More catalytic Longer-lived More easily stored & transported More active at pH & temperature extremes

Question: In your profession, you might wish to prepare “super” proteins, which are very effective in various applications. What are your available strategies? © 2006, Alpay Taralp, Sabanci University

Strategies to engineer better proteins


Geneticmanipulation


NativeApproach 1: use proteins Approach 1: use proteins in water-free mediain water-free media

Approach 2: crosslink proteins together

Approach 3: chemically Approach 3: chemically modifymodify proteinprotein monomersmonomers



Approach 2: Crosslinking of protein

Crosslinking of protein solids

Solution phase crosslinked enzymesCrosslinked enzyme crystals

Native

Q: Why crosslink protein?

Improved structural stability

Potentially altered function


Established crosslinking media:A. In a solubilizing environment

Q: What are the characteristics of method A?Solution-phase crosslinking often leads to polydispersityEvents: Soluble monomers → Dimerization → Oligomerization →→→ Insolubilization (typically gelation)


http://images.google.com.tr/imgres?imgurl=http://www.shirleyqliquor.com/mpc/docs/images/jello.jpg&imgrefurl=http://www.shirleyqliquor.com/mpc/docs/images/&h=166&w=200&sz=6&tbnid=MIfh9BCx4awjbM:&tbnh=82&tbnw=99&hl=tr&start=15&prev=/images%3Fq%3Djello%26svnum%3D10%26hl%3Dtr%26lr%3D

B: In the crystalline state

Q: What are the characteristics of method B?Crosslinked enzyme crystals are biologically active catalysts with well-defined poresEvents: Soluble monomers → Crystallization → Chemical crosslinking →→→ Insolubilization


http://images.google.com.tr/imgres?imgurl=http://liftoff.msfc.nasa.gov/shared/news2003/protein/protein1.jpg&imgrefurl=http://liftoff.msfc.nasa.gov/news/2003/news-protein.asp&h=215&w=200&sz=7&tbnid=qZsHzjfpHqOEHM:&tbnh=101&tbnw=93&hl=tr&start=15&prev=/images%3Fq%3Dprotein%2Bcrystal%26svnum%3D10%26hl%3Dtr%26lr%3D%26sa%3DN

C: In the lyophilized state

Q: What are the characteristics of method C?Crosslinking of lyophilisates leads to polydispersityEvents: soluble monomers → lyophilization at desired pH value → vacuum, heat and/or chemical crosslinker → oligomerization →→ insolubilization (powder resists swelling in solvents)


D: As a precipitate retaining the native structure

Q: What are the characteristics of method D?Events: Soluble monomers → Salt-out or insolubilize with miscible organics → Add chemical crosslinker → Oligomerization →→ Insolubilization (powder resists swelling in solvents)


E. Supported enzyme technologies

Surface-immobilized enzymes

Crosslinked andintertwined enzymes

Surface-immobilizedenzyme aggregates

Q: What are the characteristics of method E?


Q: How do the proteins eventually combine together?Method A: Classic solution-phase crosslinking of dissolved proteins


Q: Why is crosslinking solution-phase proteins (Method A) not equivalent to crosslinking of “pre-solidified” proteins (Methods B-

D)?

When reactive chemical crosslinkers are employed, the outer enzymes preferentially react at time t = 0 → teq.

Enzyme crystal

Amorphous solid (enzyme aggregate or lyophilisate)

Method A: In the solution phase, all enzymes have equivalent chances of reacting at time t = 0.

Solution phase

Method B

Methods C & D


Q: How are proteins bonded together?A: Chemical (& thermal) strategies

Common reagents: glutaraldehyde, glyoxal, glycolaldehyde, formaldehyde, WS carbodiimide, bifunctionals such as alkyldiimidates, diisoureas, diketohalides, disulfonates & ditresylates, cyanuric Cl

Reagent-free, thermal crosslink induction: www.proteovak.com

The physico-chemical properties of the interprotein bond is variable:

I. Length: Crosslink varies from zero length to multi-carbon units

II. Bonding is generally mediated by the protein amino groups and carboxyl groups. Cystine bonds are also important.

III. Some reactions proceed with charge retention of the bonded groups, whereas other reactions proceed with a change.

CN

O

H

NH (CH2)n NH

N+H2 N

+H2

NH NH

OO

OO

N+H2N

+H2

S ?

SS

s?

N+H2 N

+H2

(CH2)n

N+H2 NH N

+NH

H

NN

[ ]nO O

NN

NO2O2N

NN

H H


Q: Let us please summarize approach 2: protocols, reagents & products

A. In aqueous solution

B: In the crystalline state

C: In the lyophilized state

D: As a precipitate retaining the native structure

Your choice of reagent? Prior history; Surface accessibility and steric constraints; & effective pH during the reaction conditions

A: glutaraldehyde, other simple aldehydes,

EDC.HCl, diimidates, difluorodinitrobenzene

B: glutaraldehyde, other simple aldehydes

C: volatile or organic soluble carbodiimides, acyl group activators, disulfide exchange, thermal induction of amide bonding, bifunctional acyl halides and similar reagents

D: glutaraldehyde, dextran polyaldehyde© 2006, Alpay Taralp, Sabanci University

Re-engineering Protein Monomers via Chemical Modification in Aqueous,

Organic or Dry Environments

HN

N

NHNHC NH2

H2+N

+NH3

SCH3SS

OH

SH

CO

-ONH2Native protein at slightly

alkaline pH Values


Q: What are some chemicals to alter protein charge? Ac2O, MeI, succinic anhydride, H2NCH2CH2NH2/ carbodiimide, iodoacetic acid

Q: What are some chemicals to alter protein hydrophobicity? Octadecyliodide, PEG-tresylate

Q: What are some reaction environments to alter protein groups?

Aqueous reagents acting of dissolved protein Organic-phase reagents acting on protein powder Vapor phase reagents acting on protein powder

Let us examine the fate of one enzyme, whose charges were altered in water using chemical reagents

modify the enzymein specific areas new enzyzme is faster or

new enzyme is stabler ornew enzyme has different pH activity

Review of Approach 3: Chemical modification of proteins to alter properties


NH3+

example

inactive

COO-

COO--

OOC

-OOC

-OH

-OOC

-OOC COO

-

COO-

active

NH2

pH

catalyticrate

pH 7.5

Q: In this hypothetical example, is the enzyme active at low or higher pH values?

Q: How do you rationalize the difference?


ethylene diamine + carbodiimide

-OH

+H2NCH2CH2NC

O

H

OC+ED

CO

ED+inactive

NH3+

CED+

O

NH2

active

COO-

COO--

OOC

-OOC

-OH

-OOC

-OOC COO

-

inactive

NH3+

COO-

CED+

O

NH2

activeCO

ED+

OC+ED

+EDC

O

Q: What happens to the charge distribution along the surface?© 2006, Alpay Taralp, Sabanci University

pKa 7.5

catalyticrate

pH

pKa 6.5

Gibbsenergy

reaction coordinate

Eu

EF

EF

EuQ: You may have obtained a desirable change, but many times the change is not free... What price have you paid in this example (right graph)?

Q: In this hypothetical example (left), what has happened to the pH activity of your antimicrobial protein drug (two things)? Please rationalize...

Q: Is this change desirable (local pH of infected regions is lower)?


Q: So where do we stand?

User-friendly protocols to aggregate & crosslink protein and to improve stability

Chemical agents to alterprotein groups & function

Know-how to use proteinsin water-free environments

Thus...© 2006, Alpay Taralp, Sabanci University

My suggestion

Combine the 3 strategies to enhance the performance of proteins as drugs, etc., and as enzymes

Examples of heat-crosslinked protein & analysis


Addition to Notes:Protein Purification &

Related Analytical Methods

Some ReferencesHarris, E.L.V. and Angal, S. eds., Protein Purification Methods: A Practical Approach, 1989, IRL PressHarris, E.L.V. and Angal, S. eds., Protein Purification Applications: A Practical Approach, 1990, IRL PressDeutsher, M.P., ed., Methods in Enzymology, Guide to Protein Purification, 1990, Academic Press


If you wish to obtain new proteins, you must understand how to purify & test these proteins

Three protein types:MembraneIntracellularExtracellular


Disruption

Organelle Isolation

Disruption/Solubilization

Clarification

Centrifugation/flocculation or Liquid 2-Phase Partitioning

Liquid 2-Phase Partitioning

Ammonium Sulphate Precipitation

Organic SolventPrecipitationPrimary Separation Techniques

Chromatography Techniques

Ion Exchange Hydrophobic Interaction

ChromatofocussingIon Exchange

Other AbsorbtionMethods

Gel Permeation

Metal Chelate CovalentHydrophobicInteraction

Affinity

Intracellular Protein



To purify a protein, here are some general rules:

1. The following methods are used to purify protein:

Technique Exploited Protein PropertypH Precipitation Charge, pI valueAmmonium Sulphate Precipitation Intermolecular charge/

hydrophobic interactionsIon exchange chromatography ChargeHydrophobic Interaction chromatography HydrophobicityChromatofocusing Charge, pI valueDye Affinity chromatography Affinity for high MW dyeLigand Affinity chromatography Bioactivity/affinintyGel permeation chromatography Dynamic volume (size)

2. No single method is perfect, so you should use many together

Example manipulation:Starting purity = 10%, yield is 10mg of target proteinAfter AS precipitation, purity = 60%, yield is 8mg of target proteinAfter Chromatography, purity = 95%, yield is 4mg of target protein

3. Protein is characterized after purification: Final yield? MW? N- & C-terminal analysis, disulfide bridge analysis, etc.


Your target is insulinpI = 4.5, MW = 5600

S

S

S

S

NGlycine

NValine

HN

N

N

NH

HO

OH

OHOH


With recombinant Human insulin:Lysis the cells, centrifuge, & collect your sample: A solution of insulin, other proteins, DNA, salts, metal ions, etc. You deterimine 1% insulin, 99% other proteinsand a total of 2g insulin in the batchHow to purify?

SDS gel analysis

Centrifuge lysed insulin

pH pptation

AS pptation

ion exchange

hydrophobic chromatography

insulin receptor column


Protein purity and analysisQ: What is PURE ENOUGH?Q: What is the meaning of Purity? Free of other proteins? Free of ions? Free of DNA? Free of protease activity?

Q: In the case of insulin, what parameters should you analyze (and how)?Size (many)Primary sequence (many)N-terminus, C-terminus (many)Surface-accessibility of various groups (Chem. Mod.)Shelf-life at different humidities (Solubility)Amino group count (Kaiser test)Histidyl group count (Pauli test)pKa of ionizable groups (competative labeling)Secondary sequence (CD)Tertiary sequence (X-ray, NMR, receptor assay)


e.g. Purifying insulin as a drug productRoutes of application:Injection of suspended insulin under skinImplant materialsMicrocapsules containing insulin for lung absorption

What aspects of purity should you examine if you prepare insulin as a drug formulation?

SterilityStorage life and requirmentsBiological activityBioavailabilityMany of the previous criteria!


Date post:	27-Jan-2016
Category:	Documents
Upload:	leslie-mccormick
View:	212 times
Download:	0 times

Lecture Series Special Topics in Protein Chemistry (equivalent to a 2- credict course) Lecturer:...

Documents