Date post: | 27-Jan-2016 |
Category: |
Documents |
Upload: | leslie-mccormick |
View: | 212 times |
Download: | 0 times |
Lecture SeriesSpecial Topics in Protein Chemistry(equivalent to a 2-
credict course)
Lecturer: Alpay Taralp, Materials Science & Engineering Program, Sabancı University, Istanbul 34956; [email protected]; http://people.sabanciuniv.edu/~taralp/
Day 1 (3h)Introduction to protein chemistryStrategies used by enzymes to accelerate reaction ratesDay 2 (3h)Protein stability elucidated and enhanced via protein engineeringProtein folding & unfolding probed via protein engineeringDay 3 (2.5h)Protein folding and unfolding probed via protein engineering (continued)
Day 4 (3h)The combined power of in vitro chemical modification and paper-supported chromatography as a probe of structure and functionDay 5 (3h)The combined power of in vitro chemical modification and paper-supported chromatography as a probe of structure and function (continued)In vitro manipulation of protein monomers or their environment to enhance performanceDay 6 (2.5h)In vitro manipulation of protein monomers or their environment to enhance performance (continued)A closer look at optimizing protein function in non-aqueous environmentsProtein purification and related analytical methodsShort examination scheduling
Lecture Topics by Day
© 2006, Alpay Taralp, Sabanci University
Introduction to Protein Chemistry
© 2006, Alpay Taralp, Sabanci University
References Relevant to this Material1. Lundblad, R.L., Techniques in protein modification, CRC Press, 1995, 0-8493-2606-02. Wong, S.S., Chemistry of protein conjugation and crosslinking, CRC Press, 1991, 0-8493-5886-83. Nagradova, N.K., Lavrik, O.I., Kurganov, B.I., Chemical Modification of Enzymes, Nova Science, Inc., 1995, 1-5607-2238-X4. Brown, W.E., Howard, G.C., Practical Methods in Advanced Protein Chemistry, CRC Press, 2000, 0-8493-9453-85. J.M.Walker, Ed., Protein protocols on CD-ROM, Humana Press, 1998, 0-89603-514-X6. Darbre, A., Practical Protein Chemistry: A Handbook, John Wiley and Sons, 1986. 7. Eyzaguirre, J., Chemical Modification of Enzymes: Active Site Studies, Prentice Hall, 1987, 0-47020-763-98. Methods in Enzymology Series, Academic Press,Vols.11, 25-27, 47-49, 61, 91, 117, 130, 131, 135-137.9. Glazer, A.N, Delange, R.J., Sigman, D.S., Chemical Modification of Proteins, Elsevier Science, 1975, 0-44410-811-410. Feeney, R.E., Whitaker, J.R., American Chemical Society Advances in Chemistry Ser. (No. 160) - Food Proteins: Improvement Through Chemical & Enzymatic Modification, Books on Demand, 0-31710-649-X11. Feeney, R.E., Whitaker, J.R., Modification of Proteins: Food, Nutritional & Pharmacological Aspects, 1982, 0841206104 Advances in Chemistry Ser. (No. 198) American Chemical Society12. Feeney, R.E., Whitaker, J.R., Protein Tailoring & Reagents for Food & Medical Uses, Marcel Dekker Incorporated, 1986, 0-82477-616-X.13. Bailey, J. L., Techniques in Protein Chemistry, Elsevier Publishing Company, 1962, Lib. Congress 62-19691.14. Lundblad, R. L., Chemical Reagents for Protein Modification, 2nd Ed., CRC Press, 1991, 0-8493-5097-2.15. Walsh, G., Headon, D.R., Protein Biotechnology, John Wiley and Sons, 1994, 0-471-94393-2.16. Mean, G., Feeney, R.E., Chemical Modification of Proteins, Holden Day, Inc., 1971, Lib. Congress 74-140785.17. McGrath, K., Kaplan, D., Protein-Based Materials, Birkhauser, 1996, 0-8176-3848.18. Koskinen, A.M.P. and Klibanov, A.M., Enzymatic Reactions in Organic Media, Blackie Academic and Professional, 1996, 0-7514-0259-1.19. Suckling, C.J., Gibson, C.L., Pitt, A.R., Enzyme Chemistry: Impact and Applications, Blackie Academic and Professional, 1998, 0-7514-0362-8.20. Magdassi, S., Surface Activity of Proteins: Chemical and Physicochemical Modifications, Marcel Dekker, Inc., 1996, 0-8247-9532-6. 21. Rawn, J. D., Proteins, Energy and Metabolism, Neil Patterson Publishers, 1989, 0-89278-404-0.22. Fersht, A.R., Structure and Mechanism in Protein Science, W.H. Freeman and Company, 1999, 0-7167-3268-8.23. Oxender, D.L., Fox, F.C., Protein Engineering, Alan R. Liss, Inc., 1987, 0-8451-4300-X.24. Crieghton, T.E., Protein Structure: A Practical Approach, 2nd Edition, Oxford University Press, 1997, 0-19-963618-4.25. Crieghton, T.E., Proteins: Structures and Molecular Properties, 2nd Edition, W.H. Freeman and Company, 1993, 0-7167-2317-4.26. Wüthrich, K., NMR of Proteins and Nucleic Acids, John Wiley and Sons, 1986, 0-471-82893-9.27. Journals focused on the subject of protein chemistry: Journal of Protein Chemistry; Protein Science; Biochemistry; Journal of Biological Chemistry; Biomacromolecules28. Catalogues! Promega Protein Guide: Tips and Techniques; Pierce Products; Biorad Life Science Research Products
© 2006, Alpay Taralp, Sabanci University
The way to operate -True or false? A student of protein chemistry…
1. buys the best possible instrument and then tries to force the problems of protein chemistry to suit the use of the machine.
2. works in a problem-oriented manner in which experience and knowledge are adopted to accommodate available machines.
3. relies first on imagination, then knowledge, then machines (Consider the contrast between H. Noyrath vrs. B. Hartley). What was one of Einstein's quotations?
4. believes that protein investigation is as simple and amusing as watching Indiana Jones running away from a band of sword-wielding bandits (The Okum's Razor argument).
5. should use all his/her time reading primary references and never use his/her own ideas, intuitions or beliefs
6. gives more credit to the ideas of a supervisor than to their own ideas© 2006, Alpay Taralp, Sabanci University
To study proteins is to study diversity! i.e., diversity of structure, function, chemistry, analysis, etc.To emphasize the scope of diversity, let us focus on structural diversity...
• Structure is a shape, sequence, order, orientation, configuration, etc. of an atom or molecule.
• Eg. The electronic structure of carbon is 1s22s22p2.
• Eg. CCl4 has a tetrahedral shape.
• Eg. The primary structure of insulin begins with:
• Eg. The tertiary structure of cytochrome c is globular.
Cl
CCl
ClCl
Diversity of structure: Static vs. dynamic• Structure (an other traits) may be static (fixed)
or dynamic (changing) over time.
• The time frame of structural change may be very long (the half life of 238U is 4.5x109 years) or very brief (a 10 fs chemical interaction)
Ra226U238I-CH3-OH
( shorter) Life-time of event (longer )
We must characterize structural diversity to understand proteins
• Question 1: Can you see the 3-D shape of myoglobin with your eyes?
• Question 2: Can you live 4.5 billion years to see ½ of the 238U decay?
• Question 3: Can you react quickly enough to measure a chemical interaction?
The answer to each is NO! >> So we must use machines...Why? Reason one: We are limited by resolving power…If our information carrier is visible light, we are limited to an approximate resolution of 0.2m. Details smaller than 0.2m are lost.
Look Wilma, whata nice smoothsurface!! Light
you see
actual surface
500nm
Reason two: The event is often faster than the speed of the measurement…You obtain a blurred average.
• Eg. Try photographing chicks in a bowl
a.
b.
c.
Nature features an invisible world of details & diversity. Instruments
allow us to see these details…
Balloons pierced with a bullet
Dynamic changes along an aqueous surface: Droplets captured in motion
A Look at Diverse Protein Structures1. Protein structure is not rigid!2. Protein structure bears many aspects
Proteins are generally made from20 types of amino acids, which are:
>> Linked by amide bonds (rarely: ester bonds, Ser/Thr; thioester bonds, Cys)
>> Bridged via –S-S- groups or the desmosine group, a 4-lys crosslinker
>> Enzymatically processed: hydroxylated, formylated, phosphorylated, glycosylated,amidated, sulfonated, acetylated, methylated, hydrolyzed, etc.
>> Associated to non-proteins, e.g., WATER, heme groups, etc.
S S
© 2006, Alpay Taralp, Sabanci University
H2N
O
OH
R HI am dynamic!
The peptide bond (like formamide, below & above right) is stabilized by resonance: 60% amide and 40% hydroxyimine character
What else do we observe?All 6 atoms lie in the same plane, i.e., the peptide bond is planar.-electrons are distributed over the C-O and C-N bonds.The C-N bond is 10% shorter than a normal C-N bond.The peptide bond is trans.
The peptide bond has a permanent dipole ( = 3.7D)
H2N
O
OH
R H
O
CH N
H
H
O
CH N
H
H
O
CH N
H
H
O
CH N
H
H
-
+
O
CH N
H
H
Linking the building blocks: Stereoelectronic properties of the peptide bond
H2N
O
OH
R H
CN C
C
O
H
CN
+C
C
O-
H
© 2006, Alpay Taralp, Sabanci University
Protein diversity is enabled by linking diverse building
blocks! Stereoelectronic differences of common amino acid residues:
© 2006, Alpay Taralp, Sabanci University
Amino Acid Let. Codes MW Surface Ǻ2 Volume Ǻ3 pKa, Side,25°C pI, 25°C Sol., g/100g Crys. g/ml
Alanine ALA A 71.09 115 88.6 - 6.107 16.65 1.401
Arginine ARG R 156.19 225 173.4 ~12 10.76 15 1.1
AsparticAcid ASP D 115.09 150 111.1 4.5 2.98 0.778 1.66
Asparagine ASN N 114.11 160 114.1 - - 3.53 1.54
Cysteine CYS C 103.15 135 108.5 9.1-9.5 5.02 v. high -
GlutamicAcid GLU E 129.12 190 138.4 4.6 3.08 0.864 1.460
Glutamine GLN Q 128.14 180 143.8 - - 2.5 -
Glycine GLY G 57.05 75 60.1 - 6.064 24.99 1.607
Histidine HIS H 137.14 195 153.2 6.2 7.64 4.19 -
Isoleucine ILE I 113.16 175 166.7 - 6.038 4.117 -
Leucine LEU L 113.16 170 166.7 - 6.036 2.426 1.191
Lysine LYS K 128.17 200 168.6 10.4 9.47 v. high -
Methionine MET M 131.19 185 162.9 - 5.74 3.381 1.340
Phenylalanine PHE F 147.18 210 189.9 - 5.91 2.965 -
Proline PRO P 97.12 145 112.7 - 6.3 162.3 -
Serine SER S 87.08 115 89.0 - 5.68 5.023 1.537
Threonine THR T 101.11 140 116.1 - - v. high -
Tryptophan TRP W 186.12 255 227.8 - 5.88 1.136 -
Tyrosine TYR Y 163.18 230 193.6 9.7 5.63 0.0453 1.456
Valine VAL V 99.14 155 140.0 - 6.002 8.85 1.230
Different building blocks have stereoelectronic differences: Some are more similar than others.
© 2006, Alpay Taralp, Sabanci University
Residues joined by solid lines may be replaced with 95% confidence
The “20” amino Acids Non-polar amino acids
Charged acidic amino acids
Charged basic amino acids
Polar uncharged amino acids
Selenocysteine Pyrrolysine Selenomethionine 4-Hydroxyproline -Carboxyglutamic acid
5-Hydroxylysine
Stop codon + special tRNA
Postsynthetic
#21 #22
EE
EEE
E
E E*
E
E*
E*
© 2006, Alpay Taralp, Sabanci University
Amino Acidsin 55 Proteins
SEA >30 Å2
30 > SEA >10 Å2
SEA <10 Å2
Abs. & % nonpolar surface of residues vs. total Å2
estimatedEffecthydrophobic following residue (L) or side-chain burial (R) [kcal/mol]
Glutamic acid 0.93 0.03 0.04 69 (36%) vs. 190 1.73 0.5
Lysine 0.93 0.05 0.02 122 (61%) vs. 200 3.05 1.9
Arginine 0.84 0.11 0.05 89 (40%) vs. of 225 2.23 1.1
Asparagine 0.82 0.08 0.10 42 (26%) vs. 160 1.05 -0.1
Aspartic acid 0.81 0.10 0.09 45 (30%) vs. 150 1.13 -0.1
Glutamine 0.81 0.09 0.10 66 (37%) vs. 180 1.65 0.5
Proline 0.78 0.09 0.13 124 (86%) vs. 145 3.10 1.9
Threonine 0.71 0.13 0.16 90 (64%) vs. 140 2.25 1.1
Serine 0.70 0.10 0.20 56 (49%) vs. 115 1.40 0.2
Tyrosine 0.67 0.13 0.20 38+116 (67%) vs. 230 2.81 1.6
Histidine 0.66 0.15 0.19 43+86 (66%) vs. 195 2.45 1.3
Glycine 0.51 0.13 0.36 47 (63%) vs. 75 1.18 0.0
Tryptophan 0.49 0.07 0.44 37+199 (93%) vs. 255 4.11 2.9
Alanine 0.48 0.17 0.35 86 (75%) vs. 115 2.15 1.0
Methionine 0.44 0.36 0.20 137 (74%) vs. 185 3.43 2.3
Phenylalanine 0.42 0.16 0.42 39+155 (92%) vs. 210 3.46 2.3
Leucine 0.41 0.10 0.49 164 (96%) vs. 170 4.10 2.9
Valine 0.40 0.10 0.50 135 (87%) vs. 155 3.38 2.2
Isoleucine 0.39 0.14 0.47 155 (89%) vs. 175 3.88 2.7
Cysteine 0.32 0.14 0.54 48 (36%) vs. 135 1.20 0.0
Posttranslational modifications increase protein structural diversity
General:Proteolysis | Racemization | N-O acyl shift | N-S acyl shift | Other enzymatic processing:
N-terminus: Acetylation | Formylation | Myristoylation | Pyroglutamate
C-terminus: Amidation | Glycosyl phosphatidylinositol (GPI)
Lysine:Methylation | Acetylation | Hydroxylation | Ubiquitination | SUMOylation | Desmosine
Cysteine: Disulfide bond | Prenylation | Palmitoylation
Serine/Threonine: Phosphorylation | Glycosylation
Tyrosine: Phosphorylation | Sulfonation
Asparagine: Deamidation | Glycosylation
Aspartate: Succinimide formation
Glutamate: Carboxylation
Arginine: Citrullination | Methylation
Proline: Hydroxylation © 2006, Alpay Taralp, Sabanci University
Bonding Diversity: Factors Determining Protein Structure & Stability
Physico-chemical properties of the amino acid side chains determine the folded conformation
Evidence shows that the amino acid sequence of most proteins contains all the information to arrive at the folded conformation.
Assume each amino acid adopts 2 conformations in a 250-unit chain – We obtain 2250 ≈ 1075 conformations.
Steric constraints reduce the number, however, a very large number of conformations is still possible.
The main factors, which cause a long polypeptide chain to fold into stable conformation are:
Hydrophobic interactions among amino acid side-chains
Hydrogen bonding
Ionic interactions
Dipolar-dipolar interactions and hydrophilic interactions, dipolar interactions, quadrupolar interactions
© 2006, Alpay Taralp, Sabanci University
Diversity of Protein Structural Elements: Basic Structural Hiearchy
1. Primary structure: The exact specification of atomic composition and the chemical bonds connecting those atoms, including stereochemistry. (i.e., L-amino acid sequence, disulphide bridges, other postsynthetic modifications, e.g., insulin A & B chains; chymotrypsin A, B & C chains)
2. Secondary structure: Regular arrangment of the backbone polypeptide without reference to side-chain types or conformation. The secondary structure is usually held by H-bonds (e.g., helix, sheets, random coils)
3. Tertiary structure: 3-D arrangement of polypeptide backbone and amino acid side-chains (e.g., lysozyme). Domain structure: compactly folded units
4. Quaternary structure: Noncovalent association of folded protein subunits (e.g., haemoglobin)
>> Most enzymes: Globular shape, with hydrophobic interior & hydrophilic exterior
© 2006, Alpay Taralp, Sabanci University
So are protein physical traits diverse?Compare keratin versus collagen versus albumin (all from the same 20 amino acid types)
How do we draw protein 3-D structure?Space filling, stick/skeletal (backbone only, sometimes labeled) and ribbon/ schematic models:Show helices (coils), strands (arrows) & random structure
Note: Proteins are made not only using amino acid components – you must also consider water, metal ions, carbohydrates, lipids, porphorin rings, cofactors, etc. © 2006, Alpay Taralp, Sabanci University
Diversity of protein function
Q: What is protein function?
A: Function describes a signal transduction
a. chemical-mechanical; muscles;
b. chemical-chemical; metabolism;
c. chemical-electrical; nerve transmissions;
d. photochemical; vision & photosynthesis;
e. transport; active & passive transport;
f. defense - antibodies & blood clotting
© 2006, Alpay Taralp, Sabanci University
1. Enzymatic proteins: Proteinases, lipases, epimerases, kinases, polymerases...Proteins, which transduce chemical to chemical signals
Note - Proteins are not just enzymes – antibodies, connective tissue (collagen), fluid media, transportation vehicles (Haemoglobin, serum albumin), buffers (serum albumin), signal transducers (rhodopsin), etc.
2. Cytoskeleton – Actin (muscle), Tubulin (cell motility), Intermediate filaments (mechanical protection near membranes and cells subjected to stresses), Spectrin (cytoskeletal protein, particularly found in erythrocytes)
3. Human Plasma – Albumin (osmotic regulation, buffering, transport), -Globulins (transport),‑Globulins (iron transport {transferrin}, histocompatibility antigen {-Microglobulin}), ‑Globulins Antibodies, Fibrinogen (proteolised by thrombin to form fibrin clot), Complement A (11 different protein types working to complement the immune system)
4. Extracellular Matrix – Glycosaminoglycans (hydrated gels), Proteoglycans (long glycosaminoglycans linked to a core protein), Collagen (extracellular matrix; Type I-III tissue supporting fibrils, Type IV laminar network), Elastin (random coil protein gives elasticity to tissues), Fibronectin (cell adhesion), Integrin (integral membrane proteins, also adhesion of cells to extracellular matrix)
© 2006, Alpay Taralp, Sabanci University
Classes of Protein According to Function
5. Digestive Enzymes of Digestive Tract – Amylase (starch to disaccharides), Pepsin, Trypsin, Chymotrypsin (proteins to large peptides), Peptidases (large peptides to small peptides; small peptides to amino acids), Lipases (lipids to fatty acids and glycerol), Ribonuclease (RNA into oligonucleotides), Disaccharidases (disaccharides to monosaccharides)
6. Cytosol Proteins (300-1000 types) – Synthesis of most small molecules, proteins, carbohydrates & lipids of cell
7. Nuclear Proteins – Histones (5, complex to DNA to make chromosomes), Nucleic Acid polymerising enzymes (5-10, used in DNA and RNA synthesis)
8. Mitochondrial & Chloroplast Proteins (300-1000) – Energy production from metabolites or light
9. Endoplasmic Reticulum & Golgi Apparatus Proteins (50-200) - Protein modification, oligosaccharide and lipid synthesis
10. Lysosome & Peroxisome Proteins (300-1000) – Degradation processes of undesirable compounds
11. Plasma Membrane Proteins (100-500) – Transport across membranes, transmission of important metabolic signals across plasma membrane
© 2006, Alpay Taralp, Sabanci University
Diversity of protein physico-chemical traits:>> Diversity among proteins is high but not “random”>> Structure/construction and function are related>> Some 1˚, 2˚ & 3˚ features are retained among proteins of similar function
• Global shape and morphology: Round, tight, loose, fibrous, skinny, crystalline
• Local function-related structures: Active site, receptor site, allosteric regions, catalytic residues
• Solubility: Highly variable• pI: Highly variable• pH stability: Highly variable• Tolerance to other environmental factors: Highly variable
Understanding protein structure, protein function, and their relationships are the central problems of protein science. The rules that govern structure-function relationships are simple
Nature is presumed to provide simple solutions.The challenge is to ask the right questions.
© 2006, Alpay Taralp, Sabanci University
What is protein chemistry?Area of science related to:1. Obtaining/purifying protein,2. Investigating protein structure & function, and3. Controlling and engineering proteins
© 2006, Alpay Taralp, Sabanci University
Classic emphasis
Current emphasis
Protein chemistry contributes to the following subject areas:1. Biochemistry, Biotechniques & Bioengineering2. Analytical Chemistry and Spectroscopy 3. Surface and Colloid Science4. Clinical Chemistry5. Polymer Science6. Medicinal and Pharmaceutical Chemistry7. Organic Chemistry
Why is protein chemistry highly interdisciplinary? Protein chemistry has developed together with analytical methods such as sequencing, X–ray, NMR structure determination and site–directed mutagenesis.
Protein chemistry is useful to whom?Researchers, professionals and students in various areas of specialization:
Protein chemists, molecular biologists, materials scientists, enzymologists, clinicians, analytical chemists, biophysicists and industrial scientists
© 2006, Alpay Taralp, Sabanci University
E.g.: Protein chemists help X-ray crystallographers & genetic engineers:
Protein chemist X-ray crystallographer•Purifies 1g protein •Attempts crystallization•Chemically modifies to aid crystallization •Obtains diffraction patterns
or to form heavy atom derivatives, •Uses heavy atomwhich aid the phase problem derivatives to solve
structures at 3-4Ǻ
Protein chemist Genetic engineer•Purifies 1mg protein •Synthesizes oligonucleotides•Sequences peptides •Screens the gene-bank•Compares peptides & sequence codes •Sequences DNA of insects•Probes posttrans processing by FabMS •Constructs expression vector•Prepares antibodies •Screens using western blots•Develops protocols to purify•Compares properties of wild-type & mutant
© 2006, Alpay Taralp, Sabanci University
One of the most common and often ambitious experiments in protein chemistry is the structure-function study
I.e., How does structure perturb function? How does function define structure?e.g. Consider the pKa of active-site thiols in cysteine proteases
Structure-function experiments: probe the interdependence of structure & function in proteins; generally reflect elements of both structure & function:
Examples along this continuum:1. One end – X-ray; Emphasizes analysis of structure2. Middle ground - pH titration of protein groups, showing hysteresis; Reflects
elements of structure and function substantially3. 2nd end – Bioassay; Weighted toward functional assessment
© 2006, Alpay Taralp, Sabanci University
a S-F study ?
continuumPure S study Pure F study
How do S-F studies work? How would you learn about a system that you cannot see? You interact with the system & note the consequences.
If you walk into an icicle, your “initial state” becomes altered. Your “final state” indicates something sharp. Thus, any change in you during the interaction can probe structure.
stationaryglacier
movingglacier
happyfurry animal+ achorn
panickedfurry animal +achornREGION OF INTERACTION
Interaction 1
Interaction 2
Structure-function studies use:physical measurements (usually spectroscopic) and/or chemical protocols (usually covalent modification)
Physical methods:Generally nonintrusive, require more protein, performed in water or water-free state.
Chemical methods: Generally intrusive, may be destructive. Potentially very sensitive, performed in water, organic solvent or dry state.
Some physical methods to assess structure & functionDiffraction: X-ray, neutron diffraction Spectroscopies: Infrared, ultraviolet, Raman, optical rotary dispersion, circular dichroism, NMR, esr (principle is to infer structure by perturbing light)Thermal analysis: MicrocalorimetrySpectrometry: Mass analysisIn silicio: Computer modelingOther: Electrophoresis, hydrodynamic techniques, chromatographies
Typical outputs:Composition and secondary structure, quantification, folding energies (spectroscopies)Identifying/purifying biological materials by exploiting adsorption, isoelectric point, size/mass, affinity, etc. (chromatography & electrophoresis)Unfolding enthalpies of protein (microcalorimetry)3-D "Static" structure (X-ray, neutron diffraction) 3-D dynamic structure, kinetic folding, association constants, etc. (NMR)Local environment of coordinated metal ions (Mossbauer spec.)
© 2006, Alpay Taralp, Sabanci University
Some chemical methods to assess structure & function
Titration studies: nature & number of ionizable groups.
Proteolysis in vitro: Limited proteolysis to elucidate the structural motifs of protein.
Kinetic studies: Applied to any protein, but mainly enzymes.
Classic chemical modification: Used to identify important residues. E.g., acetylation of chymotrypsinogen vs. chymotrypsin showed the role of Ile16.
Competitive labeling: Very sensitive and powerful. Reports on individual residue pKa values, structural information such as accessibility of groups, and stereoelectronic perturbations of a group. E.g., the surface reactivities and pKa values of the 12S subunit of a native 50-protein ribosome complex was characterized.
Site-directed mutagenesis: Reports on the role of specific groups. All groups can be investigated. SDM is complementary to chemical modification. Using SDM, the role of active site groups of barnase on stability and catalysis were quantified.
© 2006, Alpay Taralp, Sabanci University
In a typical study of a poorly characterized protein...1. Physical & chemical methods to purify protein and to
analyze protein structure (some early examples): Dialysis and gel filtration, column chromatography of proteins Zone electrophoresis of proteins Estimation of protein and amino acid content Paper chromatography of amino acids and peptides High-V paper electrophoresis of amino acids and peptides Ion-exchange chromatography of amino acids and peptides Disulphide bond mapping Urea unfolding and stability tests Selective cleavage of peptide chains N-terminal sequence determination C-terminal sequence determination X-ray and later CD and NMR structures (with/without incipients)
2. Physical & chemical methods to analyze protein function Bioassays (enzyme kinetics, receptor-hormone, protein adsorption, cell
adhesion to protein layers) Comparative studies give insight to the S-F relationship!
continuumPure S study Pure F study
© 2006, Alpay Taralp, Sabanci University
A Review of Protein Structure-Function at Play: Enzyme
Strategies to Accelerate Rates
© 2006, Alpay Taralp, Sabanci University
An enzyme will not “reduce” the activation energy of a pathway! Like all catalysts, an enzyme will permit the reactants to follow an alternate, low–energy pathway.
CO → CO2
The alternative pathway reflects a new mechanism. Here, it proceeds via 2 or more intermediate steps.
2CO + O2
2CO2
Enzymes use similar tricks as non-enzymatic catalysts: E.g., bases, acids, metal surfaces, etc., PLUS some extra tricks, which are unique to its structure
overly simplified more correct
Enzymes in the Protein Family: Properties1. Monomeric or oligomeric or exist as part of a multienzyme complex2. Often require non-protein components (co-factors) for catalytic activity – activator eg. metal ion, co-enzyme, prosthetic group3. Efficient catalysts4. High Specificity5. High Stereospecificity6. Very sensitive to pH, temperature, dielectric (salts, solvent)
Industrial Uses of EnzymesTextile Industry – Cellulase for cottonDetergent Industry – Lipases and Carbohydrases for stainsFood Industry – Isomerase of glucose to fructose; lactase for lactose intolerant peopleOrganic Synthesis – Penicillin acylase; amino acid synthesis
1a. Oxidoreductases (all redox reactions) eg. Alcohol to aldehyde – catalysed by NAD oxidoreductase, aka alcohol dehydrogenase (plus NAD+ cofactor NADH)
1b. Transferases (transfer of methyl groups, glycosyl groups, phosphate groups, etc.) eg. creatine to phosphocreatine – catalysed by creatine phosphotransferase aka creatine kinase (plus ATP ADP)
1c. Hydrolases (hydrolytic cleavage of ester, amide and glycoside bonds by insertion of water) eg. glucose-6-phosphate to glucose plus phosphate – catalysed by glucose-6-phosphate phosphohydrolase, aka glucose-6-phosphatase
1d. Lyases (cleavage of bonds by mechanisms other than hydrolysis or oxidation; carbon-carbon lyases, carbon-oxygen lyases, carbon-sulfur lyases) eg. L-histidine to histamine plus carbon dioxide – catalysed by histidine decarboxylase
1e. Isomerases (racemizations, epimerizations, cis-trans isomerization) eg. D-ribulose-5-phosphate to D-xylulose-5-phosphate – catalysed by D-ribulose-5-phosphate 3-epimerase aka phosphoribuloepimerase
1f. Ligases (condensation of two different molecules at a new C-O or C-S bond, but coupled to the breaking of ATP) eg. L-tyrosine plus tRNA plus ATP to give L-tyrosyl-tRNA plus pyrophosphate – catalysed by L-tyrosyl-tRNA ligase aka lyrosyl-tRNA synthetase
© 2006, Alpay Taralp, Sabanci University
Mechanism & Strategies of Rate Acceleration
in Enzymes General questions 1) Why are enzymes such efficient catalysts? 2) Which factors typically affect enzyme performance?
Binding: Unproductive binding, competing substrates, competing products, competitive inhibition, uncompetitive inhibition, and noncompetitive inhibition
Temperature Ionic strength, pH value and other environmental factorsLocal diffusion and convection
3) Why have proteins been selected as catalysts in biological systems?
4) How large do enzymes have to be?© 2006, Alpay Taralp, Sabanci University
Quantifying enzyme rates
Q: Why do we study enzyme activity?A: Enzyme kinetics probes protein structure and function in
general.Enzymes are proteins evolved with a natural marker of structure &
function.
Q: What are some parameters to characterize enzymes?A: Enzyme Units (historically)
EU/mg protein (specific activity)Ks (Binding constant)
KM (Michaelis constant)
kcat (turnover number/catalytic constant)
kcat/KM (specificity constant, or pH activity for kcat/KM versus pH)
Ki (inhibition constant: competative, uncompetative, noncompetative)© 2006, Alpay Taralp, Sabanci University
Means of Quantification: Measure a change of S→P over time, many techniques
Rate measurements: Rate of formation of product or removal of reactant as amount/time e.g., M/s, mole/s, vol/s, g/ml/s, etc.
Try to measure these slopes!
Q: What do we call these measurements?A: Initial rates! Acquire data within a few minutes & within 1-5 mole% S conversion.
Q: Why measure initial rates?
Forward rate, S → P, has no interference:
1. No product inhibition is possible;
2. No reverse reaction is possible;
3. Enzyme instability is less of a concern; and
4. Be safe - Enzyme reaction models are more complex than ordinary kinetics: Invite errors when extrapolating non-initial rate data
Let us examine how the above theory has originated...
Historically Early studies (1895-1913) on the rates of the enzyme-catalyzed
reactions gave the following observations: 1. At constant substrate concentration, the rate of reaction was directly
proportional to the enzyme concentration. 2. At constant enzyme concentration:
a. The reaction rate was independent of substrate concentration. b. The reaction rate was directly proportional to the substrate concentration. c. The reaction rate was fractional with respect to substrate concentration,
with a value between zero and one.
In 1913, Michaelis & Menten proposed a scheme to account for the above observations: Enzyme only acts upon bound substrate, i.e., E & S must initially form a
complex, held together by physical forces. Assumptions:
E and S are equilibrated with ES, i.e., kcat << k-1
Breakdown of ES is 1st order so rate [ES] i.e. rate = kcat[ES]
Rate of reverse reaction is zero
E + S ES E + PKS
kcat
So rate = (kcat[E]o[S]o)/(Ks+[S]o) © 2006, Alpay Taralp, Sabanci University
Briggs and Haldane revised the mechanismThey assumed that k2 was significant in comparison to k-1
(not an equilibrium, rather a steady-state).They set d[ES]/dt = zero to obtain a rate formula. The “new” M-M equation has the same form as the
original! Why? Equilibrium is a special case of the steady state treatment, k2 << k-1.
How does KM vary amongst the two models? KM is either (k-1+k2)/k1 or KM ≈ KS = k-1/k1 (in the original M-M model).
E + S ES E + Pk1
k-1
k2
E + S ES E + PKS
kcat
rate = (kcat[E]o[S]o)/(Ks+[S]o)
rate = (k2[E]o[S]o)/(KM+[S]o)
as k2 << k-1
© 2006, Alpay Taralp, Sabanci University
Q: What are enzyme assays & how are they performed? SP
The Assay Any method that detects a change of physical property versus time:
Manometry, polarimetry, viscometry, NMR, MS, spectrophotometry, spectrofluoromethry and pH-stat. What is one pre-condition? The physical property should vary in proportion to S or P.
Direct assays Alcohol dehydrogenase can be monitored as a function of NADH
formation. NADH is strongly absorbent at 340nm. Is a buffer used? Hydrolases can be monitored as a function of proton formation (standard
ester cleavage). Is a buffer used here? Coupled Assays
If S & P are similar they cannot be directly used to assay. To get around this problem, a more distinguishable end product is made.
Target: With alanine aminotransferase; alanine + -ketoglutarate → pyruvate + glutamate. Using pyruvate dehydrogenase; pyruvate + NADH → lactate + NAD+ (NAD+ absorbs at 260nm). The coupled reaction should be faster than the principle reaction. WHY??
rate = (kcat[E]o[S]o)/(KM+[S]o)
© 2006, Alpay Taralp, Sabanci University
Sampling AssaysS or P is withdrawn at specific time intervals & quantified, e.g., by colorimetry or radioisotopy.
Experimental Target of a M-M assayTo measure 3 parameters: KM, kcat & kcat/KM. Do these carry a physical meaning?
Q: How do we carry out a typical M-M experiment? A: Measure the initial rates as follows:
With substrate concentration at least 200-500x greater than total enzyme concentration , measure KM& kcat directly. Carry out these measurements at 3-4 different pH values. Measure the specificity (kcat/KM) directly at many pH values, using 0.1pH unit intervals (construct a pH activity curve); In choosing your parameters, S must be at least 20x less than KM. Why? What is the significance of a pH activity curve?Repeat any of the above experiments in the presence of inhibitors, different S, activators, different environments, etc.
Q: How does your experimental scenario compare to the true situation in biological systems? Is there a biological relevance? Why do we conduct experiments in this way? rate = (kcat[E]o[S]o)/(KM+[S]o)
Hanes-Wolf plot
Michaelis-Menten kinetics
Conc of S (mM)
Initial rate (mM/s)
= KM/(kcat[E]o)
A closer look at kinetic scenarios: Probing ionizable groups, which are important for binding and/or catalysis?
(6)
Sample math treatment for 3 (apparent) ionizable groups that are important for binding and/or catalysis
= 0
0
20000
40000
60000
80000
100000
120000
140000
3 4 5 6 7 8 9
kcat/Km
pH
The pH activity profiles of cathepsin B. The substrates are acetyl-Arg-Arg-Arg-AMC (+), acetyl-Val-Arg-Arg-AMC (◊) and benzyloxycarbonyl-Arg-Arg-AMC (―).
Real example!
Thermokinetic background related to protein analysisThermodynamics: G, H, S, equilibrium constant Keq
Kinetics:G≠, H≠, S≠ , kinetic rate constant k, kinetic rate theories
Origin: Position of G, H & S changes as system proceeds along reaction coordinate
Plan: To discuss the interrelation of these parameters and to focus on G≠ and G
© 2006, Alpay Taralp, Sabanci University
Please delinate the relative importance of thermodynamic and kinetic contributions in the following scenarios
1. The right reaction releases energy faster than the left reaction. Q: Which videoclip shows the more exothermic reaction? A: Inconclusive! We cannot compare the molar enthalpy change from the videos.
2. True or false? All exothermic reactions are thermodynamically spontaneous and all endothermic reactions are thermodynamically non-spontaneous.A: False!
3. True or false? All thermodynamically spontaneous reactions yield a reaction & all thermodynamically non-spontaneous reactions fail. A: False!
Put away your weapons of mass destruction...
4. The thermite reaction is highly exothermic, H <<<0, the entropy change, S, is relatively unimportant, and the Gibbs energy change is highly negative, G <<<0. The reaction is thermodynamically spontaneous.Q: Why must you add a fuse to start the reaction?
5. The process H2O(s)→H2O(l) is highly endothermic (H>>0) Below is the evidence. Explain.
Time = 0min
Time = 60min
6. The reactant, nitrogen triiodide-NH3, sits at a high Gibbs energy level. Its products rest at a much lower energy state.
You must apply a physical shock before Nitrogen triiodide-NH3 explodes. Why?
NI3.NH3(crystal) → NH3(g) + ½ N2(g) + 3/2I2(g)
7. Liquid nitrogen evaporates. The process is thermodynamically spontaneous, endothermic & proceeds quickly. Q: How might you explain these comments?
N2(l) → N2(g)
8. The dissolution of ammonium sulfate in water is endothermic and readily proceeds under ambient conditions. Explain.
(NH4)2SO4 + bulk H2Os → 2NH4+
(aq) + SO42-
(aq) + a few less bulk H2Os
Why all the confusion?? Reason 1: Many terms and reactivity modelsReason 2: Misleading termsReason 3: Separate GS & TS concepts in chemical processes
At equilibrium, Ssys + Ssur is max.
Gsys Asys
UH
U PV
TSTS
kinetic Emic
potential Emic
rot vibrtransla
intermol. inter.
intramol. inter.
Let us progress until wearrive at the commonmodel to understandproteins...
Early measurements of U examined the link between enthalpy changes (H ≈ differences of bond energies) and reactivity
Why shouldn’t you predict reactivity using H?
A: H reports on the initial & final Ground States (GS) but not on the pathway (mechanism)
(There are other reasons too)
A
B
H
B
A
Reaction coordinate Reaction coordinate
H < 0 H > 0
Hfinal
HinitialExothermic Endothermic
What are some disadvantages of: the collision model? using Ea to predict reactivity?
Collision model: A kinetic view. Consider a potential barrier, Ea, between A & B. Rate const is kA→B = Ae-Ea/RT. (Later, A = Z)
A
B
U
Reactants collide with speed & good orientation.
In non-gases: PotE ≈ U, as (PotEf - PotEi) ≈ Uf - Ui
U ≈ H, as H = U + (PV)←very small
preexponential steric
Gibbs energy change: A way to explicitly incorporate entropy, S, to account for solvent effects, etc.
H-TSsys = Gsys
bomb calorimeter
most reactions
At equilibrium, Ssys + Ssur is max.
Gsys Asys
UH
U PV
TSTS
What is misleading by the term spontaneous?
G > 0G < 0
Spontaneity says nothing about energy barriers or chemical rates
Reaction CoordinateReactants Products
GibbsEnergy
Both processes arethermodynamicallyspontaneous
One is kinetically permitted,giving an observable rate,
and one is kinetically prohibitedby a high energy barrier
Reaction CoordinateReactants Products
Both processes arethermodynamicallynon-spontaneous
One is kinetically permitted,giving an observable rate,
and one is kinetically prohibitedby a high energy barrier, so we have 100% reactants
G≠
G
G≠
G
Transition State Theory: A kinetic element completes the Gibbs view.
State B
State A
GState A
State B
G‡
G‡
Reaction Coordinate
G
Changes of any state function are independent of path
State function
Not a state function
Problems with TS theory?
Reactant (A) proceeds through a high-energy transition state or activated complex to become product (B).
What is the weakness of predicting reactivity using only quantum tunneling?
Reaction Coordinate
Reactants Products
GibbsEnergy
Classic kineticbehavior
Tunneling (a 5A wavelength decays exponentially as it penetrates the barrier. If the barrier is not too long, R can reach the product sideof the hill without completely decaying away (i.e.,emerges on the product side with a non-zeroprobability density)
R,eg. +H P
O O-
HThe rate of proton transferoften has a significant tunneling component
5A
H
5A
Quantum tunneling kinetic view: The e- probability distribution of every particle is derived from a wave function
The above terms are related to large populationsCannot use TS theory to calculate the activity of “one” molecule or small groups of molecules, such as membrane proteins
Note: Reactions do tunnel, collision theory could apply
Let us examine a typical enzyme reaction...
G≠, H≠, S≠
G, H, SGA→B = HA→B – TSA→B
kA→B = (kBT/h) x e-G‡/RT
Keq = [B]b/[A]a = e-G/RT
To summarize: In protein systems, we assess thermodynamic & kinetic behavior in terms of G & TS theory (less use of Z or tunneling arguments)
G G
Uncatalyzed reaction Enzyme reaction
Reaction coordinate Reaction coordinate
S X+ P
E + S ES ES E + P=
ES≠
Enzymes lower G‡ (i.e., G‡ - GGS) in comparison to the uncatalyzed reaction
We shall simplify the notation even more...
Overall rxn is diffusion-controlled or rxn-controlled
Microscopic steps may be grouped intoPhysical binding (1st step; E + S → ES), and Chemical catalysis (2nd step; ES → ES‡ → E + P)
© 2006, Alpay Taralp, Sabanci University
G
Reaction Coordinate
Two models:TS lowering & GS elevation
E + P ESE + Sk1
k-1
k2
Dissect G‡ into enthalpic (H‡) and entropic (S‡) components:
k = (kBT/h) x e-G‡/RT can be written as k = (kBT/h) x eS‡/R x e-H‡/RT
where S‡ is the entropy of activation, Stransition state – Sground state and
H‡ is the enthalpy of activation, Htransition state – Hground state.
activationenergy offorwardprocess
GibbsEnergy
ProductsReactants
Reaction Coordinate
Ggs
GQ: How might you predict the free energy of activation, G‡?
Answer: Assess the enthalpic & entropic differences between:
1. reactant (ES at ground state) &
2: the activated complex (ES≠, at the transition state position).
A closer look at changing the position of G
E + P ESE + Sk1
k-1
k2
H
H
H
Reaction Coordinate
activationenthalpy oftwo forwardprocesses
HA
B
The enthalpy of activation , H‡, is always positive because bonds are being broken.
The entropy of activation, S‡, may or may not be favorable. Can you think of some examples?
Both parameters contribute to rate according to k = (kBT/h) x eS‡/R x e-H‡/RT
S
S
S
Reaction Coordinate
activationentropy oftwo forwardprocesses
S
A
Q: How might substrate-surrounding interactions affect the position of H?
(Hint: In solutions & solids, H ≈ U)
Reaction Coordinate
Polar solvent Nonpolar solvent
Enthalpy, H
Oil
Water
Q: If you ignore any entropic contribution, how might a change of H affect the Gibbs free energy position in solutions & solids?
Water
Oil
Gibbs Energy, G
Nonpolar solventPolar solvent
Reaction Coordinate
Reaction Coordinate
Reactants Products
GibbsEnergy
Equilibriumposition, before & after
Increasing theconcentration ofreactant
Q: What happens if you increase the chemical potential (i.e., the potential to do work) of a reactant?
Answer: Reaction is more spontaneous; equilibrium is even closer to the product side; transition state is reached earlier; activation energy is smaller; forward rate is greater
Q: Which principles do enzymes exploit to lower the position of the TS (& how)?
1. General acid catalysis, general base catalysis, electrostatic catalysis and electrophilic catalysis. All modes could stabilise charge accumulation in proceeding
from ground state, GS, to transition state, TS.
© 2006, Alpay Taralp, Sabanci University
Hydroxide
Enzyme
2. Covalent or nucleophilic catalysis. A covalent activated intermediate is formed, e.g., a ping-pong mechanism. The high-energy mechanism is broken into energetically less-demanding steps.
3. Neighboring charges, dipole moments & hydrophobic/dielectric considerations.
The enzyme environment enhances the reactivity of nucleophiles such as serine hydroxyl groups, cysteine thiol groups, etc.
N
O
H
OH
O
HN
H
H2O
N
O
H
O-Enz
HN
H
O
O
Enz
H2O
Enz O-
OH
O
G
Cys
S-
HNNH
His
+
Cys
SH
(aq)
pKa = 9pKa = 3
4. Pre-reduction of ground-state entropies.The basis of this strategy is to minimize the ground state freedom of the ES complex during the chemical transformation phase of a reaction. In this way, the ascent to the TS will not require a major loss of freedom.
Strategy 1. Orbital steering.
Strategy 2. Decrease the number of reaction participants in the chemical transformation phase.
N
O
HAA
N
O
H
N
O
H
A
versus
(aq)
LSS≠ << RSS≠
+ versus +
EnzEnz
≠≠
CH3CH2O- ICH2CH3 versus+
O-
I
Enz
5. Formation of low-barrier H-bonds.A normal H-bond in the GS may become a low-barier H-bond in the TS if the pKa value of the enzyme group & the activated complex (as ES≠) are matched.
1 normal H-bond ≈1-5 kcal/mole1 low-barrier H-bond ≈ 25-40kcal/mole
ONH
H3C
H Enz-O
S
GSTS
S
pKa = 10
pKa = 16
O EnzH
H
H O
+N
H3C
pKa = 10
6. Binding energy considerations.Enzyme binding groups interact non-covalently with substrate at all points along the reaction coordinate. The energy term H is variable along the reaction coordinate!
E.g. 1, GS shape of S perfectly matches enzyme site
E.g. 2, TS shape of S is much more complementary to enzyme than GS shape of S (for enzymes that behave according to the TS stabilization model of catalysis).
10 goodcontacts
+
TS??
ES
E + SG
+
2 goodcontacts
10 goodcontacts
ESE + S
G
ES=
E + P
Conclusion: Ground-state binding shouldn’t be “extremely specific”, as is often assumed
Shape of S transforms along the reaction coordinate
GS TS
EnzS
SEnz
GS
E + P
=ES
G
E + SES
+
2 goodcontacts
3 goodcontacts
5 goodcontacts
8 goodcontacts
10 goodcontacts
6 goodcontacts
TS reached
In a well-evolved enzyme-substrate interaction, we see an increase of binding energy stabilization in proceeding to the TS
SEnz
Every “good” interaction lowers the position of H, etc.
10 good contacts
2
3
6
8
Q: Can you see evidence of binding energy participation in the TS of amide bond hydrolysis?Hint: Look at the changes of KM & kcat
© 2006, Alpay Taralp, Sabanci University
At equilibrium, Ssys + Ssur is max.
Gsys Asys
UH
U PV
TSTS
kinetic Emic
potential Emic
rot vibrtransla
intermol. inter.
intramol. inter.
= E + PESESE + S EPphysicalbinding
chemicaltransformations
physicaldissociation
rate-determining transition
ES
E + S
G
=
E + PEP
ES (Hypothetical enzyme-catalyzed energy profile when binding energy is not considered, i.e., profile is analogous to non-enzymatic catalysis)
=
S (Uncatalyzed energy profile)=
Reaction Coordinate
2
10 good contacts
8
4
ES (Enzyme-catalyzed energy profile including binding energy contribution)
Summary slide
Increasing [Substrate]
Increasingreactionrate (v)observed
A
B
C
[S]o >> Km[S]o << Km
a ; va = (kcat/Km)[S]o[E]
b
c
Vmax = kcat[E]ova =
va = Vmax/2
[S] = Kma
a
a a
Reaction Coordinate Axes
IncreasingGibbs energyof freesubstrate& substratein enzymecomplexes
Substrate Product
Large [S]
S
ES
ES
forward rate const = kcat
C,D
A,B
A,C
forward rate const = kcat/Km
Small [S]
ProductSubstrate
S
ESa,c
ESb,dP
d
D
c,d
PB,D
kuncat
stabilization before binding energy consideration
(a)
(c)
(b)
ES++
a,bES++
ES++
ES++
kcatKm
a,b( ) kcatB
kcatKm
A,B( )
KmB,D
S->P
ES binding energies can be grouped into three cases:
Case 1 (not shown here): ES has very strong GS binding (shape complementarity is exceptionally good in the ground state). Examples: HormonesCase 2: ES has poor GS binding and strong TS binding. Examples include carbonic anhydrase, acetylcholine esterase and catalase. Case 3: Modest GS binding and modest TS binding. Examples – Metabolic enzymes
© 2006, Alpay Taralp, Sabanci University
Protein engineering to elucidate and improve stability
© 2006, Alpay Taralp, Sabanci University
Protein engineering has been used to investigate structure-activity & molecular recognition relationships to to make better protein productsQ: Why does Mankind wish to use proteins? Proteins accelerate chemical reactions Proteins form commercial products & improve other product properties Proteins enable novel processes
Some typical industrial applications: Bioreactors Textile treatment Medicinal and organic syntheses Protein drugs & drug delivery Biosensors Bioremediation Food preparation industries
Problems? Industrial constraints are often too demanding for native proteins. Consequences: Poor biological activity, short lifespan, limited reaction parameters, etc.
Q: What can we accomplish by using protein engineering? improve existing pharmaceutical proteins create superior high-value proteins with improved half-life create new proteins and pioneer new therapies improve desirable biological activities alter receptor specificity and binding activity reduce harmful side-effects and toxicities.
Improved Proteins
Redesigned Antibodies
Molecular Recognition
'synthesis'
'analysis'Structure-ActivityRelationships
Improved Enzymes
© 2006, Alpay Taralp, Sabanci University
The current focus of protein engineering: Formulating broad-scope protein preparations, which are:
Cheaper More stable More catalytic Longer-lived More easily stored & transported More active at pH & temperature extremes
Locating/purifying thermophiles, etc.
Geneticmanipulation
Low-techchemical strategies
Native
PROTEIN ENGINEERING
focus
© 2006, Alpay Taralp, Sabanci University
The current focus is genetic manipulation
Barnase: Superimposed Xray/NMR, schematic and ribbon sketch
Interacting residues are observed
Q: Can mutations probe the stability of the folded state?
A: All residue interactions contribute to protein stability. By mutating 1 residue of an interacting pair, the Gibbs contribution of that pair to protein stability is assessed.
Eu, E'u
Ef
E'f
wildtype
mutant E'i
Ei
E'
E
G
Reaction Coordinate
=
=
How might you measure the thermodynamic unfolding/folding energy change of barnase?
Trp'sinside Trp fluorescence
quenched
Folded Unfolded
Concentration of denaturant
Fluorescenceof Trp residues
Concentrationcorrespondingto 50% fluorescencequenching
x xx
x
x
xx
x x xwt
mutEu, E'u
Ef
E'f
wildtype
mutant E'i
Ei
E'
E
G
Reaction Coordinate
=
=
In principle, all interactions contribute to protein stability. H2OGUnfolding is the free energy change (calculated), which accompanies barnaseFolded → barnaseUnfolded in water. Here are some examples:
© 2006, Alpay Taralp, Sabanci University
Deleting one H-bonding partner where there is no access of water
Mutant [urea]1/2H2OGU GU
(in M) (kcal/mole)
wt 4.57 8.82 ----
TyrPhe78 3.88 7.68 1.35
SerAla913.58 6.41 1.93
Deleting one H-bonding partner where there is free access of water
Mutant GU
(kcal/mole)
SerAla31-0.14
TyrPhe103 0.00
Introducing a H-bonding residue in a place that contains no partner residue
Mutant Solvent access- GU
ible area (in Ǻ2) (kcal/mole)
ValThr10 0 2.48
ValThr89 0 2.55
ValThr45 43 2.44
ValThr36 70 1.15
ValThr55 93 0.60
Summary: Relative importance of types of interactions towards stability
H-bonds, no access to water → moderate
H-bonds, free access to water→ very small
Hydrophobic-hydrophobic interactions → very important, very abundant
(64 mutations) → >60kcal/mole destabilization energy© 2006, Alpay Taralp, Sabanci University
Destroying parts of buried or solvent-accessible hydrophobic residues
Mutant # methyl(ene) GU
groups < 6Ǻ (kcal/mole)
IleVal55 5 0.30
IleAla55 16 1.15
ValAla10 37 3.39
IleAla88 55 4.01
LeuAla14 62 4.32
Destroying a solvent-exposed ionic interaction between Asp8, Asp12 & Arg110
Mutant GU
(kcal/mole)
AspAla8 0.89
AspAla12 0.31
AspAla8 & AspAla12 0.80
Mutation studies have validated the importance of some interactions. Can we use site-directed mutagenesis to engineer proteins with enhanced stability?
Yes! Bridged mutants show resistance to denaturants & thermal stability!
© 2006, Alpay Taralp, Sabanci University
Mutant [Urea] GU
to unfold 50% (kcal/mole)
wt 8.8 0.0
AlaCys43 (–SH) 7.7 1.1SerCys80
AlaCys43 (–S-S-) 10.0 -1.2SerCys80
SerCys85 (–SH) 8.4 0.4HisCys102
SerCys85 (–S-S-) 12.9 -4.1HisCys102
Stability of Barnase double mutants
SDM may be aided by evolutionary clues left by Nature
With respect to Barnase, Binase has lost one amino acid (Gln2 → ) and has 17 different residues. The structure of Binase is slightly more stable than Barnase.
Hypothesis: Evolution may have selected some of the 17 amino acids because they promoted stability. If these amino acids are mutated into Barnase, the engineered Barnase may have higher stability...
© 2006, Alpay Taralp, Sabanci University
Strategy to improve the stability of Barnase: One by one, re-engineer the primary structure of
Barnase using each of the 17 residues of BinaseMeasure the conformational stability of the mutantDesign a “super” stable mutant using the information.
Mutation 50%Unfold Gu
(For comparison, wt barnase unfolds %50 in 8.8M urea)
Grand Results:
© 2006, Alpay Taralp, Sabanci University
Take-home message Effects of these individual mutations are remarkably
additive Normally cooperativity is observed
Explanation Binase and Barnase are slightly divergent on the
evolutionary tree Mutations are very conservative
Implications for any industrial enzyme such as Xylose Isomerase
Find a closely related thermophile Make individual mutations between them and determine
Gu
Choose the stabilizing mutations and create a multiple mutant, stable at high temperatures
© 2006, Alpay Taralp, Sabanci University
Directed evolution is a technique, which accelerates evolution. Evolution normally takes millions of years to produce an
improvement; accelerating the mutation process yields improvements in weeks.
One type of directed evolution is called molecular breedingDesired genetic trait obtained from a two-step process:
1. Genes are subjected to DNAShuffling, generating a diverse library of novel sequences (one or more genes are fragmented and recombined).
2. “Good” gene products are selected by screening. The good genes are subjected to more “shuffling” & screening until
the desired property is obtained
Other examples protein engineered via genetic manipulation have relied on the principle of directed evolution
Left image: Wild type green fluorescent protein gene in plants
Right image: Maxygen’s DNA shuffled green fluorescent protein gene in plants
© 2006, Alpay Taralp, Sabanci University
Protein Folding
© 2006, Alpay Taralp, Sabanci University
Folding of Barnase - Overview Elucidating rules, which govern the folded conformation of
proteins, is of theoretical interest and practical importance particularly since advances in recombinant DNA methods have enabled the design and synthesis of novel proteins.
Although many physico-chemical approaches have been employed, the mechanism of protein folding remains unclear.
An approach, which combines the technique of site-directed mutagenesis with the more classical physico-chemical techniques, has been employed to address this problem.
By altering specific side chains in a folded protein, it is possible to correlate the contributions of their interactions towards the overall stability of the protein. Thermodynamic relationships, specifically Bronsted relations, are employed in this treatment.
Barnase, a relatively small 110 amino acid, monomeric extracellular ribonuclease of Bacillus amyloquefaciens serves as a model protein for this study.
© 2006, Alpay Taralp, Sabanci University
Protein FoldingProtein folding is a large-scale
continuation of the conformational analysis problem
3 mutual gauche interactions 2 mutual gauche interactions
Less stable More stable
The New Challenge!!
© 2006, Alpay Taralp, Sabanci University
Protein structure is not rigid
1. Some native structures are more flexible and dynamic; some are tight, less dynamic and well protected
2. We note a correlation between protein flexibility and crystallizability
Article “What does it mean to be natively unfolded?”
Implications for NMR and Xray analysis?
Q: Why does a protein fold? A: Balance of enthalpic terms (non-covalent interactions) and
entropic terms (freedom decreases as conformation organizes)
Gibbs energy
Reaction coordinate
Eu = unfolded enzyme
Eı = intermediate
EF = folded enzyme
Typically,
GU→F = -5 to -15kcal/mole
© 2006, Alpay Taralp, Sabanci University
EU
EI
EF
RateDeterminingStep
G = H - TS
Q: Please estimate the conformational possibilities while a protein folds
A: In a 100 amino acid chain8 conformations eachup to 8100 conformations possible
Q: Is protein folding random? If 1011-1013 conformations are randomly adopted per
second → requiring years to fold! In fact, a protein folds while associated with
ChaperonRibosomeAlone
in msec-to-sec time scale A: Folding is clearly a directed process!
© 2006, Alpay Taralp, Sabanci University
Q: How might you define the mystery of protein folding A: How does the amino acid sequence
Direct folding? Determine the final conformation?
Q: How might you address the problem? A:
Obtain amino acid sequence (protein chemistry, DNA, molecular biology) Obtain 3-D structure (X-ray, NMR) Perturb the physico-chemical traits (Chemical modifications and site-directed
mutagenesis)
Q: Why the interest to understand protein folding? A:
Predict 3-D structure of any amino acid chain Novel enzyme design Improved industrial processes, e.g., a better xylose isomerase → € Treatment of protein related diseases, e.g., prion diseases (BSE, fatal familiar
insomnia, etc.)
© 2006, Alpay Taralp, Sabanci University
In the prion class of diseases, why does a misfolded protein lead to disaster?
Globular proteinnon-associatingsolubledegradable
Fibrous proteinaggregatingcrystallizinginsolubleaccumulating
In prion disease?
Crystal nucleation?
NormalState
PathologicalCondition
G
Reaction coordinate
Today’s focus is related to Prof. Alan Fersht’s work on the folding pathway of Barnase Barnase: RNA → ribonucleotides
References:Serrano, Day and Fersht (1993) J. Mol. Biol. 233, 305-312Fersht, Matouschek and Serrano (1992) J. Mol. Biol. 224, 771-782
Serrano, Matouschek and Fersht (1992) J. Mol. Biol. 224, 805-818 © 2006, Alpay Taralp, Sabanci University
O
RO
G
O HO
P
O
-O
O
R'
BH+
B - R'OH+HB
B
O
RO
G
OO
P
O-O
H
OH
B
BH+
O
RO
G
O HO
P
O
-O
O
H
+H2O
HN
N N
N
O
H2N P 0
N -1
P +1
N +1
P +2
N +2
H
Bond cleavage
Base 0 (guanine)
Sugar 0
Q: Why is Fersht’s work interesting?Approach uses powerful protein engineering
Novel proteins can be made, e.g., M. Smith, UBC
Data interpreted via thermodynamic treatmentLinear free energy relationships
Bronsted plots (Bronsted catalysis eqtn)
Q: Advantage of Fersht’s approach?Relates measureable data to specific non-
covalent interactions, which govern protein structure & function
Improved Enzymes
Redesigned Antibodies
Molecular Recognition
'synthesis'
'analysis'Structure-ActivityRelationships
Improved Enzymes
© 2006, Alpay Taralp, Sabanci University
Q: Why barnase as a model?Advantages:
Small monomeric proteinNo disulphide bridges or cis-prolines
No post-translational modificationsExcellent expression systems (wt → express;
mutants → express)© 2006, Alpay Taralp, Sabanci University
Ribbon structure of Barnase
© 2006, Alpay Taralp, Sabanci University
Schematic of Barnase
© 2006, Alpay Taralp, Sabanci University
Q: What was the experimental strategy? Choose a significant non-covalent interaction Make a subtle change to the interaction (e.g., Ser80 → Thr80; Ser85 → Thr85) Perform equilibrium/kinetic un/folding experiments and
compare the wt & mutant Rationale: All interactions contribute to protein stability - Some
form/break before the rate determining step of folding, whereas others form/break afterwards
© 2006, Alpay Taralp, Sabanci University
Superimposed X-ray and NMR backbone positions of Barnase
X-ray is used to identify interacting residues
X-ray, NMR, CD and bioassays are used to check correct folding of mutants
Eu, E'u
Ef
E'f
wildtype
mutant E'i
Ei
E'
E
G
Reaction Coordinate
=
=
To recap, how might you measure the thermodynamic unfolding/folding of barnase?
Concentration of denaturant
Fluorescenceof Trp residues
Concentrationcorrespondingto 50% fluorescencequenching
x xx
x
x
xx
x x x
Trp'sinside Trp fluorescence
quenched
Folded Unfolded
Fluorescenceof Trp residues
Time
Rapidly mixedBarase solution isspiked with acid
Shape analysisof curve yieldskinetic unfoldingconstants
Now, how might you measure the kinetic unfolding of barnase?
...and how might you measure the kinetic folding of barnase? Shape analysis
of curve yieldskinetic foldingconstants
Rapidly mixed Barasein 8M urea is diluted10x with aqueous buffer
Time
Fluorescenceof Trp residues
A closer look at the consequence of a mutation Any measured energy change within the protein is attributed to the mutation
You may compare (one at a time) the interactions of many neigboring groups within the protein
You can map interactions that do/don’t contribute to protein stability/folding
Q: What principle allows you to identify conformational energy changes from measurable mutation studies?
A: GU, GI, GF, G, etc. are changes of free energy upon mutation (where GA
is stateEAwt – stateE’Amutant).
Effect of a mutation varies along the reaction coordinate. E.g. if you compare
stateEAwt – stateE’Amutant, GU is very close to
zero, whereas GI, G & GF are larger.
The change of free energy upon mutation between 2 x-positions, e.g., from unfolded → → → → folded, is GF-GU.
© 2006, Alpay Taralp, Sabanci University
=
=
Reaction Coordinate
G
E
E'
Ei
E'imutantwildtype
E'f
Ef
Eu, E'u
GF
G=
GI
GU ≈ 0
Q: Is there a fundamental problem with the calculation of GF-GU? (GA is stateEA
wt – stateE’Amutant)
A: Yes! All vertical
equilibria are virtual!
Solution: Calculate instead the difference of free energy upon mutation, e.g., for unfolded → folded, we want GF-U, so measure GF-U
wt-GF-Umutant (= GF-GU!).
© 2006, Alpay Taralp, Sabanci University
EU
E'U
EI
E'I E'F
EF
=E' I
E I=
GFGIGU G I=
Q: What is the meaning of GF-U?
A: If X & Y interact and we mutate X → Z, then:
GF-U = GF(X...Y) + GF(X...E) + GF(Y...E) + GF(X...H2O) +
GF(Y...H2O) – G’F(Z...Y) – G’F(Z...E) – G’F(Y...E) – G’F(Z...H2O) –
G’F(Y...H2O) – G’F(E...H2O) – GU(X...H2O) – GU(Y...H2O) –
GU(E...H2O) + G’U(Z...H2O) + G’U(Y...H2O) + G’U(E...H2O) +
GU(reorg) – GF(reorg)
Once all the terms are considered (many cancel), we can say that GF-U ≈ stabilization energy!
Q: Why is this statement significant?
A: Stabilization energies probe transition states & intermediates © 2006, Alpay Taralp, Sabanci University
Q: What is the advantage of GF-U? A: All horizontal equilibria are measureable via a
thermodynamic equilibrium experiment, i.e., G = -RTlnK & a kinetic experiment, i.e., k = (kBT/h)e(-G/RT)
© 2006, Alpay Taralp, Sabanci University
GF-U
GI-U
G I -U=
GU GI GF
E I=
=E' I
EF
E'FE'I
EI
E'U
EU
G I=
E’I
To compare thethermodynamics offolding: GF-U = -RTln(50%urea’/50%urea)Convention: G < 0 if E’F is more stable
Bronsted catalysis equation Please recall logk = logK + C
0 < < 1: So what does describe?
For a “series” of 2 related compounds (e.g., wt & mutant) you may write logk = logK
Another way to write logk = logK + C isG = Geq + D. If we blend this logic, we
obtain: = G/Geq for wt & mutant We may define for each wt/mut pair the following:
Aunfol = GA-F/GU-F NOTE – approximates is not
Afol = GA-U/GF-U but is equated to when = 0 or 1
logk
logK
x
xx x x
xx
© 2006, Alpay Taralp, Sabanci University
We begin by probing the ratedetermining TS of unfolding (easier)
Aunfol = GA-F/GU-F
Each point (x,y) describes an energy change (GU-F, G-F) due to a mutation of barnase. E.g., Val→Ala gives Ala:Phe
Is there a patterned change of stabilization energies
upon mutation of interacting pairs?
G-F; GU-F
© 2006, Alpay Taralp, Sabanci University
Q: Let us quantify the mutant pair interactions as barnase unfolds.
unfol = G-F/GU-F
unfol = G-F/GU-F
= 2.60/2.32 = 1.06 ≈ 1
Therefore, the Val:Phe interaction
in the folded state was broken in the
TS of unfolding!
© 2006, Alpay Taralp, Sabanci University
Equilibrium measurements GU-F = 2.32 kcal/mole
Kinetic measurements
G≠-F = 2.60 kcal/mole
Q: Why must this kinetic data reflect the rate determining transition state of unfolding?A: All subsequent steps are kinetically unimportant
High [Urea]
Unfolding
For each mutation:find G-F using a kineticexperiment (urea/acid pulse) &find GU-F using anequilibrium unfoldingexperiment (urea to denature50%)The rest is easy!unfol = G-F/GU-F© 2006, Alpay Taralp, Sabanci University
The state of interactions in the transition state of Barnase with respect to the folded state
© 2006, Alpay Taralp, Sabanci University
Q: What is the general picture of the TS? Native-like, compact Some tertiary interactions lost Majority of 2˚ structure preserved Core1: weakened, esp. at Nterm 1
Core2: completely disrupted
Core3: fully intact
If we were to evaluate the unfolding pathway: 1st events: 3 of 5 loops unfold, Nterm of helices melt,
core1 weakens and core2 is destroyed; the remaining structure is disrupted later
© 2006, Alpay Taralp, Sabanci University
Low [Urea]
Refolding
Refolding experiments
© 2006, Alpay Taralp, Sabanci University
Rationale:Under refolding conditions:
EU → EI is faster than EI → EF;Thus, your measurements can examine EI → EF
Probing the rate determining TS of folding and an intermediate
For each mutation:Use the appropriate kinetic experiment (dilution of urea) &equilibrium foldingexperiment (urea) to probe the I state and TS of folding!
The state of interactions in the intermediate & transition state of Barnase with respect to the unfolded state; the last column shows the unfolding pathway for comparison
© 2006, Alpay Taralp, Sabanci University
What can be said about the folding process?
© 2006, Alpay Taralp, Sabanci University
© 2006, Alpay Taralp, Sabanci University
© 2006, Alpay Taralp, Sabanci University
Q: What can be said about folding? Correlation between hydrophobic burial and early events All early regions inteact with sheet Early burial is hydrophobic, extensive and nucleated Late processes only interact slightly with sheet; no
core nucleation; some hydrophillic burial is noted
General statements: The refolding pathway is at least partially sequential 2˚ structure formation leads to local hydrophobic burial
and precedes most 3˚ interactions Consolidation of structure is gradual and earliest for 2˚
structure elements
© 2006, Alpay Taralp, Sabanci University
Concluding Remarks. When generalized to small globular proteins, folding proceeds by nucleation, i.e., local hydro-phobic collapse of core elements, followed by consolidation of hydrophilic interactions & 3º structural domains
A Closer Look at Chemical Modification of Protein Monomers in Aqueous (and
now also Organic or Dry-state) Environments and the Use of Paper-
supported Chromatography
HN
N
NHNHC NH2
H2+N
+NH3
SCH3SS
OH
SH
CO
-ONH2Native protein at slightly
alkaline pH Values
© 2006, Alpay Taralp, Sabanci University
FOCUS To contrast chemical modification against enzyme
kinetics & SDM Discuss the applications of chemical modification Discuss non-destructive & destructive chemical
methods to learn about structure & function Look at classic derivatizations Discuss the advantages of competitive labeling
© 2006, Alpay Taralp, Sabanci University
History of chemical modification (not SDM)
• Approximately the same time as enzyme kinetics
• Glutaraldehyde tanning of leathers
• Refinement of foods, e.g., milk proteins
• Protein structure, function and S-F studies
• Insulin primary structure determination, partial and full acid hydrolysis, dansyl method, cyanogen bromide sequence alignments, Edman Degradation, Carboxypeptide-MS methods
Structure perturbs function, function defines structure
SH
Gee, my thiol must be ionized in orderto attack the peptide bond
There you go Mr. Thiolate! I have placedyou near a positive His residue and the +base of a large alpha-helix dipole moment!You may attack at will!
S-
+His
H+
Chemical modification and enzyme kinetics are complementary methods.
Advantage of enzyme kinetics: Enzyme bears an intrinsic probe, which allows you to examine
structure and function pH activity profile shows the pKa of groups that are important for
binding and/or catalysis… Disadvantage of enzyme kinetics:
Not every protein is an enzyme! Other proteins which can be bioassayed fairly conveniently are antibodies and hormone-receptor interactions.
Advantage of chemical modification in comparison to enzyme kinetics? The method can be used on any protein to determine
information related to structure and function! Disadvantage of chemical modification in comparison to
enzyme kinetics: You must insert the probe of structure and function, and you
must do it without defeating the purpose of your experiment
© 2006, Alpay Taralp, Sabanci University
Chemical modification and site-directed mutagenesis are complementary methods.
Rationale: Information obtained from chemical modification forms a base to design appropriate mutants. E.g. Chemical modification shows that 1 of 4 Met residues is
super reactive. To learn more abot the environment, use SDM to replace neigboring groups & observe the results…
Advantage of chemical modification in comparison to SDM? Fast, inexpensive, doesn’t require elaborate setup.
Disadvantage of chemical modification in comparison to SDM? Not very specific, and imposes a chemically reactive
environment. Can lead to drastic modifications. Modifications typically are limited to amino groups, carboxyl
groups, activated aromatic groups, sulfhydryl groups, guanidino groups, & imidazole groups, i.e., N & C termini, glu, asp, tyr, his, cys, met, lys, arg, trp.
© 2006, Alpay Taralp, Sabanci University
Paper methods applied to protein investigation
Historical applications – chromatography and electrophoresis
Amino acid analysis
Peptide mapping
Disulfide bridge analysis
Analysis of other post-synthetic modifications, e.g., phosphorylation
Work-up of chemical modification experiments!
Protein/peptide/amino acid resolution and purification
Current applications
Same! Apparatus has been revised somewhat – e.g., HV tlc is a strong alternative to HV paper methods
Advantages of paper methods over HPLC and MS methods to identify and purify
1.Many samples can be run simultaneously
2.Multidimensional runs are conveniently setup; sample processing is possible in between runs!
3.Resolving power is great
4.Tolerance of potential interferents is high
5.Detection methods are potentially very sensitive and selective
6.Cost efficiency of equipment and experiments
7.Less need of skilled labor
8.MS and HPLC methods can be coupled if desired
The apparatus of paper chromatography
for biological samples
for simple analytes
You may discard the drying accessories; instead clip the paper horizontally at the bottom with zig-zag scissors - allows solvent to drop evenly to the bottom
suspended papers
trough with solvent
The apparatus of high-voltage paper electrophoresis
1. paper; 2. dielectric; 3. glass tank; 4. trough with buffer (top) and base with same buffer (bottom); 5. cathode (-); and 6. anode (+)
Types of volatile solvent systems
•BAWP
•Ammonia – organic
•Pyridine acetate buffer
•Formic acid – acetic acid buffer
Types of papers
•Whatmann 3MM for high loading, large peptides
•Whatmann 1MM for high resolution
Tlc alternatives as stationary phase
•Cellulose tlc plates
•Silica tlc plates
Protein sample preparation prior to spotting on paper
Amino acid analysisHydrolysis (acid typically; base hydrolysis if Trp is needed)Dry sampleReconstitute in a minimum amount of running bufferSpot the sample on paper
Identifying the number of arginine residues in BSAChemically block all lysine residues in 8M ureaDialyze away the ureaDigest all arginyl peptide bonds using trypsinDry sampleSpot the sample on paper
Separating the and chains of insulinIncubate protein in 95:5 formic acid/30%hydrogen peroxideDry sampleSpot the sample on paper
Origin X X X X XX X X X X
(-)
Submergedin buffer at base, pH 2.1
Submergedin bufferedtrough, pH 2.1
Direction of migration40V/cm
e.g. for HVPE
oo
o
o
oo
o
o
o
o
o
o
o
o
o
o
after migration
Identifying and quantifying bands after migration and drying of paper chromatograms
Strategies to identify•Intrinsic fluorescence•Radiolabeling and exposure of X-ray film•Fluorescent derivatization or colorimetric derivatization, e.g., ninhydrin
Strategies to quantify•Densometry of chromatographic images•Liberation from paper and subsequent analysis
How shall we see these?
n-Dimensional runs & the advantage of multi-dimensions
1-D
2-D
multi-D
X X X X X X
Xoo
o
o
rotate90degrees
o o o o
run 2nddimension
o
oo
oo
o o
oo
o
o o
oo
oo
run 1stdimension
2nd & subsequent dimensions may/may not be performed using the initial conditions
oooo
run 2nddimension
o
oo
after 1stdimension
o o o o
o o o o o
you are interestedin these seriessamples
cut out,stitch ontonew paper!
ooooo
o
o
o
o
o
oo
oo
o
now you are interestedin this vertical series cut out,
stitch ontonew paper,run anotherdimension
repeat asnecessary
You may chemically process the sample in between dimensions!
•Spray or Dip
Reaction between protein functional groups & reagents Many protein groups interact with reagents as a function
of their ionization state Groups can be modified most specifically in an optimum
pH range At very high pH values, reaction with hydroxide ion
typically competes
© 2006, Alpay Taralp, Sabanci University
Met & neutral Cys feature substantial nucleophilicity Carboxylic acids react with diazo compounds principally
while in the protonated form. Ring carbon positions on Trp, His & Tyr can be modified,
usually via electrophilic attack. Phe is not sufficiently e--rich to promote electrophilic attack by typical protein reagents.
Typical reactivity of protic moieties as a function of pH
© 2006, Alpay Taralp, Sabanci University
Q: Are there other factors, which affect apparent (macroscopic) reactivities? A: YES! Accessibility and steric considerations refer to the size &
amount of reagent used, whereabouts of reactive group, permeation time of reaction, and the conditions of reaction.
Nucleophilicity considers the base strength, solvation shell, and lone electron density, polarisability & conjugation of centers on protein groups.
Electrophilicity considers the electron density of protein groups and electron deficiency of reagents.
The “hard likes hard, soft likes soft” empirical relation does apply to some degree. Basically, the interaction of reacting orbitals and centers is in part determined by their “harness” or “softness.”
© 2006, Alpay Taralp, Sabanci University
Some typical reagents of protein modification
© 2006, Alpay Taralp, Sabanci University
Some typical reagents of protein modification (contin.)
© 2006, Alpay Taralp, Sabanci University
Structure-function studies generally follow this procedure: Choose to address a particular issue; Envisage a strategy to perturb the protein in order to investigate the
issue; Anticipate the best groups to modify in order to create the desired
perturbation; Choose the best reagent and conditions to specifically modify the
functional groups in question; Compare the change of properties of the modified protein to that of
the control protein; Form conclusions using the Okum’s Razor argument.
Local changes and Global changes: Chemical modifications of protein groups affect pI & pKa values, solubility, surface hydrophilicity & hydrophobicity, bioactivity, folding stability & global organization.
© 2006, Alpay Taralp, Sabanci University
A: One application is to carry out structure-function studies Methods can be benign or destructive A typical protocol:
Label the native protein Determine a change of property, and Extract useful information on the basis of the results.
Data collected may be related to structure: E.g., a tyrosine specific reagent cannot react with a tyrosine residue; the
tyrosine may be buried. Data collected may be analytical.
E.g., Protein X does not react with a tyrosine-specific reagent in 10M urea; the primary sequence does not contain tyrosine.
Data collected may be related to function and/or structure and function. E.g. 1., 10 Tyr residues are solvent-accessible, but 1 predominantly reacts
with trace reagent; this Tyr is unusually reactive in the protein environment. E.g. 2., After Lys-93 is modified chemically, substrate X shows a reduced
Km value; Lys-93 plays a role in binding. If the role is direct, this Lys is likely near the binding site of the active center.
Q: What are the principle applications of protein chemical modification?
© 2006, Alpay Taralp, Sabanci University
Other investigative & analytical uses of protein chemical modifiers: Changing the net charge of the protein
WSC and ethylenediamine Iodoacetic acid, succinic anhydride Cyclohexanedione
Retaining the net charge but modifying the pKa Reductive formylation, amidination, guanidination
Altering groups and testing their importance Destabilizing a protein towards denaturants or reversibly protecting a protein
Citraconic anhydride, maleic anhydride Modifying protein hydrophobicity, hydrophilicity & surface activity
Adducts with different compounds such as PEG2000 Quantifying amino acids, functional groups, disulfide bridges, phosphate and other post-translational
modifications Various chemical reagents and chromatographic methods
Chemical modification is used to re-engineer proteins for improved performance in industry. The protein is characterized as much as possible The improved trait is defined A modification is envisaged to promote the property The protein is derivatized accordingly.
S-F problem?
InappropriateModification/ Conditions
Appropriate Modification/ Conditions
NOTE - Chemical treatment of biologicals need not necessarily be destructive and damaging. When conducting a structure-function study, your choice of reagent and protocol should not defeat the purpose of your study
© 2006, Alpay Taralp, Sabanci University
Judging if the purpose of an investigation has been defeated• The meaning of defeating your experimental
purpose• Cases that are indifferent to over-reaction of
protein groups• Cases that require post-reaction validation
Distinguishing between apparent/macroscopic values and true/microscopic/theoretical values
• Kinetic measurements versus thermodynamic measurements
• Measuring a value via independent methods may give a true value
A time to chemically label and a time to chemically work-up: Distinguishing between the two steps
In a protein labeling experiment and its workup, we typically note:
1. Reactions conducted on native proteins are for labeling
2. Reactions conducted on protein derivatives are for labeling or work-up
3. Reactions conducted on unfolded native proteins are for labeling of controls
4. Reactions conducted on unfolded protein derivatives are for work-up
Acetylation using Acetic Anhydride or Acetyl Imidazole at pH 7.5-9 Acetylation blocks free amino termini, lysine side-chains and tyrosine side-
chains. Acetyl tyrosine is hydrolyzed at high pH values or transesterified above pH 6 by a strong nucleophile.
Acetylation of amino groups is usually quantitative only if the reaction is carried out in urea solution. Without urea, usually 60% of the lysine residues are modified.
This modification could have the following effects Decrease of protein solubility (Acetyl BSA is only soluble at pH < 5); Changes in biological activity (by either global structural changes or specific
changes in the active site); Dissociation of multimeric complexes (if the surface charges are required for
association to other biomolecules).
HN
N
NHNHC NH2
H2+N
N
O
HCH3
SCH3SS
OO
H3C
SH
CO
-O
Example modifications
© 2006, Alpay Taralp, Sabanci University
Succinylation using Succinic Anhydride at pH > 7 Succinylation blocks amino groups without modifying other functional
groups. Succinyl tyrosine is hydrolyzed above pH 5 via an intramolecular
cyclization. The effect of succinylation parallels some of those described above for
acetylation. Succinylation is typically used to improve the solubility of poorly
soluble proteins, particularly at pH values above 5. Succinylation induces separation of protein aggregates (eg.
Hemerythrin dissociates into eight subunits). Succinyl proteins usually unfold more easily and demonstrate a shift in
their pH optima (assuming that activity is retained). Succinylation is often used as a linker molecule through which the
protein be attached to a foreign surface.
H N
N
N H N H C N H 2
H 2 + N
N
O
H CH 2 CH 2 CO O -
S CH 3 S S
O H
S H
C O
- O
© 2006, Alpay Taralp, Sabanci University
Maleiylation (left) using maleic anhydride & citraconylation (right) using citraconic anhydride at pH > 8Maleic anhydride and citraconic anhydride block amino groups but
can be removed by incubating the protein at low pH.Maleic anhydride is more difficult to remove & sometimes leads to
a minor side-reaction, in which Cys adds across the double bond.Citraconic anhydride is used to reversibly block amino groups and
to protect the amino groups from other chemical reactions. E.g., citraconylation will protect amino groups from subsequent oxidation by
hydrogen peroxide. Once Cys’s are oxidized, amino groups are regenerated. Citraconylation is the method of choice to temporarily solubilize
poorly soluble proteins.
sometimes alittle bit hereC
O-O
S
O
O
O-
O-
H
OH
SS SCH3
N
O
HCH CHCOO-
NHC NH2
H2+N
N
NH
HNHN
N
NHNHC NH2
H2+N
N
O
HCH CCOO-
CH3
SCH3SS
OH
SH
CO
-O
© 2006, Alpay Taralp, Sabanci University
Polypeptidylation using carboxyanhydrides at pH 7Carboxyanhydrides add onto amino groups, releasing carbon
dioxide and producing new amino groups. In presence of excess reagent, the reaction repeats, building a long
polypeptide chain that extends into solution.Technique can improve solubility of insoluble proteins (R = H) &
conversely, reduce the solubility of soluble proteins (R = i‑propyl). Polyvalylribonuclease, for example, aggregates in solution above 30C
Used to study hydrophobic interactions between proteins.
H N
N
N H N H C N H 2
H 2 + N
N
O
H CH R
N H
O
CH R
N H 2
S CH 3 S S
O H
S H
C O
- O
n
N H O
R O
O
© 2006, Alpay Taralp, Sabanci University
Trifluoroacetylation using Ethyl Thiotrifluoroacetate at pH 10 Ethyl thiotrifluoroacetate is used to reversibly block amino groups as in the case of
maleic anhydride and citraconic anhydride. But the derivative has a net zero charge. Furthermore, the group is removed under basic
conditions using sodium carbonate at pH 10.7 or 1M piperidine. E.g. 1., pancreatic ribonuclease was deactivated completely following
trifluoroacetylation yet there were no measurable structural changes. In carbonate buffer, activity was gradually restored.
E.g. 2., trifluoroacetylation caused structural changes in cytochrome c. Incubation in carbonate buffer restored full electron transfer ability.
Trifluoroacetylation can simplify protein sequencing if trypsin is used to cleave the protein
Trypsin normally cleaves after every free arginine and lysine residue, generating a complicated mixture. If the protein analyte is first trifluoroacetylated, trypsin can only cleave at the arginine bonds.
The fragments are isolated, deblocked and subjected to a 2nd tryptic digestion at the lysine sites
HN
N
NHNHC NH2
H2+N
N
O
HCF3
SCH3SS
OH
SH
CO
-O
© 2006, Alpay Taralp, Sabanci University
Amidination using methylacetimidate (left) or guanidination using o-methylisourea (right) at pH 7-10.Methylacetimidate forms stable derivatives with lysine residues
and amino termini. O-methylisourea reacts with lysine groups.Modification of amino groups increases steric bulk, while retaining the
positive charge. The pKa of derivatives shifts to values well above 11! Most modifications do not give significant structural changes.
Q: Suggest a test to postulate if the active-center lysine is the catalytic nucleophile or a binding group.
Q: Suggest another test to determine if a lysine and a neighboring aspartic acid maintain a crucial salt bridge. Hint: You will analyze the pH activity after reaction.
HN
N
NHNHC NH2
H2+N
N
N+H2
HCH3
SCH3SS
OH
SH
CO
-O
HN
N
NHNHC NH2
H2+N
N
N+H2
HNH2
SCH3SS
OH
SH
CO
-O
© 2006, Alpay Taralp, Sabanci University
Ethoxyformylation using diethylpyrocarbonate (ethoxyformic anhydride) at pH 4 Ethoxyformic anhydride only reacts with imidazole groups at pH 4; it modifies
amino groups at basic pH values. The acyl-imidazole adduct is stable in water, particularly at pH 7, unlike other
acyl His derivatives. The group is removed by the action of H2N-OH at pH 7. EFA is used in molecular biology to battle against RNAse. EFA can rapidly inactivate many enzymes including trypsin and is used by
industry to cold-sterilize food. EFA can aid structure-function studies if histidine is important for bioactivity.
Q: You must decide if a Lys or a His is essential for catalysis. What experiment could you design using the reagents we have discussed thusfar?
HN
N
N
O
H3CH2CONHC NH2
H2+N
N
O
HOCH2CH3
SCH3SS
OH
SH
CO
-O
© 2006, Alpay Taralp, Sabanci University
Reductive dimethylation using H2C=O & NaBH4 (left) at pH 9; Variable methylation using ICH3 (right) at pH 2-10 Reductive methylation mono/dimethylates amino groups, retaining the positive
charge of the amine. Generally, structural changes are not observed following reductive methylation.
Reaction with iodomethane quaternizes amino groups with retention of positive charge. Met and Cys are converted to their sulfonium iodides. Tyr is methylated and His is converted to the dimethylimidazolium iodide.
Reaction selectivity is tuned by appropriate choice of pH & reaction medium. Unlike reductive methylation, iodomethane:
Puts a permanent positive charge on amino, imidazole and sulfide groups Removes the hydrogen bonding ability of tyrosine
HN
N
NHNHC NH2
H2+N
+NHCH3
CH3
SCH3SS
OH
SH
CO
-O
HN
N+
N
H3C
CH3
NHC NH2
H2+N
+NCH3
CH3
CH3
S+(CH3)2SS
OCH3
S+(CH3)2
CO
-O
© 2006, Alpay Taralp, Sabanci University
Carboxyalkylation using iodoacetate (left) & carbamino-alkylation using iodoacetamide (right) at pH 2-10. Iodoacetate & iodoacetamide react similarly to iodomethane
except tyrosines are generally not modified.Derivatives are larger than iodomethaneDerivatives may be considered bulkier.
In the case of iodoacetic acid, a negative moiety is introduced. Iodoacetamide, iodoacetic acid and iodomethane are commercially
available in NMR-active and radioactive forms.
HN
N+
N
-OOCH2C
CH2COO-
NHC NH2
H2+N
+NHCH2COO-
CH2COO-
S+CH2COO-H3C
SS
OH
S+(CH2COO-)2
CO
-O
HN
N+
N
H2NOCH2C
CH2CONH2
NHC NH2
H2+N
+NHCH2CONH2
CH2CONH2
S+CH2CONH2
H3CSS
OH
S+(CH2CONH2)2
CO
-O
© 2006, Alpay Taralp, Sabanci University
Esterification using diazoglycinamide at pH 5 (left) or 0.1M methanolic HCl (right).Reactions are acid catalyzed
With diazoglycinamide, the strongest proton donors (C-terminal carboxylic acids) react first
With methanolic HCl, esterification hastens as [HCl] increases.Esterifıcation time course experiments often correlate to a loss of
biological activity
HN
N
NHNHC NH2
H2+N
NH2
SCH3
OH
SH
CO
H2NCCH2OO
S S HN
N
NHNHC NH2
H2+N
NH2
SCH3
OH
SH
CO
H3CO
S S
© 2006, Alpay Taralp, Sabanci University
Amidation using amine, hydroxysuccinimide, and ethanolamine at pH 4.75Amidation is a general method of converting protein into
many useful forms.By appropriate choice of amine, a positive, zero or negative
charge can be introduced at the carboxylic acid sites.E.g. 1., immobilization of enzyme has been carried out using
carboxyl functions. E.g. 2., chemical modification can be used to probe the active
site carboxylates of pepsin.
HN
N
NHNHC NH2
H2+N
NH2
SCH3
OH
SH
CO
HOCH2CH2NH
S S
© 2006, Alpay Taralp, Sabanci University
Reduction using mercaptoethanol or dithiothreitol (reduced form) at pH 8.Cystine bridges can be cleaved to afford two cysteines by the
action of mercaptoethanol or dithiothreitol. In the case of DTT, a stoichiometric amount of reagent is sufficient to bring
about the modification. The driving force for the reaction is formation of a stable 6-membered ring.
Reduction has been used to study bioactivity & structure. In many cases, reduced proteins exhibit changes of solubility & activity.Reoxidation of reduced cysteines sometimes reforms the correct disulfide
bridges & restores structure and function.
HN
N
NHNHC NH2
H2+N
NH2
SCH3SH
OH
SH
CO
HO
SH
© 2006, Alpay Taralp, Sabanci University
In competitive labeling, a modification ALWAYS reflects the state of the native protein. Why? Reagent is used in trace amounts
At most 1 group/protein is modified Thus, only the reactivity of the native protein is probed
Above: A typical experiment quantifies the statistical distribution of the kinetic reactivity of protein groups the reactivity of every reactable group is probed at different pH values. While the results describe apparent data, they are used to interpolate a group’s
local environment.
Molarradiolabelincorporation(apparentrelativereactivity) with respectto Phe-NH2
pH value of reaction
1-
0.5-
apparent pKaof reactive group
Apparent relative reactivity and pKa values of 3 proteingroups with respect to an external Phe-NH2 standardCompetitive Labeling
Many methods used to modify different functional groups
Generally the modifier is used in excess → proteins modified extensively.Onus is put on the investigator to prove that the results reflect the properties of the native protein.
© 2006, Alpay Taralp, Sabanci University
Steps: 1.Tracer. Incubate a solution containing protein & standard (e.g., Phe-NH2) with a trace amount of 3H-reagent. Tritium incorporation is your probe. Carry out this operation at many pH values.
2.Normalization. Unfold the protein in urea and react the remaining groups completely with 14C labeled reagent. This protocol ensures that all protein and standard derivatives become chemically homogeneous.
3.Quantification. Extract the standard quantitatively into an organic phase and quantify the 3H/14C ratio for each pH value. Digest the protein using enzymes so that only one modified group is present per peptide. Migrate all peptides in an electric field. Identify peptide positions using autoradiography. Collect each band & quantify the 3H/14C ratio. Sequence the peptide in order to identify the derivative.
4. Interpretation. The 3H/14C ratio of protein & standard are compared. The reactivity of the standard reflects its aqueous environment & should parallel its titration curve. The reaction profile reflects the groups’ local environment so you should observe a “titration curve”, which illustrates the effect of the protein environment. This titration curve may deviate significantly from what would be expected in an aqueous environment. The data can thus be used to build a picture of the local environment.
© 2006, Alpay Taralp, Sabanci University
Two data can be extracted for every group: The kinetic pKa of the group & the relative reactivity of the group can be assessed in comparison to Phe-NH2, which has a well-characterized reactivity and pKa that is devoid of any environmental influences. The two data help to assess if any potential steric considerations or stereoelectronic factors are out of the ordinary.
Below are some example results, which allow you to appreciate the power of this method:
e.g. Titration curve of Cys gives a pKa of 3. Perturbed pKa. e.g. Titration of Lys-29 gives poor reaction when reacted from pH 5 to
11. Buried. e.g. Titration of Lys-29 gives a continuous titration curve with an
interpolated pKa of 11 when reacted from pH 5 to 11 (A normal lysine is 10.5). Lysine is buried part of the time or otherwise perturbed.
e.g. Titration of Lys-29 gives a discontinuous curve when reacted from pH 5 to 11. First there is no reaction, then after pH 10.5 the reaction is very high. Conformational change and accessibility.
e.g. The N-terminal Histidine imidazole ring displays a pKa value that is equal to the pKa of its N-terminal amino group. Inductive effects and coupled reactivities.
e.g. Topography of E.coli Ribosomal Protein L12 in situ Eur. J. Biochem 80, 35-41 (1977). Next slide (please).© 2006, Alpay Taralp, Sabanci University
© 2006, Alpay Taralp, Sabanci University
The power of competitive labeling and paper methods
RECAP - Steps of competitive labeling: 1. Label each protein at most once, using a trace of tritiated reagent
His
NH2
LysTyr
SH3C
1000
1000 AcHisNH2
+100 *3-Me-I
95 *3-Me-OH
(aq)(aq)
SH3C
Me-3H*
Tyr Lys
NH2
His
(aq)
SH3C
Tyr Lys
NH2
His
Me-3H*
(aq)
SH3C
Tyr Lys
NH
Me-3H*
His
(aq)
SH3C
Tyr Lys
Me-3H*
NH2
His
AcHisNH2
Me-3H*999
997
111
in a pH "X" solution
AcHisNH21
Standard, with known pKa value & reactivity!
+
+ +
+
++
+
+excess *14-Me-I/12-MeI mixture
99% *14-MeOH/12-MeOH
in 8M urea pH 10 solution
+SH3C
Me
Tyr
Me
+Lys
Me
Me
Me
+N
Me
Me
Me
+His
MeMe
(aq)
+SH3C
Me
Tyr
Me
+Lys
Me
MeMe
+N
Me
Me
Me
+His
Me-3H*Me
(aq)
+SH3C
Me
+Lys
Me Me
Me
+N
Me-3H*
Me
Me
+His
MeMe
(aq)
+SH3C
Me
Tyr
Me
+LysMe
-3H*
Me
Me
+N
Me
Me
Me
+HisMe
Me
Ac+HisNH2
Me-3H*Me
999
997 1
11
Ac+HisNH2
Me Me1
+
++
+(aq)
MeTyr
(aq)
SH3C
Tyr Lys
NH2
His
(aq)
SH3C
Tyr Lys
NH2
His
Me-3H*
(aq)
SH3C
Tyr Lys
NH
Me-3H*
His
(aq)
SH3C
Tyr Lys
Me-3H*
NH2
His
AcHisNH2
Me-3H*
999
997 1
11
AcHisNH21
+
++
+
Step 2: Label all protein completely using excess low specific actitivity 14C-MeI to yield a chemically homoge-neous mixture
Step 3. Repeat steps 1 & 2 using more native protein; label many samples at different pH values.
Step 4. Prepare a positional marker protein; label native protein with high-specific activity 14C-MeI, then excess 12C-MeI in 8M urea
Step 4. Digest all methyl-proteins separately. Use 2-4 proteases.
Step 5. Separate the different peptides in an electric field along tlc or paper; use 1-4 dimensions; spot the marker peptides at the ends of the chromatogram.
Step 6. Expose X-ray film to the chromatogram (RT for 14C, -80˚C for 3H) to find the position of all substantially radioactive peptides.
Step 7. Find the 3H/14C ratio of each peptide band; sequence each to identify the reactive group (e.g., AlaTyr*Gln can only be Tyr18)
Step 8. Plot the reactivity (i.e., 3H/14C ratio) of each reactive group as a function of pH.
Subsequent Steps (for proteins of known 1˚ sequence):
cathode (-)
anode (+)
3kV,30min,pH 2.1
Peptide solutions spottedalong paper at origin
14C-marker peptides
x x x x x xx xx x x x x x x x x7.1 7.2 7.3 7.4 7.5 7.6 7.7 7.8
pH of labeling7.9 8.0 8.1 8.2 8.3 8.4 8.5
Paper chromatogram beneath;X-ray film superimposed on top
Recover & measure 3H cpm/14C cpm
e.g.1; peptide 8 @ pH 7.8 = 512/1890
e.g.2; peptide 7 @ pH 7.6 = 450/5680
e.g.3; peptide 6 @ pH 8.1 = 890/1543
. .
..
.
.
= radioactive ink spotted on Whatmann 3MM paper1
23
4
5
6
7
8
9
Peptide 6 was sequenced: AlaTyr*Gln indicates a Tyr reacted. There are 4 tyrosyl residues in this protein but the sequence data indicates Tyr18
Molar 3H/14Cradiolabelincorporation(apparentrelativereactivity) with respectto AcHis-NH2
pH value of reaction(referenced against known pKa of AcHis-NH2)
1-
0.5-
apparent pKaof reactive group
Apparent relative reactivity and pKa values of 5 proteingroups with respect to an external AcHis-NH2 standard
His102
Tyr18
Met1
Lys87
NH2
AcHis-NH2
In vitro manipulatin of protein monomers or their environments to
enhance performance
© 2006, Alpay Taralp, Sabanci University
Goals: To formulate broad-scope protein preparations, which are: Cheaper More stable More catalytic Longer-lived More easily stored & transported More active at pH & temperature extremes
Locating/purifying thermophiles, etc.
Geneticmanipulation
Low-techchemical strategies
Native
search for robustness:
Approach 1: use proteins in water-free media
Approach 2: crosslink Approach 2: crosslink proteins togetherproteins together
Approach 3: chemically Approach 3: chemically modifymodify proteinprotein monomersmonomers
© 2006, Alpay Taralp, Sabanci University
As scientists, you will occasionally prepare proteins, which are not in water. Please note the following pharmaceutical example:
Insulin Lysozymeintraperitoneal injections creams & gelstime-released matrix inhalable microcapsulesİnhalable microcapsules drop formulations
Question: What are some biases related to proteins in unusual environments?
Fact 1: Protein structure & function is sensitive to pH, temperature, ionic strength, etc.
Fact 2: Eminent scientists have said “Enzymes need aqueous environments to function because this is how Nature has intended them to function.”
Question: In your profession, you may prepare proteins in unusual environments. Should you worry?
© 2006, Alpay Taralp, Sabanci University
Answer: NO! Recent evidence shows that protein structure and protein function may be very tolerant to strange environments!
Today, we will discuss the structure & function of proteins in reduced-water & water-free environments
Why does mankind wish to use biologically active proteins?Proteins accelerate chemical reactionsProteins improve product propertiesProteins permit novel syntheses
Here are some typical industrial applications:BioreactorsTextile treatmentMedicinal and organic synthesesProtein drugs & drug deliveryBiosensorsBioremediationFood preparation industries
Problem? Industrial conditions are often too harsh for proteins in the native state.Consequences: Poor biological activity, short lifespan, limited reaction parameters, etc.
© 2006, Alpay Taralp, Sabanci University
Example I: Rate comparisons in octaneEnzyme kcat/Km (M-1s‑1) Rate (kENZ) Rate(kNONENZ) Enhancement
Chymotrypsin 0.7 1.1 x 10-11 6.4 x 1010
Subtilisin 1.8 1.1 x 10‑11 1.6 x 1010
N‑Ac‑L‑Phe‑OEt + amyl alcohol → N‑Ac‑L‑Phe‑OAmyl + EtOH in octane
NOTE – Reactions in water are much faster!
The new view: Proteins can maintain their structure & function without an aqueous environment.
Implication? Your protein drug formulations may beperfectly happy in a gel, or a cream or in a sugar-coatedmatrix
Typically kcat/Km water / kcat/Km
octane = 104-107
Q: Can we improve the speed of an enzyme reaction in octane? A: YES! But first we must understand more...
© 2006, Alpay Taralp, Sabanci University
Q: -Momorcharin was crystallized in water (thin lines), then crosslinked, & solvent-exchanged with CH3CN (thick lines). Compare the structures.
Similar results were obtained with: chymotrypsin in hexane subtilisn carlsberg in acetonitrile
Example II: Protein Structure & Integrity
H2O
H2O
H2OH2OH2O
H2O H2O
H2O H2O
H2O H2O
H2OH2O
H2O
o oo
o oo
o oo o oo
o oo
o ooorganics organics organics
organicsorganicsorganics
organics organics
organicsorganics
organicsorganicsorganics
organicsorganics
organics
© 2006, Alpay Taralp, Sabanci University
Example III: Solvent Effect on Catalytic RatesLyophilized (pH 7.8) chymotrypsin + dry organic solvent CT suspensionLyophilized (pH 7.8) subtilisin + dry organic solvent SBL suspension
Solvent Vmax/Km (min-1 x 10-6)Subtilisin Chymotrypsin
Hexadecane 3900 4300Octane 2000 1700Carbon tetrachloride 340 96Toluene 150 120Ethyl Ether 97 48Acetone 810 0.6Acetonitrile 150 0.4Dimethylformamide 19 <0.1Dimethylsulfoxide <0.1 <0.1
N-Ac-L-Phe-OEt + n-propanol N-Ac-L-Phe-n-OPr + EtOH
Q: Is the above data incorrect?A: NO! Products were NEVER formed unless the enzyme was added!
© 2006, Alpay Taralp, Sabanci University
Example IV: Active Site Integrity
SO2 OSer in active site
Q: Chymotrypsin(dispersed in octane) + phenylmethylsulfonyl fluoride yielded inactive enzyme. What is implied?
Question: How do we explain this trend?
Answer: Hydrophilic organic solvents strip essential water from the enzyme.
What is the evidence? Enzyme activity correlates directly with the amount of water retained by
the “dry” enzyme Enzyme activity improves greatly by adding small amounts of water
(1.5%) to the hydrophilic organic solvents.
© 2006, Alpay Taralp, Sabanci University
Example V: Rate Measurements in octane
Q: Rate measurements in organic solvents behaved as if in aqueous solution! What is implied?
Start FinishE EF
A P B Q
E + S ES E + Pk1
k-1
k2
B1
B2B31/v
1/[A]o
v
oE][= o +
oA
A
][
+
oB
B
][
© 2006, Alpay Taralp, Sabanci University
Example VI: pH activity
Q: Chymotrypsin activity in octane depends on the pH of solution from which the enzyme was dried. Why?
A: pH “memory”
pH
kcat/Km
kcat/Km
LpH
Lyophilize
Choice 1: set pH here andlyophilize
Choice 2: lyophilize and thenoptimize pH with organicsoluble buffers
o
o
o
o
o
o
o
o
o
organics
organics
organics
organics
organics
organics
organics
organics
organics
organics
organics
© 2006, Alpay Taralp, Sabanci University
Q: Enzyme dried in the presence of acetyl-L-phenylalanine was more active. Why?
A: Positional imprinting of active site groups!
o
o
o
o
o
o
o
o
o
Example VII: Imprinting protein in the dry state
© 2006, Alpay Taralp, Sabanci University
Example VIII: Competitive inhibition of chymotrypsin in water & in octane
Inhibitor Inhibition Constant KI(mM)In water In octane
Benzene 21 1000Benzoic Acid 140 40Toluene 12 1200Phenylacetic acid 160 25Naphthalene 0.4 11001‑Naphthoic acid 7.2 3
Q: Good competitive inhibitors in water are poor competitive inhibitors in octane! Poor competitive inhibitors in water are excellent competitive inhibitors in octane! WHY?
© 2006, Alpay Taralp, Sabanci University
Example IX: Substrate specificity (kcat/Km) of chymotrypsin & subtilisin (in water or octane)
Chymotrypsin Subtilisin
Substrate hydrolysis transesterification hydrolysis transesterification
N‑Ac‑L‑Phe‑OEt 4.00 x 104 0.72 1.3 x 104 1.7N‑Ac‑L‑His‑OMe 2.00 x 102 1.5 5.5 x 102 3.1N‑Ac‑L‑Ser‑OMe 0.87 x 102 2.5 1.6 x 102 4.5
Q: Hydrophobic groups yield better substrates in water! Hydrophilic groups yield better substrates in octane! WHY?
© 2006, Alpay Taralp, Sabanci University
Initial Rate (mmol h-1)/mg proteinSolvent L-enantiomer D-enantiomer enantioselectivity
(vL/vD)acetonitrile 0.85 0.12 7.1pyridine 0.645 0.15 4.3acetone 0.54 0.41 1.3dichloromethane 0.29 0.33 0.88methyl tert‑butyl 2.2 6.4 0.34octane 2.9 12 0.24tetrachloromethane 1.7 8.9 0.19
Q: Acetyl-L-Phe-OEt is a better substrate in polar organic solvents, whereas acetyl-D-Phe-OEt is a better substrate in very hydrophobic solvents. WHY?
Example X: Rates and enantioselectivities of propanol/N‑Ac‑Phe‑OEt transesterification using Aspergillus oryzae protease in anhydrous solvent
© 2006, Alpay Taralp, Sabanci University
Q: Protein structure & function may suffer in proceeding from water to dry environments. How can we help?
o
o
o
o
o
o
o
o
o
© 2006, Alpay Taralp, Sabanci University
the fraction (f) of the active subtilisin in the CLCs
A linear profile (black) indicates that diffusion is not rate-limiting. A convex profile (red) would be expected if diffusion was an important factor.
o
o
o
o
o
o
o
o
o
© 2006, Alpay Taralp, Sabanci University
In summary, when proteins are put into unusual environments they can survive!
Final Question: What are some advantages of using enzymes in water-free environments?
Novel reactions, which are not feasible in water, become posssible Use of “non-water” nucleophiles, e.g. transesterification and ester ammonolysis; Increased solubility of apolar substrates; andshifting thermodynamic equilibria to favor synthesis over hydrolysis, e.g., esterification and peptide formation.
Suppression of water‑mediated side-reactions. Alteration of substrate specificity. Enhanced thermostability of enzymes. Easy recovery of enzyme from low boiling solvents.
© 2006, Alpay Taralp, Sabanci University
Crosslinking Protein Solids to Enhance Utility
© 2006, Alpay Taralp, Sabanci University
You may prepare protein drug formulations, which are not in water:
We saw some examples just before:Insulin Lysozymeintraperitoneal injections creams & gelstime-released matrix inhalable microcapsulesİnhalable microcapsules drop formulations
New Facts: Protein structure & function can be stable in unusual environments if you use the correct procedure! What Are The Results???? Cheaper More stable More catalytic Longer-lived More easily stored & transported More active at pH & temperature extremes
Question: In your profession, you might wish to prepare “super” proteins, which are very effective in various applications. What are your available strategies? © 2006, Alpay Taralp, Sabanci University
Strategies to engineer better proteins
Locating/purifying thermophiles, etc.
Geneticmanipulation
Low-techchemical strategies
NativeApproach 1: use proteins Approach 1: use proteins in water-free mediain water-free media
Approach 2: crosslink proteins together
Approach 3: chemically Approach 3: chemically modifymodify proteinprotein monomersmonomers
© 2006, Alpay Taralp, Sabanci University
Approach 2: Crosslinking of protein
Crosslinking of protein solids
Solution phase crosslinked enzymesCrosslinked enzyme crystals
Native
Q: Why crosslink protein?
Improved structural stability
Potentially altered function
© 2006, Alpay Taralp, Sabanci University
Established crosslinking media:A. In a solubilizing environment
Q: What are the characteristics of method A?Solution-phase crosslinking often leads to polydispersityEvents: Soluble monomers → Dimerization → Oligomerization →→→ Insolubilization (typically gelation)
© 2006, Alpay Taralp, Sabanci University
B: In the crystalline state
Q: What are the characteristics of method B?Crosslinked enzyme crystals are biologically active catalysts with well-defined poresEvents: Soluble monomers → Crystallization → Chemical crosslinking →→→ Insolubilization
© 2006, Alpay Taralp, Sabanci University
C: In the lyophilized state
Q: What are the characteristics of method C?Crosslinking of lyophilisates leads to polydispersityEvents: soluble monomers → lyophilization at desired pH value → vacuum, heat and/or chemical crosslinker → oligomerization →→ insolubilization (powder resists swelling in solvents)
© 2006, Alpay Taralp, Sabanci University
D: As a precipitate retaining the native structure
Q: What are the characteristics of method D?Events: Soluble monomers → Salt-out or insolubilize with miscible organics → Add chemical crosslinker → Oligomerization →→ Insolubilization (powder resists swelling in solvents)
© 2006, Alpay Taralp, Sabanci University
E. Supported enzyme technologies
Surface-immobilized enzymes
Crosslinked andintertwined enzymes
Surface-immobilizedenzyme aggregates
Q: What are the characteristics of method E?
© 2006, Alpay Taralp, Sabanci University
Q: How do the proteins eventually combine together?Method A: Classic solution-phase crosslinking of dissolved proteins
© 2006, Alpay Taralp, Sabanci University
Q: Why is crosslinking solution-phase proteins (Method A) not equivalent to crosslinking of “pre-solidified” proteins (Methods B-
D)?
When reactive chemical crosslinkers are employed, the outer enzymes preferentially react at time t = 0 → teq.
Enzyme crystal
Amorphous solid (enzyme aggregate or lyophilisate)
Method A: In the solution phase, all enzymes have equivalent chances of reacting at time t = 0.
Solution phase
Method B
Methods C & D
© 2006, Alpay Taralp, Sabanci University
Q: How are proteins bonded together?A: Chemical (& thermal) strategies
Common reagents: glutaraldehyde, glyoxal, glycolaldehyde, formaldehyde, WS carbodiimide, bifunctionals such as alkyldiimidates, diisoureas, diketohalides, disulfonates & ditresylates, cyanuric Cl
Reagent-free, thermal crosslink induction: www.proteovak.com
The physico-chemical properties of the interprotein bond is variable:
I. Length: Crosslink varies from zero length to multi-carbon units
II. Bonding is generally mediated by the protein amino groups and carboxyl groups. Cystine bonds are also important.
III. Some reactions proceed with charge retention of the bonded groups, whereas other reactions proceed with a change.
CN
O
H
NH (CH2)n NH
N+H2 N
+H2
NH NH
OO
OO
N+H2N
+H2
S ?
SS
s?
N+H2 N
+H2
(CH2)n
N+H2 NH N
+NH
H
NN
[ ]nO O
NN
NO2O2N
NN
H H
© 2006, Alpay Taralp, Sabanci University
Q: Let us please summarize approach 2: protocols, reagents & products
A. In aqueous solution
B: In the crystalline state
C: In the lyophilized state
D: As a precipitate retaining the native structure
Your choice of reagent? Prior history; Surface accessibility and steric constraints; & effective pH during the reaction conditions
A: glutaraldehyde, other simple aldehydes,
EDC.HCl, diimidates, difluorodinitrobenzene
B: glutaraldehyde, other simple aldehydes
C: volatile or organic soluble carbodiimides, acyl group activators, disulfide exchange, thermal induction of amide bonding, bifunctional acyl halides and similar reagents
D: glutaraldehyde, dextran polyaldehyde© 2006, Alpay Taralp, Sabanci University
Re-engineering Protein Monomers via Chemical Modification in Aqueous,
Organic or Dry Environments
HN
N
NHNHC NH2
H2+N
+NH3
SCH3SS
OH
SH
CO
-ONH2Native protein at slightly
alkaline pH Values
© 2006, Alpay Taralp, Sabanci University
Q: What are some chemicals to alter protein charge? Ac2O, MeI, succinic anhydride, H2NCH2CH2NH2/ carbodiimide, iodoacetic acid
Q: What are some chemicals to alter protein hydrophobicity? Octadecyliodide, PEG-tresylate
Q: What are some reaction environments to alter protein groups?
Aqueous reagents acting of dissolved protein Organic-phase reagents acting on protein powder Vapor phase reagents acting on protein powder
Let us examine the fate of one enzyme, whose charges were altered in water using chemical reagents
modify the enzymein specific areas new enzyzme is faster or
new enzyme is stabler ornew enzyme has different pH activity
Review of Approach 3: Chemical modification of proteins to alter properties
© 2006, Alpay Taralp, Sabanci University
NH3+
example
inactive
COO-
COO--
OOC
-OOC
-OH
-OOC
-OOC COO
-
COO-
active
NH2
pH
catalyticrate
pH 7.5
Q: In this hypothetical example, is the enzyme active at low or higher pH values?
Q: How do you rationalize the difference?
© 2006, Alpay Taralp, Sabanci University
ethylene diamine + carbodiimide
-OH
+H2NCH2CH2NC
O
H
OC+ED
CO
ED+inactive
NH3+
CED+
O
NH2
active
COO-
COO--
OOC
-OOC
-OH
-OOC
-OOC COO
-
inactive
NH3+
COO-
CED+
O
NH2
activeCO
ED+
OC+ED
+EDC
O
Q: What happens to the charge distribution along the surface?© 2006, Alpay Taralp, Sabanci University
pKa 7.5
catalyticrate
pH
pKa 6.5
Gibbsenergy
reaction coordinate
Eu
EF
EF
EuQ: You may have obtained a desirable change, but many times the change is not free... What price have you paid in this example (right graph)?
Q: In this hypothetical example (left), what has happened to the pH activity of your antimicrobial protein drug (two things)? Please rationalize...
Q: Is this change desirable (local pH of infected regions is lower)?
© 2006, Alpay Taralp, Sabanci University
Q: So where do we stand?
User-friendly protocols to aggregate & crosslink protein and to improve stability
Chemical agents to alterprotein groups & function
Know-how to use proteinsin water-free environments
Thus...© 2006, Alpay Taralp, Sabanci University
My suggestion
Combine the 3 strategies to enhance the performance of proteins as drugs, etc., and as enzymes
Examples of heat-crosslinked protein & analysis
© 2006, Alpay Taralp, Sabanci University
Addition to Notes:Protein Purification &
Related Analytical Methods
Some ReferencesHarris, E.L.V. and Angal, S. eds., Protein Purification Methods: A Practical Approach, 1989, IRL PressHarris, E.L.V. and Angal, S. eds., Protein Purification Applications: A Practical Approach, 1990, IRL PressDeutsher, M.P., ed., Methods in Enzymology, Guide to Protein Purification, 1990, Academic Press
© 2006, Alpay Taralp, Sabanci University
If you wish to obtain new proteins, you must understand how to purify & test these proteins
Three protein types:MembraneIntracellularExtracellular
© 2006, Alpay Taralp, Sabanci University
Disruption
Organelle Isolation
Disruption/Solubilization
Clarification
Centrifugation/flocculation or Liquid 2-Phase Partitioning
Liquid 2-Phase Partitioning
Ammonium Sulphate Precipitation
Organic SolventPrecipitationPrimary Separation Techniques
Chromatography Techniques
Ion Exchange Hydrophobic Interaction
ChromatofocussingIon Exchange
Other AbsorbtionMethods
Gel Permeation
Metal Chelate CovalentHydrophobicInteraction
Affinity
Intracellular Protein
© 2006, Alpay Taralp, Sabanci University
© 2006, Alpay Taralp, Sabanci University
To purify a protein, here are some general rules:
1. The following methods are used to purify protein:
Technique Exploited Protein PropertypH Precipitation Charge, pI valueAmmonium Sulphate Precipitation Intermolecular charge/
hydrophobic interactionsIon exchange chromatography ChargeHydrophobic Interaction chromatography HydrophobicityChromatofocusing Charge, pI valueDye Affinity chromatography Affinity for high MW dyeLigand Affinity chromatography Bioactivity/affinintyGel permeation chromatography Dynamic volume (size)
2. No single method is perfect, so you should use many together
Example manipulation:Starting purity = 10%, yield is 10mg of target proteinAfter AS precipitation, purity = 60%, yield is 8mg of target proteinAfter Chromatography, purity = 95%, yield is 4mg of target protein
3. Protein is characterized after purification: Final yield? MW? N- & C-terminal analysis, disulfide bridge analysis, etc.
© 2006, Alpay Taralp, Sabanci University
Your target is insulinpI = 4.5, MW = 5600
S
S
S
S
NGlycine
NValine
HN
N
N
NH
HO
OH
OHOH
© 2006, Alpay Taralp, Sabanci University
With recombinant Human insulin:Lysis the cells, centrifuge, & collect your sample: A solution of insulin, other proteins, DNA, salts, metal ions, etc. You deterimine 1% insulin, 99% other proteinsand a total of 2g insulin in the batchHow to purify?
SDS gel analysis
Centrifuge lysed insulin
pH pptation
AS pptation
ion exchange
hydrophobic chromatography
insulin receptor column
© 2006, Alpay Taralp, Sabanci University
Protein purity and analysisQ: What is PURE ENOUGH?Q: What is the meaning of Purity? Free of other proteins? Free of ions? Free of DNA? Free of protease activity?
Q: In the case of insulin, what parameters should you analyze (and how)?Size (many)Primary sequence (many)N-terminus, C-terminus (many)Surface-accessibility of various groups (Chem. Mod.)Shelf-life at different humidities (Solubility)Amino group count (Kaiser test)Histidyl group count (Pauli test)pKa of ionizable groups (competative labeling)Secondary sequence (CD)Tertiary sequence (X-ray, NMR, receptor assay)
© 2006, Alpay Taralp, Sabanci University
e.g. Purifying insulin as a drug productRoutes of application:Injection of suspended insulin under skinImplant materialsMicrocapsules containing insulin for lung absorption
What aspects of purity should you examine if you prepare insulin as a drug formulation?
SterilityStorage life and requirmentsBiological activityBioavailabilityMany of the previous criteria!
© 2006, Alpay Taralp, Sabanci University