NMR Spectroscopy in
Structural Analysis
Rainer Wechselberger
2009
ii
This reader is based on older versions which were written and maintained by a number of people. To the best of my knowledge the following persons were involved in the history of this document: Rob Kaptein, Rolf Boelens, Geerten Vuister and Michael Czisch. The current version was completely revised by me and adopted to my lecture 'NMR Spectroscopy in Structural Analysis' I thank Hugo van Ingen, Marloes Schurink and Hans Wienk for proofreading, discussions, suggestions, and continuing to improve the course material. Rainer Wechselberger, Utrecht in the summer of 2009
Please report errors in the text and/or explanations or any 'unclear' passages to:
or
iii
I Introduction........................................................................................................ 1
1.1 Typical applications of modern NMR................................................................ 2
1.2 Some history of NMR ........................................................................................ 2
1.3 Aim of this course .............................................................................................. 3
1.4 General outline ................................................................................................... 4
II Basic NMR Theory ........................................................................................... 5
III An Ensemble of Nuclear Spins ...................................................................... 12
3.1 Ensemble of spins ............................................................................................. 12
3.2 Effect of the radio frequency (RF) field B1 ..................................................... 13
IV Spin relaxation ................................................................................................. 16
4.1 Molecular basis of spin relaxation................................................................... 17
V Fourier Transform NMR ............................................................................... 22
5.1 From time domain to spectrum ...................................................................... 22
5.2 Aspects of FT-NMR.............................................................................................. 26
VI Spectrometer Hardware.................................................................................. 28
6.1 The magnet........................................................................................................ 28
6.2 The lock ............................................................................................................. 30
6.3 The shim system................................................................................................ 31
6.4 The probe........................................................................................................... 32
6.5 The radio frequency system.............................................................................. 33
6.6 The receiver....................................................................................................... 33
VII NMR Parameters......................................................................................... 36
7.1 Chemical shifts.................................................................................................. 36
7.1.1 Effects influencing the chemical shift ................................................................. 38 7.1.2 Protein chemical shifts ....................................................................................... 39
7.2 J-Coupling......................................................................................................... 40
iv
7.2.1 Equivalent protons ............................................................................................. 42
VIII Nuclear Overhauser Effect (NOE).............................................................. 44
8.1 Dipolar cross relaxation ................................................................................... 44
8.2 NOEs in biomolecules ...................................................................................... 47
IX Relaxation Measurements............................................................................... 50
9.1 T1 relaxation measurements............................................................................. 50
9.1.1 Calculated example for an inversion-recovery experiment................................ 52 9.1.2 Applications of T1 relaxation.............................................................................. 53
9.2 T2 relaxation measurements............................................................................. 53
9.2.1 Applications of T2 relaxation.............................................................................. 55
X Two-Dimensional NMR................................................................................... 56
10.1 The SCOTCH experiment ................................................................................ 56
10.2 2D NOE............................................................................................................. 59
10.3 2D COSY and 2D TOCSY ................................................................................ 62
XI The assignment problem ................................................................................. 66
11.1 Chemical shift ................................................................................................... 66
11.2 Scalar coupling ................................................................................................. 67
11.3 Signal intensities (integrals)............................................................................. 67
11.4 NOE data........................................................................................................... 67
XII Biomolecular NMR............................................................................................ 69
12.1 Peptides and proteins ........................................................................................ 69
12.1.1 Assignment of peptides and proteins ................................................................. 71 12.1.2 Secondary structural elements in peptides and proteins .................................... 75
12.2 Nucleotides and nucleic acid............................................................................ 79
12.2.1 Assignment of oligonucleotide and nucleic acid spectra................................... 85
XIII Structure determination .................................................................................. 86
13.1 Sources of structural information.................................................................... 86
v
13.1.1 NOEs .................................................................................................................. 86 13.1.2 J-couplings ......................................................................................................... 87 13.1.3 Hydrogen bond constraints ................................................................................ 88
13.2 Structure calculations....................................................................................... 89
13.2.1 Distance-Geometry............................................................................................. 89 13.2.2 Restrained molecular dynamics ......................................................................... 91
Appendices.................................................................................................................. 94
Appendix A
a) Typical 1H and 13C chemical shift of common functional groups.............. 94
Appendix A
b) Random coil 1H chemical shifts for the 20 common amino acids............. 95
Appendix B: 1H chemical shift distribution of amino acids................................. 97
Appendix C: Nuclear Overhauser Effect .............................................................. 98
Appendix D: 2D NOESY experiment .................................................................. 101
Appendix E: Expected cross-peaks for COSY, TOCSY and NOESY for the
individual amino acids ................................................................... 103
Appendix F: Typical chemical shift values found in nucleic acid..................... 113
Appendix G: Typical short proton–proton distances for B-DNA....................... 114
1
I Introduction
All spectroscopic techniques are based on the absorption of electromagnetic radiation by
molecules or atoms. This absorption is connected to transitions between states of
different energies. The nature of the electromagnetic radiation varies from hard γ-rays in
Mössbauer spectroscopy to very low energy radio frequency irradiation in NMR
spectroscopy. Other spectroscopic methods, applying electromagnetic radiation of
intermediate energy, are microwave spectroscopy (vibration and/or rotation of dipolar
groups in molecules), IR spectroscopy, where vibration states are excited, UV/vis
spectroscopy, where the electronic orbitals of atoms are involved and x-ray or atom
absorption spectroscopy involving the inner electron shells. In Nuclear Magnetic
Resonance (NMR) transitions occur between the states that nuclear spins adopt in a
magnetic field. Since the energy differences between these spin states are extremely
small, long-wavelength radio-frequency matches these differences. Accordingly, NMR is
a rather insensitive method and more sample-material is usually needed than for most
other spectroscopic methods. On the other hand, however, NMR lines are quite narrow
and therefore the resolution is usually so high that hundreds of lines can be resolved in a
single NMR spectrum. Also, the interaction between different nuclear spins is manifested
in NMR spectra, for instance, in the form of J-coupling or the nuclear Overhauser effect
(NOE). These properties have made NMR quickly an indispensable tool for structural
studies in chemistry and later also in biochemistry. NMR is the only available method to
date to determine the structure of proteins and nucleic acids in solution on an atomic
scale so that it became a well-established method in the field known as "Structural
Biology".
2
1.1 Typical applications of modern NMR
Structure elucidation
Synthetic organic chemistry (often together with MS and IR)
Natural product chemistry (identification of unknown compounds)
Study of dynamic processes
Reaction/binding kinetics
Chemical/conformational exchange
Structural studies of biomacromolecules
Proteins, protein-ligand complexes,
DNA, RNA, protein/DNA complexes,
Oligosaccharides
Drug Design
Structure Activity Relationship (SAR)
Magnetic Resonance Imaging (MRI)
MRI today is a standard diagnostic tool in medicine.
1.2 Some history of NMR
NMR was discovered in 1945 by Bloch at Stanford and Purcell at Harvard University.
For this Bloch and Purcell received the Nobel Prize for physics in 1952. Initially, it
belonged to the realm of physics but after the discovery of the chemical shift (nuclei in
different chemical surroundings have different resonance frequencies) the technique
quickly became very important as an analytical tool in chemistry. The development of
3
stronger magnets (maximum proton frequency is now (2006) about 1000 MHz) and of
multidimensional NMR methods allowed its entry in the field of biology. As a result of
its continuously increasing importance in modern chemistry, biochemistry and medicine,
two more Nobel prices for NMR followed in 1991 (Richard Ernst) and in 2002 (Kurt
Wüthrich).
1.3 Aim of this course
This course will bring the student up-to-date with the principles of modern NMR
methods and provide a basic understanding of how these methods work and how they can
be applied to derive the three-dimensional structures of biomolecules by NMR.
After a brief theoretical introduction of the basic physical principles of NMR, we will
discuss the origin of the parameters that determine the appearance of an NMR spectrum
such as chemical shift, J-coupling and line-width. Spin-relaxation (i.e. how a spin system
returns to equilibrium after excitation) is important as it determines the line widths of the
NMR signals, but also the intensity of the Nuclear Overhauser Effect, which in turn is the
major source of information for the structural analysis of biomolecules.
The modern way of recording NMR spectra is by applying short radio frequency pulses
and analyzing the response by Fourier transformation. This so-called Pulse Fourier
Transform NMR (for which R.R. Ernst received the Nobel prize for chemistry in 1991)
also allows the measurement of two-dimensional (2D) NMR spectra (and even 3D and
4D). An introduction is given to the FT-NMR technique and the principles of multi-
dimensional NMR are reviewed. Exemplary, some basic 2D NMR experiments will be
discussed in more detail. Finally, the important process of assignment of biomolecular
NMR spectra is explained and an overview of the possibilities to extract a (3-
dimensional) structure out of NMR data is given.
4
1.4 General outline
I Introduction (what is NMR and what do you study with it)
II Theory (how does it work)
III Ensemble of spins (from single atom to real samples)
IV Relaxation I (after the experiment: back to equilibrium)
V FT NMR (with a single RF-pulse to a complete spectrum)
VI Hardware (what kind of device do you need for FT NMR)
VII NMR parameters (what you can see in an NMR spectrum and why)
VIII NOE (How does the Nuclear Overhauser Effect work)
IX Relaxation II (experiments to measure relaxation properties)
X 2D NMR (how to add an extra dimension and what's the good to it)
XI Assignment (which signal comes from which atom)
XII Biomolecular NMR (nucleic acids and proteins, spin systems and (structural)
parameters, sequential assignment)
XIII Structure Determination (which parameters to use, how to calculate a structure)
5
II Basic NMR Theory
The energy states, between which transitions are observed during an NMR experiment,
are created only when a nucleus with magnetic properties is brought into an external
magnetic field. These magnetic properties of nuclei can be derived from a quantum
mechanical property, the spin angular momentum, I. Most nuclei have such a spin
angular momentum, which is represented by a corresponding spin quantum number I,
which can be integer or half-integer (I = 0, 1/2, 1, 3/2....) and we may simply speak of a
nucleus with spin I. The magnitude of the spin angular momentum is given by (2.1)
)1( += IIhI (2.1)
where h is h/2π ( h = 6.626 · 10-34 J·s is Planck's constant). Due to its quantum mechanic
nature, any component of I along an arbitrary axis of observation, for instance the z-axis
(which is by definition the direction of the external magnetic field), is quantized:
Iz mh=I (2.2)
mI is the spin quantum number which can adopt values from -I to I in steps of 1 (a total
of 2I +1 values). A combination of equations 2.1 and 2.2 leads to the spin-state diagrams
for nuclei with different spins (e.g. ½, 1, ³/2 ):
6
A combination of their spin angular momentum and their positive charge causes nuclei to
have a magnetic moment (compare the effect of an electric current in a circular wire).
This magnetic moment is directly proportional to the angular momentum:
Iμ γ= (2.3)
γ is called the gyro magnetic ratio. Since I is quantized, accordingly also μ is quantized
and we can express μ in terms of the spin quantum number, I or μz in terms of the
magnetic quantum number, mI:
)1( += IIhγμ (2.4)
and
Iz mμ hγ= (2.5)
Classically, if we bring a bar magnet (compass needle!) in a magnetic field, denoted B,
the magnet will tend to turn and orient itself in the field. This is a consequence of the fact
that its energy, given by
Bμ ⋅−=E (2.6)
will then reach a minimum. For quantum mechanical objects such as nuclear spins the
situation is similar except that now only a limited set of discrete orientations (quantum
states) are available. Expression (2.6) still holds for nuclear spins. With the convention
that B lays along the z-axis we get the energy of a nuclear spin with a magnetic moment
of μ in an external magnetic field, B0 as
0BE zμ−= (2.7)
and with Eq. 2.5 it becomes clear, that the energy of a nuclear spin depends on the
magnetic quantum number mI:
0BmE Ihγ−= (2.8)
For a spin ½ nucleus this results in two states, denoted α for mI = +½ and β for
7
mI = −½. As a consequence, the nuclear spins can not be found ‘turning around’ in order
to orient in an external magnetic field, but they rather can be found only in two different
orientations, corresponding to the two possible energy levels described by the two
magnetic quantum numbers:
In all forms of spectroscopy, transitions between energy levels are induced by
electromagnetic radiation of a particular frequency ν0, provided that the frequency
matches the energy difference between these energy levels:
0νhE =Δ (2.9)
This is sometimes called Einstein's equation and it is a basic relation in spectroscopy.
Based on Eq. 2.8 we find that ΔE = γhB0 = hν0, or
00 2B
πγν = (2.10a)
or even simpler:
00 Bγω = (2.10b)
8
expressed in terms of the angular frequency
ω0 = 2πν0 (2.10c)
In NMR the relation (2.10b) is often called the "resonance condition" i.e. the condition
where the frequency of the radiation field matches the so-called Larmor frequency ωL =
γB0.
For nuclei with spin I larger than ½ we have multilevel energy diagrams. However, the
selection rule ΔmI = ± 1 still holds so that we arrive at the same resonance condition.
Since the energy of the spin states depends on the strength of the external magnetic field
(equation 2.6), we can modify the figure from above and adapt it for different magnetic
fields B0:
The separation ΔE of the energy states α and β and thus the resonance frequency,
depends on the sort of nucleus (γ) and the strength of the external magnetic field B0 or, in
other words, on the strength of the NMR magnet (compare 2.10).
E
B0
E = +½γ hB0 (β-state, m= −½)
E = −½γ hB0 (α-state, m=+½)
ΔE = γ h B0
B0
9
Table 1 shows some properties of nuclei important for applications in organic chemistry
and biochemistry.
Table 1: Properties of selected nuclei
Isotope Nuclear spin
I
Resonance
frequency
(MHz)*
gyro magnetic
ratio γ [T-1 s-1]
Natural
abundance [%]
1H 1/2 600.0 2.6752 ∙ 108 99.985
2H 1 92.1 4.1065 ∙ 107 0.015
12C 0 - - 98.89
13C 1/2 150.9 6.7266 ∙ 107 1.11
14N 1 43.3 1.9325 ∙ 107 99.63
15N 1/2 60.8 -2.7108 ∙ 107 0.37
16O 0 - - 99.76
17O 5/2 81.4 -3.6267 ∙ 107 0.04
19F 1/2 564.5 2.5167 ∙ 108 100.0
31P 1/2 242.9 1.0829 ∙ 108 100.0
*resonance frequency at a magnetic field of 14.092 T (Tesla)
We will focus here on spins with I = ½ because they have only two possible energy states
and accordingly give only a single spectral line and thus are the most popular spins in
high-resolution NMR. For protein studies these are 1H, 13C and 15N (note that 12C has no
magnetic moment, and 14N has a spin 1). For nucleic acids in addition 31P is an important
nucleus.
10
So far we have given a quantum mechanical treatment of the nuclear spin in a magnetic
field considering discrete energy levels. This led to the resonance condition (2.10).
Interestingly, a similar expression can be obtained from a classical description of the
effect of a magnetic field B on a spinning magnet with a magnetic moment μ that is tilted
with respect to the field direction. While a classical non-spinning bar magnet would just
orient itself in the direction of the field (compass), a spinning magnet cannot do this but
instead performs a precession about the direction of the field. In essence this is a
consequence of the conservation of angular momentum. There is a perfect analogy with
the motion of a spinning top (Dutch: "tol") in the gravity field of the earth. The spinning
top will also undergo a precessional motion. Mathematically, the effect of the torque
acting on a spinning magnetic moment is given by the cross product which can be written
as the determinant of a matrix. This is the “equation of motion”:
zyx
zyx
zyx
BBBdtd
μμμγγ
eeeμBμ −=×−= (2.11)
where ex, ey and ez are unit vectors forming an orthogonal
coordinate system along the x-, y- and z-axis, respectively.
Note that the direction of the precession is perpendicular to
both B and μ. With the usual convention that B is along the
z-axis (Bz = B0 and Bx = By = 0) the equations of motion for
the magnetic moment become
11
0
0
0
=
−=
=
dtd
Bdtd
Bdtd
z
xy
yx
μ
μγμ
μγμ
(2.12)
It can be shown that a correct solution is given by
)0()(
)cos()0()sin()0()(
)sin()0()cos()0()(
00
00
zz
yxy
yxx
t
tBtBt
tBtBt
μμγμγμμ
γμγμμ
=
+−=
+=
(2.13)
This indeed describes a precession of μ about the z-axis with an angular frequency ω0
given by
00 Bγω = (2.14)
an expression identical with (2.10b). Thus, we have found that the quantum based
resonance frequency corresponds exactly with the equation for the classical precession of
a spinning magnet in a magnetic field.
In NMR we will often use a classical mechanics analogy for a description of nuclear
spins. Sometimes such an analogy must break down, however, since the spins really are
quantum mechanical in nature.
12
III An Ensemble of Nuclear Spins
3.1 Ensemble of spins
In a sample of identical molecules we are dealing with a large number of nuclear spins.
In the quantum-mechanical picture these are distributed over the spin states according to
Boltzmann's law:
kTE
enn Δ−
=α
β (3.1)
where nα and nβ are the populations of the
α and β state, respectively, and k is
Boltzmann's constant (k=1.3806504(24) ×
10−23 J/K). For I = ½ such a distribution is
shown on the right.
Due to the very small energy difference ΔE = γhB0 the
populations of the α and β states are almost equal. For a field B0
= 14 T (Tesla) (proton frequency 600 MHz) the relative excess
of α spins is only one in 104. This is one of the main reasons for
the low sensitivity of NMR spectroscopy.
In the (classical) vector model an ensemble of spins ½ can be
described as shown on the right.
The individual spin vectors all make an angle with the field B0
and are slightly more aligned parallel to the field than
antiparallel. In equilibrium the "phases" of the individual spins
13
(their xy-components or positions) are randomly distributed. Therefore the resultant
magnetization vector M (the sum of all spin vectors) is aligned along B0. Like all the
individual spins that add up to M, also M has to be imagined
as spinning. So, if M is tipped away from B0 (see figure on the
left) it will perform a precessional motion about B0 with
frequency ω0 = γB0, just the way the individual spins do. This
‘tipping’ can be done by way of a radio frequency pulse - and
involves transitions between the spin states of the nucleus!
3.2 Effect of the radio frequency (RF) field B1
In all FT-NMR spectrometers an additional field B1 can be generated perpendicular to the
static field B0. This B1-field is created by means of radio frequency pulses (it’s basically
the magnetic component of the electromagnetic radio waves). What happens during the
pulse can best be compared to what happens to the spins, when they are brought into the
magnetic field of the spectrometer. We are just dealing with an additional magnetic field,
which tries to orient the spins in a new direction. The result of this additional field is a
rotation of the magnetization vector M around the axis of the additional field. This
rotation lasts only as long, of course, as the additional field is present. In other words:
only during the duration of the radio frequency pulse. In perfect analogy to the precession
around the B0-field, the speed of this rotation (its angular frequency) can be described as
11 Bγω = (3.2)
Acting on the equilibrium magnetization M, the effect of B1 is to tilt the vector away
from the z-axis. How far the magnetization is tilted depends on the sort of nucleus (γ), the
strength of the B1-field and the duration of the pulse. Illustrating the effect of a radio
frequency pulse is a bit tricky. Actually the magnetization vector is still precessing
14
around the B0-field (with ω0) and at the same time, during the RF pulse, is tilted away
from the z-axis (is rotating around the x-axis with ω1)! Fortunately there is a simple way
to describe this: the rotating frame. In a frame of reference in which the x- and y-axes
rotate with ω0 about the z-axis, the B1-field appears stationary (left). In the rotating frame
the axes are denoted x', y' and z'. Now, at resonance, the motion of M becomes very
simple:
Rotating frame Laboratory frame
While in the laboratory frame the M vector performs a complex spiralling motion, in the
rotating frame it simply precesses about the direction of the stationary B1 (in the z'y'-
plane) with an angular frequency ω1 = γB1. Thus, by going over to the rotating frame of
reference we do not need to bother about the precession about B0 anymore. The effect of
an RF pulse can easily be described now: During the duration of a pulse (and only that
long!) the magnetization M is rotating around the axis from which the radio frequency
pulse is applied! The tilting of the magnetization vectors just follows the equation of
motion given in 2.13. If we keep the B1-field on for a short time such that
2 / t 1 πω = (3.3)
15
and then switch it off we have tipped the magnetization along the y'-axis. This is called a
90° pulse. If we keep it on twice as long M is along the negative z-axis (180° pulse):
90° pulse 180° pulse
Remark: The concept of the rotating frame makes the description of NMR experiments
much easier. It is so convenient that we will use the rotating frame throughout the
complete course if not explicitly stated otherwise. For simplicity, we will use the notation
x, y, z instead of x', y', z' for the rotating frame in the following.
16
IV Spin relaxation
After a 90° pulse from the x-axis M lies along the y-axis. If we leave the system now
undisturbed we will reach the equilibrium state again after a while by a process called
spin relaxation. Actually two things will happen:
i) magnetization will grow along the z-axis until the equilibrium value Meq has
been restored,
ii) the Mx and My components decrease to zero (usually faster than the equilibrium
magnetization is restored!).
The characteristic times for these processes are called T1 and T2. Thus we have
T1: longitudinal or spin-lattice relaxation time (z-magnetization),
T2: transverse or spin-spin relaxation time (x,y-magnetization).
In mathematical terms this can be described as follows:
(4.1a)
(4.1b)
(4.1c)
The solution of Eq. 4.1a is: [ ] 1)0()(Tt
eMMMtM eqzeqz
−−=− (4.2)
and of Eq. 4.1b: 2)0()(Tt
eMtM xx
−= (4.3)
2
2
1
)()(
)()(
))(()(
TtM
dttdM
TtM
dttdM
TMtM
dttdM
yy
xx
eqzz
−=
−=
−−=
17
and similarly for My(t). Thus, all components of M return exponentially to their
equilibrium values. An important difference between T1 and T2 processes is that the
former involves changes in z-magnetization and hence, transitions between α and β spin
states that are accompanied by an exchange of energy with the "lattice" (environment). In
contrast, T2 processes involve loss of phase coherence in the xy-plane, no energy is
exchanged with the environment.
Note that spin relaxation is a random process and should not be confused with the
coherent rotations around B0- or B1-fields. For example, T1 relaxation which affects the
z-component does not create x- or y-magnetization (cf. Eq. 4.1).
4.1 Molecular basis of spin relaxation
What causes nuclear spins to relax? The simple answer is: exchange of energy with the
environment. But… the energies of the corresponding processes must match the ΔE
between the involved energy states. In most other spectroscopic techniques, this is
achieved by collisions (with other atoms or molecules). For the small energy differences
in NMR this is not an option. We have to look for processes with comparably small
energies (i.e. comparable frequencies) as the corresponding resonance frequencies in
NMR. We can find these in the interaction of moving magnetic dipoles. The magnetic
dipoles are the NMR nuclei themselves. The movement comes from the diffusional
motion of molecules. This process contributes both to T1 and T2 relaxation. But while
with T1 relaxation energy is exchanged with other molecules (the environment, the
‘lattice’) causing transitions between α and β states, in T2 relaxation the energy is
exchanged with spins of the same molecule, leading to a small variety in the precession
frequency of otherwise identical spins. The net-magnetization vector M is ‘split up’ in
many small components rotating with different frequencies. Eventually these components
are equally distributed in the xy-plane and no measurable transversal magnetization is
left. One speaks about dephasing of magnetization. It is important to note that also a
18
static distribution of Bz-fields causes different frequencies (in different locations of the
sample) and also leads to dephasing i.e. T2 relaxation.
To understand the effect of motion, it is important to consider the time-scale of it. For T1
relaxation only motions with a frequency near the Larmor frequency ω0 are effective.
After all, they have to induce transitions between spin states and therefore must have a
frequency, which coincides with the ΔE between the states (thus, the Larmor frequency).
The time-dependence of the dipolar field comes from the rotational diffusion (the
‘thermal motion’, see figure to the left) of the molecules
which is characterized by a rotational correlation time τc.
For times smaller than τc the orientation of a molecule has
not changed much (leftmost figure below), while for t >>
τc the correlation between different orientations is lost
(rightmost figure):
For small fast tumbling molecules τc is quite short (10-11 - 10-10 s) while for large
biomolecules it is much longer (10-8 - 10-7 s). An approximate relation of τc with the
molecular volume V and the viscosity η is given by
TkV
cητ = (4.4)
For macromolecules of molecular mass Mr in H2O solution at room temperature a useful
approximation is
19
12104.2
−≈ rc
Mτ (4.5)
In a randomly tumbling motion many frequencies are present. The distribution of the
frequencies of the motions is represented by the spectral density function, J(ω).
In this distribution the frequency ω = τc-1 acts much like a cut-off of the spectral density
function, in other words: motions with frequencies ω > τc-1 are quite rare. The area under
the curves is constant. Only the shape of J(ω) differs for molecules of different size. For a
smaller molecule for instance, J(ω) would look like:
τc for smaller molecules is shorter and their frequency distribution extends to higher
values of ω. On the other hand, J(ω) for low values of ω is relatively small. With
increasing size of the molecules J(ω) is getting larger for slow motions but at the same
time the ‘cut-off’ moves to lower frequencies. In other words, slow motions (small values
J(ω)
ω 1/τc
J(ω)
ω 1/τc
20
of ω) are more likely to occur for (big) biomolecules than for small organic molecules.
Not very surprising indeed!
Now what does this mean for T1 relaxation? Here, the fluctuating fields have to induce
transitions between α and β states separated by the Larmor frequency ω0 = γB0.
Therefore, the efficiency of relaxation will depend on how much this frequency is present
in the distribution of frequencies of the molecule, thus on J(ω0). This is maximal for
molecules that have τc-1
= ω0, and the efficiency will drop for both larger and smaller
molecules (longer and shorter τc values). This explains the behavior of T1 versus
correlation time τc as shown in the next figure.
The minimum in T1 (most efficient relaxation) is at ω0τc = 1, which for common NMR
fields occurs for intermediate size molecules of molecular mass Mr ≈ 1000 D. In terms
of the fluctuating fields Bx(t) and By(t) a general expression for the efficiency (or rate) of
T1 relaxation is
21
( ) )(10
222
1
ωγ JBBT yx += (4.6)
where the average of the square of Bx(t) , < Bx2 >, is a measure of the strength of the
fluctuating fields. Assuming that the components in all directions are the same ( < Bx2 >
= < By2 > = < Bz
2 > = < B2 > ) Eq. 4.6 becomes
)(210
22
1
ωγ JBT
= (4.7)
A similar expression for T2 relaxation is
( ))()0(10
22
2
ωγ JJBT
+= (4.8)
This describes the two mechanisms that contribute to T2: the static distribution of Bz-
fields (no or ‘zero’ frequency), J(0), and the effect induced by Bx(t) and By(t), which is
proportional to J(ω0). For larger biomolecules the J(0) term will dominate and becomes
approximately equal to τc. Thus, for slowly tumbling molecules we have the simple
expression
cBT
τγ 22
2
1 ≈ (4.9)
This means that T2 becomes progressively shorter for larger molecules and explains why
T2 unlike T1 does not go through a minimum.
22
V Fourier Transform NMR
5.1 From time domain to spectrum
Nowadays, all modern NMR spectrometers work as so-called Fourier-Transform NMR
spectrometer (FT-NMR). Earlier we have seen (chapter 3.2) that the magnetic component
of an electromagnetic field (RF), applied along the x-axis of the rotating frame on
equilibrium z-magnetization, results in a precession around the x-axis when ωRF = ω0. For
the precession frequency, ω1, we found ω1 = γ B1. If we apply this RF field for a period t
= π/2ω1 we create 'pure' y-magnetization. An RF pulse of this duration was called a 90°
pulse. The RF transmitter of an NMR spectrometer is operated by a pulse-computer,
which can generate a single RF pulse or a series of RF pulses of arbitrary length,
frequency, phase, and amplitude separated by delays of adjustable length. An RF pulse of
length τp excites the frequency-range νRF - 1/(2τp) to νRF + 1/(2τp). To excite a certain
range of frequencies, τp must be adjusted to be sufficiently short. For example, at 600
MHz proton resonance frequency a good excitation of an NMR spectrum of 10 kHz
requires τp << 100 μs. In practice, pulses of τp = 2-20 μs are used.
A radiofrequency pulse of duration τp (left) and the corresponding excitation profile (right). Details in the
text above.
What will happen after a single RF pulse of 90° along the x-axis? Earlier we have seen
that the magnetization vector has been rotated and now is oriented along the y-axis (phase
τp
νrf
νrf-½τp
νrf
νrf+½τp
Δνrf
23
coherence of the spins). The receiver is tuned to the frequency ωRF. If the resonance
frequency of spin j, denoted by ωj, is equal to ωRF the magnetization will remain along
the y-axis (of the frame rotating at speed ωRF). The magnitude of the magnetization will
be decreased by T2 relaxation. In case spin j has a resonance frequency ωj, which is
different from ωRF, then the magnetization
of spin j will precess in the frame rotating
at ωRF with the difference frequency
Ω = ωj- ωRF (5.1)
Needless to say, also in this case the
magnetization decays due to relaxation
processes. The in the xy- plane rotating
magnetization vector induces a current
when it passes the receiver coil. This is the
actual signal recorded by the receiver. In
order to be able to distinguish between
positive and negative frequencies (vectors
which are rotating clockwise or
counterclockwise with the same speed), both the x- and the y-component of the rotating
magnetization are recorded simultaneously. The corresponding signal induced in the
receiver coil has the shape of a decaying harmonics (sine and cosine waves) and thus is
called the free-induction decay (FID). For a number of different spins j (for example of
the Hα, the Hβ, and the HN), each with their own equilibrium magnetization Mjeq,
frequency ωj, and relaxation time T2j, the FID consists of the sum of all magnetizations:
[ ]
[ ] 2
2
sin)0()(
cos)0()(
Tt
Tt
etMtM
etMtM
jj
jx
jj
jy
−
−
′=
′=
∑
∑
ω
ω (5.2)
y
x
y
x
y
x
x component
y component
24
where | Mj(0) | = | Mjeq | in the case of a 90° excitation pulse. The following figure
illustrates the difference in appearance of an FID containing only a single frequency and
an FID containing multiple frequencies:
The FT-NMR signal (the FID) is recorded in the time domain. The signal is digitized by
an analog-to-digital (ADC) converter and stored in the memory of a computer. The
resonance frequencies, ωj', are extracted by Fourier analysis. The Fourier Transformation
(Eq. 5.3) transforms the signals f(t) from the time domain to the frequency domain, g(ω):
dtetfg ti∫∞
∞−
−= ωω )()( (5.3)
where the complex signal f(t) is defined as
2)()()(Tt
ti eeMtMitMtf j
j
eqjxy
−′∑=+= ω . (5.4)
Fourier Transformation results in a sum of complex frequency signals. The real part of
25
which describes an absorption signal:
22
2
2
)(1])(Re[
jj
j
j
eqj T
TMg
ωωω
′−+=∑ (5.5)
The imaginary part describes a dispersion signal:
22
2
22
)(1)(])(Im[jj
jj
j
eqj
TTMg
ωωωωω
′−+′−
=∑ (5.6)
The resonance line shape is a so-called “Lorentz line shape”. Usually we are only
interested in the real part of g(ω), the absorption response, because the flanks of
absorptive line drops with ω−2 , whereas the dispersive line goes with ω−1. This means
that the absorptive line is much narrower.
From Eqn. 5.5 we can calculate the
relationship between T2 and the line
width at half-height, Δν1/2:
2222 2
1115.0
νπ Δ+=
T (5.7)
26
which can be rewritten as:
2
12
1Tπ
ν =Δ (5.8)
Some important Fourier pairs are shown here. Basically these are the ones responsible for
the shapes (or their distortion) of most lines observed on a FT-NMR spectrometer
5.2 Aspects of FT-NMR
It might be that the signal-to-noise ratio is not good enough after a single scan. By co-
adding n successive NMR measurements the signal, S, increases by a factor n. The noise,
FT
t
I
I
FT
FT
f(t) g(ω)
t 0
1
-1
t 0
1
-1
M
27
N, however, fortunately increases with n only. This is due to the random nature of
noise. Hence the signal-to-noise ratio, S/N, improves as
nNS ~ (5.9)
FT-NMR is a very flexible technique. A large number of different experiments can be
done with the FT technique, each aiming at different parameters of the molecules in study
to be extracted, such as relaxation measurements (discussed in chapter 9), multi-
dimensional NMR (discussed in chapter 10), heteronuclear NMR, etc.
28
VI Spectrometer Hardware
6.1 The magnet
The magnet is the core of the NMR
spectrometer. Nowadays mainly
persistent superconducting coils are used
to generate the high magnetic fields
necessary for high resolution NMR
(permanent magnets are only used e.g. in
food sciences or on older, lower field
NMR imaging systems). The coils consist
of NbTi (or NbTi-Nb3Sn, NbTi-
(NbTa)3Sn) wires which are
superconducting at 4 K (–269 °C).
Several layers of coils generate a higher
and higher field towards the innermost
section where the final magnetic field
strength is reached. In this section the
field must be extremely homogeneous over a volume of some cubic centimeters,
otherwise the resonance frequencies would vary at different locations in the sample
leading to broad and unsymmetric lines. The coil wires are also the most expensive part
of the spectrometer. To reach higher field strength, larger (and more complicated
designed) coils have to be used. This makes most of the price difference between e.g. a
700 MHz and a 900 MHz machine, which is approx. a factor of 4 (in 2006). The
necessary low temperature for superconductivity is reached by submerging the coils into
a dewar containing liquid helium (at –269 °C). This inner dewar is surrounded by a
second, outer dewar containing liquid nitrogen (–200 °C). In the course of time, both
29
helium and nitrogen evaporate – therefore the ‘magnet’ (the dewars actually!) has to be
refilled periodically (typically weekly for N2 and monthly for He).
The strength of a magnetic field is normally given in Tesla or Gauss (1 G = 10–4 T). The
strength of an NMR magnet is often described by the corresponding resonance frequency
of hydrogen atoms (‘proton-frequency’). A field of 14 T corresponds to a field strength of
600 MHz. Today (2006) the typical field strength used in biological applications are 700
MHz and 800 MHz. The highest currently available field is about 1000 MHz.
The need for higher and higher fields is explained by the gain in resolution and in
sensitivity. The sensitivity of an NMR experiment is usually described by the signal-to-
noise ratio S/N:
S/N ~ N γ5/2 B03/2 n1/2 T2/T (6.1)
where N is the number of spins (concentration of the sample), γ the gyro magnetic ratio
of the nucleus, B0 the field strength, n the number of individual scans per experiment, T2
is the relaxation time and T the temperature of the detection circuit. How a bigger field
affects the S/N and the resolution (which follows a linear dependence on B0 ) is shown in
the following table.
B0 (T) 11.7 14.1 16.5 17.6 21.1
ν (MHz) 500 600 700 750 900
S/N 1.0 1.3 1.7 1.8 2.4
resolution 1.0 1.2 1.4 1.5 1.8
The Biomolecular NMR laboratory at Utrecht University is housing a 360 MHz, two 500
MHz, two 600 MHz, a 700 MHz, a 750 MHz and one 900 MHz high-resolution NMR
30
spectrometer, one of the 600 MHz spectrometer and the 900 MHz spectrometer are
equipped with cryogenic probe systems for additional sensitivity.
6.2 The lock
Even in a very well designed magnet the magnetic field is not perfectly stable over the
long time a measurement can take (up to one week). Small deviations of the main
magnetic field can be compensated by the ‘lock system’ by applying correction currents
in a coil which is part of the room temperature shim system (see below). The lock system
exploits the NMR phenomenon itself: A reference NMR experiment is continuously
performed on a nucleus different from the one being studied. In most biological
31
experiments deuterium (2H) is used for this purpose. The deuterium spectrum is
continuously acquired and the frequency of the single deuterium line is observed.
Whenever this frequency shifts, small correction currents are applied to the lock coil to
compensate for this change therefore slightly increasing or decreasing the total magnetic
field. In most biological applications deuterium is introduced by dissolving the sample in
a mixture of 5–10% D2O in H2O.
6.3 The shim system
As stated above, the magnetic field experienced by the sample must be very stable and
also very homogeneous to keep the lines as narrow as possible. Since the homogeneity is
not only a matter of the coil design, but is also influenced by the sample itself (filling
height, quality of tube etc.), for each individual sample additional field corrections have
to be applied. This is achieved by a number of correction coils (the shim system) in which
adjustable currents produce field gradients which can compensate field inhomogeneities.
There are two sorts of shim coils: superconducting ('cryo-shims') and room temperature
coils. The currents through the superconducting shim coils are usually only once adjusted
during the installation procedure of the magnet. The room temperature coils are the ones
used by the user for 'shimming' each individual sample. In practice the optimization of
the field homogeneity exploits the ‘lock’ experiment. The D2O in our sample gives a
single line in the NMR spectrum. The integral of this line is constant (as it only depends
on the number of nuclei in our sample, which is constant) but the height of the line is not:
The narrower the line the higher its maximum. The NMR operator can now manually
adjust the different currents in the different shim coils to optimize this value. This was
and still is a very time consuming procedure which requires some experience. Fortunately
a very fast automatic shimming method is available nowadays which employs pulsed
field gradients (PFGs) which reaches very good results within minutes. This so-called
'gradient shimming' is available on all of our machines (except the 360).
32
6.4 The probe
The probe (or probehead) is in many ways the most critical component in the
spectrometer. It has two main functions:
a) to convert the radio frequency power from the amplifiers into oscillating magnetic
fields (B1-fields) and to apply these fields to the sample.
b) To convert the oscillating magnetic fields generated by the precessing nuclear
spins of the sample into a detectable electric signal that can be recorded in the
receiver.
Both points can be achieved by a parallel tuned circuit having a coil surrounding the
sample. The tuning is dependent from the sample (on position, volume, solvent, ionic
strength). The coil has to be carefully tuned to the frequency of the nucleus of interest
(remember: the frequency of the B1-field should match the Larmor frequency). This
adjustment of the circuit is important in several ways: First, we want to transmit the
maximum possible B1-field strength to the sample. This ensures that our pulses are as
short as possible, and therefore ensures a good excitation bandwidth. Second, since NMR
is a very weak phenomenon, we do not want to loose any signal coming from the sample
by picking up only a fraction of the oscillating magnetization.
There is a variety of NMR probes. For 1H spectroscopy typically probes are used which
can hold sample tubes of 5mm diameter (with a sample volume of ~500 μl). Beside the
proton channel there is another coil for the lock system tuned on 2H (sometimes a single
double tuned coil is used for both frequencies). The same setup can be found in probes
for 3mm and 10mm tubes. The sensitivity of the probes is still improvable as reflected by
the increased sensitivity over the past 10 years (nearly a factor of 2). This shows how
critical coil design is for NMR purposes. In a quite recent development, cryogenic probe
systems were introduced, which consist of a probe which can be cooled with cold helium
in order to reduce the amount of electronic noise in the receiver coils (and preamplifiers)
to a minimum. The technological challenge of such a system, among others, is the fact
that the temperature of the sample, of course, must still be adjustable to as high as about
80 ºC without heating the cold part of the probe. Since, on the other hand, the coils of the
33
probe are supposed to be as near as possible to the sample, the difficulties of designing
such a system are obvious. The gains in sensitivity with the installation of such a system
to an existing spectrometer are remarkable and can be more than a factor of 2.2 (compare
the relative sensitivities at different fields in chapter 6.1).
6.5 The radio frequency system
The RF system mainly consists of pulse generation units, the actual transmitters and
subsequent power amplifiers. It is generating the excitation pulses at the frequency of the
nucleus of interest. On modern spectrometers the frequency can be set with a precision of
0.1 Hz across a band many megahertz in width. Since we may want to apply RF pulses
out of several directions in the rotating frame, the phases of the RF waves also must be
adjustable (typically to 0.5 degree). These settings are under extremely fast computer
control with setting times of only some microseconds. The power amplifiers boost the
transmitter output to high levels (from several tens up to hundreds of watts). This assures
that short, non selective pulses can be applied to the sample.
6.6 The receiver
The final stage in an NMR experiments is the detection of the precessing magnetization
(x- and y-components) in the sample. As stated earlier the same coil is used for this
purpose as for excitation. This means that directly before the data acquisition the
transmitter system has to be blanked and the receiver has to be opened (this ensures that
no strong RF pulses are applied while the sensitive receiver system is on). The detection
of the high frequency signal (MHz) is quite involved. First, analog filters are applied to
the signal to reduce it to the relevant frequency range. Then the weak signal is amplified.
The incoming signal is now mixed during several stages with reference frequencies. This
34
mixing reduces the frequency of the signal from several MHz to the audio range (kHz).
Finally, the signal is digitized in real-time and stored. For digitization the following
relation is important: If we want to detect a certain spectral width SW we have to digitize
the signal with a time dw ('dwell time') or faster ('Nyquist theorem'):
SW
dw2
1= (6.2)
An example (see the following figure): Assume we digitize our FID every 5ms,
corresponding to a spectral width of 100 Hz. A frequency of 100 Hz will be sampled
twice per period (solid line), which is enough to characterize this frequency. On the other
hand, a resonance precessing with 120 Hz (dashed line) is sampled less than twice per
period, thus the frequency can not be distinguished from a slower frequency (80 Hz in
this case, dotted line). This means that both frequencies would give a signal at the same
position!
All further operations after the digitization and storing of the signal are performed in the
data processing system (the computer workstation). This includes application of window
functions, zero-filling, Fourier transformation, phase corrections, baseline corrections,
integrations in several dimensions as well as displaying and plotting the final spectrum.
35
The components of a spectrometer at a glance:
RF generator (radio sender): creates the RF signal with a frequency of less than 100
MHz up to about 900 MHz.
Pulse generator: creates RF pulses of a duration of several µs up to several seconds.
RF amplifier: amplifies the pulse signal up to several 100 Watts.
Magnet: generates the B0-field (from about 1T up to about 21T).
Probe: holds the sample and houses the send and receive coils.
RF amplifier: amplifies the received signal from the probe.
Detector: Subtracts the base frequency from the signal, resulting in an audio frequency
(up to several kHz), containing only the differences of the resonance frequencies from the
base frequency.
AF amplifier: amplifies the audio signal.
ADC: Analog-to-digital converter.
Computer: controls all the other electronic parts, receives, stores and processes the NMR
signal.
36
VII NMR Parameters
7.1 Chemical shifts
We have seen that the resonance frequency of a nucleus depends on its gyro magnetic
ratio γ and the magnetic field Bz. If all nuclei of the same kind (e.g. protons) would have
an identical Larmor frequency then NMR would not be a very useful technique for
studying biomolecules – we would observe just one line per sort of nucleus. Fortunately,
this is not the case since in practice different spins, even from the same sort, have a
slightly different Larmor frequency. This is because not all nuclear spins experience the
same effective static magnetic field Beff. Instead they experience the superposition of the
external field Bz and a local field Bloc. The static field Bz induces currents in the electron
clouds surrounding each nuclear spin. These induced
currents result in local magnetic fields. The induced
current will counteract its cause (‘Lenz law’,
electromagnetism), thus the induced field will be
opposed to Bz. The nuclear spins will be ‘shielded’
from the external field. The strength of this shielding
depends on the electron density around each
individual nucleus and the strength of the static field.
zloc BB σ−= (7.1)
where σ is a quantity expressing the amount of shielding. The net field experienced by
the spin becomes
)1( σ−=+= zloczeff BBBB (7.2)
37
and the new Larmor frequency is given by (compare Eq. 2.10a)
π
σγν2
)1( zB−= (7.3)
The shielding σ is different for different types of nuclei in a molecule because the
electron density around a nucleus is very sensitive to the chemical environment of the
nucleus (e.g. chemical bonds and neighbours). The amount of shielding is usually given
as a dimensionless parameter δ, the chemical shift, which expresses the difference in
NMR resonance frequency with respect to a reference signal
)(101
10)(
10 666 σσσ
σσν
ννδ −⋅≈
−−
⋅=−
⋅= refref
ref
ref
ref (7.4)
since .1<<refσ
For the reference of the δ-scale the single line of the methyl-protons of Si(CH3)4 (TMS,
Tetramethylsilane) can be used (δ = 0). For biomolecules, slightly different compounds
(e.g. TSP, (CH3)3SiCD2CD2CO2Na, Sodium-salt of trimethylsilyl-propionic acid) are
used since TMS is not soluble in water; sometimes the water resonance itself is taken.
The chemical shift of water is temperature dependent:
9.96
][83.7)( 2
KelvinTOH −=δ (7.5)
The dimensionless δ-scale (ppm) has the important advantage over a frequency scale
(Hz) that the chemical shift values become independent of the magnetic field Bz.
38
7.1.1 Effects influencing the chemical shift
As was mentioned above, the s-orbital electrons generate a field opposing the static field,
a shielding effect. The p-orbital and other orbital electrons with zero electron-density at
the nucleus result in a weak field reinforcing the static field. The contributions of these
two effects are of well known magnitude for the different functional groups. The table in
appendix Aa gives the common chemical shift values for a number of different functional
groups. A small value of δ corresponds to a shielded proton, or a low resonance
frequency.
In addition to the “constant” effects of s- and p-orbitals to the chemical shift, there are
also variable contributions resulting from the local surrounding of the nucleus (e.g.
solvent effects or interaction with other parts of the same molecule) and the local
conformation.
Aromatic and carbonyl groups have an extensive conjugated π-electron system
comprising delocalized molecular orbitals. Also in these systems the Bz-field induces
currents, the so-called ring-currents, resulting in quite large magnetic moments. Their
effect on δ of a particular nucleus depends strongly on the distance and orientation with
respect to the aromatic system: above and below the aromatic ring-system an opposing
field is generated and Δδ is negative (‘upfield shift’). In the plane of the ring-system a
reinforcing field is generated and Δδ is positive (downfield shift). These effects can be as
large as -2 to 2 ppm and all nearby protons are affected. The effect decreases with 1/r6.
Paramagnetic groups, like Fe3+ in the heme-system of hemoglobin, can have a
pronounced effect on chemical shifts. An unpaired electron influences the proton
chemical shift through spatial interactions, the electron magnetic moment, and by direct
electron-proton hyperfine interaction.
Electric fields resulting from charged groups or electric dipoles polarize the electron
clouds and thus influence the chemical shifts.
H-bond formation strongly influences the chemical shift. The proton in an X-H···Y
hydrogen bond is very little shielded and thus has a large δ value. For example, the imino
39
protons in the Watson-Crick hydrogen-bonded bases of a B-DNA fragment resonate at
14-15 ppm, whereas the non hydrogen-bonded protons resonate at 10-11 ppm.
7.1.2 Protein chemical shifts
From section 7.1.1 it will be clear that also the 1H spectra of proteins will have a 'general'
appearance. An example 1H spectrum of a protein is given below:
Indicated are the typical regions where the different proton resonances are found. It is
even possible to split out the different chemical shift ranges on a per-residue basis. This is
shown in Appendices Ab and B.
Although the theory of chemical shifts is well known, it is quite complicated in practice
to accurately predict the chemical shifts in proteins. Partially this is the result of the
inaccuracy of protein structures and their internal mobility. On the other hand the range
40
of proton chemical shifts is fairly limited (ca. 12 ppm) and the exact geometry is
relatively important.
The range of 13C chemical shifts is much larger (ca. 200 ppm) and the effects of the exact
geometry are less important. 13C chemical shifts are therefore easier to predict and can be
used in a more straightforward fashion for interpretation of spectra.
7.2 J-Coupling
J-coupling is the interaction between two spins transferred through the electrons of the
chemical bonds between them. The resonance frequency of a spin A depends on the spin-
state of a second spin B and vice versa. If spin A is in the α state spin B will resonate at
slightly lower frequency whereas it will resonate at slightly higher frequency when spin
A is in the β-state. In the NMR spectrum we observe a doublet (two lines with equal
intensity) centered on νb, the resonance frequency of spin B without J-interaction. Also
for spin A a doublet is observed, centered around νa. The size of the coupling is JAB (the
41
distance between the components of the doublets) and is called the coupling constant. It
has the dimension Hz.
If there is a third J-coupled spin C the pattern splits again: the result is a doublet of
doublets. This means that the coupling of the third spin is independent of the coupling
between the first two spins. It just introduces another splitting on the first splitting. If the
42
spins B and C are magnetically equivalent the doublet of doublet collapses into a triplet
(three-lines with intensity ratio 1:2:1). Three equivalent neighbours (e.g. the three protons
of a methyl group) result in a quartet (four lines with intensities 1:3:3:1). As you can see
the intensity ratios follow the famous Pascal triangle.
7.2.1 Equivalent protons
In general equivalent protons are protons which are chemically and magnetically
equivalent. Chemical equivalence means that there is a symmetry axis in the molecule for
the two protons under consideration. Nuclei having this type of equivalence resonate at
the same frequency. For magnetic equivalence the nuclei must be a) chemical equivalent
and b) must experience exactly the same J-coupling with all other nuclei in the molecule.
Magnetically equivalent nuclei are a very special case in the coupling network: They do
not couple with each other. This explains why for instance the benzene spectrum shows
only one line. Due to the high symmetry of the molecule all six protons are magnetically
equivalent, thus showing only one frequency and no coupling with each other.
In proteins magnetic equivalence due to symmetry is rare because of the high complexity
of the biomolecules. But also here some protons can be equivalent. If a group of atoms
rotates fast enough they become magnetically equivalent as a result of dynamic
averaging. This is the case for e.g. methyl groups or fast rotating aromatic rings.
43
The magnitude of the J-coupling depends on the number of intervening chemical bonds,
the type of chemical bonds, the local geometry of the molecule, and on the γ values of the
nuclei involved. Proton-proton couplings in biomolecules are observed for protons
separated by two or three chemical bonds. The magnitude of these proton-proton J-
couplings is relatively small, typically 2-14 Hz. Often the patterns resulting from these 2JHH (two bonds) and 3JHH (three bonds) couplings are not resolved because of the large
line width in biomolecules.
The magnitude of heteronuclear one-bond couplings is much larger: the J-coupling
between the amide proton and the directly bond 15N nucleus, 1JNH is ca. 92 Hz. The one-
bond coupling between a proton and its directly attached 13C nucleus, 1JCH is ca. 140 Hz.
J-couplings involving 15N or 13C nuclei also exist between nuclei two or three bonds
apart. For example, there is an interaction between Hα and 15N over three bonds in a
HαC(C=O)15N fragment.
44
VIII Nuclear Overhauser Effect (NOE)
The observable intensity of the signal of a nucleus depends on the intensities of other
nuclei when they are in close spatial proximity. If, for instance, two protons are situated
at a distance of less than 5 Å and the signal of one of them is saturated by selective
irradiation, the other signal will change in intensity. This effect is called the nuclear
Overhauser effect (NOE). It is the result of a relaxation process, caused by a dipole-
dipole interaction (dipolar coupling) between the two nuclei. We are talking here about
cross-relaxation, because the population of the spin-states of one nucleus depends on the
population of the spin-states of another one.
8.1 Dipolar cross relaxation
Let us consider a spin system with two spins
A and B which are dipolar coupled (spatial
proximity!). In the steady-state NOE
experiment one resonance is selectively
saturated by RF irradiation (let's say spin B).
This disturbs its equilibrium magnetization,
therefore spin B tries to re-establish it by
exchanging magnetization with its
environment. This can either be the lattice or
another nucleus close-by. The NOE between
the saturated spin B and another spin A is
defined by the relative change in the intensity
of spin A:
NOE = 1 + η with eqa
eqaa
MMM )( −=η (8.1)
45
The energy level diagram for this two-spin
system is sketched on the right. Each of the
spins can undergo transitions between its α
and β state resulting in the resonance lines
which are observed. The rates of these
transitions are W1a and W1b for the
transitions of the spin A and B, resp. The
dipolar interaction between spin A and B
introduces two more possible transitions: W0 and W2. These transitions involve
simultaneous changes in the spin states of both the A and the B spin.
How this can be understood is schematically shown in the following figure:
The most left diagram represents the situation at equilibrium. A0 and B0 are the relative
population differences in equilibrium (corresponding to WA and WB). A and B are the
relative differences in population of the spin-states of the A and B nucleus after saturation
A
A
A
A
W2 > W0
small molecules
W0 > W2
large molecules
W1A
W1A
W1B
W1B
A0 = B0 = Δ
A = 1.5 Δ
A = 0.5 Δ
W2
W0
A
A
A = A0 = Δ
B = 0
46
(diagram in the middle) and finally after cross-relaxation (diagrams on the right).
Saturation of the transitions of nucleus B leads to the situation in the middle. As a
consequence of the saturation, the population differences for the spins-states of nucleus B
disappear. Now the cross-relaxation comes into effect. Two different cases are
distinguished here: For small molecules, W2 dominates over W0 and the result is shown
in the upper right diagram. The relative population difference for the states of the A
nucleus has increased. Consequently also the observed signal for A will be increased. For
large molecules, W0 dominates over W2 and the result is depicted in the lower right
diagram. The relative population difference for the states of the A nucleus has decreased
and consequently also the observed signal for A will be decreased. One can easily
imagine a situation, where the two cross-relaxation mechanisms just cancel each other.
Indeed a zero-crossing of the NOE is observed when (in water at room-temperature):
ωτc ≈ 1.18
0.5
0.0
-0.5
-1.0
0.01 0.1 1.0 10 100
η
ωoτfast
tumbling
slow
tumbling
47
Suppose for a particular molecule with a particular size on a particular NMR
spectrometer the 'observed' NOE turns out to be zero. Are there any options to still
observe NOEs within this molecule? Well, obviously we can do nothing about the size of
the molecule, but the tumbling speed of the molecule, of course, depends on the viscosity
of the solvent. The viscosity is usually very sensitive to changes in the temperature. So,
when we change the temperature of the sample, the tumbling speed will change and we
probably have a chance to pick up NOEs now. The other parameter which we probably
can change is ω0, which depends on the spectrometer frequency. So we could just repeat
the experiment on a spectrometer with a different field and chances are high that we
moved away from the zero-crossing situation. A more formal derivation of the NOE is
given in appendix C.
8.2 NOEs in biomolecules
T1 relaxation times are rather uniform for biomolecules. In contrast, cross-relaxation rates
vary a lot since the strength of the dipole field is strongly dependent on the distance
between the protons. A good approximation for the cross-relaxation rate, σ, in a
biomolecule is
6~rcτσ (8.2)
where τc is the rotational correlation time of the proton-proton vector, and r the distance
between the protons.
Since T1 values are uniform we can write for the NOE
6~rcτη (8.3)
48
Therefore, the measured NOE can be converted to a distance. We can calibrate the NOE
by comparing it with a NOE of a fixed distance
refc
cref
ref rr
ττ
ηη ⋅= 6
6
(8.4)
If we now assume that there is no internal mobility (a rigid molecule), τc will be uniform
in the entire molecule. We can now directly calculate the distance
6η
η refrefrr = (8.5)
Eq. 8.5 provides the basis of the most important effect for structure determination by
high-resolution NMR spectroscopy: the extraction of NOE-based distances between
proton pairs.
An example: In a protein we observe the following NOEs between the aromatic ring
protons and some other protons. The distance between CδH and CεH is fixed at 2.45 Å.
NOE intensity distance
Tyr CδH - Tyr CεH 0.15 2.45
Tyr CεH - Val CαH 0.02 3.43 r = 2.45 · (0.15 / 0.02)1/6
Tyr CεH - Asp CαH 0.01 3.85 r = 2.45 · (0.15 / 0.01)1/6
Internal motions are often fast (ωτc,intern < 1) and contribute predominantly to W2.
49
Therefore, the NOE in mobile sub-domains will be reduced in intensity since
02 WW −=σ (8.6)
In the case of internal mobility we can not give the exact value of the distance but only an
upper limit. This is because of
6refc
crefrefrr
ττ
ηη
⋅⋅= (8.7)
and 1≤refc
c
ττ
(8.8)
50
IX Relaxation Measurements
9.1 T1 relaxation measurements
The so-called inversion-recovery pulse
sequence, 180°-τ-90°-detection, can be
used for measuring the longitudinal
relaxation time T1. At the start of the
sequence, the equilibrium magnetization
Meq is inverted by a 180° pulse, after which the magnetization Mz(τ=0) = -Meq. The
magnetization will return to its equilibrium value ('recover') with the relaxation time T1,
and after time τ we have:
⎥⎥
⎦
⎤
⎢⎢
⎣
⎡−=
−121)(T
eMM eqz
τ
τ (9.1)
This time dependency is shown in the
figure on the right. We can measure the
value of Mz(τ) by applying a 90°
detection pulse, which will rotate the z-
component into the xy-plane. The FID is
recorded and the spectrum extracted by a
Fourier transformation. For τ < ln(2) T1
the spectrum is inverted. For longer values of τ, Mz(τ) has recovered to positive values
and a positive signal is recorded. By repeating the experiment with increasing values of τ,
the relaxation behavior can be determined and T1 extracted.
The T1 analysis is not limited to a molecule with a single resonance. In a molecule with
more spins, each of the individual spins, j, with frequency ωj and relaxation time T1j has
the starting magnetization:
51
jeqzj MM −=)0( (9.2)
Therefore, we can determine the individual T1j by monitoring the intensities of the
individual resonances j at ωj in the spectrum as a function of τ.
The different time points τ of the inversion recovery sequence are measured one after
another. In case more than one FID is recorded per value of τ, e.g. for S/N improvement
or in multi-dimensional experiments (discussed in chapter 8), care has to be taken that
saturation is avoided and enough time is allowed between the individual experiments for
the magnetization to relax completely. The repetition rate of the experiment should be
adjusted so that at least a time of 5·T1 is waited before the experiment is repeated.
52
9.1.1 Calculated example for an inversion-recovery experiment
We follow the intensity of one resonance and detect signal intensities between τ = 0.001s
and τ = 3s. Note that the inversion was incomplete as a result of an imperfect 180° pulse
( jeqzj MM −=)0( ). Therefore we have to use the more general Eq. 4.2 in place of Eq. 9.1:
[ ] 1
τ
)0()(T
eMMMM eqzeqz
−−=−τ
τ (s) Mz(τ) ΔM(τ)=Mz(τ)-Meq ln{ΔM(τ)/ΔM(0)}
“Mz (0)” 0.001 -12.0 -27.0 -
0.01 -9.4 -25.4 -0.06
0.03 -7.1 -22.1 -0.20
0.1 1.1 -13.9 -0.66
0.2 7.9 -7.1 -1.36
0.3 11.3 -3.7 -2.01
1.0 14.9 -0.1 -5.59
“Meq” 3.0 15.0 0
Since 1)0(
)(lnTM
M ττ −=⎭⎬⎫
⎩⎨⎧
ΔΔ
(9.3)
T1 can be determined by linear regression analysis:
Στ = 1.64; Σln(...) = -9.88;
and thus T1 = Στ / Σln(...) = 0.165 s.
53
9.1.2 Applications of T1 relaxation
T1 tells us how long we have to wait until the equilibrium magnetization is restored. This
is important information for the setup of FT NMR experiments, where a number of
experiments are repeated and the results added up in the computer. It depends on T1 how
fast we can repeat our experiments.
9.2 T2 relaxation measurements
The value of T2 could in principle be extracted from the envelope of the FID or from the
line width at half-height (Eq. 5.8). However, T2 values obtained this way also depend
strongly on the static field inhomogeneities (the “shimming”). If the main magnetic field
is not homogeneous over the whole sample, spins at different locations in the sample tube
experience a slightly different field. This results in slightly different resonance
frequencies. The total peak, which is the sum over all individual spin contributions in the
sample, will be broadened due to this effect. The apparent relaxation time is T2* which is
faster than T2 due to spin-spin
relaxation. Since only T2 depends
on the physical properties of the
molecule, this is what we are
actually interested in. Obviously, the
contribution of a 'bad' shimming to
the relaxation of our molecule is less
interesting for us! 'Pure' T2 times can
be determined by the so- called
'spin-echo' pulse sequence, shown on the right.
The equilibrium magnetization, Meq, is transferred from +z- into y-magnetization, My, by
the 90°x-pulse. The macroscopic vector My can be considered to consist of a sum of
54
macroscopic magnetizations Mj at j different positions in the sample. Now we look at the
rotating frame (which rotates with the average Larmor frequency ω0 of all the spins j): As
a result of an inhomogeneous field, we will find that some of the Mj components will
rotate faster than ω0 and some will rotate slower, depending on the exact position in the
sample. In the rotating frame (ω0), the fast and slow components will start to precess in
the xy-plane with frequency
jjz BMB Δ=− γωγ 0)( (9.4)
This frequency is different at different locations. Hence, the transversal magnetization My
will dephase due to the inhomogeneous field.
It can be shown that the spin-echo sequence eliminates the dephasing that results from
these static field inhomogeneities. In order to explain how this works, we actually should
consider the precession of each of the individual components Mj, but fortunately the
principle can be shown by picking only two spins, rotating with different speed: a slower
black and a faster white one.
55
At a point (a) in the sequence we have pure y-magnetization for all individual spins. At
point (b) some phase coherence is lost because each spin has precessed with its own
frequency. The white one a bit faster, the black one a bit slower. After the delay τ/2 a
180° pulse from the y-direction is applied (point (c)). This pulse will invert the x-
component of mj, but will not affect the y-components. During the second delay τ/2, the
vectors (Mj) precesses again with their individual frequency γΔBj, the white one still a bit
faster and the black one still a bit slower and still in the same direction as before the
180° pulse. Consequently, in point (d) both magnetization vectors have returned to the y-
axis, independent of the static inhomogeneity, creating a so called 'spin-echo'. Thus, we
have eliminated the effect of field inhomogeneity! By repeating the experiment for
several values of τ in the range 0 - 4·T2 we can determine T2 from the decay of the
intensities of the resonances in the spectrum, which now results purely from the
relaxation by random fluctuating fields and is independent from any static field
inhomogeneity.
9.2.1 Applications of T2 relaxation
In chapter 4 we have seen that the T1 and T2 values depend on the motional behavior of
the dipole-dipole vector and thus the rotational correlation time τc of the molecule. Thus,
analogously to the usage of T1, we can use T2 to determine motional parameters.
56
X Two-Dimensional NMR
In the seventies the development of two-dimensional (2D) NMR has revolutionized
NMR spectroscopy and has made the structural studies of biomolecules possible. The
basic idea is to spread the spectral information in a plane defined by two frequency axes
rather than linearly in a conventional one-dimensional spectrum. Clearly, this provides a
large increase in spectral resolution. Also, in a 2D NMR experiment interactions between
many spins in a molecule (whether it be J-coupling or NOE-type interactions) can be
measured simultaneously. This represents an enormous time-saving for large
biomolecules. To illustrate the method we will discuss first a very simple 2D NMR
experiment.
10.1 The SCOTCH experiment
SCOTCH stands for spin coherence transfer in (photo) chemical reactions. If one has, for
instance, a photochemical reaction
BA h⎯→⎯ ν (10.1)
a proton which resonates in molecule
A at ωA will resonate at ωB in
molecule B. The SCOTCH experiment
correlates the resonance frequencies in
A and B for each particular proton. In
other words it enables us to find ωB in
B that corresponds to ωA of the same
proton in A. The pulse sequence is as
57
shown above. The 90° pulse creates xy- magnetization which precesses with ωA during
the so-called evolution period t1. The light pulse changes the precession frequency to ωB
with which it is detected as an FID during the detection period t2. The trick is now to
increment t1 in a regular fashion and collect a large number (typically, say, 500) FIDs
belonging to different t1 values. Thus one records a data set S(t1, t2) depending on both t1
and t2 (the FIDs). A first Fourier transformation 22 Ft → leads to a so-called
interferogram S (t1, F2) and after a second Fourier transformation 11 Ft → we arrive at
the 2D spectrum:
spectrum 2DraminterferogFIDs
),(),(),( 212121 FFSFtSttS FTFT ⎯⎯ →⎯⎯⎯ →⎯ (10.2)
The effect of incrementing t1 is to "sample" the frequencies present during the evolution
period. How this works out in practice is illustrated in Figure 10.1 (next page). After each
t1 time the signal acquired a different phase. The FIDs recorded that way have all the
same frequency (ωB), but their phase (i.e. how far the magnetization vector rotated in the
xy-plane) depends on the evolution time t1. After the first Fourier Transformation this
leads to a peak at the position ωB in the F2-dimension with an intensity that oscillates in
the t1 direction with the frequency ωA. Looking along the t1-axis of the interferogram the
signal actually looks like an FID oscillating with the frequency ωA. Hence, after double
Fourier Transformation this gives a peak at (ωA, ωB) (ωA in the F1- and ωB in the F2-
dimension). This is called a cross-peak, in contrast to peaks on the diagonal which are
called diagonal peaks. So, indeed, the experiment gives the connection (a spectroscopist
says: 'correlation') between the resonance frequencies in A and B. In this case this is
rather trivial but if there are many nuclei this method can prove very useful.
All 2D NMR experiments adhere to the following scheme:
58
In the example above the preparation period would include a relaxation delay and the 90°
pulse. The mixing period is just the light pulse.
Fig. 10.1: The SCOTCH 2D NMR experiment. On the left side the sampling of the ωA frequency is shown at different values of t1. After Fourier Transformation of the FIDs (t2 → F2) the interferogram S(t1, F2) consists of lines at ωB in F2 with intensities dependent on t1 oscillating with ωA. The second Fourier transform (t1 → F1) leads to a 2D spectrum with a single cross-peak at (ωA, ωB). The representation is as a contour plot.
59
10.2 2D NOE
2D NOE or NOESY is one of the
most important 2D NMR
experiments, because it measures
all short inter-proton distances in
a single experiment, for instance
for a protein. The first 90° pulse
belongs to the preparation period. The evolution time t1 is incremented and mixing
consists of two 90° pulses separated by a constant mixing time τm. During τm
magnetization between neighbouring spins is exchanged via cross-relaxation (see
Chapter 8). For biomolecules σAB ≈ W0 and therefore the flip-flop transitions (αβ →
βα) are dominant for the NOE effect. We will now look at the 2D NOE experiment in
more detail. Let us assume that there are two spins, A and B, within NOE distance, and
that the carrier frequency is chosen at the Larmor frequency of spin A, ωRF = ωA. The
vector diagrams at various times in the 2D NOE pulse sequence then look as shown in
Figure 10.2.
After the first 90° pulse the magnetization vectors lie along the y-axis in the rotating
frame. During the evolution time t1 the A-vector precessing at ωRF will stay the same (at
least for short times when relaxation can be neglected), while the B vector precesses with
a frequency ωB – ωRF. The second 90° pulse tips the A-vector to the negative z-axis and
the B-vector into the xz-plane. The z-component of B is shorter than that of A. We see
that the variable t1 time acts to create z-components with different magnitudes depending
on the different Larmor frequencies (i.e. how far a particular vector did rotate in the xy-
plane during t1. This is called frequency labeling and is a common feature of the
evolution period (t1 period) of 2D NMR experiments. Focussing now on these z-
components that correspond to populations of energy levels, we know that W0 transitions
during the mixing time τm will tend to equalize populations of αβ and βα states (because
60
at equilibrium they are equal). Thus, the z-components of the A and B magnetizations
also will become more equal (d). Finally, the third 90° pulse flips the vectors in the xy-
plane where the signal can be observed (e).
Fig. 10.2. Vector diagram of the 2D NOESY experiment.
Now let us see how this leads to cross-peaks in the 2D NOE spectrum. In Fig. 10.3 we
shall look at the magnetization vectors at time d in Fig. 10.2 (after the mixing time τm).
Let us first consider a trivial case where no transfer occurs, for instance because the spins
are too far apart, and then the more interesting case where cross-relaxation occurs
between A and B.
61
The vectors are depicted for various evolution times t1 chosen such that the B-vector has
rotated through 0, 90°, 180°, 270°, and 360°. If no mixing occurs the vectors precess at
their own frequencies ωA and ωB during t1 and continue to do this during the detection
period t2. Thus, this leads to a 2D spectrum after double Fourier transformation with only
diagonal peaks at (ωA, ωA) and (ωB, ωB).
Fig. 10.3. Vector representation of spin A (solid vector) and spin B (dotted vector) at time point (d) in Fig.
10.2 for various values of t1.When no mixing occurs during τm the 2D NOE spectrum consists only of
diagonal peaks. In the case of magnetization transfer during τm cross peaks arise at (ωA, ωB) and (ωB, ωA).
In contrast, when mixing occurs in τm the equalizing effect of the W0 transitions causes
the A-vector to borrow intensity from B and vice versa. Thus, the A-vector is now
modulated with ωB for the different values of t1! Since the vector will continue to precess
at ωA in t2 this will lead to peaks both at (ωA, ωA) and (ωB, ωA) in the 2D spectrum (F1,
F2). These are the diagonal peak that we have seen before at (ωA, ωA) and an off-diagonal
cross-peak at (ωB, ωA) at the upper left half of the spectrum. In the same way, the
62
modulation of the B-vector with ωA leads to a symmetry related cross-peak at (ωA, ωB)
below the diagonal. Because mixing is reversible 2D NOE spectra are always
symmetrical. As all proton pairs within 5 Å will give rise to cross peaks with intensities
inversely proportional to r6, the 2D NOE spectrum provides a map of all short proton-
proton distances in a biomolecule. A more mathematical description of the NOESY
experiment is seen in Appendix D.
10.3 2D COSY and 2D TOCSY
In another important class of 2D
NMR experiments the magnetization
transfer in the mixing period takes
place via the J-coupling. The
simplest is the COSY (correlated
spectroscopy) with the pulse sequence shown
at the upper right. Here the mixing period is
just the second 90° pulse, which transfers
magnetization between A and B spins
whenever there is a J-coupling between them.
This results in the 2D spectrum as shown in
Fig. 10.4.
Fig. 10.4: COSY spectrum of two J-coupled nuclear
spins A and B. The sign of the cross-peaks is indicated.
The regular 1D spectrum is drawn above. It can be seen
that the cross-peaks reflect the fine structure of the 1D
spectrum (doublets in this case) and therefore can be
used to measure J-couplings. Note the typical negative-
positive sign pattern in the cross-peaks (but not in the
diagonal peaks).
63
COSY spectra are often recorded at low resolution so that the fine structure is not visible.
In this way they are used to trace networks of J-coupled nuclei. Such low-resolution
COSY spectra are shown below for the two amino acids alanine and valine:
As measurable J-couplings only arise between nuclei separated by less than four
chemical bonds, the connectivity patterns can be easily predicted. We note that in a
protein a network of J-coupled protons does not extend beyond an amino acid residue
because the CαH of residue i is separated from the NH of residue i+1 by four chemical
bonds. Each of the amino acids forms a separate spin-system. A COSY spectrum is a
valuable tool for the identification of the types of amino acids.
Another important J-coupling based 2D experiment is the TOCSY which stands for total
correlation spectroscopy. The pulse sequence is shown below. The mixing period here
consists of a complicated pulse
train. This has the effect of
transferring magnetization
through a whole network of J-
coupled spins.
64
For instance, in a molecular fragment
we have non-zero J-couplings 3JAB and 3JBC but 4JAC is close to zero. During the TOCSY
mixing period A-magnetization is transferred to B via JAB and then to C via JBC in
multiple transfer steps. Hence a cross-peak will arise between A and C even though there
is no direct J-coupling between these spins. To illustrate this, a comparison between
COSY and TOCSY spectra for the ABC fragment is shown in Fig. 10.5:
Fig. 10.5: COSY and TOCSY spectra of a three proton spin-system where JAB and JBC are non-zero and
JAC=0. Because of the symmetry only the part above the diagonal has been drawn.
It should be clear from this that a TOCSY spectrum always contains the COSY as a sub-
spectrum. Although the information content of COSY and TOCSY spectra is in principal
the same, in complicated spectra with a lot of overlap the TOCSY spectrum is still useful.
65
This is because if B and C in the example of Fig. 10.5 are in crowded spectral regions but
A is not, then the whole spin system can be observed on a vertical line at the A-position,
while this would be difficult for B and C. We leave the sketch of the TOCSY spectrum of
alanine and valine as an exercise (compare Fig. 10.4 and 10.5).
Finally, it should be mentioned that the TOCSY is a sub-spectrum of the NOESY for
almost all cross-peaks. We will come back to this point in the following chapter.
66
XI The assignment problem
The interpretation of an NMR spectrum always starts with the identification of resonance
frequencies and their corresponding nuclei in the molecule. This so-called ‘assignment’
of resonances constitutes an essential step in the structure determination process by high
resolution NMR spectroscopy that always precedes the actual calculation of structures.
While the assignment of smaller organic compounds with only a few 1H nuclei can often
be solved easily by means of a single experiment (e.g. COSY), it is much more
complicated for bigger and more complex molecules like peptides, proteins and nucleic
acids. Not only does the number of resonances increase with increasing size, also the line
width increases as a consequence of the shorter T2 relaxation times. As a result the lines
become broader and the overlap becomes increasingly severe.
We focus here on the basic principles of assignment, i.e. which parameters of the NMR
spectrum can be used. We will come back to the special case of the assignment of spectra
of biomacromolecules in the next chapter.
11.1 Chemical shift
The chemical shift of signals gives a first indication of the surrounding of the
corresponding nucleus in a molecule. We saw before (chap. 7.1.2) on the example of a
protein spectra, that proton chemical shifts usually are grouped according to the
environment in which they are located in the molecule. In our example those were the
amide and aromatic protons in the left half of the spectrum, the Hα protons just right of
the water signal (center of the spectrum), protons of aliphatic side chains still to the right
of them and finally the methyl groups on the right edge of the spectrum. If you look at the
vast variety of organic molecules you can find many more functional groups and
chemical environments a nucleus can be situated in. Most of the protons belonging to
such a group share their individual ranges of chemical shifts. Tables are available for 1H
67
and 13C chemical shifts, which can help to identify the origin of particular resonances in
NMR spectra (see appendix A).
11.2 Scalar coupling
From chapter 7.2 we know that we can identify coupled nuclei with help of their coupling
constant. If we look, for instance, at the signals of the methyl group and the CH2 group of
ethanol, we find them split into multiplets due to scalar coupling. The number of
multiplet components gives us an indication of what the neighbouring group looks like
(i.e. the signal of the methyl group is a triplet due to the coupling with the two equivalent
protons of the neighbouring CH2 group. The signal of the CH2 group is a quartet due to
the coupling with the three equivalent protons of the neighboring methyl group). The
distance of the components of these multiplets (the coupling constant) is exactly the same
in the triplet and in the quartet. This helps to identify partners with a scalar coupling
between them.
11.3 Signal intensities (integrals)
The intensity, or better the integral of a signal tells us to how many equivalent nuclei a
particular signal corresponds to. The integral of the proton signal of a methyl group is just
three times as large as the integral of a single proton in the same molecule.
11.4 NOE data
NOE data can be very helpful for the assignment, especially when we already have an
idea of (basic) structural features of the molecule we are looking at. It is quite straight
68
forward for example to identify neighbouring protons in an aromatic ring system, because
their distance from each other is quite short and well known (~2.45 Å). Once we know
one of them, we can relatively easily identify the others on hand of a NOE spectrum.
Especially in biomacromolecules, where the spin systems of the individual residues
cannot be connected by scalar coupling experiments, NOE data is often the only outcome
to still get a sequential assignment.
69
XII Biomolecular NMR
Some of the most important sorts of biomolecules are nucleotides and amino acids.
Although also lipids and carbohydrates play important roles in biological processes, in
this course we will focus on the first two groups. Nucleic acid is the bearer of the genetic
information and is involved in protein synthesis where it acts as a template containing the
sequential information for all proteins occurring in organisms. Each three consecutive
nucleotides in a gene code for a particular amino acid in a protein. In addition there are
control regions (stop codons and sequences where proteins involved in transcription and
transcription regulation bind). The role of proteins is very diverse. We know them for
example as enzymes and regulators, as building material of cells and their compounds,
and as carrier of information within and between cells. Obviously both nucleic acids and
proteins play a major role in the function of all living organisms and accordingly also
most defective disorders of them can be traced back to the malfunction of proteins or to
defects in nucleic acid. This makes these molecules to very popular subjects of study in a
variety of research disciplines. Their structural and functional understanding is supposed
to give insight into how and why they work and what probably goes wrong in the case of
diseases. Knowing the exact composition and function of a particular virus, e.g. can lead
to the development of anti-viral drugs. The exact knowledge of the structure and function
of a particular enzyme can lead to the development of e.g. inhibitors which can deactivate
the enzyme when needed.
We will have a close look here at the structural properties of nucleotides and peptides and
how their different spin systems translate into different features in NMR spectra.
12.1 Peptides and proteins
Peptides and proteins are mainly build from twenty different naturally occurring amino
acids. They all share the same basic structure and only differ in their side chain R:
70
H2N Cα CO
H
R
|
| N Cα CO
OH
H
R
|
|H|
H2N Cα CO
H
R
|
| N Cα CO
OH
H
R
|
|H|
In peptides, amino acids are linked via a so-called peptide bond. The amino group of one
residue is connected with the carboxyl group of another:
Note that the peptide bond is planar due to its partial double bond character! Amino acids
are usually referred to with either a one-letter or a three-letter code:
H2N Cα CO
OH
Hα
R
|
|Carboxyl group
Side chain
Amino group H2N Cα CO
OH
Hα
R
|
|H2N Cα C
O
OH
Hα
R
|
|Carboxyl group
Side chain
Amino group
MMetMethionineWTrpTryptophane
CCysCysteineYTyrTyrosine
RArgArginineFPhePhenylalanine
KLysLysineTThrThreonine
QGlnGlutamineSSerSerine
NAsnAsparagineIIleIsoleucine
EGluGlutamateLLeuLeucine
DAspAspartateVValValine
PProProlineAAlaAlanine
HHisHistidineGGlyGlycine
MMetMethionineWTrpTryptophane
CCysCysteineYTyrTyrosine
RArgArginineFPhePhenylalanine
KLysLysineTThrThreonine
QGlnGlutamineSSerSerine
NAsnAsparagineIIleIsoleucine
EGluGlutamateLLeuLeucine
DAspAspartateVValValine
PProProlineAAlaAlanine
HHisHistidineGGlyGlycine
71
Amino acids can be classified by the character of their side chain as: aliphatic (A, V, L, I,
(G), aromatic/ring (F, Y, W, H, P), carboxylic (D, E, N, Q), sulfur/hydroxy containing (C,
M, S, T, Y) and charged (K, R). The chemical formulas of the natural occuring amino
acids together with their COSY, TOCSY and NOESY spectra are shown in appendix E.
12.1.1 Assignment of peptides and proteins
A strategy based upon homonuclear 2D experiments (COSY, TOCSY, and NOESY) was
developed in the 80’s (K. Wüthrich, 1986). This approach is discussed in this chapter.
A more recent approach employs uniformly 15N and 15N,13C labeled proteins. The
strategy uses so-called triple-resonance experiments (involving 1H, 15N and 13C) to
transfer magnetization through the polypeptide chain employing the large one-bond
homo- and heteronuclear J-couplings.
For larger proteins several patterns corresponding to residues of a certain type are present
in a COSY, e.g. several alanines give rise to similar patterns. How can we decide which
of the alanines in the protein sequence corresponds to a particular pattern?
The solution involves three steps:
1. Find patterns of coupled interconnected spins (spin-systems) belonging to amino
acid residues using COSY and TOCSY (amino acid identification).
2. Connect neighbouring spin-systems (in the sequence) with sequential NOEs.
3. Match stretches of connected spin-systems with the (known) amino acid sequence
for unique fits.
Let us focus now on the individual steps.
72
Step1: Spin system identification
The identification of spins belonging to the same spin-
system can be performed on the basis of the COSY
and the TOCSY experiment (see the example of an
Ala-Ala peptide fragment on the right). In both
spectra, protons belonging to a certain amino acid can
be identified. A summary of all the expected patterns
for the different amino acid types is given in the
appendix E. Some of the amino acids have a very typical pattern, for instance Gly where
the side chain just consists of two Hα, or prolines where no HN is present. Some other
amino acids share a common pattern, e.g. the so-called AMX spin systems where AMX
represents the Hα and the two Hβ protons of the amino acid. To this group the following
residues belong: Phe, Tyr, Trp, His, Ser, Cys, Asp and Asn. In the COSY and TOCSY
spectra of Phe no J-couplings between Hβ and protons in the ring are observable (4 bonds
involved). This gap can be closed if in addition the NOESY spectrum is used since the
distance between these protons is typically smaller than 5 Å. Again, these cross peaks are
included in appendix E.
For bigger proteins the overlap makes it harder to identify such patterns. Also the lines
are broader in general which is due to the shorter T2 relaxation times. This also results in
a reduced efficiency of the magnetization transfer during the mixing period in TOCSY
since the magnetization is transversal during this time.
Step 2: Identification of neighbouring residues
When more than, for instance, one alanine is present in the protein, it is not a priori clear
which alanine in the primary sequence corresponds to a certain alanine pattern in the 2D
COSY spectrum. In order to make a so-called sequential assignment, i.e. correlating the
COSY patterns to individual amino acids in the primary sequence, we have to correlate
the COSY pattern of the alanine to the COSY pattern of its sequential neighbour.
73
Unfortunately, when using only proton NMR, no 1H-1H
J-couplings of appreciable size exist over the peptide
bond since the shortest connection of two protons in
neighbouring residues involves four bonds.
Consequently, the individual amino acid residues form
isolated spin systems. Fortunately we can employ
another mechanism of magnetization transfer. The short
sequential distances between consecutive residues result
in cross peaks in the NOESY spectrum. The figure on the right summarizes the sequential
assignment approach: The type of the spin system is identified using COSY and/or
TOCSY experiments (bold arrows), whereas the sequential connectivity is established by
sequential NOESY cross peaks (dotted arrows). A cross peak between the Hα proton of a
spin i and the HN proton of the neighbouring spin i+1 results from a short distance
between these two residues, often referred to as dαN. Similar, the distances between Hβ or
HN of spin i to the neighbour HN of spin i+1 are represented by dβN and dNN .
On the other hand the tertiary structure of the protein will also lead to intense signals
from non-sequential cross peaks. How can we be sure to observe a sequential peak?
The statistics of these short distances have been investigated on the basis of thousands of
known (mostly X-ray) structures. Table 12.1 shows for example that 98% of all dαN
distances shorter than 2.4 Å correspond to sequential distances. Naturally, the score drops
with increasing distance limit. Similar values are obtained for the dNN and dβN distances.
This means that for two residues i and i+1 dαN, dNN and dβN cross peaks most likely result
from sequential NOEs. The probability for identifying a sequential connection increases
dramatically when simultaneously two (or more) short distances can be found.
Table 12.1 shows that if simultaneously two NOEs are found between two amino acids,
they most likely result from sequential residues.
74
Table 12.1
Distance (Å) j − i = 1 (%)
dαN (i,j) ≤ 2.4
≤ 3.0
≤ 3.6
98
88
72
dNN (i,j) ≤ 2.4
≤ 3.0
≤ 3.6
94
88
76
dβN (i,j) ≤ 2.4
≤ 3.0
≤ 3.6
79
76
66
dαN (i,j) ≤ 3.6 && dNN (i,j) ≤ 3.0 99
dαN (i,j) ≤ 3.6 && dβN (i,j) ≤ 3.4 95
dNN (i,j) ≤ 3.0 && dβN (i,j) ≤ 3.0 90
Step 3: Matching to the sequence
The next step is to locate this fragment of two (or more) residues in the protein sequence.
For bigger proteins there might still exist several possibilities. Naturally, we can try to
link more patterns together and try to make tri-peptide, tetra-peptide, and even bigger
fragments. The uniqueness of such di-, tri-, and tetra-peptide fragments in proteins with
less than 200 residues has also been investigated. If all residue types could be identified
unambiguously in the fragment from the COSY (or TOCSY) spectra, a given di-peptide
fragment has a probability of uniqueness of 56%, the tri-peptide and tetra-peptide
fragments of 95% and 99%, respectively. As expected, increasing the length of the
fragment increases the uniqueness. A tetra-peptide fragment is usually sufficiently unique
75
to allow for identification in the polypeptide chain.
12.1.2 Secondary structural elements in peptides and proteins
The intensities of sequential NOEs contain some information on the secondary structure
because they depend on the local conformation of the polypeptide backbone. Take, for
instance, the distance between an Hα proton of a residue i to the HN proton of the
following residue (i+1). In an extended strand this distance is short (2.2 Å) whereas it is
3.5 Å in a helical conformation. This information together with analysis of other
characteristic medium range NOEs (between residues which are less than 4 positions
apart in the sequence) is sufficient to specify the secondary structure element. An
overview of important sequential- and medium-range proton-proton distances is given in
the Figure 12.1.
Fig. 12.1: Characteristic sequential and medium range NOE connectivities.
Recognition of secondary structural elements, i.e. α-helices, β-sheets, and turns,
76
constitutes an important element in the structure determination process. Most of the
observable NOE cross peaks between different residues are due to this secondary
structure.
The α-helix, β-sheet, and turn conformation result in characteristic short distances which
in turn result in characteristic NOEs in the 2D NOE spectrum. Also the 3JHNHα coupling
has traditionally been used as a marker for secondary structure as well as the presence of
slowly exchanging amide protons (cf. Chapter 13).
Fig. 12.2 shows the short distances in an anti-parallel β-sheet and in a parallel β-sheet.
Fig. 12.2: Anti-parallel (top) and parallel (bottom) β-sheet. Sequential NOEs are indicated by open arrows, interstrand NOEs by solid arrows. Hydrogen bonds connecting the strands are shown by wavy lines.
77
The anti-parallel β-sheet is characterized by short dαN(i,i+1) and interstrand dαα(i,j)
distances whereas the dαα (i,j) distance is much longer (4.8 Å) in parallel β-sheet. Also
the dNN(i,j) distance in parallel β-sheet is much longer compared to the dNN(i,j) distance in
anti-parallel β-sheet.
In Fig. 12.3 short distances are shown for an α-helix. An α-helix is characterized by a
close proximity of residues i and i+3 and residues i and i+4. The dαN(i,i+3) and dαN(i,i+4)
NOEs are therefore clear markers for this element of secondary structure. In addition to
the aforementioned short distances, the sequential dNN(i,i+1) distance is also short, and
strong sequential dNN NOEs can be found in the spectrum.
Fig. 12.3: α-helix. The sequential dNN is shown (2.8 Å) together with dαN(i,i+1), dαN(i,i+2) and dαN(i,i+3),
Note that the side-chains are not shown.
Turns are characterised by short distances between residues i and i+2; in particular the
dNN(i,i+2) distance. In general, however, since there exists a large number of turns, which
78
all have mirror images as well, e.g. I, I’, II, II’, III, III’, the precise nature of the turn is
hard to establish from NOE and J-coupling data alone. An overview of the characteristic
patterns and short distances is given in Fig. 12.4.
Fig. 12.4: Characteristic NOEs for several secondary structure elements. The thickness of the bars reflects the strength of the NOE. The thicker the bar the stronger the NOE (and the shorter the distance between the protons involved). 3J-coupling constants are also given.
The short distances in secondary structures are also listed in the following table:
Table 12.2: Secondary structure specific atomic distances (in Å)
Distance α-helix 310-helix β βp turn Ia turn IIa
dαN 3.5 3.4 2.2 2.2 3.4
3.2
2.2
3.2
dαN(i,i+2) 4.4 3.8 3.6 3.3
dαN(i,i+3) 3.4 3.3 3.1−4.2 3.8−4.7
dαN(i,i+4) 4.2
dNN 2.8 2.6 4.3 4.2 2.6
2.4
4.5
2.4
dNN(i,i+2) 4.2 4.1 3.8 4.3
dβN 2.5−4.1 2.9−4.4 3.2−4.5 3.7−4.7 2.9−4.4
3.6−4.6
3.6−4.6
3.6−4.6
dαβ(i,i+3) 2.5−4.4 3.1−5.1
a for turns, the two numbers apply for the distances between residues 2, 3 and 3, 4 respectively
79
12.2 Nucleotides and nucleic acid
Nucleic acid is mainly build from five different nucleotides. All of them share a common
general structure: They consist of a nucleobase, a pentose-sugar ring (ribose in the case of
RNA or 2'-deoxy ribose in DNA) and a phosphate group which links the nucleotide units
to each other.
(Nucleo) Base
Phosphate
Sugar
Fig. 12.5: Common structure of nucleotides
There are two sorts of nucleobases: The purine bases adenine and guanine and the
pyrimidine bases uracil (only found in RNA), thymine (only found in DNA) and cytosine.
The base is connected to the sugar moiety via a glycosidic bond at the 1' carbon of the
pentose ring. In the common nucleotides the phosphate group is attached to the 5' carbon
of its sugar. A nucleotide which has no phosphate group is called a nucleoside. In
oligonucleotides and in nucleic acid, the phosphate group is linked between the 5' carbon
of one pentose and the 3' carbon of the next. It should be clear from this, that NMR
spectra of large nucleic acids are much more complex than NMR spectra of peptides of
comparable size. While peptides are build from as many as 20 different building blocks
(the different 'natural' amino acids) nucleic acid is build from only four different
nucleotides. Consequently the number of occurrences for a particular kind of nucleotide
is in average five times as high as for any particular amino acid. This can lead to very
crowded regions in the NMR spectra and can complicate the spectral assignment
considerably.
80
1
2
34
5 6
7
8 9
1
2
34
5 6
7
8 9
1 2
3 4
5
6
Figures 12.6 to 12.9 show the different nucleobases and sugars with their numbering
schemes and the eight different RNA and DNA nucleotides.
Adenine
Uracil
Thymine
Guanine
Cytosine
Fig. 12.6: The five different nucleobases found in nucleic acids
Ribose (RNA) 2'-deoxy ribose (DNA)
Fig. 12.7: The pentose sugar rings of RNA and DNA
12
34
5
6
12
34
5
6
5'
4'
3' 2'
1'
5'
4'
3' 2'
1'
81
AMP GMP Adenosine- Guanosine- 5'-phosphate 5'-phosphate
UMP CMP Uridine- Cytidine- 5'-phosphate 5'-phosphate
Fig. 12.8: Ribonucleotides (RNA)
dAMP dGMP Adenosine- Guanosine- 5'-phosphate 5'-phosphate
dTMP dCMP Thymidine- Cytidine- 5'-phosphate 5'-phosphate
Fig. 12.9: 2'-Deoxy-ribonucleotides (DNA)
82
The figure below illustrates how the nucleotide units are linked by the phosphate groups
in oligo nucleotides and in nucleic acids. If we analyze the spin systems of this molecule,
we find that both the link between sugar and nucleobase and between the individual
nucleotide units (via the phosphate groups) reach further than three bonds before the next
proton can be found. In other words: The bases and the sugars form isolated spin systems.
This is important when it comes to think about an assignment strategy for these
molecules!
Fig. 12.10: The phosphate-sugar backbone of DNA
When we look at the structural properties of nucleic acids, the most prominent feature is
the double-helical conformation it adopts. The two strands of the helix adhere to each
other by means of hydrogen bonding between bases of the adjacent stands. These so-
83
Geometry attribute A-form B-form Z-form
Helix sense right-handed right-handed left-handed
Repeating unit 1 bp 1 bp 2 bp
Rotation/bp 33.6° 35.9° 60°/2
Mean bp/turn 10.7 10.0 12
Inclination of bp to axis +19° -1.2° -9°
Rise/bp along axis 0.23 nm 0.332 nm 0.38 nm
Pitch/turn of helix 2.46 nm 3.32 nm 4.56 nm
Mean propeller twist +18° +16° 0°
Glycosyl angle anti anti C: anti G: syn
Sugar pucker C3'-endo C2'-endo C: C2'-endo G: C2'-exo
Diameter 26 nm 20 nm 18 nm
called base-pairs were found by James Watson and Francis Crick and accordingly named
'Watson-Crick base pairs'. Base pairs are always built from one purine and one
pyrimidine base. Adenine (A) always pairs with Uracil (U) in RNA and with Thymine
(T) in DNA. Guanine (G) always pairs with Cytosine (C). The hydrogen bonds are shown
in the figure below. Note that the G−C pair is more stable than the A−T pair, because
G−C consists of three hydrogen bonds and A−T only of two.
Fig. 12.11: Watson-Crick base pairs. A-T on the left, G-C on the right
There are two major conformation in which the double helices of DNA and RNA usually
are found. The so-called B-form and the A-form (see figure 12.12). Both of them are
right-handed (like ordinary corkscrews) and differ mainly in their width and height of
their turns. A third conformation, the Z-form is left-handed and of minor importance. The
important parameters of the A-, B- and Z-form helix are listed in table 12.3:
84
A-form double helix (RNA) B-form double helix (DNA)
Fig. 12.12: The two major helical conformations
Information about typical chemical shift values of the A- and B-DNA and typical short
distances can be found in appendices F and G.
85
12.2.1 Assignment of oligonucleotide and nucleic acid spectra
The assignment strategy for nucleic acid spectra is very similar to the one discussed for
peptides and proteins. The major difference being that for the identification of a particular
residue always the combination of COSY/TOCSY and NOESY is needed, because the
bases form isolated spin systems and cannot be assigned to their sugars without making
use of NOE data. Tables to be used for this purpose with common shift values and short
distances in nucleic acids can be found in appendix F.
86
XIII Structure determination
In this chapter we will discuss the method of determination of the complete 3D structure
of a protein. Apart from the bond lengths and most bond angles, which are known on the
basis of the amino acid sequence, the most important source of structural information is
the NOE. In particular, the so-called "long-range" NOEs (those between protons more
than four residues apart in the sequence) provide important constraints on the structure.
We will first see which experimental NMR parameters can be translated into structural
constraints and then describe the computational structure calculation procedure.
13.1 Sources of structural information
13.1.1 NOEs
For a ~10kD protein typically between 1000 and 2000 cross peaks can be observed in a
2D NOE spectrum. We have seen in Chapter 7 how NOE intensities can be converted
into proton-proton distances with the aid of a reference distance (for instance the distance
of 2.45 Å between neighbouring protons on an aromatic ring). In principal the r-6 relation
between NOE and distance should give very precise distances. For example, if there is a
10% error on the NOE intensity this translates in only a 1.5% error in the distance!
However, there are two reasons why this high precision cannot be obtained in practice.
The first is local mobility. Remember that the simple relation of Eq. 8.5 was derived on
the assumption of equal τc for the protons of reference and unknown distance. Only for
very rigid proteins this assumption is really valid. Also, if there is a conformational
equilibrium the r-6 dependence gives values much more weighted to the short distances in
the average. Thus, the actual average distances may be longer than they appear in the
case of conformational averaging. The second reason is the so-called "spin-diffusion"
effect. This is the result of multiple transfer steps of magnetization A B C during
87
the mixing time τm that may disturb the intensity of the NOE cross peak between A and
C. Only for very short τc can this indirect transfer path be neglected. For these reasons
the NOE based distance constraints are often used in the form of distance ranges rather
then precise distances:
strong NOE 1.8 − 2.7 Å
medium NOE 1.8 − 3.5 Å
weak NOE 1.8 − 5.0 Å
This procedure works well in practice because it turns out that for a high precision of the
structure a large number of NOE constraints is more important than precise distances.
13.1.2 J-couplings
The magnitude of the three-bond
coupling constant, 3J, is related
to the dihedral angle (between
the two outer bonds) and
therefore provides a constraint
on this angle (θ).
This is expressed in the Karplus relation
CBAJ ++= )cos()(cos2 θθ (13.1)
where A, B and C are parameters that depend on the particular situation. For instance, the
J-coupling between the amide proton and the α-proton, JHNHα depends on the backbone
angle φ as follows:
60.1)60cos(76.1)60(cos51.6 2 +−−−= φφJ (13.1a)
88
The form of the Karplus relation for this case is shown in Fig. 13.1.
Fig. 13.1: The Karplus curve for the protein backbone φ angle as described in the text.
A complication is that several values of φ may belong to a particular J. Also,
conformational averaging may lead to average values of J in the range 6-7 Hz. Therefore,
the most reliable values for JHNHα are 9−10 Hz for extended (β-sheet) structure and 3−4
Hz for α-helical structure (Fig 12.4). Values around 6−7 Hz are often difficult to
interpret. Similar Karplus curves exist for CH-CH J-couplings from which side chain χ
angles can be derived.
13.1.3 Hydrogen bond constraints
When a protein is dissolved in D2O many of the amide protons do exchange rapidly with
deuterium and therefore disappear from the spectrum. However, often some NH signals
remain in the spectrum for some time and exchange only slowly in time. These slowly
exchanging NHs invariably are present in hydrogen-bonds such as occur in α-helices and
89
β-sheets. If the H-bond acceptor is known with certainty, for instance, because we have
several short- and medium-range NOEs defining the secondary structure, we can use
distance constraints corresponding to the H-bonds. Usually these are given the distance
ranges 2.1 − 2.3 Å for the distance between NH and O and 3.1 − 4.3 Å between N and O.
13.2 Structure calculations
Several computer programs exist that are able to calculate the 3D structure of a protein
based on distance and dihedral angle constraints. Often one starts using only geometric
constraints (distances and angles) with the so-called Distance-Geometry (DG) program.
The resulting structures can then be refined with Molecular Dynamics (MD) calculations
which include also energy terms for electrostatic interactions etc. The calculation
procedure will now be briefly discussed.
13.2.1 Distance-Geometry
The structure of a macromolecule containing N atoms can be perfectly described by
specifying the N(N−1)/2 distances between the atoms (except for the chirality). Since this
is a very large number we will never have so many distance constraints in practice. There
is, however, an algorithm called Distance-Geometry (DG) that converts distances into
Cartesian coordinates for a much smaller number of distances even if they are not
precisely known. This algorithm has the following steps:
1. Set up distance matrices for upper bound (u) and lower bound (l) distances between all
atoms in the structure. These include the so-called holonomic distances (from bond
length and bond angles) and the NOE distance constraints. For those elements for which
no information is available the upper bound is set to a large value and the lower bound to
the sum of the van-der-Waals radii (1.8 Å).
90
2. Smoothing
By this we mean the adjustment of upper
and lower bounds using triangular
inequalities. Consider three atoms i, j and k
with upper bounds uij, uik, and ujk, and lower
bounds lij, lik and ljk. The maximum distance
between i and j is when i,j and k are
colinear. Thus we have
jkikij uuu +≤ (13.2)
A similar relation exists for the lower bound lij:
jkikijl ul −≥ (13.3)
These relations are applied to all distances in the set until no further changes are obtained.
3. Select a matrix D with random trial distances between upper and lower bounds.
4. Embedding
This is the procedure of finding the best 3D structure that belongs to the distance matrix
D. We will not explain this further here.
5. Optimization
Because the matrix D, of course, is not perfect, it is usually necessary to regularize the
structure coming from the embedding stage. Optimization involves the minimization of
the coordinates against a distance error function.
Because of the random nature of step 3, the repetition of steps 3−5 will produce slightly
different structures. Ideally, if this is done many times, an ensemble of structures is
91
obtained, which samples the conformational space consistent with the experimental data.
An example of such an ensemble of structures is shown in Fig. 13.2.
Fig. 13.2. Structures of the protein crambin calculated by the DG procedure.
13.2.2 Restrained molecular dynamics
Molecular Dynamics (MD) programs were originally designed to simulate the dynamic
behaviour of atomic or molecular systems. It turns out that a purely classical approach
works well for this purpose. For the initial coordinates and velocities of all atoms i,
Newtons equation of motion is solved:
ii
i dtdm Fr =2
2
(13.4)
where mi is the mass, ri the position vector and Fi the force acting on atom i. These forces
are given by
92
i
iVr
F∂∂−= (13.5)
The empirical potential energy function V (also called the force-field) contains many
terms
⋅⋅⋅⋅+++++= ticelectrostaWaalsdervandihedralanglebondlengthbond VVVVVV (13.6)
that will not be specified here. For a faithful simulation of the dynamics the integration
step should be rather small, say 1fs (femto-second), so that for a dynamic trajectory of
several tens of ps or a ns very many integrations have to be carried out.
For the application of this method for NMR structure refinement we just add another term
in the potential energy function that reflects the NOE distance constraints:
ijijijij
ijijij
ijijijijNOE
lrrlk
urlururkV
<−=
<≤=>−=
if)(
if0if)(
2
2
(13.7)
This function looks as shown at the
right.
The effect of this term is the following:
If the distance rij in the current model is
too large (larger than the upper bound
uij) than a force is acting to decrease it.
Conversely, if rij is too small a force
will tend to increase it until rij lies
between the bounds. As this will happen for all 1000−2000 distance constraints the
structure will usually satisfy the constraints after a MD run better than before. At the
same time it will be close to a minimum with respect to the potential energy function of
Eq. 13.6. Thus, this so-called restrained MD algorithm acts as an efficient minimizer of
the energy. Of course, if the potential energy decreases the kinetic energy would increase.
93
To avoid this 'heating effect' the system is normally coupled to a 'thermal bath' so that
excessive kinetic energy is drained off. An illustration of this procedure is shown in
Figure 13.3. Again, a family of structures is calculated starting with a random starting
structure. It can be seen that the final cluster of structures is better defined than those
from the DG calculations. This is usually expressed in terms of the root-mean-square-
deviation (RMSD) calculated for the ensemble of structures. Typically, RMSD values for
good NMR structures are in the range 0.3 Å to 0.5 Å for the backbone atoms.
Fig. 13.3: Calculation of the structure of dimeric interleukin-8 by a combination of Distance-Geometry and
Molecular Dynamics. The start is a random structure obtaining only the sequence information (only the
backbone is shown). 10 DG structures are calculated, which are the starting structures for the MD
calculation. After cooling down the precession of the structure is greatly increased.
94
Appendices
Appendix A: a) Typical 1H and 13C chemical shift of common functional groups.
a)
95
Appendix A: b) Random coil 1H chemical shifts for the 20 common amino acids
3.28 2.96 4.698.31 C
3.88 4.508.38 S 2.642.13
γ CH 2 ε CH 3
2.15 2.01 4.528.42 M
2.033.683.65
γ CH 2 δ CH 2
2.28 2.02 4.44P
7.156.86
7.307.397.34
1.23
1.480.950.89
1.640.94
0.97
H2,6 H3,5
3.13 2.92 4.608.18 Y
H2,6 H3,5 H4
3.22 2.99 4.668.23 F
CH 3 4.22 4.358.24 T
1.19CH 2 CH 3 CH 3
1.90 4.238.19 I
0.90H γ CH 3 1.65 4.388.42 L
0.94CH 3 2.13 4.188.44 V
1.39 4.358.25 A
3.978.39 G
H β H α NH
3.28 2.96 4.698.31 C
3.88 4.508.38 S 2.642.13
γ CH 2 ε CH 3
2.15 2.01 4.528.42 M
2.033.683.65
γ CH 2 δ CH 2
2.28 2.02 4.44P
7.156.86
7.307.397.34
1.23
1.480.950.89
1.640.94
0.97
H2,6 H3,5
3.13 2.92 4.608.18 Y
H2,6 H3,5 H4
3.22 2.99 4.668.23 F
CH 3 4.22 4.358.24 T
1.19CH 2 CH 3 CH 3
1.90 4.238.19 I
0.90H γ CH 3 1.65 4.388.42 L
0.94CH 3 2.13 4.188.44 V
1.39 4.358.25 A
3.978.39 G
H β H α NH
8.127.14
H2 H4
3.26 3.20 4.638.41H
1.703.327.17
1.451.703.027.52
2.386.87
7.596.91
2.312.28
7.247.657.177.247.5010.2
6.62
γ CH 2 δ CH 2 NH
1.89 1.79 4.388.27R
γ CH 2 δ CH 2 ε CH 2 NH 3 +
1.85 1.76 4.368.41K
7.59CH 2 NH 2
2.13 2.01 4.378.41Q
NH 2 2.83 2.75 4.758.75N
γ CH 2 2.09 1.97 4.298.37E
2.84 2.75 4.768.41D
H2 H4 H5 H6 H7 NH
3.32 2.99 4.708.09W
H β HαNH
8.127.14
H2 H4
3.26 3.20 4.638.41H
1.703.327.17
1.451.703.027.52
2.386.87
7.596.91
2.312.28
7.247.657.177.247.5010.2
6.62
γ CH 2 δ CH 2 NH
1.89 1.79 4.388.27R
γ CH 2 δ CH 2 ε CH 2 NH 3 +
1.85 1.76 4.368.41K
7.59CH 2 NH 2
2.13 2.01 4.378.41Q
NH 2 2.83 2.75 4.758.75N
γ CH 2 2.09 1.97 4.298.37E
2.84 2.75 4.768.41D
H2 H4 H5 H6 H7 NH
3.32 2.99 4.708.09W
H β HαNH
For X in GGXA, pH 7, 35ºC (Bundi and Wüthrich 1979)
96
Random Coil Chemical shifts (in ppm) for the 20 common amino acids in acidic 8 M urea (from Wright,
Dyson et. al., Journal of Biomolecular NMR, 18: 43–48, 2000).
97
Appendix B: 1H chemical shift distribution of amino acids.
98
Appendix C: Nuclear Overhauser Effect
The population change of the αα-state, d/dt nαα, after a disturbance from equilibrium can
be expressed using the W0, W1a, W1b and W2 rates. The rate equation is:
)()(
)())((
11
2211
eqaab
eqaaa
eqeqba
nnWnnW
nnWnnWWWndtd
ββββ
ββββαααα αα
−+−+
−+−++−= (C.1)
Thus, the αα-state looses magnetization (first term), but also gains some magnetization
from the ββ-, the βα- and the αβ-states.
Similar expression can be found for the time dependence of the other three states.
Now we look at the net population differences of spin A and B, na and nb, resp. These are
)()()()(
βββααβαα
ββαββααα
nnnnnnnnnn
b
a
−+−=
−+−= (C.2a,b)
From this we can calculated the time derivate
βββααβαα
ββαββααα
ndtdn
dtdn
dtdn
dtdn
dtd
ndtdn
dtdn
dtdn
dtdn
dtd
b
a
−+−=
−+−= (C.3a,b)
If we now introduce in Eqs. C.3 the results from Eq. C.1 (and the expressions for the
other spin states) we can derive the dependency of the population from the transition
rates:
))(())(2(
))(())(2(
02210
02210
eqaa
eqbbbb
eqbb
eqaaaa
nnWWnnWWWndtd
nnWWnnWWWndtd
−−−−++−=
−−−−++−= (C.4a,b)
99
The first terms of Eqs. C.4a and C.4b describe the T1 relaxation of spins A and B,
respectively. Hence we have
)2(1
)2(1
2101
2101
WWWT
WWWT
bbb
aaa
++==
++==
ρ
ρ (C.5a,b)
The second term of Eqs. C.4 results in transfer of magnetization from A to B. We define
the cross-relaxation σ between A and B as
02 WW −=σ (C.6)
With this definition, from Eqs. C.4 we arrive at the Solomon-Bloembergen equations:
)()(
)()(
eqaa
eqbbbb
eqbb
eqaaaa
nnnnndtd
nnnnndtd
−−−−=
−−−−=
σρ
σρ (C.7a,b)
Now let us return to the steady-state NOE experiment of chapter VIII. Spin B was
selectively saturated (nb = 0). After some time the two-spin system will reach a steady
state with
0=andtd
(C.8)
Eq. C.7a then becomes
)()(0 eqbb
eqaaaa nnnnn
dtd −−−−== σρ (C.9)
100
so that the NOE (Eq. 8.1) becomes
a
b
a
b
a
a
aeq
eq
aeq
eqa
nn
nnn
γγ
ρσ
ρση ⋅=⋅=
−=
)( (C.10)
Here we exploited the fact that the macroscopic magnetization Ma is proportional to the
population na, and that the populations are themselves proportional to the gyromagnetic
ratio (Eqs. 2.8 and 4.1).
For identical nuclei γa = γb, and ρa = ρ, and Eq. C.10 reduces to the simple form
ρση = (C.11)
101
Appendix D: 2D NOESY experiment
In mathematical terms the 2D NOE experiment can be described as follows. During the
evolution period the transversal magnetization of nucleus B can be written as
[ ])sin()cos( 111 titMeMM BB
eqeqB B
BB
tiωω
ω⋅+= = (D.1)
This results at time point c (Fig. 10.2) in a z-component
)cos( 1, tMM Beq
zB Bω−= (D.2)
According to the Solomon equation (Eq.C.7) we have for MA a dependency from MB
)()( eqBBAB
eqAAaA MMMMM
dtd −−−−= σρ (D.3)
For short mixing times 11
−=<< Am T ρτ we can neglect spin-lattice relaxation and Eq. D.3
becomes approximately
)( eqBBABA MMM
dtd −−≈ σ (D.4)
Using Eq. (D.2) this becomes
[ ])cos(1 1tMMdtd
BeqBABA ωσ +≈ (D.5)
For short mixing times τm we can approximate this by
[ ])cos(1 1tMMB
eqBAB
m
A ωστ
+=Δ (D.6)
102
and thus a fraction of the A-magnetization ΔMA is modulated with ωB
[ ])cos(1 1tMM BeqBmABA ωτσ +=Δ (D.7)
During the detection period this evolves with 2ti Ae ω and after Fourier transformation we
will have a cross-peak at (ωB, ωA) in the 2D spectrum (F1, F2), correlating the protons A
and B in the 2D NOESY spectrum. The intensity of this cross-peak is
mABABI τσωω ~),( (D.8)
Since for biomolecules we have
6~AB
cAB r
τσ (D.9)
we get for the cross-peak intensity from Eq. (D.8):
6~),(AB
cmAB r
I ττωω (D.10)
103
Appendix E: Expected cross-peaks for COSY, TOCSY and NOESY for the individual
amino acids. Diagonal peaks should be there, but are not shown!!
104
105
106
107
108
109
110
111
112
113
Appendix F: Typical chemical shift values found in nucleic acid
114
Appendix G: Typical short proton–proton distances for B-DNA
All distances are given in Å. Sequential distances (to its 3' neigbor) below the diagonal,
intraresidual distances above the diagonal.
H6/8 3.8 2.1 3.6 4.1 4.9 3.4 4.4 4.8
3.5 1' 3.0 2.3 3.9 3.6 4.5 3.8
3.8 4.1 2' 1.8 2.4 3.9 3.8 4.0 2.1
2.3 2'' 2.7 4.0 4.9 3.6
4.9 3' 2.7 3.7 2.9 4.1
4.2 4' 2.7 2.3 4.9
1.8 4.3 3.2 4.5 3.6 5' 1.8 3.4
3.3 4.0 4.7 4.1 5'' 4.4
4.8 3.5 3.8 2.3 H6/8