A Computerized System for Storing, Retrieving, and Correlating NMR Data
FEATURE ARTICLE
Herman Skolnik
Hercules Incorporated, Research Center, Wilmington, Delaware •9899
(Received 20 August 1971)
A new indexing concept, using the author's notation system, is described for storing, retrieving, and correlating NMR data via computer processing. INDEX HEADINGS: Computerized data system for NMR; Indexing system; Notation system.
High resolution nuclear magnetic resonance (NMR) spectroscopy is an impor tant tool in organic chemistry for qualitative and quant i ta t ive analyses and in con- firmation of structures, resolution of conformational problems, and s tudy of rate processes. The importance and value of N M R arise from the unique and char- acteristic chemical shifts of protons in organic com- pounds. The resultant N M R chemical shift spectrum, although primarily due to the protons in each proton group, is modified by the presence of other nuclei in the molecule. The connection between the chemical shift value and structure is thus somewhat empirical.
A meaningful correlative index of chemical shift values is essential for the effective use of N M R data. The Sadtler Chemical Index 1 and the Varian Func- tional Group Index 2 have been used extensively for relating basic proton and environmental groups with proton chemical shifts. Because these two methods require manual indexing from as many viewpoints as there are proton groups and because neither method includes more than a few atoms of the full structure, a method was introduced recently tha t eliminated these two disadvantages. 3
The new correlative method was based on a linear notat ion system 4 in which symbols are used to express various combinations of carbon and hydrogen, such as CH3-, -CH2-, > C H - , -CH=, ~CH, =CH2, and other atoms and hydrogen, such as -NH~, > N H , -OH, and -SH. The notat ion of each organic com- pound and the chemical shifts of each proton group in the compound const i tuted the computer input which, in turn, was processed in the computer by a permutat ion or wraparound program tha t yielded as
Table I. Computer output of notation and chemical shifts for propyl alcohol.
Notation and proton group Chemical shift
H QCCA ]2~1-3.58-1.57-0.92 HQ C CA 2.28-13.58]-1.57-0.92
HQC C A 2.28-3.58-]1.571-0.92 HQCC A 2.28-3.58-1.57-10.9__2_ ]
many index entries as there were proton groups (with their chemical shift data) in each molecule. Thus, for propyl alcohol, CH3CH2CH20H or ACCQH by the notat ion system, the output contained four proton group retrieval points from the single input as shown in Table I.
In addition, a computer program was writ ten to yield an output by increasing value of the chemical shifts of any proton group relative to its neighboring groups. Thus, with one input per compound, the method yields as many outputs as there are proton groups and a pr intout by increasing chemical shift values. Applying the system to the 700 spectra in Vols. 1 and 2 of the Varian Spectra Catalog disclosed one major disadvantage: The linewidth for output required an allocation of 80 characters for the notat ion and 120 for the chemical shift data. Thus, even large- size computer paper could not contain the complet e notat ion and all associated chemical shifts for many
Table II. Notation symbols.
Single-bonded
Carbons Fused or
Double-bonded Triple-bonded bridgehead
-CHz A >CH~ C > C H - Y > C < X
Carbonyl
=CH~ E ~-CH U > C H - J =CH- B ~C- V > C < T =C< D =C< R =C= X
Other elements and carbonyl
Halogen Oxygen Nitrogen Other
>C=0 K -CH=O KH
O
-C-~OH KQH
- F F - O - Q - B r G =O Q
O
- I I CO W
-CI L - O - O - Q2 - O H QH
> NH M - H H -NH2 MH - S - S
> N - N - S H S H
~N Z =S S =N- Z >SO SQ -N=O ZQ >SO2 SW =NOH ZQH P P -NO2 ZW
# ® 1 &
L
Character symbols Cyclic moiety
• Atoms between bridgeheads in a cyclic moiety Fused or bridgehead atom other than carbon; placed following notation symbol Ionic form Symbol for condensed ring moieties
-1 Polymer repeating unit Used to denote atoms (other than above) with atomic symbol, e.g., &NA for sodium Cis, beta (rosins and steroids), cxo, and axial designation Trans, alpha (rosins and steroids), endo, and equatorial designation
Volume 26, Number 2, 1972 APPLIED SPECTROSCOPY 173
Table III. Notations for acyclic compounds.
Varian No. Structure Notation
1 CH3OH AQH 14 CH3CH20H ACQH 43 CH~CH2CH~OH AC2QH 44 (CHa) 2CHOH A2YQH
120 CH3CH(OCHa)CH~CH20H AY(AQ)C2QH 282 CH3 (CH2)90H AC9QH 423 (CH3) 3COH A3XQH
6 CHaCHO AKH 8 CH3COOH AKQH 9 HCOOCH3 HKQA
25 CH3CHC1COOH AY(L)KQH 39 HCON (CH3) 2 HKNA2 79 CH3COOCH~CH3 AKQCA 76 CHaCH~COCH3 ACKA
277 CH3COCHCH2COOCH2CH3 AKY (KQCA)CKQCA I COOCH2CIIa
215 CH3COOCH2 AKQC, 2
CH3CO0~H~
compounds in the Varian Spectra Catalog, and we had to resort to a double line per compound or to limit the chemical shift da ta to the indexed pro ton group in the output . But even allowing 80 spaces for the notat ion does not make for a line as readable as we would like. For tuna te ly , work on using the notat ion sys tem for producing a nota t ion symbol index 5 and a chemical f ragment nota t ion index G led to a great ly improved sys tem for N M R d a t a - - t h e subject of this paper.
I. T H E N O T A T I O N SYSTEM
The nota t ion sys tem employs a set of nota t ion symbols, listed in Table I I , t ha t designates the bonding and number of hydrogens associated with carbon in par t icular and with other a toms in general. Allocating 12 nota t ion symbols (single let ters of the alphabet) to carbon (denoting bonding, number of a t t ached hydrogen, and fused or br idgehead a tom) is in ha rmony with the predominance of carbon and hydrogen a toms in organic chemistry. Because of the high occurrence of the carbonyl group, it is assigned a separate nota t ion symbol, viz., K.
Chemical s t ructures are represented with the notat ion sys tem in an a t o m - b y - a t o m correspondence with the s t ructural formula as usually drawn, i.e., for acyclic compounds f rom left to right with functional groups at the r ight-end position, or for cyclic struc- tures f rom the a tom having the highest position number to the a tom in position 1. Thus, the notat ion sys tem is in accord with accepted number ing schemes as well as with the convent ion of drawing s t ructural formulas. This is i l lustrated in Table I I I for acyclic compounds and in Table IV for cyclic compounds.
Cis and trans isomers are differentiated by the characters I for cis a n d - for trans, as i l lustrated in the
Table IV. Notations for cyclic compounds.
Yariaa No. Structure Notation
3 2
32 CH2-CHCH3 . CY(A) Q. \ /
O
3 2 33 CH2-CH~ . C3Q.
4
NH
94 ~ ~ C H O . B3D (KI-I)S.
S
157 CHa . B5D (A).
4
161 CH~OH . B5D (CQH). [
4
124 OH . B4D (QH) D (QH).
4
550 0 . B4R2(~KB2K. 8 II
6 3
O
552 O . B4R2(~KD (A)BQ.
e ~ / ~ - C H 3
487 . i ~ . 32 . B2JB2J: C.
a
272 CH3 . X (A2)JCBD (A)J: C. 21
CH3 ~.~'~
CH3 s
174 Volume 26, Number 2, 1972
Table V. Relating proton groups via structural formula.
Varian Structural formula No. and chemical shifts
1 CH3---OH 3.47 1.43
14 CHa-CH2--OH 1.2 2.5 3.7O
43 CH3-CH2-CH2-OH 0.92 1.57 3.58 2.28
44 (CH3) ~-CH-OH 1.20 4.00 1.60
following notations for cis- and trans-crotonic acids:
C H - C H 3 C H - C H a
II II CH-COOH HOOC-CH
cis- trans-
AB2KQH ( [ ) AB2KQH (_)
These characters thus allow us to differentiate between the methyl chemical shift (2.03) in cis-crotonic acid and (1.90) in trans-crotonic acid.
II. RELATING PROTON GROUPS VIA STRUCTURAL FORMULA
The most meaningful method for relating each proton group in a molecule with its chemical shift is via the structural formula as illustrated in Table V. Writing the structural formula with sufficient space between the proton fragments for assigning the chemical shift under each proton fragment yields a highly readable format. This format can be converted into an index format by using a virgule (or slash) to separate the proton fragments and a double virgule
Table VI. Proton group/chemical shift index for a series of alcohols in increasing chemical shift order of the methyl group.
Varian No. Formula and notation Index
575 CHa(CH2)~CH(OH)CH2NH2 A /C /C6/Y(/QII) /C / M H / / AC7Y(QH)CMtI 0.89/1.28/ /3.50/1.75 /2.51/1.75//
282 CH,(CHgsCtI~OH A /C /C7/C / Q H / / AC9QH 0.91/1.28/ /3.63/2.03//
43 CH3CH2CH20It A /C /C /QH// AC2QH 0.92/1.57/3.58/2.28//
45 CHaCH(OH)CH2CH2OH AY(/QH)C/QH// AY(QH)C2QIt 1.12 /4.28 /4.28//
87 CH3CH(OH)CH(OH)CH3 A /Y(/QH) /Y(/QH) /A / / AY(QH)Y(QH)A 1.13/3.80/3.05 /3.80/3.05 /1.13//
A //A /Y( /QH) /Y( /QH) / 1.13//1.13/3.80/3.05 /3.80/3.05 /
120 CH3Ctt(OCHa)CH~CH2OH A /Y(/AQ) /C /C / Q H / / AY(QA)C2QH 1.18/3.10/3.33/3.10/3.73/3.10//
44 (CHs)~CHOH A /Y( /A) / Q H / / A2YQH 1.20/4.00/1.20/1.60//
14 CHsCH2OH A /C /QH// ACQH 1.22/2.58/3.70//
86 CH~CH(OH)CHsCH~OH A /Y( /QH)/C /C / Q H / / AY (QH)C2QH 1.23/3.80/3.61 /1.68/3.80/3.61//
120 See above AQ) /C /C /QH//A /Y( / 3.33 /3.10/3.73/3.10 //1.18/3.10/
1 CHaOH A / Q H / / AQH 3.47/1.43//
to denote the end of the molecule, as follows for propyl alcohol :
CH3 /CH~ /OH2 /OH / / (1) 0.92 /1.57 /3.58 /2 .28 / / .
Subscripts, however, are not commonly available on most computer print trains. Consequently it is more practical to input the following format:
CH3 /CH2 /CH2 /OH / / (23) 0.92 /1.57 /3.58 /2 .28 / / .
Table VII. Proton group/chemical shift index for the moiety CH3CH2C(=O)- or ACK.
Varian No. Structural formula Index
76 CH3CH2COCH3 A /CK /A // 1.05/2.45/2.13//
40 CHaCH~CONH~
206
£L CH3 0 COCH2CH3
A / C K / M H / / 1.13/2.23/6.42 / /
A /CK)Q. / / . D ( / A ) /B / B D ( / 1.20/2.78 / / /2.38/6.13/7.08 /
163 A /CK)S . / / . B /B / B D ( / ~ L 1.23/2.93//7.60/7.10/7.70/
S COCH~CH3
215 CH3CH~COOCH2 A /CKQ /C, 2 / / I 1.25/4.15 /2.62 / /
CHaCH2COOCH2
547 CH3CONHCH(OCOCH~CtI3)2 A /CKQ)2 / / A K / M /Y( / 1.30/4.28 / /2 .08/6 .60/5 .19/
APPLIED SPECTROSCOPY 175
Table VIII. Proton group/chemical shift index for the moieties CHsCOCH~- or AKC and CH3COCH < or AKY.
Varian No. Structural formula Index
139 (CH3)~CHCH2COCH3 AN //A2Y /C / 2.12//0.93 /2.28/
76 AK //A /C / 2.13//1.05/2.47/
345
CH3CH2COCH3
COCH3 AK). //.C2K /BRJ@C2J@C2T( /A)J@C2Y( / 2.31 / / /5.87 /0.70 /
545 CHCH~COCH3 AK). //.C4D( /B /C / ]l 2.15 //1.67 /5.43 /3.13/
C6HaNHCOCH2COCH~ AK). //.B5D( /MK /C / 2.17 / / /9.34 /3.52/
(C,Hs)~CHCOCH3 AK). //.B5D, 2( /Y / 2.20 //7.25 /5.08/
256
318
277 COOCH~CH3 CIt3COCH(
CH2COOCH~CH3
AK /Y(KQC /A)( /CKQC /A) / / 2.35/3.98 /1.26/2.87 /1.26//
With (23) as input, computer processing, by means of a wraparound program, yields the following in addi- tion to (2a) :
CH2 /CH2 /OH / /CH3 / (2b) 1.57 /3.58 /2.28//0.92 /,
CH2 /OH / /CH3 /CH2 / (2c) 3.58 /2.28//0.92 /1.57 /,
OH / /CH3 /CH2 /CH2 / (2d) 2.28//0.92 /1.57 /3.58 /.
The wraparound program and computer alphabet- ization of the output results in a proton group/ chemical shift index by as many ways as there are proton groups in each molecule from a single input [such as (23)]. This is in contrast to the Sadtler and Varian indexes, which require manual indexing from as many viewpoints as there are proton groups. Furthermore, neither of the two systems shows the full molecule in the index.
It is quite apparent that a computerized system using the above structural formula fragment concept,
Table IX. Proton group/chemical shift index for the moiety CH3COCH= or AKB.
Varian No. Structural formula Index
616 CH=CHCOCH3 AK /B /B)X( /A)( /A)C /C /BD(/A). //.Y( / 2.26/6.06/6.60 /0.86/0.93 /2.05/5.50 /1.57//2.30 /
CH3
~ / J CH~
617 CH=CHCOCH3 AK /B /B). // .X( /A2)C2 /CD( /A)D( / I 2.31/6.11/7.28// /1.07 /2.09 /1.78 /
CH3.. 2
.2" CH3 ~V j
251 CsHsCH=CHCOCHa AK /B /B). //.B5D(9/ 6.35/6.70/7.48// /
176 Volume 26, Number 2, 1972
Table X. Proton group/chemical shift index for the moiety CH3CONH- or AKM.
Varian No. Structural formula Index 265 C6H~CH~CH2NHCOCH~ AK /ivl /C /C). //.B5D( /
1.90/6,50/3.48/2.80//7.25 /
689 AK /M) /C2R2®D3(AQ, 3) /B. //.B /BD( /AQ)K 1.96/8.38/2.43 /6.55//7.37/6.92 /3.67 CH ,O -S '- , / - ---~
III N.ooo.
I / / \\ CH30 \~
CH,O / %0
690 C H 3 0 - ~ ( ~NHCOCH3
CH30 NN~v/~ = 0
/BR2® /Y / /7.63 /4.62/
AK /5{)R2@ /J /J /BD( /AQ)K.//.BD3(AQ, 3)R2@C2 /Y( / 2.07/6.02 /3.63/4.12/6.68 /3.73 //6.49 /4.84/
I OCH~
547 CH~CONHCH(OCOCH2CH~)2 AK /M /Y( / i /CKQ)2 / / 2.08/6.60/5.19/1.30/4.28 //
491 CN AK /M /Y(VZ)KQ /C /A // J 2.12/6.75/5.50. /4.36/1.37//
CH3CONHCH \
COOCH2CH3
NHCOCH~
CHa
NHCOCH3 I
0 OCH2CH3
239
267
AK /M). //.B /BD(/A) /B /BD( / 2.12/7.88 //7.37/7.12 /2.30/7.12/7.37 /
AK /M). //.B /BD( /A /CQ) /B /BD( / 2.12/7.91 //7.41/6.83 /1.38/4.00 /6.83/7.4l /
regardless of how inviting it is, would be awkward and space-consuming for more complex structures. Structural formula fragments are particularly difficult to delineate for cyclic structures without invoking arbitrary rules.
These difficulties, however, disappear when the notation system is used to delineate the fragments in a molecular structure.
III. RELATING PROTON GROUPS VIA THE NOTATION SYSTEM
Because the notation system corresponds essentially atom by atom with the way structural formulas are drawn, uses the same numbering rules as generally accepted, and is based on combinations of carbon and hydrogen, it is particularly suitable for relating proton groups with chemical shifts. Moreover, the notation
system is completely compatible with computer processing operations.
Input by the notation system for propyl alcohol and computer processing of the fragments yield the following :
h /C /C /QH / / (Input) (3a) 0.92/1.57/3.58/2.28// ,
C /C /QH / / i / (3b) 1.57/3.58/2.28//0.92/,
C /QH / / A /C / (3c) 3.58/2.28//0.92/1.57/,
QH / / A /C /C / (3d) 2.28//0.92/1.57/3.58/.
It is quite apparent that (1), (2a), and (3a) are equivalent as are their products from computer
APPLIED SPECTROSCOPY 177
Table XI. Proton group/chemical shift index for the moiety CH.~COO- or AKQ.
Varian No. Structural formula Index
642 "'-\t~-/ XXT /O--'---r-CH~OCOCH3 AKQ /C)Q.). / / . B /BKMKN( / .YY( /AKQ)Y( /AKQ)Y( / _ _ _ _ ) ~ - - k , } OCOCHIa 1.92 /3.80 ~/ % I /3.87 / /
N //7.65/5.81 /5.83 /1.92 /1.92 ii O OCOCH ~
356
261
79
140
182
362
361
353
692
167
242
242
65
440
326
ph-ffO~fA)~OClt 3 ( ~ I - O C O C I I , ~
N:NPh
AKQ)Y(Z2.B5D.)J20)CQ /Y(.B5D.)Q. / / .Q / y ( / A Q ) /Y( / 1.92 /5.58 / / /5.07/3.47 /5.63/
C6HsCH~CH2OCOCI-],~ AKQ). / / .B5D( /C /C / 2.02 //7.29 /4.30/2.93/
CH3COOCH2CI-I3 AKQ /C /A / / 2.03 /4.12/1.25//
CHzCOO(CHD3CH~ AKQ /C3 /A / / 2.03 /4.05/0.95//
CtI3COOCtI (CHa)CH~COOCH~
CIt3 I CH (CH2) 2COOCH~
CH3COO
CHaCOO COCH:
CH3CO0
OCOCH3
0
C H 3 C O O ~ ~ 0
I ~ H : C H - ~ - - o c o c ~ CH3COO
~ O~-'CH2OCOCH 3
CH~COOCH (CH~OCOCH3) 2
See above
CHaCOOCH=CH2
CH~COOC (CH3) =CH,
N \
l If :x- : ' ~ / \ N/II (,
AKQ / Y ( / A ) /CKQ /A / / 2.03 /5.30/1.32/2.60 /3.70//
AKQ)CJT( /A)~C2J2(o)CKT( /A)J(~C2Y(AYC2KQ /A). / / .C2 /Y( / 2.03 /1.03 /1.03 /3 .70/ / /4.70/
AKQ)CJT( /A)(~C2J2(~C /Y( /AKQ)T( /A)J(~C2 /Y( / A K ) . / / . C 2 /Y( / 2.04 /0.92 /5.17/2.04 /0.70 /2.98/2.18 / / /4.73/
AKQ). // .C2KCJT( /A)~C2J2~C2T( /A)J(,~C2 /Y( / 2.05 / / /1.03 /0.80 /4.62/
AKQ)R2@D( /B /B.D /B /BD(/AKQ) /B /B.) /BKQ / / . B D ( / A K Q ) /BD( / 2.06 /7.33/6.89 /7.53/7.15 /2.32 /7.15/7.53 /6.36 //6.83 /2.32 /7.08 /
AKQ /C)Q. / / . B /B /BD( / 2.07 /5.03 //6.38/6.35/7.40 /
AKQ /Y( /AKQ /C)2 / / 2.08 /5.25/2.08 /4 .22/ /
AKQ /C)2 //AKQ /Y( / 2.08 /4.22//2.08 /5.25/
AKQ /B /E / / 2.12 /7.25/4.55//
/4.85//
AKQD(/A) /rE / / 2.12 /1.93/4.69//
AKQ /C)Q.)BZ. // .ZBZD( /MH)R2(~N(.Y /Y( /AKQ) /Y( /AKQ) /Y( / 2.12 /4.43 / / /6.37 /5.95/2.12 /5.70/2.12 /4.43/
178 Volume 26, Number 2, 1972
Table XI. (Continued)
N 324 I'~ N----N
nN~x I) I[ N - O () ~ ( ~-CH,OCOCHa
CHaCOO-L_._J-OCOCH,
~ - ' - - N ~ O--i---CHiOCOCH~ 534 O=\ ___~¢~ ' - - ~ ,___i__ OCOCH,
~ O OCOCH.~
554 CHaCOOCHC~H~ I COOH
AKQ /C)Q.)BZ. / / .ZB /MKR2@N( / .Y /Y( /AKQ) /Y( /AKQ /Y( / 2.12 /4.42 / / /13.1 /6.17/5.88/2.12 /5.62/2.12 /4.42/
AKQ /C)Q.) / / . K M K /BZN( / .Y2( /AKQ)Y( /AKQ)Y / 2.13 /4.28 / / /7.67 /6.14 /2.13 /2.13 /
/4.32 / /
AKQ)). / / .B5D( /Y( /KQH)( / 2.18 //7.41 /5.95/9.88 /
processing Ecompare (2b), (2c), and (2d) with (3b), (3c), and (3d)~. The advantage of the notation system over the structural formula fragments Ee.g., (3a) over (2a) or (1)1 is the use of single letters to denote com- binations of carbon and hydrogen and the bonding associated with the carbon. This advantage is par- ticularly pertinent with highly substituted or complex acyclic compounds and in cyclic structures. For example, the proton group/chemical shift index for 3-diethylamino-2,2-dimethylpropionaldehyde (Varian No. 548) by the notation system is as follows:
0.97 I
(CH3CH2)2NCH2C(CH~)2CHO I I I I 2.50 2.55 1 .07 9.57
~AC, 2NCX(A2)KH, (4)
A /C,2N /CX( /A2) / K H / / (Input) (4') 0.97/2.50 /2.55 /1.07/9.57 / / .
In (4'), A /C,2N / is equivalent to (AC)2N and to (CH3CH2)2N; C X ( / A 2 ) / is equivalent to CH2C(CH3)~; and KH is equivalent to CHO. Thus, even with the high compression of information, (4') discloses every proton group as an index point and, at the same time, every fragment in the molecule and the full molecule. The proton group/chemical shift index for 1,4-naphthoquinone (Varian No. 550) by the notation system is as follows:
8.07 0 B K 7.77 I ~ 6.97 / / ~ /
~ , ~ / ~ / B R B *1 I I
B R B / \ \ / \ /
7.77 I % 6.97 B K 8.07 0 ~ .B4R2@KB2K. -~.BB2BR2@KB2K. (5)
.B /B2 /BR2@K /B2K. / / (Input) (5') 8.07/7.77/8.07 /6.97 //.
Cyclic structures in the notation system are char- acterized by left and right terminal periods, as illustrated in (5'). Both (4') and (5') contain a maxi-
mum amount of information within a minimum space allocation. Each has as many fragments as there are proton groups with chemical shifts, and each frag- ment is associated with its neighboring groups within the full molecule, and this remains true for each of the wraparound products.
Table VI illustrates the proton group/chemical shift index by the notation system for a series of alcohols in increasing chemical shift order of the methyl group protons. Table VI illustrates rather pointedly the correlative power of the indexing con- cept. Thus, the methyl group protons in Varian compound numbers 575, 282, and 43 are very much alike and similarly modified by being removed from the hydroxyl group by three or more methylene groups ; the methyl group protons in Varian compound numbers 45 and 87 show the same chemical shift effect of similar glycol neighboring groups ; the methyl group protons in Varian compound numbers 44, 14, and 86 show the same chemical shift effect of -CHOH- and -CH2OH. In methyl alcohol, in which the methyl group protons are adjacent to the hydroxyl, the chemical shift is unlike that of the other alcohols in this series but somewhat similar to the protons in the methoxy fragment of Varian compound number 120, i.e., the NMR spectrum of methyl alcohol categorizes it as containing a methoxy group (see Table XII) and a hydrogen proton, i.e., as AQ /H / / i n the notation fragment index.
IV. EFFECT OF VARIOUS ENVIRONMENTS ON THE METHYL GROUP
In view of the correlative power of the indexing concept exhibited in Table VI, it was further evalu- ated with other environments that affect the methyl group. The compounds and their chemical shift data were selected from Vols. 1 and 2 of the Varian Spectra Catalog.
Relative to the hydroxyl function (Table VI), the carbonyl function exhibits a slightly greater effect on the methyl group protons (Table VII). Thus, the methyl group shift values in the fragment CH3CH2CO- (or ACK-) range from 1.05 (in burn- none) to 1.30 (in acetamidomethylidene bispropio-
APPLIED SPECTROSCOPY 179
Table XII. Proton group/chemical shift index for the moiety CHsO- or AQ.
Varian No. Structural formula Index
120 CH~CH(OCH3)CH2CH~OH AQ) /C /C /QH //A /Y( / 3.33 /1.72/3.73/3.10//1.18/3.55/
69 CH~OCH2CH2CN
419 BrCH~CH(OCH3)~ AQ)2 / /G /C /Y( / 3.40 / / /3.37/4.54/
446 (CH30) 2CHCOOCH3
I CH3OH A /QH // 3.47/1.43//
28
112
CHaOCH2CN
CH~-CHCOOCH3 \ /
CHs
64 CH~=CHCOOCH3
AQ /C /CVZ / / 3.40/2.62/3.62 / /
AQ /Y( /AQ)K /AQ / / 3.44/4.82/3.44 /3.82//
AQ /CVZ / / 3.47/4.20 / /
AQ). / / .C2 /Y(K / 3.67 //0.93 /1.63 /
AQ / /E /BK / 3.75//5.82/6.20 /
/6.20/
106 NCCH2CH~COOCHa AQ //ZV /C /CK / 3.75// /2.68/2.68 /
I13 CH~=C (CH3) COOCH8
258 CH=CHCH3 l
0 OCH+
HCOOCH3
CH30 H
CsHsOCHa
()CH.~
I OCHa
HC-=C-CH=CH---OCHa
NCCH~COOCH3
(CHsO) 2CHCOOCH3
266
162
608
100
57
446
AQ //ED( /A)K / 3.75//5.57 /1.95 /
/6.10 /
AQ) /B /BD( /B /B /A). / / .B /BD( / 3.75 /6.80/7.23 /6.28/6.08/1.83//7.23/6.80 /
AQ / /HK / 3.77//8.08 /
AQ)B2R2@ /C /C /C /M. / / .BD( / 3 . 7 7 /2.73/2.73/3.22/3.49// /
AQ). / / .B5D( / 3.78 / / /
AQ) /B2D(/AQ) / :C. / / . J /B2 /JR2@D( / 3.79 /6.50 /3.79 /2.22 //4.17/6.83/ /
AQ //UV /B /B / 3.80//3.08/4.52/6.35/
AQ //ZV /CK / 3.82// /3.48 /
AQ //AQ, 2 /YK / 3.82//3.44 /4.82 /
180 Volume 26, Number 2, 1972
Table XII. (Continued)
Varian No. Structural formula Index 260 OCH~ AQ). / / .B /BD( /C /B /E) /BD( /QH)D( /
~ H 3.83 //6.87/6.64 /3.30/5.94/5.05/6.69 /5.59 / -OH
~CH=CH2
172 NH~ AQ)D( /MH(. //.B4D( / I 3.85 /3 .73 //6.79 /
0 -OCH3
249
284
257
576
CI COOCH3
C1- I,----S ~-OCOCH3 / \
CH~COO C1
OCH3 I
CH~=CI-I2NO2
CN /
C6Hs-CH=C \
COOCHa
AQ) /BR2@ /B /B /BZ. / / . B /BD( / 3.87 /7.00 /7.97/7.28/8.71 //7.97/7.35 /
AQ)X(L2) /Y( /AKQ)D(L)D(/AKQ). //.Y(K / 3.87 /6.63/2.20 /2.25 //6.05 /
AQ)D(/AQ). //.B3D( /B /B(NW))D( / 3.90 /3.90 //7.07 /8.20/7.73 /
AQ).//.B /B3 /BD( /BD(VZ)K / 3.96 //8.03/7.55/8.03 /8.27 /
nate). Within this range, Table VII reveals the in- creasing effect on the methyl chemical shift of the amide group, aromaticity, and the carboxyl group, relative to that in butanone.
The effect of the carbonyl group when adjacent to the methyl group is shown in Table VIII for a series of CH3COCH2- (AKC) and CH3COCH<(AKY) compounds, in which the chemical shift of methyl group protons ranges from 2.12 to 2.35. The electronic effects of neighboring groups beyond the carbonyl are quite revealing in Tables VIII-XI.
As revealed in Table XII, oxy and carboxy have a marked effect on the protons in the attached methyl group. Chemical shifts of the methoxy group attached to alicyclic saturated carbons range from 3.33 to 3.47. Methoxy groups attached to unsaturated and cyclic carbons have chemical shifts within the carbomethoxy range, viz., 3.67-3.96, with the methoxy groups falling between 3.75 and 3.90.
V. ALPHABETIZATION OF PERMUTED INDEXES
Whereas the proton group notation fragment index input is easily alphabetized to place together molecules
of similar nuclei, the wraparound indexes cannot be alphabetized meaningfully from a left-to-right com- puter alphabetization. Consequently, the computer program needs to be such that the first term in each index is examined for the right-hand parenthesis and for the double virgule. If either or both are present, the first term is then alphabetized with the final term rather than the second term; if with a double virgule subsequent alphabetization is from right to left; if with a right-hand parenthesis and a single virgule the next alphabetization is with the second term and the next to last term.
Thus, in the case of 3-methoxybutanol (Varian No. 120) in Table XII, alphabetization would place it with
A /
AQY \
C
In the case of carbmethoxycyclopropane (Varian No. 112) in Table XII, alphabetization would place it with
AQKYC2.
APPLIED SPECTROSCOPY 181
Table XIII. Alphabetical proton group/chemical shift index for the Varian No. moiety CHsO- or AQ.
Varian No. Alphabetization Index
100 AQBBVU
69 AQCCVZ
28 AQCVZ
258 AQD(B)B
266 AQD(B)B
249 AQD(]3)B
162 AQDB5
260 AQD(B)D
172 AQD(B)D
608 AQD (B) R
257 AQD(D)D
1 AQH
64 AQKBE
106 AQKCCVZ
57 AQKCVZ
113 AQKD(A)E
576 AQKD(VZ)B
9 AQKH
112 AQKYC2
284 AQKY(D)X
446 AQKYQA
120 AQY(A)C
419 AQY(AQ)CG
446 AQY(AQ)K
AQ //UV /B /B / 3.80//3.08/4.52/6.35/
AQ /C /CVZ / / 3.40/2.62/3.62 / /
AQ /CVZ / / 3.47/4.20 / /
AQ) /B /BD( /B /B /A). / / . B /BD( / 3.75 /6.8O/7.23 /6.28/6.08/1.83 //7.23/6.80 /
AQ)B2R2(~ /C /C /C /M. / / .BD( / 3 . 7 7 /2.73/2.73/3.22/3.49// /
AQ) /BRZ(~ /B /B /BZ. / / . B /BD( / 3.87 /7.00 /7.97/7.28/8.71 //7.97/7.35
AQ). / / .B5D / 3.78 / / /
AQ). / / .B /BD( /C /B /E) /BD( /QH)D( / 3.83 //6.87/6.64 /3.30/5.94/5.05/6.69 /5.59 /
AQ)D(/MH). / / .B4D( / 3.85 /3.73 //6.79 /
AQ) /B2D(/AQ) / :C. / / . J /B2 / JR2~D( / 3.79 /6.50 /3.79 /2.22 //4.17/6.83/ /
AQ)D(/AQ). / / .B3D( /B /B(NW))D( / 3.90 /3.90 //7.07 /8.20/7.73 /
A /QH / / 3.47/1.43//
AQ / / E /BK / 3.75//5.82/6.20 /
6.20
AQ //ZV /C /CK / 3.75// /2.68/2.68 /
AQ / /Z¥ /CK / 3.82// /3.48
AQ / /ED( /A)K / 3.75//5.57 /1.95 /
6.10 /
AQ). / / . B /B3 /BD( /BD(VZ)K / 3.96 //8.03/7.55/8.03 /8.27 /
AQ / /HK / 3.77//8.08 /
AQ). / / .C2 /Y(K / 3.67 //O.93 /1.63 /
AQ)X(L2) /Y( /AKQ)D(L)D(/AKQ). / / .Y (K / 3 . 8 7 /6.63/2.20 /2.25 //6.05 /
AQ //AQ, 2 /YK / 3.82//3.44 /4.82 /
AQ) /C /C /QH / /A /Y( / 3.33 /1.72/3.73/3.10//1.18/3.55/
AQ)2 / /G /C /Y( / 3.40 / / /3.37/4,54/
AQ /Y( /AQ)K /AQ / / 3.44/4.82/3.44 /3.82//
Table XII I illustrates the alphabetical order for the compounds in Table XII.
Alphabetization program logic is more clearly illus- trated with arrows in the following examples from Table XII :
100
Notation fragment alphabctization
A Q / / U V / B / n /
69 A Q / C / C V Z / /
258 [ I
AQ) / S /BD ( /B /B /A) . / / . B / ( /
172 AQ)D(/iV[H). / / . B 4 D
The program needs to be such that in compounds 258 and 172 the second and next to last terms, which are nuclei neighbors of the last term, are examined and placed in alphabetical order.
The alphabetical order in Table XI I I successfully brings together structurally similar compounds, such as compounds 69 and 28; compounds 258, 266, 162, and 249; compounds 106 and 57; compounds 113 and 576; and compounds 112, 284, 446, 120, 446, and 419. Within structurally similar compounds, the alpha- betical arrangement reveals the effect of different nuclei beyond the structural similarity, such as for AQKY-.
VI. CONCLUSIONS
Magnetic variations of the same proton group arises from subtle, and sometimes obvious, differences in shielding. The extent of the differences depend on the magnetic anisotropy of many or all of the other groups in the molecules and their orientations with respect to other nuclei.
The indexing concept presented in this paper pro- vides a new method for relating groups with neigh- boring groups and with every group in each molecule ; for correlating the same proton group with the various environmental groups it is associated with in different molecules; and for computer processing of a single input per molecule to yield a proton group/chemical shift index under as many retrieval points as there are proton groups in each molecule, alphabetically by the proton groups or numerically by the chemical shift data.
1. "Chemical Shift Index Chart," Sadtler Research Laboratories , Inc., Philadelphia, Pa., 1967.
2. N M R Spectra Catalog (Varian, Palo Alto, Calif, 1962 and 1963), Vols. 1 and 2.
3. H. Sholnik, J. Chem. Doc. 10, 216 (1970). 4. H. Skolnik, J. Heterocyclic Chem. 6, 689 (1969). 5. I I . Skolnik, J. Chem. Doc. 11, 120 (1971). 6. H. Skolnik, "A Chemical F r a g m e n t Nota t ion Index , " J.
Chem. Doc. 11, 142 (1971).
182 Volume 26, Number 2, 1972