+ All Categories
Home > Documents > A Computerized System for Storing, Retrieving, and Correlating NMR Data

A Computerized System for Storing, Retrieving, and Correlating NMR Data

Date post: 30-Sep-2016
Category:
Upload: herman
View: 215 times
Download: 0 times
Share this document with a friend
10
A Computerized System for Storing, Retrieving, and Correlating NMR Data FEATURE ARTICLE Herman Skolnik Hercules Incorporated, Research Center, Wilmington, Delaware •9899 (Received 20 August 1971) A new indexing concept, using the author's notation system, is described for storing, retrieving, and correlating NMR data via computer processing. INDEX HEADINGS:Computerized data system for NMR; Indexing system; Notation system. High resolution nuclear magnetic resonance (NMR) spectroscopy is an important tool in organic chemistry for qualitative and quantitative analyses and in con- firmation of structures, resolution of conformational problems, and study of rate processes. The importance and value of NMR arise from the unique and char- acteristic chemical shifts of protons in organic com- pounds. The resultant NMR chemical shift spectrum, although primarily due to the protons in each proton group, is modified by the presence of other nuclei in the molecule. The connection between the chemical shift value and structure is thus somewhat empirical. A meaningful correlative index of chemical shift values is essential for the effective use of NMR data. The Sadtler Chemical Index 1 and the Varian Func- tional Group Index 2 have been used extensively for relating basic proton and environmental groups with proton chemical shifts. Because these two methods require manual indexing from as many viewpoints as there are proton groups and because neither method includes more than a few atoms of the full structure, a method was introduced recently that eliminated these two disadvantages. 3 The new correlative method was based on a linear notation system 4 in which symbols are used to express various combinations of carbon and hydrogen, such as CH3-, -CH2-, >CH-, -CH=, ~CH, =CH2, and other atoms and hydrogen, such as -NH~, >NH, -OH, and -SH. The notation of each organic com- pound and the chemical shifts of each proton group in the compound constituted the computer input which, in turn, was processed in the computer by a permutation or wraparound program that yielded as Table I. Computer output of notation and chemical shifts for propyl alcohol. Notation and proton group Chemical shift H QCCA ]2~1-3.58-1.57-0.92 HQ C CA 2.28-13.58]-1.57-0.92 HQC C A 2.28-3.58-]1.571-0.92 HQCC A 2.28-3.58-1.57-10.9__2_ ] many index entries as there were proton groups (with their chemical shift data) in each molecule. Thus, for propyl alcohol, CH3CH2CH20H or ACCQH by the notation system, the output contained four proton group retrieval points from the single input as shown in Table I. In addition, a computer program was written to yield an output by increasing value of the chemical shifts of any proton group relative to its neighboring groups. Thus, with one input per compound, the method yields as many outputs as there are proton groups and a printout by increasing chemical shift values. Applying the system to the 700 spectra in Vols. 1 and 2 of the Varian Spectra Catalog disclosed one major disadvantage: The linewidth for output required an allocation of 80 characters for the notation and 120 for the chemical shift data. Thus, even large- size computer paper could not contain the complet e notation and all associated chemical shifts for many Table II. Notation symbols. Single-bonded Carbons Fused or Double-bonded Triple-bonded bridgehead -CHz A >CH~ C >CH- Y >C< X Carbonyl =CH~ E ~-CH U >CH- J =CH- B ~C- V > C < T =C< D =C< R =C= X Other elements and carbonyl Halogen Oxygen Nitrogen Other >C=0 K -CH=O KH O -C-~OH KQH -F F -O- Q -Br G =O Q O -I I CO W -CI L -O-O- Q2 -OH QH > NH M -H H -NH2 MH -S- S >N- N -SH SH ~N Z =S S =N- Z >SO SQ -N=O ZQ >SO2 SW =NOH ZQH P P -NO2 ZW # ® 1 & L Character symbols Cyclic moiety Atoms between bridgeheads in a cyclic moiety Fused or bridgehead atom other than carbon; placed following notation symbol Ionic form Symbol for condensed ring moieties -1 Polymer repeating unit Used to denote atoms (other than above) with atomic symbol, e.g., &NA for sodium Cis, beta (rosins and steroids), cxo, and axial designation Trans, alpha (rosins and steroids), endo, and equatorial designation Volume 26, Number 2, 1972 APPLIED SPECTROSCOPY 173
Transcript
Page 1: A Computerized System for Storing, Retrieving, and Correlating NMR Data

A Computerized System for Storing, Retrieving, and Correlating NMR Data

FEATURE ARTICLE

Herman Skolnik

Hercules Incorporated, Research Center, Wilmington, Delaware •9899

(Received 20 August 1971)

A new indexing concept, using the author's notation system, is described for storing, retrieving, and correlating NMR data via computer processing. INDEX HEADINGS: Computerized data system for NMR; Indexing system; Notation system.

High resolution nuclear magnetic resonance (NMR) spectroscopy is an impor tant tool in organic chemistry for qualitative and quant i ta t ive analyses and in con- firmation of structures, resolution of conformational problems, and s tudy of rate processes. The importance and value of N M R arise from the unique and char- acteristic chemical shifts of protons in organic com- pounds. The resultant N M R chemical shift spectrum, although primarily due to the protons in each proton group, is modified by the presence of other nuclei in the molecule. The connection between the chemical shift value and structure is thus somewhat empirical.

A meaningful correlative index of chemical shift values is essential for the effective use of N M R data. The Sadtler Chemical Index 1 and the Varian Func- tional Group Index 2 have been used extensively for relating basic proton and environmental groups with proton chemical shifts. Because these two methods require manual indexing from as many viewpoints as there are proton groups and because neither method includes more than a few atoms of the full structure, a method was introduced recently tha t eliminated these two disadvantages. 3

The new correlative method was based on a linear notat ion system 4 in which symbols are used to express various combinations of carbon and hydrogen, such as CH3-, -CH2-, > C H - , -CH=, ~CH, =CH2, and other atoms and hydrogen, such as -NH~, > N H , -OH, and -SH. The notat ion of each organic com- pound and the chemical shifts of each proton group in the compound const i tuted the computer input which, in turn, was processed in the computer by a permutat ion or wraparound program tha t yielded as

Table I. Computer output of notation and chemical shifts for propyl alcohol.

Notation and proton group Chemical shift

H QCCA ]2~1-3.58-1.57-0.92 HQ C CA 2.28-13.58]-1.57-0.92

HQC C A 2.28-3.58-]1.571-0.92 HQCC A 2.28-3.58-1.57-10.9__2_ ]

many index entries as there were proton groups (with their chemical shift data) in each molecule. Thus, for propyl alcohol, CH3CH2CH20H or ACCQH by the notat ion system, the output contained four proton group retrieval points from the single input as shown in Table I.

In addition, a computer program was writ ten to yield an output by increasing value of the chemical shifts of any proton group relative to its neighboring groups. Thus, with one input per compound, the method yields as many outputs as there are proton groups and a pr intout by increasing chemical shift values. Applying the system to the 700 spectra in Vols. 1 and 2 of the Varian Spectra Catalog disclosed one major disadvantage: The linewidth for output required an allocation of 80 characters for the notat ion and 120 for the chemical shift data. Thus, even large- size computer paper could not contain the complet e notat ion and all associated chemical shifts for many

Table II. Notation symbols.

Single-bonded

Carbons Fused or

Double-bonded Triple-bonded bridgehead

-CHz A >CH~ C > C H - Y > C < X

Carbonyl

=CH~ E ~-CH U > C H - J =CH- B ~C- V > C < T =C< D =C< R =C= X

Other elements and carbonyl

Halogen Oxygen Nitrogen Other

>C=0 K -CH=O KH

O

-C-~OH KQH

- F F - O - Q - B r G =O Q

O

- I I CO W

-CI L - O - O - Q2 - O H QH

> NH M - H H -NH2 MH - S - S

> N - N - S H S H

~N Z =S S =N- Z >SO SQ -N=O ZQ >SO2 SW =NOH ZQH P P -NO2 ZW

# ® 1 &

L

Character symbols Cyclic moiety

• Atoms between bridgeheads in a cyclic moiety Fused or bridgehead atom other than carbon; placed following notation symbol Ionic form Symbol for condensed ring moieties

-1 Polymer repeating unit Used to denote atoms (other than above) with atomic symbol, e.g., &NA for sodium Cis, beta (rosins and steroids), cxo, and axial designation Trans, alpha (rosins and steroids), endo, and equatorial designation

Volume 26, Number 2, 1972 APPLIED SPECTROSCOPY 173

Page 2: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table III. Notations for acyclic compounds.

Varian No. Structure Notation

1 CH3OH AQH 14 CH3CH20H ACQH 43 CH~CH2CH~OH AC2QH 44 (CHa) 2CHOH A2YQH

120 CH3CH(OCHa)CH~CH20H AY(AQ)C2QH 282 CH3 (CH2)90H AC9QH 423 (CH3) 3COH A3XQH

6 CHaCHO AKH 8 CH3COOH AKQH 9 HCOOCH3 HKQA

25 CH3CHC1COOH AY(L)KQH 39 HCON (CH3) 2 HKNA2 79 CH3COOCH~CH3 AKQCA 76 CHaCH~COCH3 ACKA

277 CH3COCHCH2COOCH2CH3 AKY (KQCA)CKQCA I COOCH2CIIa

215 CH3COOCH2 AKQC, 2

CH3CO0~H~

compounds in the Varian Spectra Catalog, and we had to resort to a double line per compound or to limit the chemical shift da ta to the indexed pro ton group in the output . But even allowing 80 spaces for the notat ion does not make for a line as readable as we would like. For tuna te ly , work on using the notat ion sys tem for producing a nota t ion symbol index 5 and a chemical f ragment nota t ion index G led to a great ly improved sys tem for N M R d a t a - - t h e subject of this paper.

I. T H E N O T A T I O N SYSTEM

The nota t ion sys tem employs a set of nota t ion symbols, listed in Table I I , t ha t designates the bonding and number of hydrogens associated with carbon in par t icular and with other a toms in general. Allocating 12 nota t ion symbols (single let ters of the alphabet) to carbon (denoting bonding, number of a t t ached hydrogen, and fused or br idgehead a tom) is in ha rmony with the predominance of carbon and hydrogen a toms in organic chemistry. Because of the high occurrence of the carbonyl group, it is assigned a separate nota t ion symbol, viz., K.

Chemical s t ructures are represented with the notat ion sys tem in an a t o m - b y - a t o m correspondence with the s t ructural formula as usually drawn, i.e., for acyclic compounds f rom left to right with functional groups at the r ight-end position, or for cyclic struc- tures f rom the a tom having the highest position number to the a tom in position 1. Thus, the notat ion sys tem is in accord with accepted number ing schemes as well as with the convent ion of drawing s t ructural formulas. This is i l lustrated in Table I I I for acyclic compounds and in Table IV for cyclic compounds.

Cis and trans isomers are differentiated by the characters I for cis a n d - for trans, as i l lustrated in the

Table IV. Notations for cyclic compounds.

Yariaa No. Structure Notation

3 2

32 CH2-CHCH3 . CY(A) Q. \ /

O

3 2 33 CH2-CH~ . C3Q.

4

NH

94 ~ ~ C H O . B3D (KI-I)S.

S

157 CHa . B5D (A).

4

161 CH~OH . B5D (CQH). [

4

124 OH . B4D (QH) D (QH).

4

550 0 . B4R2(~KB2K. 8 II

6 3

O

552 O . B4R2(~KD (A)BQ.

e ~ / ~ - C H 3

487 . i ~ . 32 . B2JB2J: C.

a

272 CH3 . X (A2)JCBD (A)J: C. 21

CH3 ~.~'~

CH3 s

174 Volume 26, Number 2, 1972

Page 3: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table V. Relating proton groups via structural formula.

Varian Structural formula No. and chemical shifts

1 CH3---OH 3.47 1.43

14 CHa-CH2--OH 1.2 2.5 3.7O

43 CH3-CH2-CH2-OH 0.92 1.57 3.58 2.28

44 (CH3) ~-CH-OH 1.20 4.00 1.60

following notations for cis- and trans-crotonic acids:

C H - C H 3 C H - C H a

II II CH-COOH HOOC-CH

cis- trans-

AB2KQH ( [ ) AB2KQH (_)

These characters thus allow us to differentiate between the methyl chemical shift (2.03) in cis-crotonic acid and (1.90) in trans-crotonic acid.

II. RELATING PROTON GROUPS VIA STRUCTURAL FORMULA

The most meaningful method for relating each proton group in a molecule with its chemical shift is via the structural formula as illustrated in Table V. Writing the structural formula with sufficient space between the proton fragments for assigning the chemical shift under each proton fragment yields a highly readable format. This format can be converted into an index format by using a virgule (or slash) to separate the proton fragments and a double virgule

Table VI. Proton group/chemical shift index for a series of alcohols in increasing chemical shift order of the methyl group.

Varian No. Formula and notation Index

575 CHa(CH2)~CH(OH)CH2NH2 A /C /C6/Y(/QII) /C / M H / / AC7Y(QH)CMtI 0.89/1.28/ /3.50/1.75 /2.51/1.75//

282 CH,(CHgsCtI~OH A /C /C7/C / Q H / / AC9QH 0.91/1.28/ /3.63/2.03//

43 CH3CH2CH20It A /C /C /QH// AC2QH 0.92/1.57/3.58/2.28//

45 CHaCH(OH)CH2CH2OH AY(/QH)C/QH// AY(QH)C2QIt 1.12 /4.28 /4.28//

87 CH3CH(OH)CH(OH)CH3 A /Y(/QH) /Y(/QH) /A / / AY(QH)Y(QH)A 1.13/3.80/3.05 /3.80/3.05 /1.13//

A //A /Y( /QH) /Y( /QH) / 1.13//1.13/3.80/3.05 /3.80/3.05 /

120 CH3Ctt(OCHa)CH~CH2OH A /Y(/AQ) /C /C / Q H / / AY(QA)C2QH 1.18/3.10/3.33/3.10/3.73/3.10//

44 (CHs)~CHOH A /Y( /A) / Q H / / A2YQH 1.20/4.00/1.20/1.60//

14 CHsCH2OH A /C /QH// ACQH 1.22/2.58/3.70//

86 CH~CH(OH)CHsCH~OH A /Y( /QH)/C /C / Q H / / AY (QH)C2QH 1.23/3.80/3.61 /1.68/3.80/3.61//

120 See above AQ) /C /C /QH//A /Y( / 3.33 /3.10/3.73/3.10 //1.18/3.10/

1 CHaOH A / Q H / / AQH 3.47/1.43//

to denote the end of the molecule, as follows for propyl alcohol :

CH3 /CH~ /OH2 /OH / / (1) 0.92 /1.57 /3.58 /2 .28 / / .

Subscripts, however, are not commonly available on most computer print trains. Consequently it is more practical to input the following format:

CH3 /CH2 /CH2 /OH / / (23) 0.92 /1.57 /3.58 /2 .28 / / .

Table VII. Proton group/chemical shift index for the moiety CH3CH2C(=O)- or ACK.

Varian No. Structural formula Index

76 CH3CH2COCH3 A /CK /A // 1.05/2.45/2.13//

40 CHaCH~CONH~

206

£L CH3 0 COCH2CH3

A / C K / M H / / 1.13/2.23/6.42 / /

A /CK)Q. / / . D ( / A ) /B / B D ( / 1.20/2.78 / / /2.38/6.13/7.08 /

163 A /CK)S . / / . B /B / B D ( / ~ L 1.23/2.93//7.60/7.10/7.70/

S COCH~CH3

215 CH3CH~COOCH2 A /CKQ /C, 2 / / I 1.25/4.15 /2.62 / /

CHaCH2COOCH2

547 CH3CONHCH(OCOCH~CtI3)2 A /CKQ)2 / / A K / M /Y( / 1.30/4.28 / /2 .08/6 .60/5 .19/

APPLIED SPECTROSCOPY 175

Page 4: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table VIII. Proton group/chemical shift index for the moieties CHsCOCH~- or AKC and CH3COCH < or AKY.

Varian No. Structural formula Index

139 (CH3)~CHCH2COCH3 AN //A2Y /C / 2.12//0.93 /2.28/

76 AK //A /C / 2.13//1.05/2.47/

345

CH3CH2COCH3

COCH3 AK). //.C2K /BRJ@C2J@C2T( /A)J@C2Y( / 2.31 / / /5.87 /0.70 /

545 CHCH~COCH3 AK). //.C4D( /B /C / ]l 2.15 //1.67 /5.43 /3.13/

C6HaNHCOCH2COCH~ AK). //.B5D( /MK /C / 2.17 / / /9.34 /3.52/

(C,Hs)~CHCOCH3 AK). //.B5D, 2( /Y / 2.20 //7.25 /5.08/

256

318

277 COOCH~CH3 CIt3COCH(

CH2COOCH~CH3

AK /Y(KQC /A)( /CKQC /A) / / 2.35/3.98 /1.26/2.87 /1.26//

With (23) as input, computer processing, by means of a wraparound program, yields the following in addi- tion to (2a) :

CH2 /CH2 /OH / /CH3 / (2b) 1.57 /3.58 /2.28//0.92 /,

CH2 /OH / /CH3 /CH2 / (2c) 3.58 /2.28//0.92 /1.57 /,

OH / /CH3 /CH2 /CH2 / (2d) 2.28//0.92 /1.57 /3.58 /.

The wraparound program and computer alphabet- ization of the output results in a proton group/ chemical shift index by as many ways as there are proton groups in each molecule from a single input [such as (23)]. This is in contrast to the Sadtler and Varian indexes, which require manual indexing from as many viewpoints as there are proton groups. Furthermore, neither of the two systems shows the full molecule in the index.

It is quite apparent that a computerized system using the above structural formula fragment concept,

Table IX. Proton group/chemical shift index for the moiety CH3COCH= or AKB.

Varian No. Structural formula Index

616 CH=CHCOCH3 AK /B /B)X( /A)( /A)C /C /BD(/A). //.Y( / 2.26/6.06/6.60 /0.86/0.93 /2.05/5.50 /1.57//2.30 /

CH3

~ / J CH~

617 CH=CHCOCH3 AK /B /B). // .X( /A2)C2 /CD( /A)D( / I 2.31/6.11/7.28// /1.07 /2.09 /1.78 /

CH3.. 2

.2" CH3 ~V j

251 CsHsCH=CHCOCHa AK /B /B). //.B5D(9/ 6.35/6.70/7.48// /

176 Volume 26, Number 2, 1972

Page 5: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table X. Proton group/chemical shift index for the moiety CH3CONH- or AKM.

Varian No. Structural formula Index 265 C6H~CH~CH2NHCOCH~ AK /ivl /C /C). //.B5D( /

1.90/6,50/3.48/2.80//7.25 /

689 AK /M) /C2R2®D3(AQ, 3) /B. //.B /BD( /AQ)K 1.96/8.38/2.43 /6.55//7.37/6.92 /3.67 CH ,O -S '- , / - ---~

III N.ooo.

I / / \\ CH30 \~

CH,O / %0

690 C H 3 0 - ~ ( ~NHCOCH3

CH30 NN~v/~ = 0

/BR2® /Y / /7.63 /4.62/

AK /5{)R2@ /J /J /BD( /AQ)K.//.BD3(AQ, 3)R2@C2 /Y( / 2.07/6.02 /3.63/4.12/6.68 /3.73 //6.49 /4.84/

I OCH~

547 CH~CONHCH(OCOCH2CH~)2 AK /M /Y( / i /CKQ)2 / / 2.08/6.60/5.19/1.30/4.28 //

491 CN AK /M /Y(VZ)KQ /C /A // J 2.12/6.75/5.50. /4.36/1.37//

CH3CONHCH \

COOCH2CH3

NHCOCH~

CHa

NHCOCH3 I

0 OCH2CH3

239

267

AK /M). //.B /BD(/A) /B /BD( / 2.12/7.88 //7.37/7.12 /2.30/7.12/7.37 /

AK /M). //.B /BD( /A /CQ) /B /BD( / 2.12/7.91 //7.41/6.83 /1.38/4.00 /6.83/7.4l /

regardless of how inviting it is, would be awkward and space-consuming for more complex structures. Structural formula fragments are particularly difficult to delineate for cyclic structures without invoking arbitrary rules.

These difficulties, however, disappear when the notation system is used to delineate the fragments in a molecular structure.

III. RELATING PROTON GROUPS VIA THE NOTATION SYSTEM

Because the notation system corresponds essentially atom by atom with the way structural formulas are drawn, uses the same numbering rules as generally accepted, and is based on combinations of carbon and hydrogen, it is particularly suitable for relating proton groups with chemical shifts. Moreover, the notation

system is completely compatible with computer processing operations.

Input by the notation system for propyl alcohol and computer processing of the fragments yield the following :

h /C /C /QH / / (Input) (3a) 0.92/1.57/3.58/2.28// ,

C /C /QH / / i / (3b) 1.57/3.58/2.28//0.92/,

C /QH / / A /C / (3c) 3.58/2.28//0.92/1.57/,

QH / / A /C /C / (3d) 2.28//0.92/1.57/3.58/.

It is quite apparent that (1), (2a), and (3a) are equivalent as are their products from computer

APPLIED SPECTROSCOPY 177

Page 6: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table XI. Proton group/chemical shift index for the moiety CH.~COO- or AKQ.

Varian No. Structural formula Index

642 "'-\t~-/ XXT /O--'---r-CH~OCOCH3 AKQ /C)Q.). / / . B /BKMKN( / .YY( /AKQ)Y( /AKQ)Y( / _ _ _ _ ) ~ - - k , } OCOCHIa 1.92 /3.80 ~/ % I /3.87 / /

N //7.65/5.81 /5.83 /1.92 /1.92 ii O OCOCH ~

356

261

79

140

182

362

361

353

692

167

242

242

65

440

326

ph-ffO~fA)~OClt 3 ( ~ I - O C O C I I , ~

N:NPh

AKQ)Y(Z2.B5D.)J20)CQ /Y(.B5D.)Q. / / .Q / y ( / A Q ) /Y( / 1.92 /5.58 / / /5.07/3.47 /5.63/

C6HsCH~CH2OCOCI-],~ AKQ). / / .B5D( /C /C / 2.02 //7.29 /4.30/2.93/

CH3COOCH2CI-I3 AKQ /C /A / / 2.03 /4.12/1.25//

CHzCOO(CHD3CH~ AKQ /C3 /A / / 2.03 /4.05/0.95//

CtI3COOCtI (CHa)CH~COOCH~

CIt3 I CH (CH2) 2COOCH~

CH3COO

CHaCOO COCH:

CH3CO0

OCOCH3

0

C H 3 C O O ~ ~ 0

I ~ H : C H - ~ - - o c o c ~ CH3COO

~ O~-'CH2OCOCH 3

CH~COOCH (CH~OCOCH3) 2

See above

CHaCOOCH=CH2

CH~COOC (CH3) =CH,

N \

l If :x- : ' ~ / \ N/II (,

AKQ / Y ( / A ) /CKQ /A / / 2.03 /5.30/1.32/2.60 /3.70//

AKQ)CJT( /A)~C2J2(o)CKT( /A)J(~C2Y(AYC2KQ /A). / / .C2 /Y( / 2.03 /1.03 /1.03 /3 .70/ / /4.70/

AKQ)CJT( /A)(~C2J2(~C /Y( /AKQ)T( /A)J(~C2 /Y( / A K ) . / / . C 2 /Y( / 2.04 /0.92 /5.17/2.04 /0.70 /2.98/2.18 / / /4.73/

AKQ). // .C2KCJT( /A)~C2J2~C2T( /A)J(,~C2 /Y( / 2.05 / / /1.03 /0.80 /4.62/

AKQ)R2@D( /B /B.D /B /BD(/AKQ) /B /B.) /BKQ / / . B D ( / A K Q ) /BD( / 2.06 /7.33/6.89 /7.53/7.15 /2.32 /7.15/7.53 /6.36 //6.83 /2.32 /7.08 /

AKQ /C)Q. / / . B /B /BD( / 2.07 /5.03 //6.38/6.35/7.40 /

AKQ /Y( /AKQ /C)2 / / 2.08 /5.25/2.08 /4 .22/ /

AKQ /C)2 //AKQ /Y( / 2.08 /4.22//2.08 /5.25/

AKQ /B /E / / 2.12 /7.25/4.55//

/4.85//

AKQD(/A) /rE / / 2.12 /1.93/4.69//

AKQ /C)Q.)BZ. // .ZBZD( /MH)R2(~N(.Y /Y( /AKQ) /Y( /AKQ) /Y( / 2.12 /4.43 / / /6.37 /5.95/2.12 /5.70/2.12 /4.43/

178 Volume 26, Number 2, 1972

Page 7: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table XI. (Continued)

N 324 I'~ N----N

nN~x I) I[ N - O () ~ ( ~-CH,OCOCHa

CHaCOO-L_._J-OCOCH,

~ - ' - - N ~ O--i---CHiOCOCH~ 534 O=\ ___~¢~ ' - - ~ ,___i__ OCOCH,

~ O OCOCH.~

554 CHaCOOCHC~H~ I COOH

AKQ /C)Q.)BZ. / / .ZB /MKR2@N( / .Y /Y( /AKQ) /Y( /AKQ /Y( / 2.12 /4.42 / / /13.1 /6.17/5.88/2.12 /5.62/2.12 /4.42/

AKQ /C)Q.) / / . K M K /BZN( / .Y2( /AKQ)Y( /AKQ)Y / 2.13 /4.28 / / /7.67 /6.14 /2.13 /2.13 /

/4.32 / /

AKQ)). / / .B5D( /Y( /KQH)( / 2.18 //7.41 /5.95/9.88 /

processing Ecompare (2b), (2c), and (2d) with (3b), (3c), and (3d)~. The advantage of the notation system over the structural formula fragments Ee.g., (3a) over (2a) or (1)1 is the use of single letters to denote com- binations of carbon and hydrogen and the bonding associated with the carbon. This advantage is par- ticularly pertinent with highly substituted or complex acyclic compounds and in cyclic structures. For example, the proton group/chemical shift index for 3-diethylamino-2,2-dimethylpropionaldehyde (Varian No. 548) by the notation system is as follows:

0.97 I

(CH3CH2)2NCH2C(CH~)2CHO I I I I 2.50 2.55 1 .07 9.57

~AC, 2NCX(A2)KH, (4)

A /C,2N /CX( /A2) / K H / / (Input) (4') 0.97/2.50 /2.55 /1.07/9.57 / / .

In (4'), A /C,2N / is equivalent to (AC)2N and to (CH3CH2)2N; C X ( / A 2 ) / is equivalent to CH2C(CH3)~; and KH is equivalent to CHO. Thus, even with the high compression of information, (4') discloses every proton group as an index point and, at the same time, every fragment in the molecule and the full molecule. The proton group/chemical shift index for 1,4-naphthoquinone (Varian No. 550) by the notation system is as follows:

8.07 0 B K 7.77 I ~ 6.97 / / ~ /

~ , ~ / ~ / B R B *1 I I

B R B / \ \ / \ /

7.77 I % 6.97 B K 8.07 0 ~ .B4R2@KB2K. -~.BB2BR2@KB2K. (5)

.B /B2 /BR2@K /B2K. / / (Input) (5') 8.07/7.77/8.07 /6.97 //.

Cyclic structures in the notation system are char- acterized by left and right terminal periods, as illustrated in (5'). Both (4') and (5') contain a maxi-

mum amount of information within a minimum space allocation. Each has as many fragments as there are proton groups with chemical shifts, and each frag- ment is associated with its neighboring groups within the full molecule, and this remains true for each of the wraparound products.

Table VI illustrates the proton group/chemical shift index by the notation system for a series of alcohols in increasing chemical shift order of the methyl group protons. Table VI illustrates rather pointedly the correlative power of the indexing con- cept. Thus, the methyl group protons in Varian compound numbers 575, 282, and 43 are very much alike and similarly modified by being removed from the hydroxyl group by three or more methylene groups ; the methyl group protons in Varian compound numbers 45 and 87 show the same chemical shift effect of similar glycol neighboring groups ; the methyl group protons in Varian compound numbers 44, 14, and 86 show the same chemical shift effect of -CHOH- and -CH2OH. In methyl alcohol, in which the methyl group protons are adjacent to the hydroxyl, the chemical shift is unlike that of the other alcohols in this series but somewhat similar to the protons in the methoxy fragment of Varian compound number 120, i.e., the NMR spectrum of methyl alcohol categorizes it as containing a methoxy group (see Table XII) and a hydrogen proton, i.e., as AQ /H / / i n the notation fragment index.

IV. EFFECT OF VARIOUS ENVIRONMENTS ON THE METHYL GROUP

In view of the correlative power of the indexing concept exhibited in Table VI, it was further evalu- ated with other environments that affect the methyl group. The compounds and their chemical shift data were selected from Vols. 1 and 2 of the Varian Spectra Catalog.

Relative to the hydroxyl function (Table VI), the carbonyl function exhibits a slightly greater effect on the methyl group protons (Table VII). Thus, the methyl group shift values in the fragment CH3CH2CO- (or ACK-) range from 1.05 (in burn- none) to 1.30 (in acetamidomethylidene bispropio-

APPLIED SPECTROSCOPY 179

Page 8: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table XII. Proton group/chemical shift index for the moiety CHsO- or AQ.

Varian No. Structural formula Index

120 CH~CH(OCH3)CH2CH~OH AQ) /C /C /QH //A /Y( / 3.33 /1.72/3.73/3.10//1.18/3.55/

69 CH~OCH2CH2CN

419 BrCH~CH(OCH3)~ AQ)2 / /G /C /Y( / 3.40 / / /3.37/4.54/

446 (CH30) 2CHCOOCH3

I CH3OH A /QH // 3.47/1.43//

28

112

CHaOCH2CN

CH~-CHCOOCH3 \ /

CHs

64 CH~=CHCOOCH3

AQ /C /CVZ / / 3.40/2.62/3.62 / /

AQ /Y( /AQ)K /AQ / / 3.44/4.82/3.44 /3.82//

AQ /CVZ / / 3.47/4.20 / /

AQ). / / .C2 /Y(K / 3.67 //0.93 /1.63 /

AQ / /E /BK / 3.75//5.82/6.20 /

/6.20/

106 NCCH2CH~COOCHa AQ //ZV /C /CK / 3.75// /2.68/2.68 /

I13 CH~=C (CH3) COOCH8

258 CH=CHCH3 l

0 OCH+

HCOOCH3

CH30 H

CsHsOCHa

()CH.~

I OCHa

HC-=C-CH=CH---OCHa

NCCH~COOCH3

(CHsO) 2CHCOOCH3

266

162

608

100

57

446

AQ //ED( /A)K / 3.75//5.57 /1.95 /

/6.10 /

AQ) /B /BD( /B /B /A). / / .B /BD( / 3.75 /6.80/7.23 /6.28/6.08/1.83//7.23/6.80 /

AQ / /HK / 3.77//8.08 /

AQ)B2R2@ /C /C /C /M. / / .BD( / 3 . 7 7 /2.73/2.73/3.22/3.49// /

AQ). / / .B5D( / 3.78 / / /

AQ) /B2D(/AQ) / :C. / / . J /B2 /JR2@D( / 3.79 /6.50 /3.79 /2.22 //4.17/6.83/ /

AQ //UV /B /B / 3.80//3.08/4.52/6.35/

AQ //ZV /CK / 3.82// /3.48 /

AQ //AQ, 2 /YK / 3.82//3.44 /4.82 /

180 Volume 26, Number 2, 1972

Page 9: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table XII. (Continued)

Varian No. Structural formula Index 260 OCH~ AQ). / / .B /BD( /C /B /E) /BD( /QH)D( /

~ H 3.83 //6.87/6.64 /3.30/5.94/5.05/6.69 /5.59 / -OH

~CH=CH2

172 NH~ AQ)D( /MH(. //.B4D( / I 3.85 /3 .73 //6.79 /

0 -OCH3

249

284

257

576

CI COOCH3

C1- I,----S ~-OCOCH3 / \

CH~COO C1

OCH3 I

CH~=CI-I2NO2

CN /

C6Hs-CH=C \

COOCHa

AQ) /BR2@ /B /B /BZ. / / . B /BD( / 3.87 /7.00 /7.97/7.28/8.71 //7.97/7.35 /

AQ)X(L2) /Y( /AKQ)D(L)D(/AKQ). //.Y(K / 3.87 /6.63/2.20 /2.25 //6.05 /

AQ)D(/AQ). //.B3D( /B /B(NW))D( / 3.90 /3.90 //7.07 /8.20/7.73 /

AQ).//.B /B3 /BD( /BD(VZ)K / 3.96 //8.03/7.55/8.03 /8.27 /

nate). Within this range, Table VII reveals the in- creasing effect on the methyl chemical shift of the amide group, aromaticity, and the carboxyl group, relative to that in butanone.

The effect of the carbonyl group when adjacent to the methyl group is shown in Table VIII for a series of CH3COCH2- (AKC) and CH3COCH<(AKY) compounds, in which the chemical shift of methyl group protons ranges from 2.12 to 2.35. The electronic effects of neighboring groups beyond the carbonyl are quite revealing in Tables VIII-XI.

As revealed in Table XII, oxy and carboxy have a marked effect on the protons in the attached methyl group. Chemical shifts of the methoxy group attached to alicyclic saturated carbons range from 3.33 to 3.47. Methoxy groups attached to unsaturated and cyclic carbons have chemical shifts within the carbomethoxy range, viz., 3.67-3.96, with the methoxy groups falling between 3.75 and 3.90.

V. ALPHABETIZATION OF PERMUTED INDEXES

Whereas the proton group notation fragment index input is easily alphabetized to place together molecules

of similar nuclei, the wraparound indexes cannot be alphabetized meaningfully from a left-to-right com- puter alphabetization. Consequently, the computer program needs to be such that the first term in each index is examined for the right-hand parenthesis and for the double virgule. If either or both are present, the first term is then alphabetized with the final term rather than the second term; if with a double virgule subsequent alphabetization is from right to left; if with a right-hand parenthesis and a single virgule the next alphabetization is with the second term and the next to last term.

Thus, in the case of 3-methoxybutanol (Varian No. 120) in Table XII, alphabetization would place it with

A /

AQY \

C

In the case of carbmethoxycyclopropane (Varian No. 112) in Table XII, alphabetization would place it with

AQKYC2.

APPLIED SPECTROSCOPY 181

Page 10: A Computerized System for Storing, Retrieving, and Correlating NMR Data

Table XIII. Alphabetical proton group/chemical shift index for the Varian No. moiety CHsO- or AQ.

Varian No. Alphabetization Index

100 AQBBVU

69 AQCCVZ

28 AQCVZ

258 AQD(B)B

266 AQD(B)B

249 AQD(]3)B

162 AQDB5

260 AQD(B)D

172 AQD(B)D

608 AQD (B) R

257 AQD(D)D

1 AQH

64 AQKBE

106 AQKCCVZ

57 AQKCVZ

113 AQKD(A)E

576 AQKD(VZ)B

9 AQKH

112 AQKYC2

284 AQKY(D)X

446 AQKYQA

120 AQY(A)C

419 AQY(AQ)CG

446 AQY(AQ)K

AQ //UV /B /B / 3.80//3.08/4.52/6.35/

AQ /C /CVZ / / 3.40/2.62/3.62 / /

AQ /CVZ / / 3.47/4.20 / /

AQ) /B /BD( /B /B /A). / / . B /BD( / 3.75 /6.8O/7.23 /6.28/6.08/1.83 //7.23/6.80 /

AQ)B2R2(~ /C /C /C /M. / / .BD( / 3 . 7 7 /2.73/2.73/3.22/3.49// /

AQ) /BRZ(~ /B /B /BZ. / / . B /BD( / 3.87 /7.00 /7.97/7.28/8.71 //7.97/7.35

AQ). / / .B5D / 3.78 / / /

AQ). / / .B /BD( /C /B /E) /BD( /QH)D( / 3.83 //6.87/6.64 /3.30/5.94/5.05/6.69 /5.59 /

AQ)D(/MH). / / .B4D( / 3.85 /3.73 //6.79 /

AQ) /B2D(/AQ) / :C. / / . J /B2 / JR2~D( / 3.79 /6.50 /3.79 /2.22 //4.17/6.83/ /

AQ)D(/AQ). / / .B3D( /B /B(NW))D( / 3.90 /3.90 //7.07 /8.20/7.73 /

A /QH / / 3.47/1.43//

AQ / / E /BK / 3.75//5.82/6.20 /

6.20

AQ //ZV /C /CK / 3.75// /2.68/2.68 /

AQ / /Z¥ /CK / 3.82// /3.48

AQ / /ED( /A)K / 3.75//5.57 /1.95 /

6.10 /

AQ). / / . B /B3 /BD( /BD(VZ)K / 3.96 //8.03/7.55/8.03 /8.27 /

AQ / /HK / 3.77//8.08 /

AQ). / / .C2 /Y(K / 3.67 //O.93 /1.63 /

AQ)X(L2) /Y( /AKQ)D(L)D(/AKQ). / / .Y (K / 3 . 8 7 /6.63/2.20 /2.25 //6.05 /

AQ //AQ, 2 /YK / 3.82//3.44 /4.82 /

AQ) /C /C /QH / /A /Y( / 3.33 /1.72/3.73/3.10//1.18/3.55/

AQ)2 / /G /C /Y( / 3.40 / / /3.37/4,54/

AQ /Y( /AQ)K /AQ / / 3.44/4.82/3.44 /3.82//

Table XII I illustrates the alphabetical order for the compounds in Table XII.

Alphabetization program logic is more clearly illus- trated with arrows in the following examples from Table XII :

100

Notation fragment alphabctization

A Q / / U V / B / n /

69 A Q / C / C V Z / /

258 [ I

AQ) / S /BD ( /B /B /A) . / / . B / ( /

172 AQ)D(/iV[H). / / . B 4 D

The program needs to be such that in compounds 258 and 172 the second and next to last terms, which are nuclei neighbors of the last term, are examined and placed in alphabetical order.

The alphabetical order in Table XI I I successfully brings together structurally similar compounds, such as compounds 69 and 28; compounds 258, 266, 162, and 249; compounds 106 and 57; compounds 113 and 576; and compounds 112, 284, 446, 120, 446, and 419. Within structurally similar compounds, the alpha- betical arrangement reveals the effect of different nuclei beyond the structural similarity, such as for AQKY-.

VI. CONCLUSIONS

Magnetic variations of the same proton group arises from subtle, and sometimes obvious, differences in shielding. The extent of the differences depend on the magnetic anisotropy of many or all of the other groups in the molecules and their orientations with respect to other nuclei.

The indexing concept presented in this paper pro- vides a new method for relating groups with neigh- boring groups and with every group in each molecule ; for correlating the same proton group with the various environmental groups it is associated with in different molecules; and for computer processing of a single input per molecule to yield a proton group/chemical shift index under as many retrieval points as there are proton groups in each molecule, alphabetically by the proton groups or numerically by the chemical shift data.

1. "Chemical Shift Index Chart," Sadtler Research Laboratories , Inc., Philadelphia, Pa., 1967.

2. N M R Spectra Catalog (Varian, Palo Alto, Calif, 1962 and 1963), Vols. 1 and 2.

3. H. Sholnik, J. Chem. Doc. 10, 216 (1970). 4. H. Skolnik, J. Heterocyclic Chem. 6, 689 (1969). 5. I I . Skolnik, J. Chem. Doc. 11, 120 (1971). 6. H. Skolnik, "A Chemical F r a g m e n t Nota t ion Index , " J.

Chem. Doc. 11, 142 (1971).

182 Volume 26, Number 2, 1972


Recommended