Contrast and the realization of schwa vowels in English Edward Flemming.

Contrast and the realization of schwa vowels in English

Edward Flemming

2

Introduction

The project:• To derive properties of phonetic and phonological

vowel reduction from general constraints related to speech production and perception

Outline of the talk:• Outline a model of vowel reduction.• Explore its application to English vowel reduction

(‘reduction to schwa’).

3

Phonological vowel reduction• Vowel contrasts are neutralized in unstressed syllables.• E.g. Southern Italian (Mistretta dialect, Mazzola 1976)

Primary stressed: Elsewhere:i u i u

e o a

a

stressed vowels unstressed vowels[i] vi¤n˘i ‘he sells’ vin˘i¤mu ‘we sell’[e] ve@ni ‘he comes’ (vini¤mu‘we come’)[a] a@vi ‘he has’ avi¤ti ‘he has’[o] mo@ri ‘he dies’ (muri¤mu ‘we die’)[u] u@Ô˘i ‘he boils’ uÔ˘i¤mu ‘we boil’

4

Phonological vowel reduction

• Common patterns of vowel reduction:(a) i u (b) i u (c) i u

e o e o a/´E ç a

a• (a) reduces to (b), e.g. Standard Italian, B. Portuguese, Slovene

• (b) reduces to (c), e.g. Standard Russian, S. Italian, Catalan

• (a) reduces to (c), e.g. E. Catalan

• Reduction to a single vowel, e.g. English, Dutch, Salish

• Primarily neutralization of height contrasts.

5

Outline of an analysis of vowel reduction

• Vowel reduction is fundamentally motivated by undershoot in short unstressed syllables.

6

Phonetic vowel reduction - Undershoot

Lindblom's V Reduction Model - gVg

300

350

400

450

500

550

600

650

700

5001000150020002500

F2 (Hz)

F1 (Hz)

200ms

125 ms

100 ms

i

e

a

o

u

7

Outline of an analysis of vowel reduction

• Vowel reduction is fundamentally motivated by undershoot in short unstressed syllables.

• Short duration of unstressed vowels increases the effort required to achieve distinct vowel qualities, particularly low vowels (Lindblom 1963).

• Contrasts are subject to distinctiveness constraints, so neutralization occurs where phonetic reduction would otherwise render contrasts insufficiently distinct.

8

Undershoot as a consequence of effort minimization

• Faster movements are more effortful (Nelson 1983, Perkell et al 2002).

• In a CVC sequence, the articulators have to move to and from the position for the vowel.

• Undershoot results from avoiding fast movements.

F2V

F2C

F1C

F1V

9

Formant undershoot as a function of duration and distance

Lindblom’s model:

F2V = k2(F2i –F2t)e-T+F2t

if F1t 375 Hz:

F1V = F1t

if F1t > 375 Hz

F1V = k1(375 –F1t)e-T+F1t

F2i is F2 at C release

F1t, F2t are V formant targets

k, depend on consonant context

0

200

400

600

800

1000

1200

1400

1600

1800

50 100 150 200 250 300

vowel duration (ms)

frequency (Hz)

F1i

F2i

F2t

F1t

F2v

F1v

More undershoot where:• Vowel is shorter• Distance between C and V is greater

10

Formant undershoot as a function of duration and distance

Lindblom’s model:

F2V = k2(F2i –F2t)e-T+F2t

if F1t 375 Hz:

F1V = F1t

if F1t > 375 Hz

F1V = k1(375 –F1t)e-T+F1t

F2i is F2 at C release

F1t, F2t are V formant targets

k, depend on consonant context

More undershoot where:• Vowel is shorter• Distance between C and V is greater

0

200

400

600

800

1000

1200

1400

1600

1800

50 100 150 200 250 300

vowel duration (ms)

frequency (Hz)

F1i

F2i

F2t

F1t

F2v

F1v

11

Implementation of the model of vowel reduction

• Stressed and unstressed inventories of contrasting vowel categories are selected from a space of possible vowels so as to best satisfy constraints on contrasts: Maximize distinctiveness of contrasts. Maximize number of contrasts. Minimize articulatory effort.

• Effort minimization implies undershoot.

12

Model of vowel reduction

• The vowel space is modeled on Liljencrants and Lindblom (1972).

2

3

4

5

6

7

8

68101214

F2(Bark)

F1(Bark)

i u

a

13

i. Maximize the distinctiveness of contrasts

• Distinctiveness of the contrast between Vi and Vj is the (weighted) distance between the vowels in formant space.

2

3

4

5

6

7

8

68101214

F2(Bark)

F1(Bark)

i u

a

V1

V2

d12

€

dij = (a(x i − x j ))2 + (y i − y j )

2

Where xn is F2 of Vn in Barkyn is F1 of Vn in Barka < 1

14

i. Maximize the distinctiveness of contrasts

where

• Overall distinctiveness cost of a vowel system depends on the minimum distance found in either inventory.

€

1

dmin2

€

dmin = mini ≠j

dijCost:

15

ii. Maximize the number of contrasts

• maximize the number of vowels in the stressed and unstressed vowel inventories.

where

€

1

nave2

€

nave =nstressed + nunstressed

2Cost:

16

iii. Articulatory effort

• The space of possible vowels contracts as vowel duration is reduced, following the undershoot functions proposed by Lindblom (1963)

• Consonants are assumed to assimilate partially to the vowel target in F2, but not in F1.

2

2.5

3

3.5

4

4.5

5

5.5

6

68101214

F2(Bark)

F1(Bark)

100 ms

160 ms

17

Overall cost function

• The optimal vowel system is the one that best satisfies these constraints:

€

minimizeV

1

dmin2

+wn

2

nave2

(subject to vowel space constraint)

18

Optimal inventories

a = 0.14, k1 = 1.5, 1 = 0.008, k2 =1.5, 1 = 0.01,c = 0.27, F2l = 1400 Hz, wn = 6

Durations:stressed 160 msUnstressed 100 ms

2

2.5

3

3.5

4

4.5

5

5.5

6

68101214

F2 (Bark)

F1 (Bark)

stressed unstressed

19

Italian vowels

2

2.5

3

3.5

4

4.5

5

5.5

6

6.5

7

68101214F2 (Bark)

F1 (Bark)

stressed unstressed

e

E

a

ç

o

ui

Data from Albano Leoni et al 1995

20

Undershoot and vowel reduction

• Relating phonological vowel reduction to undershoot helps to explain:

i. The tendency to neutralize vowel contrasts in short unstressed syllables.

ii. The generalization that vowel reduction primarily eliminates height contrasts.

iii. The generalization that neutralizing vowel reduction is accompanied by phonetic reduction.

21

English reduction to schwa

• English exhibits a pattern of vowel reduction whereby vowel quality contrasts are neutralized in unstressed syllables.

• The resulting vowel is usually transcribed as schwa [´]

atom »QR´matomic ´»tHAmIk

22

Reduction to ‘schwa’

Predictions of the undershoot model:• Reduction to a single vowel should be most likely

where vowels are very short.• Where there is a single vowel, distinctiveness of

vowel quality contrasts is irrelevant, so effort minimization should dominate.

• So schwa should be a transitional vowel, maximally assimilated to the surrounding context.– ‘targetless schwa’

23

Minimum effort vowels

• Minimal deviation from the narrow constrictions for surrounding consonants results in low F1 (a high vowel) because any constriction above the pharynx lowers F1.

• Minimal deviation from the tongue body and lip positions for surrounding consonants and vowels results in contextually variable F2.

• But schwa is often said to be a mid central vowel.

24

Experiment 1: English schwa vowels( research with Stephanie Johnson)

• Materials final non-final

Rosa rhapsody

Lisa suggest

Russia suspend

asia prejudice

ninja today

sofa begin

vodka report

soda compare

alpha

umbrella

probable

suffocate

25

Experiment 1

• Also recorded full vowels for comparison.

heed [i], hid [I], head [E], had [Q], odd [A], hood [U], who [u]

• Spoken in carrier phrase ‘Say ___ to me’.• 9 female speakers of American English.• Measured first two formants at the mid point of

the vowels.• compare frequently lacked any voiced vowel in

the first syllable, so it was excluded from analysis.

26

Results

Non-final schwa:• Low F1 (mean 425 Hz)

• F2 is contextually variable.

300

400

500

600

700

800

900

1000

1100

50010001500200025003000F2 (Hz)

F1 (Hz)

non-final full vowels (means)

i

I

E

Q

A

U

u

27

Results

Non-final schwa:• Low F1 (mean 425 Hz)

• F2 is contextually variable.

Final schwa:

• F1 shows wide range (mean 665 Hz).

• Much of this is between-speaker variation.

• Central F2 (mean 1772 Hz)

300

400

500

600

700

800

900

1000

1100

50010001500200025003000F2 (Hz)

F1 (Hz)

non-final full vowels (means) final schwa

i

I

E

Q

A

U

u

28

Results

Final schwa:

• F1 shows wide range (mean 665 Hz).

• Much of this is between-speaker variation. vocal tract size quality of final

schwa ([´] - [√])

0

200

400

600

800

1000

1200

1400

0 2 4 6 8 10

Subject

mean F1 (Hz)

final schwa

heed

had

29

Two patterns of vowel reduction

• The difference between final and non-final schwa vowels can interpreted in terms of the undershoot model of vowel reduction.

• There are two degrees of unstressed vowel reduction, depending on characteristic vowel duration.

30

Two patterns of vowel reduction

• Final unstressed vowels are longer than non-final unstressed vowels: 153 ms vs. 64 ms.– Presumably a result of final lengthening.

• Allows for a lower schwa vowel, which in turns allows for the maintenance of contrasts

• The vowels [i, ´, oU] and rhotic [´’] contrast in unstressed syllables contrast in absolute word-final position.

pretty [»p®IRi] letter [»lER ’]beta [»beIR ]motto [»mARoU]

• Non-final schwa does not minimally contrast with other vowel qualities.

• Consequently it is assimilated to context.

31

The correlation between contrast and reduced vowel quality

The same correlation between system of contrasts and reduced vowel quality is observed across languages:

• Contextually variable high vowels only seems to be found where all vowel quality contrasts are neutralized, e.g. Dutch (van Bergem 1994), probably Montana Salish.

• Mid central [´] is found in contrast with higher vowels, e.g. reduced vowel inventories of the form [i, ´, u] occur in unstressed syllables in Russian (Padgett and Tabain 2003), E. Bulgarian (Wood and Pettersson 1988), Catalan (Herrick 2003), and in final unstressed syllables in Brazilian Portuguese (Mattoso Camara 1972).

32

The correlation between contrast and reduced vowel quality

• Central Catalan stressed and unstressed vowels (Herrick 2003).

i

e

E

a

ç

o

u

´

33

Experiment 2: Targetless schwa

• This analysis hypothesizes that the non-final reduced vowel in English is a minimum effort vowel.

• So far based on impressionistic evaluation of a relatively unsystematic sample of medial schwa vowels.

34

Experiment 2: Targetless schwa

• Systematically vary the preceding and following context of medial schwa.

Questions: • Does variability of schwa involve assimilation to the

surrounding context?• Is there any evidence that schwa has a vowel quality

target?Conclusions:• Much of the variability of schwa can be attributed to

assimilation.• Schwa lacks a vowel quality target, but it is not completely

targetless - its target is to be a vowel.

35

Materials

• Nonce words of the form: [»bV1C1´«C2V2t]– All combinations of V1 from {i, œ, u}

Cn from {b, d, g}

V2 from {i, A, u}(81 words)

– Subjects were instructed to model the stress pattern on words like propagate and parakeet.e.g. »bid´«gut, »bQg´«bit, etc.

36

Materials

• All words were read in the sentence frame:

‘X. Do you know what a X is?’• Presented twice in random order (only second run

is analyzed here).• Read by four native speakers of American

English, 2 male, 2 female.

37

Analysis

Measured F1 and F2: • at steady state, extreme values, or midpoint of V1 and

V2• at the offset of V1, and at the onset of voicing in V2.• in schwa: at the onset of voicing, at the offset of

schwa, and half way between these points.

Time (s)26.1806 26.60530

5000

V1 S V2

38

Results

200

300

400

500

600

700

800

1000120014001600180020002200

F2 (Hz)

iu

Q

200

300

400

500

600

700

800

900

1000150020002500

F2 (Hz)

F1 (Hz)

iu

Q

200

300

400

500

600

700

800

900

1000

10001500200025003000

F2 (Hz)

i

Q

u

300

400

500

600

700

800

900

10001500200025003000

F2 (Hz)

F1 (Hz)

ui

Q

formants at midpoint of schwa

39

Results

• Medial schwa is highly variable in quality, particularly in F2.

200

300

400

500

600

700

800

900

1000

10001500200025003000

F2 (Hz)

i

Q

u

300

400

500

600

700

800

900

10001500200025003000

F2 (Hz)

F1 (Hz)

ui

Q

40

Is this variability the result of interpolation between preceding and following context?

• F2 in schwa is correlated with F2 of surrounding vowels

• But it is not the result of simple interpolation between vowels.

1000

1200

1400

1600

1800

2000

2200

2400

2600

V1mid Smid V2mid

F2 (Hz)

i_iu_ai_au_i

41

Is schwa variability the result of interpolation?

• Unsurprisingly, the consonants also have a significant influence on F2 of schwa.

900

1100

1300

1500

1700

1900

2100

2300

2500

V1mid V1off Smid V2ons V2mid

F2 (Hz)

ib_bi id_di ig_gi

900

1100

1300

1500

1700

1900

2100

2300

2500


F2 (Hz)

ud_da ub_ba ug_ga

42


• The F2 trajectory of schwa is more likely to be an interpolation between preceding and following consonants.

• But F2 adjacent to a consonant depends in turn on F2 of the adjacent vowel.

43

Locus equations

• Typically consonant F2 is a linear function of F2 at the midpoint of the adjacent vowel (Lindblom 1963, Klatt 1987, etc).

• The slope and intercept of this function depend on the consonant.

Time (s)1.96743 2.42708

0

5000

Time (s)11.3647 11.8142

0

5000bid bçd

44


• These considerations suggest the following model of F2 at schwa midpoint:

F2Smid = aC1F2V1 + bC1 + cC2F2V2 + dC2

• The effect of the vowels on F2 of schwa is modulated by the intervening consonants.

• This model is quite successful (r2 = 0.73-0.86 for individual subjects)– For comparison, a model based on F2 at the vowel

mid points alone yields r2 of 0.19-0.36

45

Summary for F2

• Medial schwas show wide variation in F2.• This variation is systematically conditioned by

context.

• It is difficult to determine whether schwa F2 is the result of simple interpolation.

46

Is F1 variability the result of interpolation?

• F1 at schwa midpoint also varies substantially according to vowel context, but cannot be the result of interpolation between preceding and following vowels.

0

100

200

300

400

500

600

700

800

900

V1mid Smid V2mid

F1 (Hz)

i_iae_ai_aae_i

47

Is F1 variability the result of interpolation?• The consonants can account for the fact that schwa F1 tends to

be much lower than in adjacent vowels: forming a stop closure lowers F1.

• But if F1 in schwa is governed by the adjacent consonantal constrictions, then we would expect schwa vowels to have very low F1.

0

100

200

300

400

500

600

700

800

900

V1mid Smid V2mid

F1 (Hz)

i_iae_ai_aae_i

48

Is F1 variability the result of interpolation?

• In iC´Ci, F1 at schwa midpoint is higher than in preceding or following vowels.

• Schwa appears to have an F1 (height) target.

• If there are no height contrasts, why is there a height target?

200

300

400

500

600

700

800

900


F1 (Hz)

ib_bi id_di ig_gi

200

300

400

500

600

700

800

900


F1 (Hz)

aed_da aeb_ba aeg_ga

49

Height target or ‘presence’ target?

• While non-final schwa does not contrast with other vowels in quality, it is still important to distinguish presence vs. absence of schwa.

• In some contexts presence vs. absence of schwa is minimally contrastive.

about [´baUt] vs. bout [baUt]parade [pH´®eId] prayed

[pH®eId]support [s´pHç®t] sport [spç®t]

• So schwa is expected to have targets related to signaling the presence of a vowel, e.g. duration intensity peak

50

Height target or ‘presence’ target?

• Producing an intensity peak generally involves increasing F1 (cf. Howitt 2000).

• So the apparent F1 target for non-final schwa may be a byproduct of signaling the presence of a vowel.

• This interpretation predicts that the ‘target’ for F1 is not a specific value, but a minimum value. Accordingly, schwa should assimilate to its context where this would result in high F1 - e.g. adjacent to a low vowel.

51

• Preliminary evidence suggests that schwa fully assimilates to the F1 of an adjacent low vowel. E.g. ‘Mrs Shah adressed the audience’

• If schwa had a specific F1 target (e.g. ~400 Hz), then we should observe movement towards this target.

Time (s)0 0.309392

0

5000

Time (s)5.13834 5.62427

0

5000

Mr Shah dressed... Mrs Shah addressed...

52

Summary of experiment 2

• The quality of medial schwa vowels is highly variable.• Variation in F2 is particularly extensive, and involves

assimilation to the surrounding context, suggesting that medial schwa has no specific backness or rounding targets.

• F1 in schwa also covaries with F1 of surrounding vowels, but systematically deviates from F1 of the context, suggesting that schwa has an F1 target.

• This apparent F1 target may be a byproduct of realizing an intensity peak to cue the presence of a vowel.– it is important to consider contrasts with absence as

well as quality contrasts as sources of constraint on the realization of segments.

53

Conclusion

• English shows two patterns of vowel reduction, both are consistent with the undershoot model of vowel reduction.

• Non-final schwa vowels are short, which would tend to lead to substantial undershoot.

• Vowel quality contrasts are neutralized.• The resulting vowel is substantially assimilated to its context.• But assimilation is limited by the need to provide clear cues to the presence

of a vowel, e.g. an intensity peak.• Final unstressed position has longer vowels, allowing for vowel quality

contrasts.• Schwa is realized as a mid central vowel, distinct from /i, oU/.• The same correlation between system of contrasts and the quality of reduced

vowels is observed across languages– contextually variable vowel ( high in most contexts) in the absence of

quality contrasts.– mid central vowel in contrast with higher vowels.

54

End

55

Schwa presence vs. absence

Time (s)0.0123687 0.291936

0

5000

Time (s)16.9501 17.2303

0

5000

saw n(othing) saw an(other)

Date post:	17-Dec-2015
Category:	Documents
Upload:	agnes-obrien
View:	232 times
Download:	4 times

Contrast and the realization of schwa vowels in English Edward Flemming.

Documents