An Acoustic Analysis of Heritage Spanish Vowels

AN ACOUSTIC ANALYSIS OF HERITAGE SPANISH VOWELS

Rebecca E. Ronquest

Submitted to the faculty of the University Graduate School in partial fulfillment of the requirements

for the degree Doctor of Philosophy

in the Department of Spanish and Portuguese, Indiana University

July 2012

ii

Accepted by the Graduate Faculty, Indiana University, in partial fulfillment of the requirements for the degree of Doctor of Philosophy.

Doctoral Committee _________________________________ Erik W. Willis, Ph.D.

Chair

_________________________________ Kimberly L. Geeslin, Ph.D.

_________________________________ Manuel Díaz-Campos, Ph.D.

_________________________________ Kenneth de Jong, Ph.D.

May 25, 2012

iii

©2012 Rebecca E. Ronquest

iv

ACKNOWLEDMENTS

This dissertation could not have been completed without the help and support of

so many individuals. First and foremost, I would like to thank my committee for their

guidance throughout this process. Thanks to Ken de Jong for sharing his knowledge and

experience with me, and for providing me (and my collection of animal-shaped mittens)

with a sound foundation in acoustic phonetics. I would also like to thank Manuel Díaz-

Campos for his insightful comments on this work, and for inviting me to participate in

collaborative projects at various points throughout my graduate career. Thanks to

Kimberly Geeslin for all the support and advice, quick and thoughtful responses to my

many frantic e-mails, and for offering additional insight into this project. I would like to

express my deepest gratitude to my advisor, Erik Willis, for teaching me the importance

of the viaje; for e-mails in the form of talking cats and crickets; and most of all, for

endless encouragement and patience.

Special thanks to the Latino Studies Program at Indiana University and John

Nieto-Phillips for the fellowship opportunity that provided me with the time and

resources to complete this work. I am very grateful to Kim Potowski for inspiring me to

study heritage speakers, and for assistance in recruiting participants and reserving space

for recording sessions. Special thanks also to Stephanie Dickinson at the Indiana

Statistical Consulting Center for help in designing the statistical model and interpreting

the results. Finally, thanks to Jon Tweedy for creating the Euclidean distance calculator,

and for support and friendship throughout this process.

I am extremely fortunate to have had the opportunity to work with so many

talented people throughout my academic career, all of whom have inspired my love of

v

Spanish and linguistics: Clancy Clements, Jonathan Arries, George Greenia, Silvia

Tandeciarz, and Marliss Arruda, to name a few. I would also like to thank David Pisoni,

Luis Hernández, and Susie Levi – my mentors in the Speech Research Laboratory –

whose instruction and training have made me a stronger and more skilled researcher.

I am thankful to have had so many wonderful friends to help me through the

dissertation writing process with their ánimo and smiles: Erin Mcnulty, Greg Newall,

Catalina Méndez Vallejo, Scott Lamanna, Paul Malovrh, Lauren Schmidt, Marda Rose,

Tanya Flores, and Janet Smith. I would also like to thank my dear friends Jane and

Walter Gantz for their encouragement and wisdom, and for motivating me to write (and

swim) every day. I extend my sincerest thanks to James Moskowitz and Antje

Schwennicke for their incredible friendship and support over the past several years. From

doggie play dates and flag football games, to somehow getting me on a plane to Los

Angeles, they have been there through it all.

Last of all, and perhaps most importantly, special thanks to my mom, Bette

Ronquest, who has always believed that I could achieve this goal and any other that I set

out to accomplish. My achievements would not have been possible without her love and

support.

vi

Rebecca E. Ronquest

AN ACOUSTIC ANALYSIS OF HERITAGE SPANISH VOWELS

The present dissertation has two main goals: 1) to provide a descriptive, acoustic

characterization of the heritage Spanish vowel system and 2) to assess the extent to which

linguistic and extralinguistic variables affect heritage Spanish vowel production. Via

acoustic analysis of the vowel productions of heritage speakers (i.e. early bilinguals), this

investigation contributes to our understanding of bilingual Spanish vowel pronunciation,

offering insight into the types of variation the system can exhibit.

Sixteen heritage speakers of Spanish from the Midwestern United States

completed three speech elicitation tasks, background and cultural sensitivity

questionnaires (Cushner, 1986), and a grammar proficiency test (Geeslin & Gudmestad,

2010). The first and second formant frequencies, Euclidean distance from the center of

the vowel space, and the duration of each speaker’s vowels were calculated. The acoustic

data were further coded for a series of linguistic variables (lexical stress, syllable type,

speech style) and extralinguistic variables (course level, grammar proficiency, travel

frequency, Spanish use, and cultural sensitivity). Mixed linear models were conducted in

order to determine how the linguistic and extralinguistic factors impacted pronunciation.

The heritage Spanish vowel system was found to differ considerably from

traditional descriptions of monolingual Spanish vowel configurations. The /e/ was

produced further back in the acoustic space, the /u/ further front, and atonic vowels were

centralized and significantly shorter in duration than their tonic counterparts. The effect

of syllable type on vowel pronunciation was inconsistent, lending evidence against the

vii

presence of “open/closed” mid vowel allophony in this variety of Spanish. The impact of

speech style, however, was robust: vowels became longer and more greatly dispersed as

speech style became less natural. Finally, frequent travel abroad and Spanish use outside

of the classroom -- two characteristics known to impact the acquisition of L2

pronunciation -- resulted in an expansion of the vowel space that more closely

approximated native speaker norms. Overall, the acoustic and statistical results indicate

that although heritage speakers are often argued to exhibit “native-like” pronunciation,

their vowel system is not identical to that of a monolingual: it is unique and is influenced

by a variety of linguistic and extralinguistic factors.

____________________________________

____________________________________

____________________________________

____________________________________

viii

TABLE OF CONTENTS

LIST OF TABLES ............................................................................................................. xi

LIST OF FIGURES .......................................................................................................... xii

1 INTRODUCTION ...................................................................................................... 1

1.1 Previous linguistic research on heritage populations ........................................... 2

1.2 Spanish vowel systems ......................................................................................... 4

1.3 Goals..................................................................................................................... 6

1.4 Contributions ........................................................................................................ 7

1.5 Dissertation outline .............................................................................................. 9

2 PREVIOUS LITERATURE ..................................................................................... 10

2.1 Heritage speakers ............................................................................................... 10

2.1.1 Who are heritage speakers, and why should we learn more about them? ....... 11

2.1.2 Early bilingual and heritage pronunciation...................................................... 19

2.2 Vowel systems.................................................................................................... 29

2.2.1 General properties of vowels ........................................................................... 30

2.2.2 Impressionistic studies of Spanish vowels ....................................................... 36

2.2.3 Acoustic studies of Spanish vowels ................................................................. 49

2.3 Bilingual and contact vowel systems ................................................................. 72

2.3.1 Bilingual and contact Spanish vowel systems ................................................. 72

2.3.2 English vowel systems of heritage Spanish speakers ...................................... 83

2.4 Individual speaker characteristics and speech production ................................. 96

2.5 Research questions and hypotheses.................................................................. 102

3 METHODOLOGY ................................................................................................. 108

3.1 Instruments ....................................................................................................... 108

3.1.1 Narrative retelling task ................................................................................... 108

3.1.2 Picture identification task .............................................................................. 110

3.1.3 Carrier phrase task ......................................................................................... 114

3.1.4 Language background questionnaire ............................................................. 116

3.1.5 Grammar proficiency test .............................................................................. 116

3.1.6 Cultural sensitivity questionnaire .................................................................. 117

ix

3.2 Participants ....................................................................................................... 117

3.3 Procedure .......................................................................................................... 119

3.4 Data analysis .................................................................................................... 120

3.4.1 Acoustic measures ......................................................................................... 121

3.4.2 Individual variables ........................................................................................ 136

3.4.3 Statistical analyses ......................................................................................... 140

4 RESULTS ............................................................................................................... 145

4.1 Within-subjects variables ................................................................................. 145

4.1.1 Main effects and simple interactions ............................................................. 147

4.1.2 Complex interactions .................................................................................... 182

4.1.3 Summary of within-subjects effects.............................................................. 205

4.2 Between-subjects variables .............................................................................. 210

4.2.1 Course level .................................................................................................. 212

4.2.2 Frequency of travel ....................................................................................... 217

4.2.3 Grammar proficiency .................................................................................... 222

4.2.4 Spanish use.................................................................................................... 227

4.2.5 Cultural sensitivity ........................................................................................ 231

4.2.6 Summary of between-subjects effects .......................................................... 234

4.3 Overall summary of results .................................................................................. 237

5 DISCUSSION ......................................................................................................... 243

5.1 Overall distribution and organization of HS vowels ........................................ 243

5.2 Within-subjects variables ................................................................................. 258

5.2.1 Lexical stress ............................................................................................. 259

5.2.2 Syllable type.............................................................................................. 264

5.2.3 Speech style .............................................................................................. 270

5.3 Between-subjects variables .............................................................................. 277

5.3.1 Course level .............................................................................................. 277

5.3.2 Grammar proficiency ................................................................................ 282

5.3.3 Travel ........................................................................................................ 285

5.3.4 Spanish use................................................................................................ 290

5.3.5 Cultural sensitivity .................................................................................... 292

x

5.4 Interpreting the expansion effect ...................................................................... 295

5.5 Limitations and future directions ..................................................................... 300

5.5.1 Experimental design.................................................................................. 300

5.5.2 Language and identity ............................................................................... 309

5.5.3 Control data ............................................................................................... 312

5.6 Summary of discussion .................................................................................... 315

6 CONCLUSIONS..................................................................................................... 320

REFERENCES ............................................................................................................... 326

Appendix A: List of words presented in the picture identification task (PIT) ............... 340

Appendix B: List of words presented in the carrier phrase task (CPT) ......................... 341

Appendix C: Language background questionnaire ........................................................ 342

Appendix D: Grammar proficiency Test (Spanish) ....................................................... 347

Appendix E: Language attitude and cultural sensitivity questionnaire ......................... 350

Appendix F: Participant demographic information ....................................................... 355

Appendix G: Total vowels analyzed in the analyses ..................................................... 358

Appendix H: Results of statistical analyses conducted with F1 .................................... 367

Appendix I: Results of statistical analyses conducted with F2 ...................................... 376

Appendix J: Results of statistical analyses conducted with Euclidean distance. ........... 386

Appendix K: Results of statistical analyses conducted with duration. .......................... 397

Appendix L: Normalized and non-normalized formant values ..................................... 405

Appendix M: Vowel by task by travel interaction ......................................................... 406

xi

LIST OF TABLES

Table 2-1. Spanish vowel system based on articulatory parameters .............................. 30 Table 2-2. Vowel pronunciation described in Navarro-Tomás (1918) ........................... 39 Table 3-1. Variables included in the Mixed Linear Model ........................................... 144 Table 4-1. Basic characteristics of vowels produced by male HS ................................ 146 Table 4-2. Basic characteristics of vowels produced by female HS ............................. 146 Table 4-3. F1, F2, and Euclidean distance measures of HS vowels ............................. 148 Table 4-4. Overall HS vowel duration .......................................................................... 153 Table 4-5. F1, F2, and Euclidean distance measures according to stress context ........ 157 Table 4-6. Atonic and tonic HS vowel duration ........................................................... 161 Table 4-7. F1, F2, and Euclidean distances of HS vowels by syllable type ................. 164 Table 4-8. Vowel duration in closed and open syllables .............................................. 167 Table 4-9. Formant values and Euclidean distance measures of HS vowels by task ... 171 Table 4-10. Vowel duration in each speech style .......................................................... 179 Table 4-11. Formant values and Euclidean distances by stress type in three tasks ....... 183 Table 4-12. Tonic and atonic vowel duration in three speech styles ............................. 192 Table 4-13. Formant values and Euclidean distances by syllable type in three tasks ... 196 Table 4-14. Vowel duration by syllable type in three tasks ........................................... 203 Table 4-15. Information pertaining to individual characteristics................................... 212 Table 4-16. Formant values and Euclidean distances for two course levels ................. 214 Table 4-17. Formant values and Euclidean distances for three travel groups ............... 219 Table 4-18. Summary of results ..................................................................................... 242

xii

LIST OF FIGURES

Figure 2-1. Figure of formant values reproduced from Quilis and Esgueva (1983). ....... 32 Figure 3-1. Images from the Tortilla Rag, narrative retelling task. ............................... 109 Figure 3-2. Sample training slide from PIT. .................................................................. 111 Figure 3-3. Model slide from actual task phase in the PIT. ........................................... 112 Figure 3-4. Slide of "strange" gift from the PIT. ........................................................... 113 Figure 3-5. Sample slides from CPT.............................................................................. 115 Figure 3-6. Waveform and spectrogram of the word tapa (“lid”). ................................. 123 Figure 3-7. Waveform and spectrogram for the word sistema (“system”). ................... 124 Figure 3-8. Waveform and spectrogram for the word mano (“hand”). ......................... 126 Figure 3-9. Waveform and spectrogram of the word un lápiz (“a pencil”). .................. 127 Figure 3-10. Waveform and spectrogram of the word cucharas (“spoons”). ................ 128 Figure 3-11. Representation of the centroid and distance of the point vowels from the centroid. .......................................................................................................................... 135 Figure 4-1. Overall acoustic distribution of HS vowels. ............................................... 149 Figure 4-2. Overall dispersion of HS vowels................................................................. 151 Figure 4-3. Overall dispersion of HS vowels with adjusted scale. ................................ 153 Figure 4-4. Overall vowel duration for all speakers. ..................................................... 154 Figure 4-5. Vowel duration based on acoustic distribution ........................................... 155 Figure 4-6. Tonic and atonic vowel productions for all speakers .................................. 158 Figure 4-7. Tonic and atonic vowel dispersion for all speakers .................................... 160 Figure 4-8. Tonic and atonic vowel duration for all speakers ....................................... 162 Figure 4-9. Acoustic distribution of vowels produced by all speakers in open and closed syllables........................................................................................................................... 165 Figure 4-10. Vowel dispersion based on syllable type. ................................................. 166 Figure 4-11. Vowel duration by syllable type. .............................................................. 168 Figure 4-12. Average vowel duration based on consonantal context. ........................... 169 Figure 4-13. Overall vowel space based on task (speech style). ................................... 172 Figure 4-14. Vowel dispersion based on task (speech style). ........................................ 177 Figure 4-15. Vowel duration based on task (speech style). ........................................... 180 Figure 4-16. Tonic and atonic vowel space in spontaneous speech (NRT). .................. 184 Figure 4-17. Tonic and atonic dispersion in spontaneous speech (NRT). ..................... 186 Figure 4-18. Tonic and atonic vowel space in semi-spontaneous speech (PIT). ........... 186 Figure 4-19. Tonic and atonic dispersion in semi-spontaneous speech (PIT). .............. 188 Figure 4-20. Tonic and atonic vowel space in controlled speech (CPT). ...................... 189 Figure 4-21. Tonic and atonic vowel dispersion in controlled speech (CPT). .............. 190 Figure 4-22. Tonic and atonic duration in each of the three speech styles, averaged across syllable type. ........................................................................................................ 193 Figure 4-23. Vowel space based on syllable type in spontaneous speech (NRT). ........ 197

xiii

Figure 4-24. Vowel dispersion by syllable type in spontaneous speech (NRT). ........... 198 Figure 4-25. Vowel space based on syllable type in semi-spontaneous speech (PIT). . 199 Figure 4-26. Vowel dispersion by syllable type in semi-spontaneous speech (PIT). .... 200 Figure 4-27. Vowel space based on syllable type in controlled speech (CPT). ............. 201 Figure 4-28. Vowel dispersion by syllable type in controlled speech (CPT). ............... 202 Figure 4-29. Vowel duration in closed and open syllables in each task. ....................... 204 Figure 4-30. HS vowel production by course level. ...................................................... 215 Figure 4-31. Vowel distribution based on frequency of travel abroad.. ........................ 220 Figure 4-32. Vowel distribution based on grammar score: speakers 2f and 10f. .......... 224 Figure 4-33. Vowel distribution based on grammar score: speakers 10f and 7f ........... 225 Figure 4-34. Representation of vowel production based on Spanish use.. .................... 228 Figure 4-35. Vowel distribution based on cultural sensitivity. ...................................... 234 Figure 5-1. Overall vowel space for HS of Spanish ...................................................... 244 Figure 5-2. All vowel tokens produced by female HS ................................................... 247 Figure 5-3. All vowel tokens produced by male HS ...................................................... 247 Figure 5-4. Dispersion of vowel categories for female HS. .......................................... 249 Figure 5-5. Dispersion of vowel categories for male HS............................................... 250 Figure 5-6. Comparison of vowels produced by 13 female HS to vowels produced by 2 monolingual females in S&R (2001). ............................................................................. 252 Figure 5-7. Comparison of vowels produced by 3 male HS to vowels produced by 2 monolingual males in S&R (2001). ................................................................................ 253 Figure 5-8. HS and monolingual vowel duration in controlled speech. ........................ 256 Figure 5-9. Dispersion of normalized male and female vowels in the CPT. ................. 272 Figure 5-10. Dispersion of normalized male and female vowels in the PIT. ................ 272 Figure 5-11. Dispersion of normalized male and female vowels in the NRT. .............. 273 Figure 5-12. Normalized formant values for speakers enrolled in intermediate and advanced-level courses. .................................................................................................. 280 Figure 5-13. Vowel production in three speech tasks for the non-travel group. ............ 296 Figure 5-14. Vowel production in three speech tasks for the infrequent travel group. . 297 Figure 5-15. Vowel production in three speech tasks for the frequent travel group. .... 297

1

1 INTRODUCTION

The primary objective of this dissertation is to provide a descriptive, acoustic

characterization of the heritage Spanish vowel system. Heritage speakers, who are

sometimes referred to as circumstantial or early bilinguals (Valdés, 2005), are a specific

type of bilingual differentiated principally from other bilingual, monolingual, and learner

populations by their 1) early contact with, and acquisition of, a minority language in a

naturalistic context and 2) limited access to formal education in the minority language

until later in life.

Although heritage speaking populations exist in multilingual contexts throughout the

world (e.g., Spanish-Catalan bilinguals in Spain, French-English bilinguals in Canada,

and French-Arabic bilinguals in France), the present investigation focuses on heritage

speakers of Spanish living in the United States. Therefore, for the purpose of this

investigation, a heritage speaker (hereafter HS) of Spanish is defined as a Spanish-

English bilingual who grew up within a Spanish-speaking household and community, and

was thus exposed to the Spanish language from an early age, often from birth (Bolger &

Zapata, 2011; Montrul et al., 2008b; Potowski et al, 2009; Valdés & Geoffrion-Vinci,

1998). The HS of Spanish described in this investigation were raised in the United States

overhearing and speaking Spanish within the home and community with parents,

relatives, and neighbors, but most did not receive any formal literacy education in the

Spanish language until high school or college. Therefore, like monolingual speakers of

Spanish and simultaneous Spanish-English bilinguals, HS acquired the Spanish language

naturalistically from a very early age. Their exposure to and experience with Spanish

during childhood results in their exhibiting advanced communicative and comprehension

2

skills, much like those of monolingual and simultaneous bilingual speakers. In contrast,

however, most HS do not receive formal literacy instruction in Spanish until later in life –

a characteristic they share with L2 learner populations – and are therefore sometimes

described as possessing a “limited ability” in the areas of formal oral and written

discourse (Montrul et al., 2008b).

1.1 Previous linguistic research on heritage populations

The study of the linguistic knowledge of HS of Spanish has received much attention

in the past two decades. Two areas that have garnered considerable attention are the

pedagogy for HS and the morphosyntactic knowledge of HS. The general findings in

these two distinct bodies of literature reveal important differences and similarities

between HS populations and late L2 learners, simultaneous bilinguals, and age-matched

monolingual speakers of Spanish. Research in the field of pedagogy, for example, has

emphasized that because HS acquired Spanish naturalistically during early childhood but

did not receive any explicit instruction during primary school, their instructional needs

are unique. A number of investigations in the fields of heritage and second language

acquisition have focused on 1) testing the effectiveness of instructional methods utilized

in traditional foreign language classrooms, or 2) devising unique instructional strategies

for heritage populations that emphasize the acquisition of the formal oral and written

registers (Kagan, 2005; Lynch, 2003; Montrul et al. 2008; Potowski et al., 2009;

Rothman, 2007). The general findings of this literature indicate that HS benefit most

from communicative methods of language instruction as opposed to form-focused

exercises, even if the gains are not the same as those observed in L2 learner populations

(Potowski et al., 2009).

3

Research in the field of morphosyntax has revealed additional differences, but

also some similarities, between HS and late L2 learners in terms of their production and

judgment of certain morphosyntactic structures. Heritage speakers typically outperform

proficiency-matched L2 learners in speech production tasks and more closely

approximate native-like usage of structures such as clitic pronouns and grammatical

gender (Montrul et al., 2008a; Montrul, 2010). In contrast, L2 learners sometimes

outperform HS on written grammaticality judgment tasks by rejecting ungrammatical

structures more frequently and systematically than HS (Montrul et al., 2008a; Montrul et

al., 2008b). Despite the differences, however, HS share certain aspects of acquisition

with L2 learners such as transfer errors from the majority language, lexical extension, and

syntactic calquing (Lynch, 2003). Also similar to L2 learner populations, HS exhibit a

wide-range of variation in their Spanish abilities that is influenced by a number of factors

(Mikulski, 2010; Mrak, 2011). Extralinguistic factors such as Spanish use outside of the

classroom and frequency of travel, for example, have been found to influence HS

performance on morphosyntactic tasks (Mikulski, 2010; Montrul et al., 2008b).

One of the central arguments presented in the morphosyntactic literature is that HS

experience a “benefit” in pronunciation (but not in morphosyntax) as a result of their

early exposure to the Spanish language, resulting in their “sounding” more “native-like”

than those who acquired the language later in life. Interesting, however, is that very few

studies have systematically examined HS pronunciation. Although a considerable body

of literature has investigated early bilingual pronunciation (Flege, 1991; Flege & Eeftig,

1987; Zampini & Green, 2001; Yavas, 2006, among others), only a handful of studies

have examined HS as they are defined here (Au et al., 2008; Knightly et al., 2003; Oh et

4

al., 2008), most of which have focused primarily on the pronunciation of consonants.1

Combined, the results of the investigations focusing on HS and early bilinguals indicates

that their pronunciation resembles that of a native monolingual in some respects, but

may more accurately be defined as a complex, hybrid system of speech sounds that falls

somewhere between that of an L2 learner and a monolingual speaker.

To summarize briefly, the majority of the investigations which specifically examine

HS of Spanish have focused on knowledge and judgment of morphosyntactic structures,

or have attempted to devise unique instructional strategies for this population. That

considerably fewer studies have examined the pronunciation of this specific subtype of

bilingual motivates the need for a closer examination of the HS phonetic/phonological

system. One area that is deserving of further investigation is the vowel system, due in

part to the fact that HS of Spanish must employ two very different vowel systems (i.e.,

English and Spanish) throughout their daily lives. An additional motivating factor is that

although several recent investigations have examined different aspects of Hispanic

English vowels (Fought, 1999; Godiñez & Maddieson, 1985; Konopka & Pierrehumbert,

2008; Roeder, 2010), no study to date has described or analyzed the Spanish vowel

system of heritage Spanish speakers.

1.2 Spanish vowel systems

The Spanish vowel system is traditionally described as a simple, symmetrical, five-

vowel system that is diatopically stable, especially in comparison to English (Hualde,

2005). Based on the assumption of stability and simplicity, considerably fewer acoustic

studies of Spanish vowels have been conducted as compared to studies of Spanish 1 Notable exceptions are Bullock (2009) for heritage prosody, Chang et al. (2009) for heritage Mandarin vowels, and Godson (2003) for heritage Armenian vowels.

5

consonants. Quilis and Esgueva (1983) provided some of the first acoustic descriptions

of Spanish vowels, including baseline acoustic measures which showed the presence of

small differences in vowel quality and duration based on stress context and talker gender,

but the overarching assumption has remained that there is little variability within the

system in general.

In addition to the assumption of cross-dialectal similarity in vowel productions,

traditional descriptions of the Spanish vowel system argue that 1) vowel quality and

duration are fairly unaffected by lexical stress (Delattre, 1969; Hualde, 2005; Quilis &

Esgueva, 1983; but in contrast, Marín Gálvez, 1995) and 2) the Spanish mid vowels

exhibit allophonic variation based on syllable type (Navarro-Tomás, 1918; but in contrast

Morrison, 2004; Servín & Rodríguez, 2001). Unfortunately, our knowledge of Spanish

vowel production and the assumptions about the system are primarily based on analyses

of Peninsular varieties of Spanish and/or vowels produced in highly controlled speech

styles (Bradlow, 1994; Morrison, 2004; Morrison & Escudero, 2007). More recent

studies of non-Peninsular varieties and vowels produced in less-controlled speech styles

suggest that there may be more dialectal and stylistic variation in the Spanish vowel

system than previously believed (Delforge, 2008; O’Rourke, 2010; Martín Butragueño,

2011; Poch-Olivé et al., 2008; Willis, 2005).

Acoustic studies of bilingual vowel suggest that bilingual speakers of Spanish exhibit

a modified system of pronunciation that differs from monolingual norms within the same

dialect region. O’Rourke (2010) and Menke and Face (2010) reported differences in

overall dispersion in distinct populations of Quechua-Spanish and English-Spanish

bilinguals and L2 learners, respectively. Willis (2005) reported that Spanish-English

6

bilinguals in the Southwestern United States produced /u/ farther front, /o/ lower, and /a/

farther front than the same monolingual Spanish vowels described by Quilis and Esgueva

(1983). Finally, both Menke and Face (2010) and Alvord and Rogers (2011) reported

that bilingual speakers in the Southwestern United States and Miami-Cuban bilinguals

exhibited considerable unstressed vowel centralization not attested in monolingual

varieties.

Taken together, the results of the investigations discussed above motivate the need

for further examination of Spanish vowel systems and bilingual Spanish vowel

production. Recent acoustic studies have brought into question some of the basic

assumptions about Spanish vowels, resulting in considerable disagreement among

researchers as to how the Spanish vowel system is organized acoustically and which

linguistic and stylistic factors affect vowel quality and quantity. Heritage speakers, who

are argued to “sound” native, provide the opportunity to test the robustness of the effects

of linguistic and stylistic variables on Spanish vowel production, while at the same time

offering insight into the bilingual vowel system.

1.3 Goals

The literature discussed above motivates the need to further investigate HS

pronunciation as well as Spanish vowel production. Thus, the goals of the present study

are as follows:

1) To contribute to the general body of literature pertaining to Spanish vowels by systematically examining HS vowel production in a number of contexts.

2) To provide a descriptive, acoustic characterization of the heritage Spanish vowel system by:

7

a) Assessing the extent to which HS vowels resemble monolingual Spanish vowels as they are traditionally described in the literature.

b) Taking detailed, acoustic measurements of vowel formants, duration, and calculating the distance of HS vowels from the center of the vowel space.

3) To examine the role (or lack thereof) of lexical stress, syllable type, and speech style on HS vowel production.

4) To investigate if and how individual characteristics such as course level, grammar proficiency, travel abroad, Spanish use, and cultural sensitivity influence HS vowel pronunciation.

In order to achieve these goals, a series of speech elicitation instruments and

questionnaires were designed and administered to a group of HS residing in Chicago, IL.

Through a combination of acoustic and statistical analyses, the present investigation

provides an in-depth understanding of Spanish-English bilingual vowel systems.

1.4 Contributions

The present research study makes several important contributions to the fields of

Spanish phonetics and bilingual speech pronunciation. First, this dissertation contributes

to the general body of literature pertaining to Spanish vowels. Traditional descriptions of

the Spanish vowel system argue for a simple, stable, symmetrical vowel system that

exhibits some variation based on speech style and syllable type, but is fairly unaffected

by lexical stress (Hualde, 2005; Quilis & Esgueva, 1983). Analyzing the relationship

between HS vowel production and the aforementioned variables may test the robustness

of previous claims, and offer important insight into how the Spanish vowel system can

vary.

Secondly, the present investigation provides the first acoustic description and

analysis of heritage Spanish vowels as produced by speakers of (primarily) Mexican

8

descent living in the Midwestern United States. This study therefore offers an important

first glimpse at the vowel system of this population.

Thirdly, this initial examination of HS vowels is a crucial first step in systematically

addressing the assumption that heritage speakers resemble native speakers in terms of

their pronunciation. Future comparisons between monolingual speakers of Spanish will

reveal the extent to which HS approximate native speaker norms, which characteristics

they share with native speakers, and on which dimensions they differ.

In addition, the present investigation lays the foundation for future work comparing

HS pronunciation to L2 speech, as well as to additional regional HS varieties throughout

the United States. Future comparisons with L2 learner populations will permit further

examination of how early exposure to a language during childhood affects vowel

production in adulthood. Comparisons with heritage populations in other regions

throughout the United States may offer insight into if and how regional varieties of

heritage Spanish differ across the country, and how contact with different dialects of

English may influence their pronunciation of Spanish vowels.

Finally, an examination of the extralinguistic factors argued to influence HS

morphosyntactic knowledge and HS and L2 (consonantal) segmental production will

reveal 1) the relationship between these same variables and the pronunciation of vowels,

and 2) similarities and differences between HS and L2 learners in terms of how they

benefit from certain linguistic practices.

To preview the results of this investigation, the statistical analyses revealed that 1)

HS exhibit a unique distribution of vowels within the acoustic space that is not attested in

monolingual varieties, 2) lexical stress and speech style impact the degree of

9

centralization and total duration of vowels, and 3) the HS who speak Spanish and travel

more frequently produce vowels more peripherally (i.e., expanded) than those who report

less usage and travel. The findings offer additional support for the argument that

bilingual speakers exhibit a modified system of pronunciation. The results also suggest

that some of the same activities known to have a positive impact on L2 phonology also

influence HS vowel production.

1.5 Dissertation outline

The remainder of this dissertation is organized as follows. Chapter two provides a

comprehensive review of the literature describing heritage speakers, vowels, and

bilingual pronunciation. The research questions guiding the present study and the

hypotheses are motivated at the end of chapter two. Chapter three describes the

methodology employed in the present study, including the details of the instruments,

participants, data analysis, and statistical analyses employed to analyze the data. The

results of the statistical analyses conducted on the data are described in chapter four. A

discussion of the results is presented in chapter five, and the overall conclusions in

chapter six.

10

2 PREVIOUS LITERATURE

The goal of this chapter is to discuss the previous literature that has relevance to this

dissertation. Section 2.1 defines the population under investigation—heritage speakers—

and describes some of the current findings pertaining to their linguistic knowledge. The

purpose of section 2.2 is to discuss vowel systems, with specific emphasis on Spanish

vowel systems. The contributions of both impressionistic (section 2.2.2) and acoustic

(2.2.3) studies of Spanish vowels will be discussed, and a separate section is devoted to

reviewing the literature on bilingual and contact vowel systems of both Spanish and

English (sections 2.3.1 and 2.3.2, respectively). The final objective of this chapter is to

discuss the connection between language and an individual’s language background and

use, connection with the heritage culture, and cultural sensitivity and pronunciation

(section 2.4). The chapter will conclude with a synthesis of the previous literature and

how it relates to the goals of the present investigation, as well as the research questions

and hypotheses that are guiding this study in section 2.5.

2.1 Heritage speakers

This portion of the chapter will 1) introduce the population of investigation in the

present study, and 2) describe some of the research that has focused on their linguistic

systems. A definition of heritage speakers will be provided, as well as an overview of the

investigations focusing on differences between this population and L2 learners. The

review of the literature pertaining to the fields of morphosyntax and pedagogy is not

exhaustive, but is sufficient to explain some of the most important characteristics of this

population. The final portion of this section discusses studies of bilingual and heritage

11

speaker pronunciation, again presenting the most important findings reported in the work

on early bilingual and heritage language pronunciation and phonology.

2.1.1 Who are heritage speakers, and why should we learn more about them?

The definition of a “heritage speaker” (hereafter HS) presents a wide-range of

variability. This definition can be viewed as a continuum which includes, at one extreme,

those individuals who feel a connection or association with their heritage but do not

speak or understand the heritage language, to those who have advanced comprehension

and oral communication abilities (Mikulski, 2010; Mrak, 2011). Much of the literature

pertaining to HS of Spanish defines this group as consisting of bilingual speakers of

English and Spanish who grew up speaking Spanish in the home and community, but

who were formally educated almost exclusively in English (Bolger & Zapata, 2011;

Potowski et al., 2009; Montrul et al., 2008). HS were raised in Spanish-speaking

households and acquired the Spanish language naturalistically at a very early age, often

from birth, and often simultaneously with English (Bolger & Zapata, 2011). The early

exposure to Spanish results in their exhibiting communicative and comprehensive

proficiency in the Spanish language even though they often do not receive formal

education in the language until high school or college (Montrul & Bowles, 2010). The

majority language, in this case English, however, is the typical language of social

interaction outside of the home and community with peers and siblings (Lynch, 2003).

Given that HS exhibit variability in use of the heritage language and also differ in

terms of the quantity of input they receive in the home, many researchers have argued

that the HS linguistic system is “incomplete,” “deficient,” or “abbreviated” when

compared to that of a monolingual (Bolger & Zapata, 2011; Potowski et al., 2009;

12

Montrul et al., 2008; Montrul, 2010). In addition, Kagan (2005) argues that HS differ

from late L2 learners (i.e., those who began learning a second language after puberty), as

well as from monolingual native speakers of the heritage language, and should be treated

differently from these populations in terms of instruction. These arguments have

prompted a considerable amount of research in the separate areas of morphosyntax and

pedagogy that examines how HS, L2 learners, and monolinguals differ with respect to 1)

their morphosyntactic knowledge and 2) their respective pedagogical needs.

The majority of the literature pertaining to HS has examined HS knowledge and

judgments of different grammatical structures and analyzed how their performance on

oral and written tasks compares to that of late L2 learners. Although an exhaustive

review of the morphosyntactic literature is not possible here, several notable studies

describe the similarities and differences between learner and heritage populations. The

assumption present in much of the literature is that early exposure to the Spanish

language results in HS benefitting in the domain of pronunciation, but not, or not as

much, in the morphosyntactic, syntactic, or lexical domains (Montrul et al., 2008).

A number of investigations conducted by Montrul (2005; 2010) and Montrul,

Foote, and Perpiñán (2008a; 2008b) seem to confirm that HS do not exhibit superior

grammatical knowledge or performance when compared to L2 learners, but subtle

differences do exist between the groups based on the type of task (oral vs. written) and

especially once proficiency level is taken into account. Montrul (2005), for example,

examined HS, L2 learner, and monolingual judgments of the syntax and semantics of

unaccusative and unergative verbs in Spanish. The HS and L2 learners were grouped

into low, intermediate, and advanced levels based on their scores on a grammar

13

proficiency test. The general results in this study show that L2 learners and HS at

advanced levels are very knowledgeable of the syntax of unaccusativity in Spanish, and

that they show similar patterns in terms of acceptability ratings. The HS at low levels,

however, differed from low-level L2 learners in their acceptance of the semantic reading

of unaccusative and unergative verbs. Whereas the HS showed variable patterns of

acceptance that were less robust than the intermediate and advanced groups, they

outperformed the low level L2 learners who showed no knowledge of the semantic

differences between these verb types. Thus, Montrul’s (2005) results suggest that, while

L2 learners and HS are similar with respect to their syntactic knowledge and judgments,

at the low levels of proficiency, HS early exposure to language may result in a slight

benefit in semantic knowledge.

An additional study by Montrul et al. (2008a) analyzed L2 learner and HS

knowledge of grammatical gender in Spanish. Late English-Spanish bilinguals (L2

learners), early Spanish-English bilinguals (heritage speakers), and monolingual control

subjects completed a grammar proficiency test, two written tasks testing knowledge of

grammatical gender, and an oral production task. The proficiency tests revealed that the

HS and L2 learners did not differ statistically with respect to the performance on this test,

but that they scored significantly lower than the monolingual controls. The results from

the written tasks on gender agreement indicated that L2 learners performed significantly

better than the HS and made fewer, more systematic errors. In contrast, the HS exhibited

fewer errors in the speech production task when compared to the L2 learner groups. The

results of this investigation support the argument that early language exposure does not

result in significant gains in morphosyntactic knowledge, as the HS and L2 learners

14

exhibited comparable proficiency scores and showed similar error patterns. The benefit

was observed in production, however, with the HS producing fewer agreement errors in

the oral task.

Montrul (2010) investigated L2 and HS knowledge of clitics, clitic left

dislocation, and differential object marking. Heritage speakers and L2 learners

participated in production tasks and grammaticality judgments tasks containing these

structures. In terms of production, the HS produced more clitics, behaving similarly to

the monolingual Spanish speakers, whereas the L2 learners produced more NP’s—

evidencing greater transfer effects from English. Heritage speakers also approximated

the native speaker norm in their judgment of sentences containing clitic left dislocation,

and outperformed the L2 learners who assigned lower acceptability ratings to these

sentences. The differences between L2 learners and HS were most notable in the

differential object marking tasks. Although both groups had difficulty marking animate

direct objects in the production tasks, the HS outperformed the L2 learners. In the

grammaticality judgment task, however, the L2 learners were less accepting of

ungrammatical sentences than HS. This result confirms previous findings that HS tend to

perform better on oral and production tasks than L2 learners, but that learners often

outperform HS on written tasks. Overall, this study shows that both groups of bilinguals

have difficulty with object markers, but they manifest themselves differently in each

population.

In a related study, Montrul et al. (2008b) investigated the knowledge of Wh-

movement in Spanish by the same participants examined in their 2008(a) study on

grammatical gender. Their initial analysis of language proficiency and grammaticality

15

judgments of all sentences types indicated that the early (HS) and late (L2) bilinguals

showed no significant differences in proficiency or in judgments. They then divided the

participants into high, intermediate, and low proficiency groups based on the scores from

the proficiency test, and examined performance on all of the different sentence types

separately. This reexamination of the grammaticality judgment task showed that mid and

high level HS accepted ungrammatical sentences without complementizers more than the

mid and high level L2 learners. The low level HS and late bilinguals showed even higher

acceptability ratings of these ungrammatical sentences, but the difference between these

two groups at this level was not statistically different. A final step in this investigation

examined the individual response patterns of the participants, grouping them according to

their preferences. These results showed that HS exhibited patterns that were more similar

to the native speakers than those of the late L2 learners. The results of this investigation

are thus somewhat contradictory. In terms of proficiency, overall judgments, and

acceptance of ungrammatical Spanish sentences without complementizers, HS and L2

learners appear very similar. These results support the argument that early exposure to

the language does not result in a benefit in knowledge of the syntactic structures tested.

On the other hand, when response patterns were analyzed, the HS exhibited patterns that

were more similar (although not statistically) to those of native monolinguals than did the

L2 learners. Taken together, the findings from this study provide evidence that HS and

L2 learners are similar in terms of their grammatical knowledge in some respects, but

differ in others.

The results of the type obtained in the four previous studies mentioned above lead

to several, sometimes contradictory, conclusions about the differences between HS and

16

L2 learners and how overhearing the language during childhood affects grammatical

knowledge and performance. In terms of language proficiency, HS and L2 learners, and

especially those at the advanced levels, perform similarly on morphosyntactic and

syntactic tasks, suggesting little benefit in these domains as a consequence of learning

Spanish early. Both populations have been argued to exhibit similar errors and evidence

of transfer effects from the dominant language, but these “errors” may manifest

themselves in different ways. Subtle differences do exist between these groups of

bilinguals, however, in that HS perform better on oral tasks, L2 on written tasks.

Differences in grammaticality judgments exist, but are most observable in the lowest

level of learners. Finally, analysis of overall response patterns and consistency of

errors/judgments suggests that HS more closely mimic the monolingual control patterns

than do the L2 learners.

In addition to the studies pertaining to HS morphosyntactic knowledge and

grammaticality judgments, research has also questioned the pedagogical needs and types

of language instruction that are most beneficial to them. Kagan (2005) argues strongly

that HS (or in her terms, HLL’s) are neither native monolingual speakers nor L2 learners

and thus need to be treated separately from those two groups. In particular she argues

that HS “1) cannot be dismissed as native speakers who need no instruction, 2) do not

need to be placed in beginning-level language classes, 3) can be tracked and placed

according to their background, and 4) need a curriculum with a structure and a set of

materials that differ considerably from those intended for foreign language students” (p.

216). With respect to her fourth point, she specifically argues that classical literature,

17

film, and poetry, should be included in the curriculum, and that a cross-cultural and

comparative approach be introduced to facilitate higher-level language skills.

Valdés (2005) is in accordance with Kagan (2005), citing even greater distinctions

within HS populations. She argues that not only should instructional strategies differ for

HS and L2 learners, but that also the stage in the HS language continuum in which an

individual falls should determine the type of language instruction he or she needs. For

example, HS who failed to acquire a structure as part of the L1 may benefit from form-

focused language instruction similar to that employed in an L2 classroom. In contrast, in

cases of attrition in which a form has already been acquired but lost, focused instruction

may be inappropriate. The individual who falls into this category may benefit more from

the type of input and interaction more akin to a monolingual linguistic interaction, as

opposed to traditional grammar exercises presented to L2 learners.

As with the investigations pertaining to morphosyntax, an exhaustive review of

pedagogical studies will not be presented here. One study in particular conducted by

Potowski, Jegerski, and Morgan-Short (2009), however, exemplifies how L2 learners in a

traditional foreign language classroom and HS differ in terms of how they benefit from

overt language instruction. The primary goal of their investigation was to examine the

differences and similarities between HS and L2 learners’ acquisition of the subjunctive

mood via two different types of language instruction: processing instruction and

traditional instruction. Their goal was to determine if HS would benefit from overt

grammar instruction of this structure in the same or different way as late L2 learners

enrolled in a traditional language classroom. One hundred and one HS and 22 L2

learners completed a pretest and a posttest, and completed 9 different tasks (either

18

processing instruction or traditional instruction, depending on the group). Ten control

subjects who did not receive treatment were also included. Overall, the results of this

investigation showed that heritage speakers also benefit from overt language instruction

of grammatical structures, but that they differed in certain ways from the L2 learners.

Although the HS and L2 learners both showed improvement in the posttest for the

interpretation and production tasks, only the L2 learners showed improvement in the

grammaticality judgment task. This improvement, coupled with the fact that L2 learners

actually outperformed the HS on the post-test, indicated that HS do not benefit from these

types of language instruction in the same way that L2 learners do. The authors explain

that this difference may be due in part to the fact that L2 learners are more accustomed to

structured grammar activities. In contrast, the HS who were enrolled in upper-level

university courses were more accustomed to writing and production activities. Given a

statistical complication, it was not possible to determine which form of instruction was

significantly more beneficial for either group. The overall conclusion, however, is that

HS do benefit from overt language instruction, but the effect many not be as pronounced

for this population when compared to L2 learner populations. This conclusion supports

the need for different types of instructional strategies for HS of Spanish.

Taken together, the research presented above portrays heritage speakers as a

unique population. In some ways, they share characteristics with late L2 learners who

acquired Spanish in a traditional foreign language classroom such as, overgeneralization

and simplification of variable structures, lexical extension, and syntactic calquing

(Lynch, 2003). In other respects, HS differ from L2 learners in terms of the performance

on oral production tasks and their ability to recognize fine-grained semantic differences

19

between different syntactic and morphosyntactic structures (Montrul, 2005; Montrul et

al., 2008a, 2008b). In this way, HS often more closely resemble monolingual native

speakers of Spanish when compared to age-matched L2 learners who acquired the

language later in life. Additional research in the field of language pedagogy supports the

need for HS to be treated differently from L2 learners, indicating that they do not benefit

in the same way from traditional forms of grammar instruction as the late L2 learner does

(Potowski, et. al, 2009). Lynch (2003:11) specifically argues for content-based,

communicative approaches in HS language classrooms, arguing that their language has

been “dialogic, discursive and absolutely contextual from the beginning.”

One other crucial difference between HS and L2 learners is that for the HS,

language is often a means of establishing, maintaining, or expressing identity and a

connection with the heritage culture. Further explanation and examination of the HS

identity as expressed through language will be presented in section 2.4.

2.1.2 Early bilingual and heritage pronunciation

Although very few studies have examined the pronunciation of heritage speaker

populations as they have been defined here, a considerable body of literature has

examined bilingual pronunciation, as well as the pronunciation of early and late bilingual

populations. These studies have revealed several important characteristics of bilingual

speakers and the factors that influence the pronunciation of the L2 and the L1. The first

important finding is that bilingual speakers’ pronunciation often approximates the native

speaker norm, but their production is not identical to that of a monolingual native (Yavas,

1996; Zampini & Green, 2001). Several studies analyzing the VOT of /ptk/ and /bdg/ in

English and Spanish, for example, indicated that bilinguals produced these sounds with

20

intermediate values that fell somewhere between those of fluent monolinguals of English

and Spanish (Flege & Eefting, 1987, 1988; Yavas, 1996). The intermediate values may

be the result of a bi-directional influence of the L1 and L2 on speech production, lending

evidence to the notion that the bilingual has a modified system of pronunciation and not

two completely separate systems (Flege & Eefting, 1987; Zampini & Green, 2001).

Finally, factors such as the age of learning a second language and the age of arrival to a

country in which the L2 is spoken also influenced the perception of foreign accented-ness

in speech. Bilinguals who acquired a language later in life or arrived to the country in

which the L2 was spoken as adults were rated as sounding less native than those who

learned their L2 as children or arrived in the country as youngsters (Flege et al., 1995;

Munro et al., 1996).

As introduced earlier in this chapter, one of the main claims present throughout

the literature is that as a result of their early exposure to the Spanish language, HS

experience a benefit in the phonological domain and in pronunciation but not necessarily

in morphosyntax (Knightly et al., 2003; Montrul et al., 2008a). In order to test this

assumption, Knightly, Jun, Oh, and Au (2003) conducted a series of experiments

consisting of acoustic analysis as well as grammaticality judgment and written

morphosyntactic tasks.2 In their 2003 investigation, Knightly et al. examined the speech

2 Although not discussed in detail in the text, two additional studies were conducted by this group of researchers that preceded the investigations mentioned here. Au et al. (2002) was a precursor to Knightly et al. (2003). The results of this study were nearly identical to those reported in Knightly et. al (2003): overhearers produced /ptkbdg/ in a more native-like manner than late L2 learners, and there was no statistical difference in morphosyntactic knowledge between the childhood overhearers and the L2 learners. This first study in the series showed, however, that merely overhearing Spanish in childhood is sufficient to provide pronunciation benefits. An additional study by Oh, Jun, Knightly, and Au (2003) examined childhood speakers, childhood overhearers, and novice learners of Korean. These results again showed that childhood speakers benefitted in their pronunciation of VOT, their perception of phonemic contrasts, and had higher accent ratings than overhearers and learners. That the same effects reported for Korean were

21

production and grammatical knowledge of 15 Spanish-English bilinguals (native speakers

of Spanish), fifteen late-learners of Spanish who overheard Spanish during childhood but

were not schooled formally in Spanish until adolescence (childhood overhearers), and

fifteen late Spanish learners who had no exposure to Spanish until high school or college

(“typical” late L2 learners).3 The three groups of Spanish speakers participated in several

oral production tasks (i.e., carrier phrase task that examined VOT, retelling of the Frog

Story), as well as several morphosyntactic tasks that examined grammaticality judgments,

the construction of NP’s with gender and number agreement, and a VP completion task.

The results of the analyses of speech production tasks indicated that childhood

overhearers exhibited shorter VOT values for /ptk/ (i.e., more “Spanish-like”) than the

late L2 learners. These differences were particularly robust in word-initial position.

With respect to voicing, the overhearers produced /bdg/ with voicing characteristics that

were intermediate to the native controls and the L2 learners, but the differences were not

statistically significant. Lenition of /b/ and /g/ also differed between the overhearers and

the L2 learners, with the overhearers outperforming the learners with respect to their

pronunciation of /b/, but the reverse was true for the pronunciation of /g/. Native-ness

ratings of the Frog stories revealed that overhearers were rated as more “native-

sounding” than L2 learners, but less so than the native speakers. Finally, the results of

the morphosyntactic components of the experiment revealed no significant differences in

grammaticality judgments or performance between the overhearers and the L2 learners.

corroborated with Spanish in Au et al. (2008), providing further evidence of the benefits of not only childhood overhearing, but more importantly of childhood speaking. 3 The term “typical” is used by Knightly et al. to describe an L2 learner who acquired Spanish in a traditional foreign language classroom setting. I want to clarify that the word “typical” refers to the type of instruction they received, and not that any learner can be classified with such a term given the wide range of variation observed in L2 learner populations.

22

The results of this investigation lend evidence to the argument that overhearing a

language during childhood results in more native-like pronunciation, but that the same

benefit is not observed in other (i.e., morphosyntactic) domains.

Au, Oh, Knightly, Jun, and Romo (2008) conducted a similar study that not only

examined the production and grammatical knowledge of childhood overhearers, but

included childhood speakers of Spanish as well. These two groups, overhearers and

speakers, were then compared to late L2 learners and native speaker controls in order to

test if speaking, as opposed to simply overhearing, a language during childhood results in

greater gains in pronunciation and grammatical competency. Ten childhood speakers, 20

childhood overhearers, 39 L2 learners who did not speak or overhear Spanish during

childhood, and 25 native speakers of Spanish took part in the experiment. All

participants were first tested on their production and comprehension of slang by

translating orally a series of English slang phrases into Spanish and Spanish slang phrases

into English. The responses were then compared to a list of acceptable translations

compiled by native Spanish speakers from various dialect regions. The results of this

first task showed that native speakers had the highest translation accuracy, followed by

the childhood speakers, the overhearers, and the L2 learners with the littlest knowledge of

slang. The other instruments utilized in this study are identical to those used in Knightly

et al. (2003): production of sentences in a carrier phrase to test VOT, retelling of The

Frog Story, a noun phrase production, a verb phrase production, and a grammaticality

judgment task. They also included a task which examined the perception and production

of sentences in noise. Like the results of Knightly et al. (2003), the findings of this

investigation also showed that early exposure to Spanish during childhood resulted in

23

more native-like pronunciation. The crucial difference, however, is that the speakers

tended to outperform the overhearers in terms of their native-ness ratings and their

repetition and perception of sentences in noise. In addition, the childhood speakers were

found to produce narratives that were more grammatically well-formed and were better at

detecting errors in ill-formed grammatical constructions. Overall they conclude that

although overhearing a minority language during childhood has a positive impact on

pronunciation in adulthood, actually speaking the language during childhood provides

additional gains.

Au et al.’s (2008) research concludes with a discussion of which other factors

may have resulted in the childhood speakers outperforming the other two non-native

groups. The estimated number of hours that each of the groups was exposed to Spanish

was found to correlate with the well-formed-ness of production in the narrative task: the

more exposure, the more well-formed the constructions were in the narrative. An

analysis of daily Spanish use and motivation to speak Spanish was also conducted. These

analyses revealed there were no differences between groups in terms of their daily use of

Spanish and motivation to speak Spanish outside of the classroom, and no significant

correlations between this usage and their pronunciation and grammatical knowledge.

There was some suggestion that motivation to learn and maintain their Spanish abilities

may affect native-ness ratings (pronunciation), but the statistics revealed no reliable

differences between the groups. Even though none of the extralinguistic factors

examined in the subsequent analyses correlated with or explained the differences between

overhears and speakers, Au et al. (2008) argue that quantity of input, Spanish use, and

24

motivation may have a significant impact on pronunciation and grammatical knowledge

if a larger sample were to be analyzed.

In addition to the research conducted by Au et al. (2008), Knightly et al. (2003),

three other notable acoustic studies on heritage language pronunciation have been

conducted, although none of them examined heritage Spanish and only two of them

examined vowels. Bullock (2009) for example, examined the patterns of intonation

exhibited by two heritage speakers of French in Frenchville, Pennsylvania. In particular

she examined penultimate prominence, focus via prominence in situ, and the prosody of

left dislocation. A series of acoustic analyses of the intonation contours produced by

these speakers in French indicated that their pitch accents and use of tonal contours

differed from the patterns generally attested for monolingual varieties of French, but were

similar to those of English. It was argued that their use of intonation as a means to focus

or emphasize certain structures within a phrase was most likely the result of long-term

contact with English in the area, but interestingly, the syntactic characteristics typically

employed to signal information structure (left dislocation and clefts) were still present in

their speech. The integration of English-like focus and prominence strategies via

intonation was determined to be an additional recourse for these speakers. Their contact

variety thus maintained many of the characteristics of the heritage language system, but

also incorporated elements of the majority language. The result was a unique variety

which provided additional opportunities and strategies to express prominence in French.

Chang, Haynes, Yao, and Rhodes (2009) analyzed the vowel pronunciation and

laryngeal characteristics (VOT) of Mandarin native speakers, heritage speakers of

Mandarin, and native English learners of Mandarin. They examined the acoustic

25

characteristics of the rounded vowels /u/, /ou/, and /y/ in English and Chinese. The

vowel articulations and words containing the words with distinct laryngeal categories

were obtained from a list of read phrases in English and Mandarin. The acoustic analyses

of F1, F2, and F3 of these vowels indicated several important differences between the

native, heritage, and learner groups.4 Native speakers of Mandarin produced both

Mandarin and English /u/ with the lowest F2 values (i.e., farther back), the L1 English-

dominant learners of Mandarin produced /u/ with the highest F2 values (i.e., farther

front), and the heritage speakers of Mandarin produced /u/ in both languages with

intermediate values that fell between the productions of the natives and learners. Chang

et al. suggest that this pattern indicates that the native language of the participants (i.e.,

native Mandarin vs. native English L2 learners of Mandarin) influenced the production of

the L2 sounds: Native Mandarin speakers produced English /u/ in a manner more similar

to Chinese /u/, whereas the native English-speaking L2 learners of Mandarin produced

Chinese /u/ with values more similar to English /u/. The heritage speakers were found to

produce /u/ vowels in both languages that were closer to the native productions. A nearly

identical pattern was observed for the Mandarin vowel /ou/, with native Mandarin

speakers producing the /ou/ with the lowest F2 value, the learners the highest F2 value,

and the HS clustering between those values. This same difference between groups was

not observed, however, for the production of the Mandarin rounded vowel /y/.

The results of this experiment revealed two very important conclusions that are

relevant to the present investigation. The first is that the three groups of speakers

produced both Mandarin and English vowels that differed according to distribution

4 Although the F3 value was measured in Chang et al.’s (2009) study, they do not report any analyses conducted on the third formant.

26

within the space. That is, each group’s vowel space was distinct in each language. A

second important finding is that HS of Mandarin produced the high /u/ and the mid /ou/

with values that were intermediate to those of the natives and the learners. They were

also found to produce vowels that were closer to the native speaker norms when

compared to the production of the L2 learners, and maintained the greatest difference

between the categories in the two languages.

The second experiment in this study examined each speaker’s VOT of English

voiced /bdg/, voiceless /ptk/, Mandarin unaspirated /ptk/ and Mandarin aspirated /pʰ, tʰ,

kʰ/. The results of the analyses of VOT for these three speaker groups were less robust

than the findings of vowels, but still revealed important differences between HS of

Mandarin and L2 learners. The most important finding from the second analysis was that

Mandarin native speakers and HS of Mandarin were able to produce differences between

the Mandarin unaspirated and English voiceless stops, but the L2 learners were not. The

authors also report that there was a correlation between speaker rank (i.e., use of

Mandarin) and these productions. This significant correlation indicated that as a

participant’s experience with Mandarin Chinese increased, they exhibit greater degrees of

aspiration of the Mandarin aspirated stops. Taking into consideration the findings of

these two experiments within the study, HS of Mandarin exhibit a modified or

intermediate system of vowel pronunciation. The pronunciation of the aspirated

Mandarin stops was also affected by level of experience with the language, such that

more experience resulted in greater segmental production accuracy.

27

The only other study to date to examine the vowel pronunciation of heritage

speakers was conducted by Godson (2003) for Western-Armenian.5 The goal of this

study was to examine if the onset of acquisition of English (early childhood or adulthood)

would affect the pronunciation of Armenian vowels. The participants were two groups of

bilingual speakers of Armenian and English living in Southern California, and one native

monolingual of Armenian. Eight participants were Armenian-English bilinguals who

began acquiring English before the age of 8 (“interrupted acquirers” of Armenian), ten

were Armenian-English bilinguals who acquired English as adults (“uninterrupted

acquirers” of Armenian), and one speaker served as a monolingual Armenian control.

All participants were female, and all except for the monolingual control indicated that

they were either English dominant or considered themselves to be bilingual. The vowel

production of Armenian vowels was conducted using a carrier phrase task. The vowels

under investigation were embedded within the words used within the task. The vowel

productions were later analyzed acoustically by measuring the first four formants at the

steady-state of each vowel. In order to assess the potential influence of English on

Armenian vowel production of the two groups of bilinguals, formant values of California

English obtained from a previous study were used for comparison.

The results of the acoustic and statistical analysis revealed that the two bilingual

groups (“interrupted” Armenian bilinguals and “uninterrupted” Armenian bilinguals)

differed in terms of the production and acoustic distribution of the vowels /i/, /ɛ/, and /a/.

The uninterrupted bilinguals who had acquired English later in life produced these

vowels that were closer to the Western-Armenian baseline, whereas those bilinguals who

5 To the best of my knowledge, there are no other acoustic studies of Spanish vowels produced by HS from the United States, as the population is defined here. For heritage Spanish vowel perception, see Boomershine (2011). For Spanish-Catalan bilingual vowel production, see Amengual (2011).

28

learned English earlier in life and thus their acquisition of Armenian was interrupted,

more closely approximated the English measures. The production of the /a/ was found to

be farther front for the both groups of bilingual speakers when compared to both

monolingual baselines. The production of the back vowels /o/ and /u/ did not differ

statistically between the two bilingual groups, but both groups productions differed

considerably from the baseline measures, and especially from the English baseline

measure. This difference was attributed to the fact that Armenian back vowels are

rounded, but that previous research suggests that California back vowels are often

unrounded. The distinction in back vowel pronunciation between the monolingual

English variety and the bilingual varieties suggest that bilinguals resisted integrating this

characteristic of English into their Western Armenian pronunciation. Overall, this

investigation showed that the uninterrupted Armenian bilinguals (i.e., those who learned

English later in life) produced vowels that resembled the monolingual baseline for this

language, but were also distinct from the interrupted bilinguals who learned English by

age 8. This result is consistent with previous accounts of a modified vowel system for

heritage speakers of a language. The influence of the second language was not consistent

across all vowels, however. The research concludes by arguing that theories of bilingual

speech production that argue for consistent influence of the L2 across the board are too

general in nature.

In summary, the research presented in this section portrays HS as a population

that is neither a monolingual nor a language learner in the most traditional sense.

Although there is much variation within heritage populations, HS generally exhibit

communicative proficiency in the heritage language, even if their formal grammatical

29

knowledge is incomplete. They share some characteristics with L2 learners, but often

outperform later learners in their ability to detect fine-grained differences in meaning and

structure. Their pronunciation is often judged to be superior to, at least more native-like

than, that of L2 learners, but HS pronunciation is not identical to that of a monolingual

native either. The acoustic studies of speech production of HS and early bilinguals have

shown that their system can often be described as intermediate, falling somewhere

between that of a native and an L2 learner.

This dissertation focuses primarily on describing and exploring the phonological

system of HS of Spanish with specific reference to their Spanish vowel system. The two

studies conducted by Chang et al. (2009) and Godson (2003) indicate that HS of

Mandarin and Armenian, respectively, show patterns of vowel production that differ from

established monolingual norms in both languages. While the present study does not

directly compare the production of HS vowels to monolingual speakers who had

completed the same set of elicitation tasks, it will comment on the nature of the HS vowel

organization and dispersion, and how these patterns compare to traditional descriptions of

the Spanish vowel system. More information pertaining to vowels, and Spanish vowel

systems in particular, is presented next in section 2.2.

2.2 Vowel systems

This portion of the chapter introduces the basic articulatory and acoustic qualities

of vowels, discusses previous research on Spanish monolingual vowel systems, and

comments upon additional research of Spanish bilingual vowel systems and heritage

English vowel systems.

30

2.2.1 General properties of vowels

Unlike consonants which are produced with a blockage or constriction in the

vocal tract that impedes the airflow, vowels are a class of speech sounds that are

produced with a relatively unobstructed airway (Ladefoged, 2001). In Spanish, vowels

are classified as high, mid, or low in terms of their height, and front, central, and back in

terms of their location along the front-back continuum. Some introductory Spanish

textbooks, such as Hualde (2005), suggest that the height and backness of vowels

correspond to the position of the tongue body in the mouth when it produces each vowel,

such that “front” vowels are produced with a fronted tongue position and “back” vowels

with a retracted tongue position. Table 2-1 represents the distribution of the five Spanish

vowels as they are typically presented in introductory textbooks.

Table 2-1

Spanish vowel system based on articulatory parameters

front central back

high i

u mid e

o

low

a

Considerable research in the UCLA Phonetics Laboratory, however, has indicated

that vowels are more accurately classified by taking into account their acoustic properties

and not articulatory parameters (Ladefoged, 1980). The location of the constriction in the

mouth and the resulting size and shape of the vocal cavity result in each vowel being

characterized by its own set of resonant frequencies (Rosner & Pickering, 1994). These

resonant frequencies, known as formants, are characteristic overtones that distinguish

each vowel. Generally speaking, the first three formants are sufficient to describe a

31

vowel’s height, backness, and rounding. The first formant (hereafter F1) is the lowest

resonant frequency and can be equated with the vowel’s height. There is an inverse

relationship between vowel height and F1 frequency, such that low vowels, such as /a/,

have higher F1 frequencies, and high vowels such as /i/ have low F1 frequencies. The

second formant frequency (F2) is correlated with the position of the vowel along the

front-back continuum. Vowels that are categorized as “front” (i.e., palatal, such as /i/)

have higher F2 frequencies than those that are produced with a constriction farther back

(i.e., velar, such as /u/) in the oral cavity. The third formant (F3) is an important correlate

of rounding, as well as r-colored vowels in American English. In Spanish, the two back

vowels /o/ and /u/ are claimed to be redundantly rounded (Hualde, 2005), but as this

rounding is only found in these two Spanish vowels and is redundant with backness, the

F3 is often not analyzed. The third formant is of particular interest in languages like

French that have front rounded vowels.6

In order to better understand the organization and distribution of vowels within

the acoustic space, Figure 2-1 below is a reproduction of the formant values obtained in

an acoustic study of Spanish vowels by Quilis and Esgueva (1983). The values of the

first formant frequency (F1) are presented along the y-axis, and the values for the second

formant frequency (F2) are located on the x-axis.

6 For further reading on the acoustic and articulatory properties of vowels see Ladefoged (1980; 2001; 2003; 2005), Johnson (2003); for Spanish vowel systems, see Hualde (2005).

32

Figure 2-1. Figure of formant values reproduced from Quilis and Esgueva (1983). These values are based off of the F1 and F2 frequency of sixteen male speakers of varying Spanish dialects.

There are several challenges in analyzing the acoustic properties of vowels. The

first is that vowels are highly context sensitive, and their formant frequencies can be

affected substantially based on their immediate consonantal context. The seminal study

by Stevens and House (1963) analyzed the effects of different consonantal contexts on

the formant frequencies of either American English vowels. They first examined the

differences between vowel formants when the vowel was produced in an isolated context

versus when it was produced in a “null” context involving the frame /hVd/ (i.e., “heed,”

“hid,” “had,” etc.). They found that there were no significant differences in formant

values of vowels produced in the isolated and null contexts. For this reason, many

analyses of English vowels often analyze the productions within the hVd consonantal

frame.

The results of their analyses of vowel formants in other consonantal contexts were

very different. They analyzed the production of the same eight English vowels (/i I ε æ ɑ

/i/

/e/

/a/

/o/

/u/

200

300

400

500

600

700

5001000150020002500

F1 H

z

F2 Hz

Spanish formant values (Quilis & Esgueva,1983)

33

ʊ ʌ/) in nonsense words that began with the unstressed syllable /hǝ/ followed by a

stressed syllable with identical consonants on either side of the target vowel (e.g., /hǝbIb/,

/hǝbæb/). The consonants examined were /p b f v θ ð s z t d č ǰ k g/. The results of their

analyses indicated that consonantal context resulted in formant values and patterns of

movement that were different from the isolated and null context vowel formants.

Although the average shifts in F1 were minimal for all vowels, the effect of context on F2

was substantial and dependent upon the consonant and the vowel in question. They

argued that the main trend was for the F2 to of the vowel to shift away from its value in

the null context towards a more centralized position. This means that for the front

vowels, the F2 tended to shift downward (i.e., farther back), and for the back vowels, the

F2 tended to shift upward (i.e., farther front). The front vowels showed the greatest

changes (i.e., greatest lowering of F2) when the surrounding consonants were postdental

or labial and the least effect when in contact with velars. For the back vowels, the

postdental consonants resulted in the largest shift upward of F2, whereas labials had little

effect. They argued that these changes in formant values were the result of how far the

articulators had to move from the consonant to the vowel and then back to the consonant.

The distance and amount of movement resulted in distinct formant values when

compared to a null context.

Stevens and House also analyzed the effect of manner of articulation and voicing

characteristics on the formant values, again describing differences in formant values

based on the characteristics of the consonantal context. These results revealed that when

the surrounding consonants were voiced, the F1 was higher than when the surrounding

consonant was voiceless. Along the F2 dimension, the front vowels showed higher F2

34

values when in contact with voiced consonants than when in contact with voiceless

consonants. The manner of articulation also affected the F2 values of vowels but differed

for front and back vowels. The front vowels /I, ε/ exhibited higher F2 values for stops as

compared to fricatives. Higher F2 values for fricative contexts were observed for the

back vowels when compared to the formant values produced in stop contexts. Overall,

this investigation showed that vowels were affected by the place, manner, and voicing of

the surrounding consonantal contexts. Most studies of monophthong vowels take

formant measurements at the center of the vowel, or try to identify a steady state portion

(Martínez-Celdrán, 1995; Morrison, 2004; Willis, 2005). This helps avoid the influence

of the context and ensures that measurements are not taken at the formant transitions.

Stevens and House’s work, showed, however, that even when the measurements of the

formants were taken at a steady-state midpoint, contextual and coarticulatory effects were

observed.

In addition to coarticulatory effects caused by surrounding consonants, several

other factors need to be considered when analyzing vowels. Although not as relevant to

the current investigation as consonantal context, factors such as vocal tract length,

gender, and regional variety also influence the formant frequencies of vowels. In terms

of vocal tract length, long vocal tracts result in lower formant values, and shorter vocal

tracts result in higher formant values (Rosner & Pickering, 1994). Vocal tract length

varies by age and gender, such that men, who typically have longer vocal tracts, tend to

produce vowels at lower frequency ranges than those of women, and subsequently

children (Hillenbrand et al., 1995; Peterson & Barney, 1952; Rosner & Pickering, 1994).

The differences in vocal tract size often result in the appearance of women having a more

35

expanded vowel space, or producing vowels in more peripheral locations than men

(Rosner & Pickering, 1994). There is some suggestion that the differences in vowel

production observed for males and females may be due in part to sociolinguistic factors

or conscious efforts made by females to modify their vowel articulations. Several studies

argue that women exhibit a wider acoustic space than men either by consciously

modifying the constriction size of their vowels (Goldstein, 1980) or as an “expressive

posture which is socially more appropriate for one sex or the other” (Labov, 1972, p.

304). Despite these claims, Rosner and Pickering conclude that the differences in

formant frequencies are most likely due to anatomical differences.

One final factor which is known to influence vowel formant frequencies deals

with regional variety. It is widely accepted that varieties of American English exhibit

substantial patterns of variation in vowel production based on geographic region (Clopper

et al., 2005; Thomas, 2001). Thus, English speakers from the areas such as Wisconsin

differ greatly from those speakers in the American south with respect to how they

produce certain vowels. For Spanish, however, this regional variation is argued to be

virtually non-existent. Hualde (2005, p. 128) argues that “vowel qualities are

remarkably stable among Spanish dialects. There is nothing like the differences in vowel

quality that we find across geographical and social varieties of English. This is in no

doubt due to the simplicity and symmetry of the system.” He continues on to argue that

the majority of the dialectal variation observed in Spanish tends to occur in the

pronunciation of consonants and not vowels. An acoustic investigation of Peninsular and

Peruvian vowels by Morrison and Escudero (2007) seems to confirm the stability in

vowel pronunciation across Spanish dialects, in that the only vowel that was found to

36

differ across the two dialects was the /o/. Thus, although some languages are argued to

exhibit considerable differences in vowel pronunciation based on geography, Spanish is

argued to not be one of them. Nonetheless, it is important to consider the potential for

variation in vowel production based on geographic region, as few comparative

investigations of Spanish vowel pronunciation have been conducted to date.

To summarize this section, the main acoustic correlates of vowels that will be

discussed in this dissertation are the first (F1) and second (F2) formant frequencies.

These frequencies are associated with vowel height and vowel backness respectively.

Vowels are also highly context sensitive, and the surrounding sounds are known to

influence its acoustic characteristics and location within the acoustic space. Finally,

factors such as vocal tract length, gender, and dialect can influence vowel production.

The same vowel produced by a male speaker and a female speaker will most likely be

characterized by very different formant values. In addition, speakers from different

regions may also pronounce vowels differently, although this is argued to be less

prevalent in the regional varieties of Spanish. The next two sections, 2.2.2 and 2.2.3,

focus specifically on the Spanish vowel system, and review a number of impressionistic

and acoustic studies of Spanish vowels.

2.2.2 Impressionistic studies of Spanish vowels

Prior to the invention of the sound spectrograph in the 1940’s (Koenig et al.,

1946), the majority of the studies pertaining to Spanish vowels had to be conducted on an

impressionistic basis; with the researcher listening and transcribing what he or she heard,

but having no way to quantify or systematically test the claims that were being made.

One of the earliest linguistic descriptions of the Spanish vowel systems was provided by

37

Navarro-Tomás (1918) in his Manual de pronunciación española. Perhaps one of the

most well-known and frequently cited authors in Spanish linguistics, Navarro-Tomás

described the manner in which the Spanish vowels should be pronounced and detailed

how different consonantal and syllabic contexts affected their pronunciation. First and

foremost, his manual is a comprehensive and descriptive guide to Spanish pronunciation

intended for the learner of Spanish. The goal was to present guidelines for the “correct”

pronunciation of the Spanish language; the educated variant spoken in Castile, Spain, at

the beginning of the 20th century.7 The author begins by describing the different

anatomical structures responsible for producing speech sounds and continues on to

describe the parameters in which each of the Spanish vowels and consonants are made.

In his description of Castilian Spanish vowel pronunciation, Navarro-Tomás

(1918, p 40) explained that the Spanish alphabet only distinguishes between 5 vowel

sounds: a, e, i, o, u, and sometimes y when it is pronounced as a vowel and thus has the

same pronunciation as i. Despite the simplicity in orthography, however, he argued that

each of these five vowels has at least three distinct allophones, each surfacing based on

the stress context and surrounding consonantal and vocalic context. The low vowel /a/

was argued to be either media (middle/central), palatal (palatal), velar (velar), or

relajada (relaxed). In the majority of instances, the /a/ is pronounced as a central vowel,

having the greatest degree of aperture of the lips of all the vowels, and the tongue in the

lowest position. In the context of other palatal sounds, such as /ñ/, it is articulated

slightly differently with the tongue more closely situated toward the palate. The /a/ is

said to be a velar variant and articulated farther back in the mouth when it forms part of a

7 Navarro-Tomás clarifies his usage of the term “correct” by arguing that it does not imply that the Castilian variety is superior to any other dialect of Spanish, but rather as a means to distinguish it from other varieties spoken throughout Spain and Latin America.

38

diphthong with /u/ or occurs before a /u/ or an /o/, in a syllable that is closed by an /l/, and

before the velar /x/. A relaxed /a/ is said to occur in final position before a pause, word-

internally or group-internally, and between relatively unaccented syllables (p. 55-57).

The other four vowels were argued to have three distinct variants each: cerrada

(closed), abierta (open), and relajada (relaxed). The relaxed variants of all four vowels

were described as occurring most frequently in rapid and conversational speech when

they are situated between the primary and secondarily stressed vowels and in word-final

position before a pause. The closed and open versions of the high vowels /i/ and /u/

pattern similarly. The closed allophones were described as having slightly more extreme

tongue positions, with the closed /i/ being situated closer to the teeth and alveolar ridge

than the open variant, and the closed /u/ exhibiting a tongue position closer to the velum

than the open articulation of the same vowel. The closed allophones of /i/ and /u/ occur

with both tonic and atonic vowels in open syllables. The open allophones of /i/ and /u/,

with more retracted tongue positions, were described as most common when they occur

in contact with /r/, in closed syllables, and before the velar /x/. The mid vowels /e/ and

/o/ exhibit a similar parallelism. The closed allophones of these vowels have more

advanced tongue positions when compared to their open counterparts. Like the high

vowels, the closed mid vowels are most common in open syllables. The open allophones

of both mid vowels occur when in contact with /r/, before the velar /x/, and for /o/, in

closed syllables. The open /e/ (i.e., [ε]) was argued to occur in closed syllables as well,

as long as the closing consonant was not /d, m, n, s, x, z/. Table 2-2 below provides a

more comprehensive representation of each of the vowel allophones and the contexts in

which Navarro-Tomás argued they occur.

39

Table 2-2

Vowel pronunciation described in Navarro-Tomás (1918)

Orthography Variant

Symbol used by

Navarro-Tomás

Contexts

closed i tonic open syllables; atonic open syllables in slow speech.

i open į

closed syllables, especially those spoken with intensity; in contact with /r/; before /x/.

relaxed ị

conversational and rapid speech; when situated between a primary and secondary accent.

closed e

open syllables with primary or secondary stress; in syllables closed by orthographic m, n, s, d, z; followed by orthographic x and another consonant

e

open ę (ε)

when followed or preceded by /r/; before /x/; in the diphthong /ei/; syllables closed by any consonant other than orthographic d, m, n, s, z; before orthographic x when pronounced as [ɣs].

relaxed ǝ

conversational speech, situated between a primary and secondary accent, word-final position before a pause.

central a

tonic syllables, both open and closed, when not in contact with a palatal or velar consonant.

a palatal a before palatal consonants; in the

diphthong /ai/.

velar ạ

when forming part of the diphthong /au/; before an accented /u/; before an /o/; in a syllable closed by /l/; prior to /x/.

relaxed ᵄ

word final position before a pause; word or group-internally when situated between relatively unaccented syllables.

closed o

open syllables with the primary or secondary accent; more closed at the end of a word when forming a diphthong with a /u/ in the following word.

o

open ǫ (ɔ)

when in contact with /r/; before /x/; in the diphthong /oi/; in syllables closed by any consonant; in accented position between an /a/ and a following /l/ or /r/.

40

relaxed o

conversational speech in word-word final position before a pause; word or group-internally when situated between accented syllables.

closed u rapid speech; tonic open syllables;

atonic open syllables; in slow speech.

u open ų when in contact with /r/; before /x/; in

syllables closed by any consonant.

relaxed u

conversational speech in weak position; between a primary and secondary accent .

Note. The phonetic symbol provided in the third column are those utilized by Navarro-Tomás in text. When applicable, the symbols from the International Phonetic Alphabet (IPA) are included in parentheses.

Despite the detailed descriptions in which each of these allophones surface and

the distinct phonetic symbols assigned to each, Navarro-Tomás (1918) discussed that the

differences between the open, closed, and relaxed versions of Spanish vowels are very

slight. He also emphasized that the difference between the closed and open allophones of

the mid vowels /e/ and /o/ tended to be more noticeable than the same differences in the

high vowels /i/ and /u/. The distinction between the palatal /a/ and the central /a/ was said

to be so slight that they were not assigned separate symbols as the difference was

supposedly nearly imperceptible. The detailed account of vowel timbre within the

system, however, does indicate that there is some variation in terms of quality within the

five vowel system. The author lamented that an acoustic analysis of Spanish vowel

quality had yet to be conducted at the time his text was written.

The discussion of vowel quality is followed by an explanation of differences in

vowel quantity (duration), and how different stress and consonantal contexts result in

vocalic duration of varying lengths. He argued that duration should be considered in

terms of relative duration, in that the length of any given sound varies as a function of

41

speaking rate and emotive and emphatic qualities of speech. Navarro-Tomás (1918)

divided Spanish vowels into three durational categories: largas (long), semilargas (semi-

long), and breves (short). The stressed vowels in final position that are not followed by

/n/ or /l/ were agued to have the longest duration, ranging from 15-20 hundredths of a

second, and were classified as long (largas). Final stressed vowels followed by /l/ and /n/

and stressed vowels in an open penultimate syllable (i.e., grave, or palabras llanas) were

classified as semilong (semilargas), ranging in duration from 11.5-15 hundredths of a

second. The shortest vowels (breves) are those vowels which occur in closed penultimate

and when the vowel occurs in the antepenultimate syllable. The duration of the accented

short vowels was described as falling between 7.5 and 10 hundredths of a second. A final

observation pertaining to the length of stressed vowels is that the following consonant

may influence its overall duration. Vowels in the context of /r,b,d,g,y/ tend to be slightly

longer than when they are in contact with other consonants.

Navarro-Tomás (1918) argued that the unstressed vowels were shorter in duration

than their stressed counterparts, and therefore none could be considered to be long.

According to his impressionistic description, only in emphatic or slow speech does the

duration of an unstressed vowel approximate that of a stressed vowel. The shortest

duration is observed in the posttonic vowel in words with antepenultimate stress (i.e., the

u in the word capítulo “chapter”). Vowels in this position were described as ranging

between 5.7 and 5.8 hundredths of a second. An unstressed vowel in word-initial

position was considered to be slightly longer, often classified as semi-long. The duration

ranged from 10.5 to 12 hundredths of a second. In slow and careful speech, however,

unaccented vowels in any position within the word were argued to lengthen, resulting in

42

their approximating the duration of a stressed vowel. Unlike the stressed vowels, there is

supposedly very little durational differences between atonic vowels that occur in closed

syllables versus those that occur in open syllables.

In addition to providing details about the articulation, timbre, and duration of the

Spanish pure vowels, Navarro-Tomás (1918) made four important generalizations about

the vowel system that will be discussed throughout this dissertation. First, he argued that

the timbre of the Spanish vowels was generally invariable throughout the sound,

exhibiting a great deal of uniformity (p. 43). In certain dialects, in instances of “strong”

pronunciation, and in slow speech he argues that a certain vowel sound may lengthen, but

the general assumption remained that Spanish vowel sounds have a somewhat stable

timbre throughout their articulation. A second important characteristic of Spanish vowels

was that unstressed vowels, especially word-final vowels and those in antepenultimate

position had a tendency to relax, resulting in a timbre that was less defined and precise (p.

44). In careful, slow, or emphatic speech, he argued that these same vowels did not

undergo relaxation and instead maintained their clarity.8 The third general claim about

the Spanish vowel system was that unstressed vowels are shorter than the stressed

vowels, but that the difference between the two is very slight. As will be explained

further below, the general assumption has remained that Spanish vowels do not show

extreme differences in duration based on their stress context, but more recent

investigations have put this claim into question (Marín Gálvez, 1995). A fourth and final

implied assumption about the Spanish vowel system is that both vowel quality and vowel

duration are affected by speech style. Throughout his text Navarro-Tomás argued that

8 As will be described in greater detail in section 2.2.3, the majority of the acoustic investigations of Spanish vowels have analyzed highly controlled speech samples (i.e. careful, slow, and emphatic speech).

43

vowels relaxed and become shorter in conversational speech, and were less relaxed and

longer in duration in careful and emphatic speech. As will be demonstrated in the

sections that follow, very few investigations have directly examined the connection

between speech style and vowel production.

In addition to Navarro-Tomás’s (1918) seminal work on Peninsular Spanish,

impressionistic studies of Spanish vowels extend to other regions in the Spanish-speaking

world. Of particular interest in this dissertation are studies pertaining to vowel

pronunciation in central Mexico, as all but one of the participants in this investigation are

of Mexican heritage. Henriquez Ureña (1921) was the first to comment that the atonic

vowels in Mexican were very short in duration, “reduced,” and often deleted in certain

contexts. This observation resulted in several additional impressionistic studies of

unstressed vowel reduction (hereafter UVR) or deletion that examined the contexts in

which this reduction occurred.

Boyd-Bowman (1952) and Matluck (1952) were the first two studies to address

the pronunciation of unstressed vowels in Mexican Spanish. Boyd-Bowman’s study,

which focused on the Altiplanicie de México and includes Mexico City and Guanajuato,

was based solely on his impressions overhearing the pronunciation of certain words

during his time in the region. He argued that the reduction and loss of atonic vowels was

most frequent when in contact with the sibilant /s/ and another voiceless consonant. This

reduction was even greater when the atonic vowel was in contact with a word-final /s/.

He argued that all vowels except for /a/ underwent reduction in the sample expressions he

heard.

44

Matluck (1952) described the same phenomenon in the Valle de México—a larger

region that encompasses Mexico City, a large portion of the state of Mexico, as well as

portions of the state of Hidalgo. His analysis was much more in-depth than Boyd-

Bowman’s two page study, but was still based on his impressions of the pronunciation of

the vowels and not on acoustic measurements. He first described some of the differences

between the pronunciation of vowels in Mexico and those of Spain, as described by

Navarro-Tomás (1918). He argued that the Mexican pronunciation of /a/ did not exhibit

as much velarization as the /a/ in standard Spanish. He also noted differences in the

articulation of the mid vowel /e/ according to syllable structure. Whereas Navarro-

Tomás argued that the mid vowel /e/ was produced as a closed variant [e] in a syllable

closed by orthographic m, n, s, z, d, x, in the Valle de México the /e/ in syllables closed

by any consonant was produced with the open variant [ε]. In his examination of UVR, he

observed that in the speech of the middle class (i.e., among the semicultas) and the lower

classes, the initial unstressed vowel in a word had a tendency to delete, resulting in the

lengthening of the following consonant (enero n:ero “January”). He also argued that

when a vowel occurred in absolute initial position, or occurred after another word ending

in a vowel, the unstressed vowel could disappear when in contact with /s/ (está bien

stábien “he/she/it’s good”). In contrast to an initial unstressed vowel, vowels in word-

internal position were argued to reduce but rarely delete. Final unstressed vowels were

argued to be greatly reduced and could often disappear. Matluck, like Boyd-Bowman,

argued that a vowel followed by a voiceless consonant, and especially /s/, had the greatest

likelihood of reducing or devoicing. Finally, he commented that a final /e/ in contact

45

with a palatal consonant could often be produced as /i/, as in the word noche nochi

(“night”).

By far the most comprehensive investigation of UVR in Mexican Spanish was

carried out by Lope Blanch (1972). Although no acoustic or statistical analyses were

conducted, he was able to analyze which consonantal contexts and vowel positions were

most subject to reduction by examining the percentage of cases that occurred in each

category. He collected data from 100 speakers residing in central Mexico that he argued

represented all social classes and educational levels. The speech was classified as

spontaneous, and was gathered based on conversations he had with each participant

dealing with topics of daily life, conversations about radio programs, personal interests,

and other spontaneous discussions. All speakers were between the ages of 15 and 60

years, including both men and women. Nearly all of the participants interviewed were

born in Mexico City, and those who were not born in the area had resided in the city for

at least 20 years. Fifty two of the interviews were recorded on magnetic tape so that their

pronunciation could be examined multiple times if necessary. The data from the

remaining 48 speakers was not recorded, but detailed notes pertaining to their

pronunciation were compiled during the time of the interview.

Although 100 speakers were interviewed, Lope Blanch’s (1972) main analysis

was based on the 52 recorded interviews. In total he analyzed 2,284 vowels “by ear” (p.

57), and determined that there were four degrees of vowel weakening ranging from

simple relaxation to complete loss. The criteria with which Lope Blanch determined

these degrees of weakening is not explained in detail, thus it is difficult to determine

exactly how he classified the vowel articulations he heard. Based on his impressionistic

46

analysis, several important characteristics of UVR in Mexican Spanish were described.

Lope Blanch argued that the position of the vowel within the word was not as influential

in weakening as argued by Matluck (1952) and Boyd-Bowman (1952), but rather that it

was determined by the nature of the surrounding consonantal context. Unstressed

vowels in contact with voiced consonants were not found to show reduction or deletion.

He found that among the consonantal contexts that most favored deletion or reduction,

nearly 90% of the cases of reduction occurred when a vowel was in contact with /s/ -- an

observation made by both Matluck and Boyd-Bowman in their investigations. The vast

majority of these cases were observed when /s/ followed the vowel as opposed to

preceding it. He then analyzed the frequency of occurrence of reduction in several

different consonant + /s/ contexts, and found that /s/ + /s/ accounted for 22.9% of cases of

reduction. The groups /p/ + /s/, /t/+ /s/, and /k/ + /s/ also favored reduction, at rates of

9.7%, 13.7%, and 9%, respectively. In the rare cases in which an /s/ preceded a reduced

vowel, the following consonants that most favored reduction were the nasals /m/ and /n/

with 12.8% and 14.3% of the cases, respectively. In his analysis of the open syllables,

vowels in contact with the voiceless stops /p t k/ were reduced more frequently than when

they were in contact with any other consonant.

In terms of the type of reduction (i.e., relaxation or full deletion), he reported that

only 17.5% of the reduced vowels were classified as complete deletions.9 A closer

examination of the tokens in which full deletion occurred, however, revealed that most of

the deletion was isolated to the words pues (“well”) and entonces (“then”). Once these

tokens were excluded, only 9.8% of all reduced vowels were argued to be fully deleted.

9 As described previously, Lope Blanch did not provide well-defined criteria for determining how the vowels were grouped into the four different categories of reduction.

47

In terms of the individual vowels that were most subject to reduction, he reported that

42% of the cases involved the vowel /e/, followed by /o/ with 24%, and /i/ with 5.6%.

The /a/ was found to be resistant to reduction, accounting for 7% of the cases. As /a/ is

the most frequent vowel in Spanish, however, its higher percentage of reduction

compared to /i/ was explained as a frequency effect and not that it was more likely to

reduce than /i/. No information pertaining to the rate of reduction of /u/ was reported.

Lope Blanch’s (1972) analysis is the most complete and most quantitative of all

three investigations focusing on UVR in Mexican Spanish. His analysis indicated that

the consonantal context in which a vowel occurred was the most important characteristic

influencing UVR. Unstressed vowels in contact with /s/, and especially surrounded by

/s/, showed the greatest rate of reduction. The vowel /e/ was also shown to reduce more

than any other vowel, at least in this variety of Spanish. At the end of his manuscript,

Lope Blanch indicated that several of his colleagues had reported similar pronunciation in

regions of Ecuador, Bolivia, and Argentina. He concluded that this phenomenon may be

more common in the Spanish language than previously believed, and that additional

investigations should analyze its presence in other geographic regions.

Despite his detailed account of the Mexican pronunciation, morphology, syntax,

and lexicon, Lipski (1994; 2008) offers little additional information about the

pronunciation of vowels in Mexico. In his 1994 text, he reiterates that central Mexico is

distinguished from other regions within the country and other regions in Latin America

by the tendency of unstressed /e/ to reduce when in contact with /s/. In his description of

Mexican American Spanish, he argues that the vowel /i/ is also subject to this process.

In addition, he indicated that the pronunciation of the Spanish vowel /e/ in Mexican

48

American varieties of Spanish was often produced lower and farther back in the vowel

space, more closely approximating the pronunciation of /ε/ in the English word let.

Taken together, these investigations of Mexican Spanish vowel pronunciation indicated

that UVR is one of the most distinguishing characteristics of this dialect. As the heritage

speakers who participated in this investigation are speakers of Mexican American

Spanish, the acoustic analyses conducted in this dissertation will help determine if they

exhibit UVR, and if so, how is it characterized (i.e., quality reduction and centralization

or devoicing).

The impressionistic studies of vowels discussed in this section were some of the

first to provide descriptions of the Spanish vowel system.10 Navarro-Tomás’s (1918)

detailed account provided that basic description to which other researchers would

compare their work – both impressionistic and acoustic – and contains a number of

claims that may have influenced researchers to avoid the analysis of vowels (i.e., the

implied notion of stability). Additional studies conducted in the mid to late 20th century

showed that Mexican dialects of Spanish are dissimilar from Peninsular varieties,

especially with respect to their reduction of unstressed vowels. Despite the differences

between dialects – distinctions which had been noted as early as 1952 – the assumption

has generally remained that Spanish vowels do not exhibit great degrees of variation

across dialect. Many introductory textbooks on Spanish pronunciation and phonetics cite

Navarro-Tomás as one of the authorities on Spanish vowels, and argue for a simple,

stable, symmetrical five-vowel system. As will be demonstrated in the following

sections, acoustic studies of Spanish vowels have offered additional insight into the

vowel system of several different varieties of the language. Acoustic methodologies have 10 See also Amastae (1982) for a morpho-phonological account of mid vowel raising in Spanish.

49

permitted a more thorough examination of the claims put forth by Navarro-Tomas

(1918), Boyd-Bowman (1952), Matluck (1952) and Lope Blanch (1972).

2.2.3 Acoustic studies of Spanish vowels

As the sound spectrograph and use of spectrograms to measure formant

frequencies became more readily available, impressionistic accounts of Spanish vowels

became less frequent, being replaced by instrumental and experimental acoustic studies

of vowels, and more predominantly, of consonants. The purpose of the following section

is to describe several notable acoustic descriptions of Spanish vowels, beginning with

earliest and progressing semi-chronologically and thematically to the present day. The

findings from the collection of investigations described below will demonstrate the

degree to which researchers differ in how they describe the Spanish vowel system. This

disagreement is one of the main factors guiding the research questions in this dissertation.

The summary of these investigations also demonstrates the importance of acoustic

analysis in the description of phonetic and phonological systems, and how careful,

quantitative measures of speech can reveal details and variation within the system that are

often not perceptible by impressions alone.

One of the earliest and most comprehensive acoustic analyses of Spanish vowels

was conducted by Quilis and Esgueva (1983).11 They analyzed the speech of 22

informants, 16 males and 6 females, from varying regions within Spain, Mexico, and

11 Prior to Quilis and Esgueva´s (1983) study, Skelton (1969) and Guirao and Borzone de Manrique (1975) conducted acoustic studies of Spanish vowels. Skelton´s study will be discussed later on in this chapter with the other investigations of syllable type. Guirao and Borzone de Manrique´s study of Argentinian Spanish showed variation of vowel articulations based on consonantal context, and compared Spanish vowels to the Cardinal English vowels. The methodology and description of the results, however, are difficult to follow, and no clear and describable trend emerged. Also, as this investigation also examines Argentine Spanish vowels, it does not have a direct connection to the present study. For these reasons, these two earlier acoustic investigations will not be discussed in great detail here.

50

Latin America. The participants were asked to read a target word that was embedded

within a carrier phrase, the target word containing the vowel of interest.12 Unlike other

earlier acoustic studies (e.g., Guirao & Borzone de Manrique, 1975) they included both

tonic and atonic occurrences of each vowel and investigated three different consonantal

contexts that included nasal and non-nasal sounds. The corpus of target words consisted

of three examples of each of the five Spanish vowels in tonic and atonic position

embedded within the contexts of p-p, b-b, and m-m. In total, each speaker produced 30

vowels. The first, second, and third formant frequencies, as well as the total duration of

each vowel, were calculated for each speaker using the Sona-Graph 6061B device.13

Analyses for each individual speaker were calculated separately and were plotted within

the traditional format of the vowel triangle as a means and clouds to demonstrate the

location of the vowels in the acoustic space.14 Examination of each individual speaker’s

production revealed several overall trends in terms of vowel quality (formant

frequencies) and vowel duration, which will be discussed below in further detail.

The results of the formant analyses showed little differences between the tonic

and atonic vowels in terms of their position within the acoustic space. For both the male

and female participants, there was very little difference between the tonic and atonic

productions such that no consistent trend could be identified. The male participants had a

slight tendency to produce tonic vowels with a more open articulation, whereas no

correlation between aperture and tonicity was reported for the female speakers. One

12 The carrier phrase task employed by Quilis and Esgueva (1983) represents the most careful and formal speech style, and is the methodology employed in the majority of the acoustic studies of Spanish vowels. 13 It is unclear which portion of the vowels (i.e. onset, midpoint, or offset) were analyzed in this study. The authors describe that once the words were recorded with previously known techniques, they proceeded to the acoustic analysis using the Sono-Graph 6061B, (p. 163, my translation). As the most common point of measurement is the midpoint, one can assume that the formant values were extracted from the center point. 14 See figure 2-1 for the average values produced across speakers.

51

unexpected finding was that the female informants produced vowels that occupied a

larger portion of the acoustic space when compared to the male speakers. The acoustic

space for the male speakers was found to be smaller and more centralized than that of the

females, and also exhibited a slight backing of the vowels /a,o,u/. Finally, the overall

values across all participants were calculated and plotted against the cardinal vowel

triangle in order to depict the “vowel triangle for the Spanish language” (p. 251, my

translation). They report that the Spanish vowels analyzed in this study differ greatly

from the prototypical, synthesized cardinal vowels, with the /i a u/ being farther back in

the space and /e/ and /o/ being pronounced with a more open articulation.15

With respect to vowel duration, the tonic vowels for all participants were found to

be longer than the atonic vowels. The consonantal context also influenced duration, with

all but one of the participants producing the shortest vowels in the context of /p/, slightly

longer vowels in the context of /m/, and the longest vowels in the context of /b/. Finally,

the authors reported that the vowels produced by the female informants were longer

overall than those produced by the male informants.

The findings in the seminal acoustic study by Quilis and Esgueva (1983) provided

some of the earliest acoustic descriptions of the Spanish vowel system, and advanced the

understanding of vowels by reporting numeric values and visual representations of

speech production. Despite its contribution, however, this investigation lacked statistical

analyses, and thus the ability to determine if the differences (or lack thereof) were

consistent or significant. In the following decades, additional acoustic research expanded

15 Unfortunately, although an analysis of F3 was carried out in this study, there is very little description or discussion of these results. The authors describe that the values of F3 varied greatly, and present a table of values for the male and female speakers demonstrating where the values were obtained. There is no further discussion of the F3 values other than presenting the means values averaged across the male and female speakers.

52

upon this investigation by including statistical analyses and more concrete descriptions of

formant values, vowel duration, and the dispersion of vowels within the acoustic space.

The subsequent studies verified the acoustic correlates necessary to distinguish Spanish

vowels, offered further insight into the effect of lexical stress and syllable type on vowel

production, and examined the effects of speech style on the pronunciation of Spanish

vowels.

The goal of Martínez-Celdrán’s (1995) acoustic study was not to describe the

Spanish vowel space per se, but rather to examine the five Spanish pure vowels and

determine what acoustic parameters are sufficient to distinguish them. A second goal of

this study was to identify the limits for each of the spaces (campo de dispersión) in order

to facilitate the possibility of a vowel recognition system that could accurately identify

each vowel based on its F1 and F2. The procedure for the series of experiments seems to

have taken place in two distinct steps, although the description of the tasks is somewhat

unclear. For the formant analysis, five male and female speakers produced each of the

five Spanish vowels embedded within the contexts pamp/bVna, tant/dVna, kank/gVna,

resulting in 30 tokens per person and 300 in all for the ten informants. The midpoint of

each tonic vowel was calculated by examining the wide-band spectrograms. For the

recognition software system, five male speakers produced 100 isolated CVCV and

VCCV words in which the consonant was always a stop or a fricative. The words were

then digitized and the values for bandwidth, amplitude, intonation, and F1-F4 were

calculated at 10ms time steps throughout the production16.

16 Very little detail pertaining to the methodology of this study was provided. It is unclear if the words used in this task are real or nonce words. It is also unclear if the same five male participants completed both production tasks.

53

Statistical analyses of the F1 and F2 measures for the male and female

participants revealed that the first formant frequency was sufficient to distinguish the

Spanish vowels in terms of height, grouping the vowels into three categories: high, mid,

and low. The second formant frequency was confirmed as an accurate predictor of vowel

backness, resulting in three distinct groups: front, central, and back. The author argues

that F2 is actually the more important of the two cues in distinguishing the five vowels

based on the higher discriminant functions that were obtained when only F2 was

included. The second formant predicts with 100% accuracy the recognition of the low

vowel /a/ and with approximately 80% accuracy the other four vowels, whereas F1 did

not predict any of the vowels with 100% accuracy and the back vowels /o/ and /u/ with

only 52 and 53% accuracy, respectively.

An additional finding of the formant analysis revealed that male and female

formant distributions differed significantly from one another. Overall, the analyses

indicated the F1 and F2 values for the female speakers were higher than those of the male

speakers, resulting in an acoustic distribution that was shifted downward and farther front

when plotted in the acoustic domain. These differences are similar to those reported by

Quilis and Esgueva (1983), who described a larger, more expanded vowel space for

females.17

The investigations conducted by Quilis and Esgueva (1983) and Martínez-Celdrán

(1995) are two of the earliest and most comprehensive acoustic analyses of Spanish

vowels.18 Each study provided vital information pertaining to the acoustic nature of the

17 Neither Martínez-Celdrán (1995) nor Quilis and Esgueva (1983) normalized the formant values produced by the male and female speakers. 18 Two additional acoustic studies of Mexican Spanish vowels were conducted by Servín and Rodríguez (2001) and Martín Butragueño (2011). Both of these investigations provided formant values for male and

54

Spanish vowel system, arguing for little differences in vowel production based on lexical

stress and dialect, notable differences in pronunciation based on speaker gender, and

confirmed that the first two formant frequencies were the most important when

characterizing Spanish vowel production. These investigations were general, focusing

on the overall distribution and the basic characteristics of the system. Several other

acoustic investigations have examined more specific aspects of Spanish vowel

pronunciation, in particular the effects of lexical stress, syllable type, and speech style on

the acoustic properties of vowels.

2.2.3.1 Lexical Stress

Despite the fact that Navarro-Tomás (1918) and several studies of Mexican

Spanish (Boyd-Bowman, 1952; Matluck, 1952; Lope Blanch, 1972) argued for

differences in vowel quality based on lexical stress, most textbook descriptions of

Spanish vowels argue that the language exhibits little differences between the quality of

tonic and atonic vowels (Hualde, 2005). A very early study conducted by Delattre (1969)

confirmed the lack of reduction of Spanish atonic vowels, especially when compared to

three other languages. He examined the differences between stressed and unstressed

vowels in Spanish, English, German, and French via the reading of isolated words with

alternating stress patterns. Five male native speakers of each language read each of the

target words twice. The first and second formant frequencies were then extracted from

the target vowels in each word, and were plotted in order to compare the degree of

centralization in each language. Delattre’s analyses revealed that English showed the female speakers of Mexican Spanish, but also had more specific foci. Servín and Rodriguez (2001) specifically examined the effect of syllable type on vowel production, and for this reason will be discussed in greater detail in section 2.2.3.2. Martín Butragueño (2011) examined the amount of overlap present in the vowel categories of male and female speakers, and also examined how speech style on vowel production. This study will be explained in greater detail in section 2.3.2.3.

55

greatest degree of unstressed vowel reduction, followed by German and French, and

Spanish with the least amount of reduction. A closer examination of the patterns of

reduction indicated that the high vowels were not significantly affected by lexical stress,

that the mid vowels showed some centering, and that the /a/ was raised and centered

when atonic. In addition, he argued that the front vowels showed a greater effect of

lexical stress than the back vowels, indicating that the unrounding of the back vowels did

not play a significant role in unstressed reduction. His analysis also revealed that Spanish

did not have the vowel schwa that occurs in English, which lead Delattre to argue that

this is “one more sign of syllabic equality” in Spanish (p. 323). He subsequently argued

that the lack of reduction observed in Spanish is attributed to the “perfect balance of the

Spanish vowel system: two front, two back, and one central” (p. 324). The findings

reported in this investigation, thus, may have resulted in the overarching assumption that

Spanish does not, in fact, exhibit quality differences based on lexical stress.

A more recent acoustic analysis conducted by Delforge (2008) on Andean

Spanish did report differences in vowel production based on lexical stress, but not in

terms of vowel quality and centralization as argued by Matluck (1952) and Lope Blanch

(1972) for Mexican Spanish. Delforge’s investigation of the Spanish of Cusco was aimed

at more accurately describing (UVR) in Peruvian Spanish via spectrographic and

statistical analyses. Unstressed Spanish vowels were obtained from 10-minute-long

spontaneous speech samples produced by 16 native-speaking residents of Cusco, Peru.

Fourteen males and 2 females ranging in age from 24-90 years took part in the

interviews. Nine of the speakers were also bilingual in Quechua, although Delforge

56

confirmed that all of the bilinguals had learned Spanish first and were more accurately L2

speakers of Quechua.19

The examination of the vowel productions indicated that of the 16,581 unstressed

vowels produced across all speakers, 1,648 (9.9%) were identified as reduced.

Additional spectrographic analyses of the reduced tokens indicated that as suggested in

Lope Blanch (1972), “reduction” was a gradient phenomenon. Based on the acoustic

characteristics of the unstressed vowels, Delforge determined four degrees of reduction

which she argued were not necessarily mutually exclusive: devoiced/shortened, weakly

voiced, completely devoiced, and apparently elided. As more than 50% of the reduced

tokens exhibited some degree of devoicing, a more accurate description of the reduction

phenomenon in this variety of Spanish was determined to be “unstressed vowel

devoicing” as opposed to quality reduction.20 The vowels most subject to devoicing were

found to be /e/, /i/, and /u/. An analysis of word position indicated that although overall

devoicing rates were higher in word-final position, /u/ in word-medial position actually

exhibited more devoicing than /u/ produced in final position. The rate of devoicing in

final syllables closed by /s/ was also found to be less than that reported by Lope Blanch,

who argued that this environment most favored UVR. Finally, an analysis of the

consonantal contexts most favoring unstressed vowel devoicing revealed differences

between Delforge’s spectrographic analysis and Lope Blanch’s impressionistic account.

19 It is important to note that Quechua has a 3-vowel system consisting of /i a u/. 20 Additional analyses of the F1 and F2 of stressed and unstressed vowels confirmed that centralization in terms of quality was not a characteristic of Cusco Spanish. These additional analyses indicated that there were no significant differences between the formant values of unstressed reduced, unstressed unreduced, and stressed vowels in this variety.

57

Vowels that were in contact with any voiceless consonant, and most importantly

affricates and assibilated /r/, showed greater influences than /s/.

Overall, Delforge’s (2008) analysis of UVR in Andean Spanish showed how

detailed, acoustic analyses offer insight into processes that had previously been described

impressionistically. Through the use of spectrography, she was able to determine that

what had previously been described as “reduction” and assumed to refer to quality

reduction or centralization was more accurately a type of gradient devoicing. This

investigation also proved that at least for Andean Spanish, the high vowels were most

subject to devoicing, voiceless consonantal contexts other than /s/ resulted in devoicing,

and that this process occurred more frequently in word-medial position than traditionally

argued. Her findings also corroborate those of Delattre (1969) who argued for the

absence of centralization of unstressed vowels in Spanish.

Unstressed vowels have also been shown to reduce in duration. An acoustic

examination of vowel duration conducted by Marín Gálvez (1995) analyzed the

production of vowels by two male speakers of Peninsular Spanish. Each participant read

two texts taken from a newspaper that had been modified slightly in order to incorporate

tonic and atonic vowels in open and closed syllables. In total, 491 vowels were analyzed

acoustically for each speaker. The results of their assessment of duration indicated that

overall, atonic vowels were shorter than tonic vowels. This difference was found to be

statistically significant based on the results of t-tests. Analyses of each of the vowels

separately indicated that the pattern was consistent across each vowel as well; the atonic

productions exhibited significantly shorter duration than tonic vowels in all cases except

/u/ in which the difference was not statistically significant. Thus, these findings confirm

58

Navarro-Tomás’s impressions that unstressed vowels were shorter and reduced in

comparison to their tonic counterparts.

Taking into consideration all of the acoustic and impressionistic studies of

unstressed vowel reduction in Spanish therefore results in an unclear picture of the effect

of lexical stress on vowel production. Whereas some studies suggest the presence of

differences in quality and duration based on stress context (Lope Blanch, 1972; Marín

Gálvez, 1995; Navarro-Tomás, 1918), others indicated a lack of centralization (Delattre,

1969). Furthermore, what had previously been described as “reduction” was determined

by Delforge (2008) to actually be devoicing and not centralization for a Peruvian dialect.

These inconsistencies in the literature in part motivate the need to further examine the

effect of lexical stress on vowel production in Spanish. The present study includes an

acoustic analysis of the effects of this linguistic variable on both quality and duration,

thus offering insight into how stress differences are manifested in heritage Spanish.

2.2.3.2 Syllable type

Recent acoustic examinations of the effects of syllable structure on monolingual

Spanish vowel production conflict considerably with the detailed descriptions provided

by Navarro-Tomás (1918). As mentioned previously, even Matluck (1952) argued that

the closed and open syllable type contexts described by Navarro-Tomás, especially for

the mid vowel /e/, did not apply to Mexican varieties of Spanish. These early

impressionistic comparisons between the categories described by Navarro-Tomás suggest

that his distinction may not be robust, and that it may, in fact, differ according to dialect

region.

59

The only acoustic investigation that seems to have confirmed the presence of mid

vowel allophones was conducted by Skelton (1969). He analyzed the vowel production

of 20 native speakers of Spanish from varying regions within Spain and the Americas.

The utterances contained the five Spanish vowel phonemes in the contexts in which

Navarro-Tomás (1918) indicated there would be variation. Overall, Skelton reported a

considerable amount of overlap between the vowel categories. Of particular interest was

the overlap that occurred between the high front and mid front categories corresponding

to the vowels /i/ and /e/, respectively. He reported that the only examples of the mid

vowel /e/ that overlapped with the high vowel space were those that were produced in

open syllables and termed “close” by Navarro-Tomás. Based on Skelton’s description of

the back vowels, it is difficult to discern if the same pattern was observed for the mid

back vowel /o/. Thus, at least for the front mid vowel /e/, Skelton argues that the overlap

of the /e/ in open syllables with the space of /i/, but no such overlap with /e/ in closed

syllables evidenced two distinct allophones. Unfortunately, Skelton’s description of the

methodology and representation of the results are not clear. In addition, he did not use

any statistical methods to support his findings. For this reason, his investigation is

difficult to interpret and the findings should be taken with caution.

In contrast, several other investigations have provided evidence that the mid

vowel allophony that was argued to arise as a consequence of syllable structure is not

borne out by the acoustic data. Although not separate studies dedicated to the analysis of

syllable structure, Jurado and Arenas (2005) and Martínez-Celdrán (1984) do mention the

effect of syllable type on the pronunciation of Argentinian and Peninsular Spanish

vowels, respectively. Jurado and Arenas (2005) reported a radiographic study they

60

conducted in 1981 with one male speaker of Argentinian Spanish. They do not provide

information pertaining to the study itself in terms of how the data were collected and

analyzed. At the end of a short paragraph, they conclude that the mid vowels exhibit

variation between [e] and [ɛ] and [o] and [ɔ], but the open variants of the vowels only

occur in syllables closed by /r/.

Martínez-Celdrán (1984) analyzed the first and second formant frequencies of

vowels produced by male speakers. He then compared these productions to the

categories described by Navarro-Tomás in order to determine if the acoustic data

supported the distinction between mid, close, and open vowel allophones. For the mid

vowels specifically, he found that the correspondence between the productions of his

speakers and the categories set forth by Navarro-Tomás only amounted to about 50%.

Based on these findings, he concluded that the effect of syllable type on vowel

production in Spanish was not consistent as no clear pattern emerged.

Two larger-scale studies of mid vowel variation in Spanish were conducted by

Servín and Rodríguez (2001) and Morrison (2004). Servín and Rodríguez examined the

differences in vowel articulation based on syllable structure in the speech of four

speakers of Mexican Spanish. Two male and two female natives of Mexico City

completed a carrier phrase task in which 30 words were embedded. Six utterances of

each of the five vowels were included, with three words representing the vowel in an

open syllable and three representing the vowel in a closed syllable.21 The first three

formant frequencies were measured at the stable point within the vowel. A comparison

21 The consonantal contexts represented in Servín and Rodríguez do not correspond exactly to those described by Navarro-Tomás. In particular, a syllable closed by /s/ was classified with all other closed syllables, and not with the open syllables as indicated by NT. This categorization is consistent with the observation that for Mexican Spanish, a syllable closed by the consonant /s/ results in the open allophone [ɛ] (Matluck, 1952).

61

of the formant values for vowels produced in open and closed syllables indicated that

there were very little acoustic differences between the two. The mid vowels showed

slight differences, with those produced in closed syllables exhibiting a slightly more open

articulation than those in open syllables, but these differences were minimal. As no

statistical analyses were conducted, it was not possible to determine if the tendencies

were statistically significant or not. Regardless, the authors conclude that there is not

sufficient evidence to speak of vowels produced in open and closed syllables as separate

allophones.

Morrison (2004) set out to specifically examine the contexts in which Navarro-

Tomás (1918) indicated that close and open allophones of the mid vowels would occur.

One male and one female speaker of a highly educated dialect of Madrid, Spain, took part

in this experiment. Pairs of words which contained the mid vowels /e/ and /o/ in the

contexts in which Navarro-Tomás indicated there would be allophonic variation were

constructed and embedded within a carrier phrase.22 The participants said each of the 24

sentences 10 times in random order. The F1 and F2 of each vowel were measured at the

midpoint, and the frequency values were then converted to a mel scale. The acoustic

measures were then submitted to two different statistical analyses: a cluster analysis and

a discriminant analysis. The first test was chosen in order to determine if the vowels

clustered into distinct groups, thus evidencing separate categories. The Discriminant

analysis was used to determine if the categories identified by the cluster analysis were

differentiated from one another. The results of these analyses did not support the

22 Words were constructed in pairs, such as seca “dry” and secta “sect” in which the /e/ in the former would be expected to exemplify the close [e] and the latter the open [ε]. The words were then embedded into the carrier phrase Pues____ se ha dicho (“Well, ____ she/he said to herself/himself”). The contexts included vowels that were or were not in contact with /r/, /l/, /x/, /p/, /t/, /k/. Examples of vowels in syllables closed by /s/ were not included in this analysis.

62

existence of separate allophones for the Spanish mid vowels. There was considerable

overlap between the supposed allophones of /o/ for the male speaker, indicating a lack of

two separate categories. The analyses of the female’s productions of /o/ and both

speakers productions of /e/ provided slightly more separation which seemed to suggest

there might be a distinction, but Morrison argued that the differences observed might be

better accounted for based on articulatory constraints. Overall, he concluded that the

acoustic data did not support the existence of the allophonic categories described by

Navarro-Tomás.

In terms of vowel duration, Marín Gálvez (1995) reported minimal differences in

vowel length based on syllable type, and more importantly, based on the nature of the

following consonant. His analysis of the vowel duration of two male speakers (see

section 2.2.3.1. for methodological details) indicated that vowels produced in open and

closed syllables did not differ significantly with respect to duration. When the manner of

articulation of the following consonant was examined, however, the results showed that

vowels followed by fricatives in closed syllables were longer than vowels followed by

fricatives in open syllables. The opposite pattern was reported for vowels that occurred

in contact with nasal and liquid consonants. Despite the pattern, however, none of these

differences were shown to be statistically significant. Thus, like the previous studies of

vowel quality, the variability in vowel duration based on syllable structure that was

described by Navarro-Tomás (1918) was not corroborated with acoustic measurements.

The studies reported above indicate that the traditional claim that Spanish vowels

exhibit allophonic variation according to their syllable structure is not borne out by

acoustic data. In other words, although there may be some difference in terms of

63

pronunciation for vowels produced in closed and open syllables, especially for the mid

vowels, robust differences suggesting separate categories were not observed in this data.

The previous claims described in this section of the chapter motivate the need to examine

the effect to syllable type on heritage Spanish vowel pronunciation. Although most of

the acoustic research suggests that these speakers will not exhibit significant quality and

quantity differences based on syllable type, HS are also bilingual in English. It is well

accepted that consonantal context affects formant values of vowels produced in English

(Stevens & House, 1963) and Spanish (Guirao & Borzone de Manrique, 1975). Also

generally accepted is that the nature of the following consonant and the syllable structure

affect the duration of English vowels (House & Fairbanks, 1953). Thus, the interaction

of the two linguistic systems may result in HS exhibiting differences in vowel quality and

quantity as a consequence of syllable structure. If, however, HS do approximate native

speaker norms (i.e., native Spanish norms) as is typically argued in the literature, the

close/open mid vowel distinction may be absent.

2.2.3.3 Speech style

As discussed previously, one of the explicit claims in the work of Navarro-Tomás

(1918) is that the quality and quantity of Spanish vowels varies as a function of speech

style. Harmegnies and Poch-Olivé (1992) conducted one of the few studies to directly

examine the effects of speech style on vowel production. The goal of their investigation

was to examine the effect of speech style—either spontaneous or laboratory style—on

centralization and inter and intra-cluster variability: the degree to which the distributions

of the formants within the acoustic space cluster together both within and across speech

styles. Spontaneous speech data were collected from a semi-directed interview

64

conducted with one male informant. The participant was described as having an “average

cultural background” and no “marked dialectal accent” (p. 431). The laboratory speech

corpus consisted of words that were selected from the interview and then put into a word

list which was read by the participant. A total of 477 pairs of words, the same words in

both spontaneous and laboratory style, were analyzed. The first (F1) and second (F2)

formant frequencies for the vowels were calculated at the vowel midpoint, along with the

Euclidean distance of each of the vowels from schwa. The standard deviations of the first

and second formant frequencies were also calculated in order to examine the degree of

variability and clustering of each of the vowels within the acoustic space.

The statistical analyses indicated that on average, the vowels /i,e,o,u/ showed

significantly more centralization in the spontaneous speech sample than in laboratory

speech. The vowel /a/ showed the least amount of centralization, and the difference

between the two speech styles for this task was not statistically significant. The front

vowels /i,e/ also exhibited greater degrees of centralization than the back vowels /o,u/.

The analysis of intra-cluster variability revealed that the vowel tokens extracted from the

spontaneous speech task showed greater variability (i.e., were more centralized) than

those that were produced in the controlled speech task. The authors argue that the

differences in clustering between the two speech styles, as well as the varying degrees of

centralization, could potentially be explained by H&H Theory (Lindblom, 1990).23 The

spontaneous speech style may exhibit hypoarticulation, in which the phonetic targets are

undershot, resulting in differences (i.e., centralization) in timbre that are not observed in

the laboratory speech task.

23 H&H theory (Hyper- and Hypoarticulation) is a theory of speech production and perception that will be discussed in greater detail in chapter 5.

65

In addition to the acoustic analyses, the authors also conducted a recognition task

to determine the degree of overlapping of vowel categories in each of the speech styles.

They constructed a recognition task and report that the degree of centralization of a vowel

in the spontaneous speech task is not directly correlated to its rate of recognition. The

high vowels /i,u/ showed high recognition rates and were less confusable in the

recognition task than the vowel /a/, but exhibited greater degrees of centralization. The

vowel /a/ showed the lowest degree of centralization, but was the most highly confused.

Unfortunately, the authors provide virtually no description of the stimulus materials,

procedure, and statistical tests that were carried out in this experiment. For this reason,

these results and their implications should be interpreted with caution.

In a related study, Poch Olivé et al. (2008) reexamined the effect of speech style

on the production of Mexican Spanish vowels. Utilizing the same procedure as they

employed in their 1992 study, 30 tonic, open, examples of all five of the Spanish vowel

phonemes were extracted from a semi-informal interview. These same words were then

repeated later in an isolated context, and the vowels produced in the two speech styles

were subsequently compared. The distance of each vowels from the center of the vowel

space was also calculated using the same methodology employed in their earlier study.

The results of this investigation revealed several important similarities and

differences between the speakers of the two varieties. Like the speaker of Peninsular

Spanish described in Harmegnies and Poch Olivé (1992), the Mexican participant also

exhibited a significant degree of centralization of vowels produced in spontaneous speech

when compared to laboratory speech. A discriminant analysis of the data showed that for

both speakers, greater clarity was maintained between the vowel categories in laboratory

66

speech as compared to spontaneous speech. Comparisons between the two speakers,

however, indicated that the tendencies for centralization and the vowels affected differed.

Whereas the speaker of Peninsular Spanish described in the 1992 study exhibited a

pattern of centralization that corresponded with vowel aperture (i.e., the high vowels

showed the greatest rates of centralization, followed by the mid vowels, and so on), the

Mexican speaker only showed significant degrees of centralization for the non-back

vowels.

An examination of the overall areas of dispersion in laboratory speech revealed

additional similarities and differences between the two speakers as well. Generally

speaking, the vowels were situated in similar areas, but the Mexican speaker produced

high vowels with a more open articulation (i.e., lower) and also showed greater degrees

of overlap between the two back vowel categories. The patterns of reduction (i.e.,

centralization) also differed. Whereas the Peninsular speaker exhibited a proportional

decrease in the size of the acoustic space in spontaneous speech, the Mexican speaker’s

reduction resulted in the four non-low vowels clustering in a very narrow frequency

range. In other words, the Peninsular speaker’s spontaneous speech data maintained the

overall shape of the vowel distribution observed in laboratory speech, whereas the

Mexican speaker showed a greater clustering of /i/, /e/, /o/, and /u/ into a small acoustic

area. The centralization observed in Mexican Spanish was thus not symmetrical or

proportionally equal. The main findings of this investigation thus indicated that vowel

production varies as a function of speech style, but that the type of reduction and

centralization can vary across dialects.

67

The previous studies conducted by Harmegnies et al. (1992) and Poch Olivé et al.

(2008), although important and innovative, only presented the results of two speakers.

An additional investigation carried out by Martín Butragueño (2011) revisited the

question of style-induced reduction by analyzing male and female speakers from Mexico

City. He also compared the vowel productions of his speakers to those reported in Servín

and Rodríguez (2001). Four male and four female speakers of Mexican Spanish took part

in this investigation. Five tonic and five atonic examples of each of the five vowels were

extracted from a semi-informal interview, yielding 400 vowels total across all speakers.

The first and second formant frequencies were measured at the center of each vowel. The

comparisons between the speakers in Servín and Rodríguez’s (2001) study and those

obtained by Martín Butragueño (2011) again revealed a robust effect of speech style on

the rate of centralization. The methodology used in the former was more representative

of laboratory speech, thus the distance of the vowels from the center of the space was

greater than those values obtained from Martín Butragueño’s spontaneous speech task.24

These findings support a robust effect of speech style not only across dialects of Spanish,

as reported in Poch Olivé et al. (2008), but also within dialects.

Other than these three investigations, the only other acoustic examination of

speech style effects in Spanish was conducted by Willis (2005). His analysis of bilingual

English-Spanish speakers in the Southwestern United States will be discussed in greater

24 In addition to the findings pertaining to speech style, other important differences emerged. An analysis of the vowel pronunciations of male and female speakers revealed differences in dispersion and overlap. The male speakers, for example, exhibited a considerable degree of overlap between the /i/ and /e/ vowel categories and even more so for the /u/ and /o/ categories. On the other hand, the female speakers exhibited greater degrees of separation of the high and mid vowels, although there was some overlap. The range of F1 and F2 values for the high front vowel /i/ was also found to be much greater for male speakers than for females, but the females showed more variation in their production of /o/.

68

detail in section 2.3.1., but the basic findings confirm previous reports that speech style

has a centralization effect on vowel pronunciation.

The overall findings of these aforementioned studies confirm Navarro-Tomás’s

(1918) observation that Spanish vowels have a tendency to reduce in more naturalistic

speech contexts. Thus, in order to provide an accurate and complete description of the

Spanish vowel space, naturalistic and controlled speech samples should be examined in

order to determine how spontaneity affects vowel organization, distribution, and overlap.

As will be explained later on in this chapter and chapter three, this dissertation also

examines the effects of speech style on the production of heritage Spanish vowels. In so

doing, it offers a more complete description of the manner, degree, and type of reduction

and variation that can occur within the system.

2.2.3.4 Dialectal variation?

A final aspect of the Spanish vowel system that needs to be addressed is the claim

that vowel production across Spanish dialects is fairly stable. Interestingly, Navarro-

Tomás (1918) never argued that the pronunciation of Spanish vowels across dialects was

invariant. In fact, little mention of the stability of Spanish vowels appears in the early

investigations of pronunciation. In textbooks such as Hualde (2005), however, it is

argued that the majority of the dialectal variation observed in Spanish occurs with the

consonantal system and not with vowels. As discussed in section 2.2.1., Hualde

specifically mentioned that the quality of vowels across Spanish dialects is very stable

and does not exhibit the type of variation observed in English varieties.

Acoustic and impressionistic examinations of vowel pronunciation present

conflicting results in terms of dialectal stability. The investigation of Morrison and

69

Escudero (2007), for example, seems to confirm that vowel quality across dialects of

Spanish is similar. They examined the pronunciation of vowels as produced by

Peninsular and Peruvian speakers of Spanish, and found very few significant differences

in the productions. Quilis and Esgueva’s (1983) investigation suggested that there could

be slight differences in pronunciation based on dialect, but as no statistical methods were

employed, it was determined that the differences were not robust.

In contrast, O’Rourke (2010), which will be discussed in greater detail later in this

chapter, reported that vowel pronunciation within Peru differed significantly. Her

examination of vowel production in Lima and Cusco revealed significant differences in

dispersion and organization for both groups of Spanish monolinguals and Spanish-

Quechua bilinguals. In a study of monolingual Spanish vowel production in the

Dominican Republic, Willis (2008) observed differences in the acoustic distribution and

degree of overlap of Dominican Spanish vowel categories when compared to those

reported by Quilis and Esgueva (1983). In addition, differences in overlap of vowel

categories produced by each speaker were revealed, providing further evidence for

variability in vowel productions across speakers as well as across dialects. Poch Olivé et

al. (2008) also noted differences between Peninsular and Mexican Spanish not only in

terms of overall organization, but also with respect to the manner in which the vowels in

each variety centralized in spontaneous speech. Finally, even the earliest impressionistic

studies of Mexican Spanish conducted by Matluck (1952) and Lope Blanch (1972)

indicated that vowels produced in central Mexico differed from the traditional

descriptions of Peninsular vowels provided by Navarro-Tomás (1918).

70

Thus, the assumption of dialectal invariance in terms of vowel quality may have

resulted in the comparative lack of investigations of Spanish vowels. The assumption of

stability, however, could have also been a misinterpretation. It is perhaps not that

Spanish vowels exhibit little variation in quality across dialects, but rather that,

comparatively speaking, they do not vary as much as the dialects of American English.

Although the present investigation will not examine any cross-dialectal distinctions, the

finding that vowel quality differs in some dialects indicates that the vowel system of

Spanish may vary more than previously believed.

2.2.3.5 Summary of acoustic investigations

To summarize the findings discussed above, a number of acoustic investigations

of Spanish vowels have supported and refuted some of the claims put forth in the

impressionistic literature. With respect to lexical stress, there is evidence of reduction in

terms of quality and duration in Spanish, but Delattre (1969) proved that Spanish showed

the least amount of unstressed vowel reduction when compared to three other languages.

Marín Gálvez (1995), however, did report significant differences in vowel duration, with

atonic vowels exhibiting significantly shorter duration than tonic vowels. Finally, an

investigation by Delforge (2008) indicated that at least for Andean Spanish, “reduction”

of unstressed vowels might be better described as devoicing.

The investigations of syllable type and speech style both refute and support

traditional descriptions of the Spanish vowel system. In terms of syllable type, the

majority of the acoustic investigations (e.g., Morrison, 2004; Servín & Rodríguez, 2001)

did not find acoustic evidence of separate mid vowel categories. Their findings conflict

with Navarro-Tomás’s (1918) detailed description of quality differences based on

71

syllable type and surrounding context. The investigations of speech style, however,

support Navarro-Tomás’s observation that vowels produced in rapid, conversational, and

naturalistic speech samples tend to reduce and centralize when compared to more

emphatic, slow speech. The series of studies by Harmegnies and Poch-Olivé (1992),

Poch- Olivé et al. (2008) and Martín Butragueño (2011) all provided robust evidence of

centralization in spontaneous speech when compared to laboratory speech that was

elicited in a controlled context. Their findings are critical, because with the exception of

their three investigations and that conducted by Willis (2005), most investigations of

Spanish vowels have focused on very controlled speech samples. Many studies involved

the examination of isolated vowels or words, nonce words, or vowels produced in carrier

phrase tasks. That Harmegnies and Poch-Olivé, Poch- Olivé et al. and Martín

Butragueño have found significant differences in vowel production in controlled and

spontaneous speech samples indicates the need to examine more than one type of speech

style. Examining vowels that were produced in speech of varying levels of spontaneity

offers a more complete and potentially more accurate picture of the vowel system of a

certain speaker or speakers.

Finally, although not discussed in great detail, the general assumption that there is

considerable stability in vowel quality across dialects is only partially supported by the

acoustic data. Whereas some investigations reported only minimal differences across

speakers of different dialects (Morrison & Escudero, 2007; Quilis & Esgueva, 1983),

others indicated that Spanish vowels do vary regionally (O´Rourke, 2010; Poch Olivé et

al., 2008). Thus, although the Spanish vowel system may not exhibit as much variation

cross-dialectally as does English, most researchers appear to recognize that differences do

72

exist. That there is some variability in vowel production across dialects indicates that the

pronunciation of the “simple, stable, five-vowel system” may not be as simple and stable

as previously believed.

2.3 Bilingual and contact vowel systems

The goal of this section of the chapter is to present the findings of several studies

of bilingual Spanish and English vowel systems. Section 2.3.1 reviews several studies

pertaining to the pronunciation of Spanish vowels by different groups of Spanish-English

and Spanish-Quechua bilinguals. Section 2.3.2 presents several important research

studies of Chicano, Mexican American, and Mexican Heritage English vowels in two

main regions within the United States.

2.3.1 Bilingual and contact Spanish vowel systems

In addition to studies of monolingual Spanish vowels, several investigations have

examined the acoustic properties of vowels produced by bilingual speakers. With the

exception of O’Rourke (2010) who examined Spanish-Quechua bilinguals in Peru, the

other notable studies focus on Spanish-English bilingual adults and children living in

different regions in the United States.

O’Rourke’s (2010) investigation of Peruvian Spanish vowels had three main

goals. The first was to determine if there was evidence of dialectal variation in Peru by

examining the vowel pronunciation of speakers from two different cities: Lima and

Cuzco. The second goal was to assess the potential influence of Quechua on Peruvian

Spanish vowel pronunciation. The third and final objective was to examine if bilingual

speakers of Spanish and Quechua organized their vowel space differently than

monolingual speakers in the two cities. In order to achieve these three goals, four groups

73

of participants took part in this study, with three speakers in each group: native speakers

of Spanish from Lima, native speakers from Cuzco, bilingual speakers from Cuzco who

began learning Spanish and Quechua before entering school (referred to as bilinguals),

and native Quechua speakers who began learning Spanish after starting school (referred

to as learners). All participants were male speakers who were born in either Lima or

Cuzco, and were between the ages of 20-39. The vowel production data were collected

from a read corpus of sentences that was originally designed to assess intonation.

Participants were given index cards containing questions and answers written on them

that were associated with characters in a map task. The 28 sentences were read twice by

each of the twelve speakers, yielding 672 total vowels. All tokens were monophthong

vowels with penultimate stress, with the exception of /u/ which had antepenultimate

stress. The F1 and F2 values of each vowel were measured at the midpoint. Then the

values were converted to a Bark scale, and finally normalized following a procedure

described in Guion (2003).25

The first set of results of the statistical analyses revealed that that all speakers

showed a five-vowel distinction in terms of both height and backness, regardless of

dialect region or language background. The majority of the participants did not

distinguish mid vowel /e/ and the high vowel /u/ with respect to height. Several speakers

also failed to produce a distinction in backness for /o/ and /u/, with the latter occupying a

position more fronted in the acoustic space than the /o/. Based on these preliminary

analyses, O’Rourke concluded that Peruvian Spanish back vowels /o/ and /u/ are

25 As described by O´Rourke (2010), the first three formants were extracted and converted to a Bark scale. She explains that “for each speaker, a k-factor was calculated as a ratio of the average F3 value for /a/ (in Bark) for one specific speaker (S1) divided by the average for each individual speaker (Si). The k-factors for Lima and Cuzco speakers range from 0.97-1.11…Then the F1 and F2 values for all vowels for each speaker were multiplied by the k-factor for that speaker” (p.22).

74

considerably lower in height than what has been described for other Spanish dialects.

She argued that the position of the /u/ and its occasional fronting may result that /a o u/

are primarily distinguished in terms of height and not backness.

Despite these similarities, several notable differences between the monolingual

speakers from the two different regions were found, as well as differences between the

monolingual, bilingual, and learner groups in Cuzco. Comparisons across the regions

indicated that the /i/ was articulated higher for the Cuzco speakers than for the Lima

speakers. The pronunciation of /i/, /e/, and /a/ by the speakers in Cuzco were farther front

than those produced by the speakers from Lima. As a result, the speakers in Lima

exhibited vowels that were less dispersed and occupied a smaller portion of the acoustic

space when compared to the speakers from Cuzco. The comparisons between the

monolingual, bilingual, and Spanish learners revealed that the productions of /i/ and /u/

were lower for the Cuzco bilinguals than for the native Spanish speakers and Spanish

learners in this region. An additional important finding was that the bilingual speakers in

this region shared characteristics with both the native and the L2 systems, forming a

hybrid of the two. They produced /a/ and /u/ similarly to the native Spanish speakers, but

produced /e/ in the same acoustic region as the L2 speakers.

Combined, the results presented in O’Rourke (2010) indicate that not only does

the Peruvian vowel system differ from the standard description of Spanish vowels, but

that there was also variation within the country. Cuzco and Lima speakers pronounced

the vowels differently and exhibited different rates of dispersion and organization when

compared statistically. The comparison between the monolingual, bilingual, and Spanish

learner groups revealed the largest differences in pronunciation between the learners and

75

the natives. In contrast, the bilingual speakers shared characteristics of both vowel

systems, resulting in a modified and unique configuration not found in the other varieties.

Thus, these results support the presence of dialectal variation in Spanish vowel

production, as well as significant differences in pronunciation based on linguistic

experience.

Shifting focus to bilingual varieties in the United States, Willis (2005) conducted

an acoustic analysis of the Spanish vowels in the Southwestern United States (hereafter

SWS). The goals of his study were to provide an acoustic description of this bilingual

variety using data from a narrative, and also to examine the effect of lexical stress on the

low vowel /a/. The participants in the study were four female bilingual speakers of SWS

who had learned Spanish at home and began learning English upon entering elementary

school.26 The narrative task involved a retelling of Mercer Mayer’s picture story Frog,

where are you?. They also participated in an interview in which they spoke of their

experiences with Spanish, situations that resulted in fear or happiness, a discussion about

their family, and an additional reading task. The first five occurrences of /i e a o u/ in

tonic, open syllables were extracted from the narrative data. Additional tokens of /a/ in

atonic open syllables were extracted to analyze the effect of lexical stress on vowel

production. The F1 and F2 values at the midpoint of each vowel were extracted.

The mean formant values of SWS speakers were calculated and then compared to

the values obtained for Mexican speakers by Quilis and Esgueva (1983). Although no

statistical analyses were conducted, the visual representation of the vowels produced by

26 The description of the participants in Willis’s (2005) study sounds very similar to the definition provided earlier in this chapter for heritage speakers. Willis does not refer to his participants as HS. He does explain that they spoke 40% English and 60% Spanish beginning in high school up until the time they were interviewed. Additional demographic and instructional information is not provided in his study.

76

both speaker groups revealed considerable differences between the systems. The SWS

front vowels /i/ and /e/, for example, were produced lower and slightly farther front than

those reported in Quilis and Esgueva (1983). In addition, the SWS /a/ was lower and

produced farther front than what is typically described for the Spanish vowel system.

Interestingly, the /o/ category for the SWS speakers overlapped with the /a/ category for

the monolingual Mexican Spanish speakers. Last of all, the SWS /u/ was articulated

farther front than the monolingual productions reported in Quilis and Esgueva (1983),

and lower than previously described. The analysis of the production of /a/ in tonic and

atonic position revealed that only one speaker exhibited a significant difference in

production based on lexical stress. Given this result, Willis concluded that lexical stress

did not play a significant role in the production of /a/ in SWS, even though the

participants in his study were bilingual. There was no evidence that the unstressed tokens

of /a/ had moved to a location approximating the English schwa. Thus, there appeared to

be no direct transfer of a lexical stress effect from English.

The results of the visual comparison indicated that the SWS vowel system

differed from descriptions presented in the impressionistic literature as well as from the

monolingual system presented in Quilis and Esgueva (1983). Willis argued that these

differences (i.e., fronting of all vowels and overlapping of categories) indicated that the

Spanish vowel system exhibits more variation in quality than previously described. In

addition, a comparison of the narrative data to controlled speech data for the same

speakers collected in a pilot investigation revealed that SWS vowels were more

centralized in the narrative speech style than in a carrier phrase. This finding is

consistent with previous research conducted by Harmegnies and Poch-Olivé (1992) that

77

showed greater degrees of centralization in spontaneous speech. Finally, the lack of a

stress effect observed for SWS /a/ suggested that the bilingual speakers of this variety

were not integrating characteristics of English pronunciation into their vowel

pronunciation, at least not for that particular vowel. All in all, this study presented

evidence for variation within the Spanish vowel system across varieties and across speech

styles.

Another important study of Spanish-English vowel pronunciation was conducted

by Menke and Face (2010). Their investigation examined the pronunciation of Spanish

vowels by L2 learners of Spanish at different levels, paying specific attention to the effect

of lexical stress on pronunciation. Sixty native English speakers participated in this study

who were divided into three equal groups based on their level of language instruction:

fourth semester Spanish learners, graduating Spanish majors, and Ph.D. students in

Spanish. Six native speakers of Spanish were also a part of this study, three of which

were from Spain, two from Mexico, and one from Colombia. Although this participant

group is labeled “native” by Menke and Face, they were actually bilingual in English and

some had been speaking or studying English for as many as 20 years. All participant

groups first completed a language background questionnaire, followed by an oral reading

of the Spanish story Aniversario by Luis Romero. The text was modified slightly so that

there were sufficient examples of atonic and tonic vowels. One hundred tokens (20

tokens of each of the five Spanish vowels) were analyzed for each speaker that were

balanced across stress context (tonic and atonic) and syllable type (closed and open). The

F1 and F2 values of each vowel were measured at the midpoint.27

27 The text, Aniversario, is not included in the appendix, therefore it is not possible to examine if the vowels analyzed in Menke and Face’s (2010) study were balanced by consonantal context. This study also only

78

Overall comparisons of the three learner groups revealed differences in F1 and F2

frequencies according to level of instruction. The lowest level of learners (i.e., fourth

semester) produced vowels with F1 values that were significantly higher (i.e., lower

vowels) than those produced by the speakers in the other two groups. The fourth

semester learners also produced the front vowels with lower F2 values when compared to

the other two learner groups, resulting in /i/ and /e/ that were farther back in the vowel

space. The reverse pattern was observed for /u/, which was produced by the fourth

semester learners with a higher F2 value resulting in a more fronted articulation. The

fourth semester learners, thus, exhibited a reduced and lowered vowel space when

compared to the other two participant groups. The analysis of the native speaker vowel

productions indicated that neither of the two advanced participant groups differed

statistically form the natives. The results of the first set of statistical analyses thus

indicated that the higher the learner level, the more expanded the space. As learners

moved from the fourth semester onto more advanced levels of instruction, their vowel

pronunciations began to more closely approximate the articulations of a native speaker.

The analyses of lexical stress indicated that learners at all levels exhibited some

differences in the articulation of stressed and unstressed vowels. The only vowels

affected in the fourth semester learners’ speech were /a e u/. The atonic /a/ was produced

higher and farther front. Atonic /e/ and /u/ exhibited lower and higher F2 values,

respectively, resulting in all three vowels moving towards the center of the vowel space.

The graduating Spanish majors produced atonic /a/ higher and farther front than tonic /a/,

their production of atonic /i/ was produced lower and farther back, and the tokens of

examined F1 and F2 formant values, and did not examine vowel duration. As will be described in greater detail in chapter 5, the only study to the best of my knowledge that examines L2 Spanish learner vowel duration is Stevens (2011).

79

atonic /e/ and /o/ in atonic syllables were more centralized than the tonic productions.

The Ph.D. students exhibited a similar pattern, again with /a/ higher and farther front

when atonic, and /o/ was articulated higher. The front vowels /i/ and /e/ and the back

vowel /u/ exhibited F2 values that were more centralized. For the latter two learner

groups, then, the effect of lexical stress was greater along the F2 dimension, yielding

atonic vowels that were more centralized than their tonic counterparts. Even though most

descriptions of the Spanish vowel system argue for the absence of differences in vowel

quality based on lexical stress, the native speakers also exhibited some differences. The

only vowels affected, however, were /e/ and /u/, both of which were more centralized in

atonic productions than tonic.

The results of Menke and Face’s (2010) investigation revealed two important

characteristics of L2 Spanish learner and native speaker vowel production. The first is

that as a learner’s level of proficiency increased, their vowel space became more

expanded and more like that of a native speaker. In addition, the examination of lexical

stress demonstrated that learners at all levels of proficiency, as well as the native speakers

of Spanish in this investigation, showed some degree of centralization of unstressed

vowels in Spanish. The effect of lexical stress was strongest along the F2 dimension,

indicating that stress had a greater impact on the frontness or backness of the vowels and

less effect on their height. The differences in vowel quality based on lexical stress

became less extreme as the level of proficiency increased. As learners gained more

experience with the language, the influence of the English stress distinction was lessened

in the production of their Spanish vowels. Menke and Face’s study thus indicates that

lexical stress is a factor affecting the production of learner Spanish vowels, especially at

80

lower levels of proficiency. It also indicates, however, that previous accounts of the lack

of stress effects in Spanish may not hold true for all speakers and speaker populations.

A final and very recent acoustic investigation of bilingual vowel production in the

United States was conducted by Alvord and Rogers (2011). The goals of their study were

to determine if Miami Cuban Spanish exhibited unstressed vowel centralization, if the

English vowels /æ/ was integrated into the system of these speakers, and if vowel

pronunciation varied according to speech type. Six male and five female speakers of

Miami Cuban Spanish participated in this study, and were further divided into first,

second, or third generation speakers based on their age of arrival to the United States.28

The vowel production data were gathered from three speech tasks: a read word list, a

read story and comprehension questions, and a sociolinguistic interview. The first and

second formant frequencies of English and Spanish vowels extracted from the tasks were

measured at the midpoint. The formant measures were then normalized in order to

compare across genders. The differences between tonic and atonic vowel productions

were compared statistically via paired-samples t-tests.

Alvord and Rogers reported that Miami Cuban bilinguals exhibited differences in

vowel quality based on lexical stress, regardless of their generation.29 They observed that

in atonic syllables, /a/ was raised, /i/ and /e/ were backed, and /o/ was fronted. A

comparison of their English and Spanish vowel pronunciation revealed that this group of

bilinguals did not integrate characteristics of English into their Spanish. In fact, they

28 The first generation speakers (two males and one female) were born in Cuba and immigrated to the United States after age eleven. The 2nd generation group consisted of two male and two female speakers who immigrated to the United States before the age of six years or were born to a first generation parent. The 3rd generation group consisted of two male and two female speakers who were born to at least one 2nd generation parent. 29 Alvord and Rogers (2011) did not analyze vowel duration in their study.

81

found evidence of Spanish influence on their English vowel productions, especially with

the pronunciation of the vowels /æ/ and /ɑ/. Some speakers in the first generation did not

exhibit a lax/tense distinction with the front vowel /i/. In terms of task type, the overall

acoustic space of the vowels became more reduced as the speech style became less

formal. The vowels produced in the wordlist task were the most peripheral, the story task

resulted in vowels that were somewhat centralized, and the vowel tokens extracted from

the sociolinguistic interview were the most centralized. These findings are consistent

with Menke and Face (2010) who reported unstressed vowel centralization in Spanish for

learners and native Spanish-speaking bilinguals. In addition, the centralization effect

observed for spontaneous speech is consistent with several investigations of speech style

on vowel pronunciation (Hamegnies & Poch-Olivé, 1992; Poch-Olivé et al., 2008; Willis,

2005). Alvord and Rogers thus conclude that their investigation supports the notion of

variation and unstressed vowel centralization in Spanish based on stress context and

speech style.

The investigations of bilingual Spanish vowel pronunciation presented above

indicate a number of important characteristics about bilingual vowel systems. The first

and most important observation is that bilingual vowel systems are not identical to

monolingual varieties in the same dialect region. O’Rourke (2010) reported differences

between monolingual and bilingual speakers of Spanish and Quechua in Peru, and Willis

(2005) and Menke and Face (2010) observed that Spanish-English bilinguals differed

from the monolingual Spanish norms as well. These findings suggest a modified system

of pronunciation for bilinguals that often exhibits characteristics from both of the

82

bilinguals’ languages.30 It is important to note, however, that a bilingual may not

integrate characteristics of the majority language (i.e., English) into their native system.

Alvord and Rogers (2011), for example, reported that not only did Miami Cuban

bilinguals not integrate characteristics of their English pronunciation into their Spanish,

but that the reverse was observed with the low vowels /ɑ/ and /æ/.

A second important finding is that factors which are argued to not affect or not be

present in monolingual varieties of Spanish are often observed in a bilingual speaker’s

pronunciation. Lexical stress, for example, was shown to affect the Spanish vowel

pronunciations of L2 learners and Spanish bilinguals (Menke & Face, 2010). Atonic

vowels were found to centralize, especially along the F2 dimension, for these speakers,

whereas most monolingual varieties of Spanish do not exhibit such variation based on

lexical stress. Alvord and Rogers (2011) also reported stress effects in Miami Cuban

Spanish, with atonic vowels moving more towards the center of the space when

compared to the tonic productions.

The overall usage of the acoustic space and dispersion of vowels within the space

was also shown to vary as a function of linguistic experience. O’Rourke’s (2010) study

reported that monolingual and bilingual speakers of Spanish and Quechua showed

different rates of dispersion based on the age at which they began learning their second

language. Menke and Face (2010) also showed differences between L2 learners of

different levels, with the most advanced learners producing vowels that occupied the

largest acoustic area. Finally, Alvord and Rogers noted generational differences in vowel

30 The bilingual system s are “modified” in the sense that they differ from the acoustic description provided by Quilis and Esgueva (1983).

83

production, indicating that experience and length of exposure to the majority language

within a region can induce changes in a speaker’s L1 system.

A final important finding from these studies is that speech style was found to

affect bilingual vowel pronunciation in much the same way as it affects monolingual

vowel production. Alvord and Rogers (2011) found that as the spontaneity of speech

increased, vowels had a greater likelihood of centralizing. Willis (2005) reported similar

results for bilingual Spanish in the Southwestern United States. Thus, speech style has

robust and consistent effects on vowel pronunciation in bilingual varieties of Spanish as

well as monolingual varieties.

2.3.2 English vowel systems of heritage Spanish speakers

The investigations described above indicate that bilingual speakers of English and

Spanish, and also of Spanish and Quechua, pronounce Spanish vowels differently than

Spanish monolinguals. Some varieties exhibit different rates of dispersion of vowels,

whereas other bilingual Spanish varieties are characterized by notable differences in

production as a consequence of lexical stress. These findings support the need for an

examination of heritage Spanish vowels, and suggest that these speakers, too, will exhibit

a modified vowel system when compared to previous descriptions. Although no studies

of HS Spanish vowels in Chicago have been conducted to date, several important

investigations have analyzed the pronunciation of Mexican Heritage English (also

referred to as Chicano English, Mexican American English, or non-ethnic English) in

Southern California and the Midwestern United States. These studies offer insight into

how English-dominant bilingual speakers integrate characteristics of the majority dialect

into their speech, which characteristics of the Spanish system may remain intact in their

84

system, and how the minority communities’ relationship with the majority speakers

impacts their English pronunciation.

Godiñez and Maddieson (1985) analyzed the vowel duration and quality of

Chicano English speakers in Eastern Los Angeles, California. The participants in this

study consisted of 15 speakers of Chicano English who had little knowledge of Spanish,

15 Spanish/English bilinguals from the same area, and 15 speakers of General California

English who attended a neighboring high school in Los Angeles. All participants were

male and were in the 11th grade, ranging from 16 to 17 years of age. In order to analyze

vowel duration and quality, the participants were asked to read aloud a list of /hVd/

words embedded within a carrier phrase, in which the vowel represented one of the

phonemes present in American English. Two repetitions of each phrase were produced

by each participant. The total duration of the vowel and the F1 and F2 frequencies

measured at the steady-state portion were analyzed. The difference between F2-F1 was

also calculated, as this measure has been argued to provide a more accurate perceptual

dimension for vowel backness.

The analysis of duration showed no significant overall effect of group and no

significant interactions, although the Chicano English speakers showed a smaller range of

duration between the long and short vowel phonemes in English. This difference in

durational range, however, was not found to be statistically significant. In terms of vowel

quality, however, several important differences were reported. Three separate

comparisons were conducted: Chicano bilinguals vs. Chicano monolinguals, Chicano

monolinguals vs. General California English speakers, and Chicano bilinguals vs.

General California English speakers. For the first comparison between the Chicano

85

groups, the bilinguals tended to produce vowels that were lower than the monolinguals.

In addition, the front vowels produced by the Chicano monolingual group were slightly

farther back than those produced by the Chicano bilingual group. When comparing the

Chicano monolinguals to the General California English speakers, the Chicanos had a

higher F1 for all vowels other than /i/ and /u/ than the General group. For the Chicano

monolinguals, /u/ was produced farther back in the acoustic space than in General

California English, but the non-back vowels were produced farther front. Finally, the

comparison between the Chicano bilinguals and the General California English speakers

showed that /I ε æ/ were produced higher by the bilingual speakers, but /i/ and /a/ were

produced lower. The back vowels /ʊ/ and /u/ were produced farther back for the

bilingual speakers, but the other vowels were produced farther front when compared to

the General California pronunciations. Overall, Godiñez and Maddieson (1985) conclude

that the Chicano monolinguals produced vowels higher than the bilinguals, and that their

vowels were more similar to those produced by the bilinguals than they were to the

General California English pronunciation. Both of the Chicano groups produced /ʊ/ and

/u/ farther back in the vowel space relative to the English variety, which the authors argue

may be due to the influence of Spanish. In the end they conclude that Chicano English

“represents an “autonomous social dialect with distinct characteristics passed on by the

usual processes of linguistic transmission (p. 57).”

A subsequent study also focusing on California English was conducted by

Fought (1999). The goals of her study were to first determine if members of the minority

Latino community were integrating a majority sound change -- /u/-fronting – into their

speech (i.e., their Latino variety of English), and second to identify the sociolinguistic

86

factors connected with the presence or absence of this pronunciation. The fronting of the

vowel /u/ has been attested in general California English, but at the time of Fought’s

investigation, no study to date had analyzed the extent to which speakers of Latino

English were integrating this characteristic into their speech. Fought conducted

sociolinguistic interviews in English with 32 Latino young adults residing in western Los

Angeles. Many of the participants had also attended an alternative high school called

Westside Park, and had some affiliation, connection, or contact with Latino gangs in the

area. The pronunciation of the /u/ was examined by extracting as many tokens as

possible of this vowel from the interview data and measuring the first and second formant

frequencies.31

An examination of the data revealed that the Latino English speakers did exhibit

/u/-fronting in their speech, but that there was a considerable amount of variation in terms

of the degree of fronting. Some speakers fronted the /u/ such that it almost overlapped

with the space for the front vowel /i/, whereas others produced no significant fronting at

all. A second component of her analysis was to examine which social factors influence

the presence or absence of /u/-fronting in this minority community. She included

traditional social variables such as gender, age, and social class, but also included a

measure of gang-affiliation. The results of this analysis revealed that no single social

factor alone produced a clear pattern of /u/-fronting; that is, there was no clear,

observable tendency for males or females, middle or lower class speakers, or gang

members/gang affiliates and non-gang affiliates to produce a fronted /u/. Interactions

between these social factors, however, did reveal trends. She found a pattern that

31 Fought (1999) states that she extracted as many tokens as possible of the peripheral vowels /i/, /u/, /ɑ/ and /æ/ from the interview data. She extracted the tokens “well into” the interview “in order to access the most vernacular speech style.” She does not indicate how many total vowel tokens she analyzed.

87

differed for men and women based on the conjunct of social class and gang-affiliation.

Non-gang affiliated women overall favored /u/-fronting, but when social class was

factored in to the gang-affiliated group, the working and lower class speakers disfavored

/u/-fronting whereas the middle class gang-affiliated women favored it. The pattern for

men was a mirror image of that observed for women. The gang-affiliated men overall

disfavored /u/-fronting, whereas the non-gang-affiliated members differed according the

social class. The working and lower class non-gang affiliated men disfavored /u/-

fronting, but the middle class favored it.

The results of Fought’s (1999) investigation emphasized the importance of

including non-traditional sociolinguistic variables into these types of investigations.

Including measures of group affiliation and ties to certain networks (i.e., gang-affiliation)

revealed a trend that was otherwise difficult to observe. Her study also showed that

speakers of minority communities do integrate characteristics of majority sound changes

into their speech. The presence or absence of /u/-fronting was perhaps one way in which

individual speakers could signal their identity and membership in a specific community.

The results of Fought conflict slightly with the results of Godiñez and Maddieson (1985),

who showed that Chicano monolingual male speakers did not exhibit noticeable degrees

of /u/-fronting; their productions of /u/ were backed relative to the General California

English speakers, approximating the values of those produced by the Chicano bilinguals.

The lack of /u/-fronting in Godiñez and Maddieson’s study, however, may be attributable

to the social characteristics of the participants. If all participants were non-gang members

of the working or lower class, then according to Fought’s description this group of

speakers would not front the /u/. Regardless, the results of these two studies demonstrate

88

that /u/-fronting, a majority characteristic, does exist in a non-white community in Los

Angeles, and that the degree to which this phenomena occurs may be due in part to a

speaker’s speech community and other social characteristics.

In addition to studies of Chicano and Latino pronunciation in California, three

recent investigations have also examined the vowel pronunciation of Mexican Heritage

English speakers in portions of the Midwestern United States. Roeder (2010), for

example, analyzed the pronunciation of the English vowels by Mexican American

speakers living in Lansing Michigan—an area of the United States known to exhibit the

Northern Cities Vowel Shift (NCSV). Lansing presents as an interesting case of

linguistic diversity, however, in that not only are Mexican Americans exposed to the

standard Anglo variety, but also have contact with Tejano English from the migrant

population that passes through the area. In addition, many of the Mexican Americans in

Lansing lived in Texas and were exposed to Tejano English prior to moving to Michigan.

For this reason, Roeder wanted to determine if the Mexican American speakers living in

Lansing Michigan exhibited characteristics of Tejano English, or if they had adopted the

NCSV observed in the speech of Anglo-Americans.

The participants in this study included six female speakers of Mexican American

English between the ages of 14-22, eight older (26-45) male speakers of Mexican

American English, and 12 young-adult Anglo speakers of English from Lansing. Each

speaker read a list of 107 English words, 77 of which contained target vowels that were

all tonic. The F1 and F2 were measured at the midpoint of each vowel, and the Lobanov

method of normalization was performed in order to compare across gender. The tokens

of /æ/ were analyzed separately for those in prenasal versus nasal position based on

89

previous research indicating that the pronunciation of this vowel was affected by nasality.

The pronunciation of Tejano English vowels were based off of data presented in Thomas

(2001), and no Tejano speakers actually participated in this experiment.32

The analyses of the F1 and F2 values indicated that the Mexican American

speakers’ vowel productions were very similar to those produced by the Anglo speakers

in that they exhibited many characteristics of the Northern Cities Vowel Shift (NCSV).

In particular, they had established separate categories for the lax vowels that were shifted

in the direction expected for the NCSV, and exhibited a closeness of the vowels /æ/ and

/ε/. The proximity of these two vowels in the Mexican American speech indicated that

their variety was more similar to the Anglo English dialect than to Tejano English. It was

concluded that these characteristics indicated a high degree of accommodation to the

majority (i.e., Anglo) norm of vowel pronunciation by these speakers.

Despite the overall similarity between the systems, there were considerable

differences observed between genders and for the production of /æ/ in prenasal and

nonprenasal positions. The females exhibited more advanced stages of the NCVS when

compared to men, as evidenced by their greater degree of lowering of /I/ and backing of

/ε/. In addition, the pronunciation of the /æ/ in prenasal and non-prenasal positions

differed for men and women, and also between the Mexican American speakers and the

Anglo speakers. While the Mexican American women exhibited lower F1 values for the

32 Thomas (2001) provides further information pertaining to the characteristics of the variety of Mexican American English spoken in Laredo, Texas. Some of his findings are consistent with the studies conducted on California and Midwestern varieties of Mexican American English. He argued that the variety of English spoken by the speakers he investigated showed the influence of Spanish in their English, such that /o/, /u/, and /ʊ/ remained as back vowels—which conflicts with Fought’s (1999) study. He also reported that many of the Mexican American speakers did not distinguish between the front and back tense and lax vowels /e/, /ε/, /u/, /ʊ/. He also argued that these speakers had a tendency to merge /ɔ/ and /ɑ/ such that the resulting vowel is farther front than the Anglo Americans who also exhibit this merger. Finally, he indicated that these speakers did not exhibit a lowering or retraction of /æ/ as had been observed in some of the younger Anglo speakers in the same region.

90

prenasal /æ/ that were in line with the Anglo speakers’ productions, they failed to show

the distinction in F2. That is, although their productions of /æ/ before nasals were

significantly higher than the same vowel in other consonantal contexts thus

approximating that of Anglo females, they were not as far front as would be expected for

advanced stages of the NCSV. The Mexican American men showed almost no

accommodation to this characteristic of the NCSV. This research confirmed previous

arguments that women are more likely to accommodate to the standard variety, whereas

men and older speakers have been found to resist conforming to the regional norm. In

addition, the Mexican Americans showed differences in their production of /æ/,

indicating that their variety of English was still somewhat distinct from the majority

variety spoken in Lansing.

Two additional studies conducted on Mexican Heritage English pronunciation

were conducted by Konopka and Pierrehumbert (2008, in press). The goal of their first

(2008) study was to compare and contrast the English vowel pronunciation of Mexican

Heritage English speakers, L2 English speakers, and monolingual English speakers of the

Midland dialect. Mexican Heritage English speakers were defined as speakers whose

parents had emigrated from Mexico, but who were born in the United States and formally

educated in the Chicago school system.33 The L2 English speakers were English learners

of Mexican descent. The monolingual English (“non-ethnic”) control data was taken

from a study conducted by Hillenbrand et al. (1995), and contained the productions of the

English vowels of 48 adult females. The Mexican Heritage English participants (10

33 To clarify, the Mexican Heritage English (MHE) speakers in Konopka and Pierrehumbert’s investigations were native English speakers of Mexican heritage who were formally educated in English. Thus, the term “heritage” therefore refers to the speakers’ background, and not the manner in which they acquired English.

91

female) and L2 English participants (11 female) read out loud English words from a

wordlist that contained the 11 English vowels embedded in a CVC context. The list

consisted of 179 individual words, but once vowels in prenasal and preliquid contexts

were eliminated, the total number of words utilized was 92. The overall vowel duration

was measured for each vowel, as well as the F1 and F2 frequencies at the midpoint.

The analysis of the L2 English speakers revealed that they exhibited significant

overlap among the high and back vowels, not producing distinctions between /I/ and /i/

and /ʊ/ and /u/. The production of /æ/ for this group was found to be closer to the

Spanish /a/, evidencing the influence of the five-vowel system of Spanish. The Mexican

Heritage English speakers, however, showed the separation of categories into distinct

tense and lax vowels. Their /i/ and /u/ were raised and fronted relative to the non-ethnic

English variety (i.e., Anglo English). Their productions of the lax vowels /I/ and /ʊ/ were

lower than those produced by the Anglo speakers. The raising and fronting of the high

tense vowels and lowering of the high lax vowels resulted in a greater separation of the

tense/lax distinction. The greatest difference between the Mexican varieties of Spanish –

both the L2 English speakers and the Mexican Heritage English speakers – was the

pronunciation of /æ/. In the non-ethnic variety, the /æ/ was fronted, indicating the

presence of the NCVS. The Mexican English speakers, however, did not exhibit

fronting. Konopka and Pierrehumbert (2008) conclude that Mexican Heritage English

speakers in Chicago do not participate in the NCVS but rather have established their own

variety characterized by the raising of /i/ and /u/ and lack of /æ/-fronting. They argue that

“insular, close knit communities are more likely to maintain dialects distinct from the

larger regional population in which they are situated” (p.100).

92

In a related study, Konopka and Pierrehumbert (in press) re-examined Mexican

Heritage English vowel pronunciation, focusing not only on the static qualities of vowels

(F1 and F2 at the midpoint of the vowel) but also the dynamic properties of the vowel

system (vowel inherent spectral change, duration ratios between long and short vowels).

The goal of this study was to compare and contrast Mexican Heritage English, Anglo

English, and L2 English, as well as determine which properties, either static or dynamic,

contributed to the perception of a foreign accent. They analyzed the speech of four

different participant groups: 14 female speakers of Mexican Heritage English, 12 female

second-language learners of English, 12 female Anglo speakers, and 7 native Spanish

speaking females from Mexico.34 All speakers resided in the Albany Park Neighborhood

in Chicago. The English speaking participants (i.e., all but the Mexican natives) read out

loud English words from a wordlist that contained the 11 English vowels embedded in a

CVC context—the same instrument used in Konopka and Pierrehumbert (2008). The

Spanish-speaking participants (i.e., all but the Anglo speakers) read a Spanish wordlist

composed of the five Spanish vowels embedded with CVCV contexts in which the

surrounding consonants were always /p b t d/. Each word was repeated five times. The

formant frequencies of the English and Spanish vowels were then measured at the 20%,

50%, and 80% points throughout the vowel, and duration measures were calculated.

In their comparison of the varieties, they reported that the Mexican Heritage

English speakers exhibited a well-differentiated vowel system that was very similar to

34 The Mexican Heritage English speakers who participated in Konopka and Pierrehumbert (in press) have a similar profile to those who completed their 2008 study. As described earlier, Mexican Heritage English speakers were defined as speakers whose parents had emigrated from Mexico, but who were born in the United States and formally educated in the Chicago school system. The L2 English speakers were English learners of Mexican descent who were recruited from the advanced-level ESL courses. The Anglo English speakers were native speakers of the Inland North dialect. The monolingual Spanish speakers were recruited from beginning level ESL classes and most were from Michoacán, Mexico.

93

that of the Anglo English speakers. In contrast, the L2 English speakers did not exhibit a

well-differentiated English vowel system, collapsing the tense/lax vowel categories as

described for Konopka and Pierrehumbert (2008). In addition, the L2 English lacked a

long/short distinction between vowels in terms of duration. This durational characteristic

of the L2 vowels was more similar to that of the monolingual Spanish system which

showed uniformity in duration. The L2 English speakers also produced /æ/ in a region

that was more similar to Spanish /a/.

Overall, the Mexican Heritage English speakers more closely approximated the

pronunciation of the Anglo English speakers. Very subtle differences existed between

these two varieties, however, the most notable being the fronting of /i/ and /I/ in the

Mexican Heritage English relative to the Anglo variety. Both of these systems exhibited

long/short distinctions in vowel duration, but the ratio of the durational differences was

considerably less for the Mexican Heritage English speakers. In fact, the duration ratio of

long to short vowels in Mexican Heritage English was found to be less than that of Anglo

English, but greater than that observed for the L2 speakers.35 Another notable difference

between the Anglo and Mexican Heritage English variety was the difference in vowel

inherent spectral change (VISC), or formant movement throughout the course of the

vowel, for /æ/. The trajectory of /æ/ in Anglo English was considerably greater than it

was in Mexican Heritage English.

35 For the Anglo speakers, the long and short vowel systems had average values of .25 seconds and .15 seconds, respectively, indicating a separation of long vowel and short vowel systems. The MHE speakers also exhibited separation, in that their average long vowel duration was .23 seconds and the short vowel system was .18 seconds. The L2 English speakers showed less evidence of distinct long and short vowel subsystems, in that their long vowels had an average duration of .20 seconds and the short vowels and average of .18 seconds.

94

A second component of this investigation consisted of accent ratings. Thirty one

naïve speakers of American English provided accent ratings of the Mexican Heritage

English speakers’ pronunciation. An analysis of the static and dynamic properties that

contributed to a speaker’s nativeness revealed that the dynamic properties, and not the

static properties, contributed more to a speaker being rated as having little foreign accent

in English.36 They found that in terms of static properties, the less fronted productions of

/i/ and /I/ were perceived as exhibiting more foreign accent—which seems

counterintuitive given that the MHE speakers produced these vowels further front than

the Anglo speakers. In terms of the dynamic properties, most notable were the voice

duration ratio (i.e., the presence of vowel lengthening preceding voiced consonants), the

VISC of the vowel /æ/, and finally the duration ratio (i.e., the degree of distinction made

between short and long vowels). Higher ratios and greater trajectories contributed to a

speaker exhibiting less foreign accent in English. Therefore, the overall results of this

study indicate that although Mexican Heritage English speakers and Anglo English

speakers may exhibit very similar vowel distribution in static terms, the distinctions

between these groups may lie in the dynamics of their pronunciation. For this reason,

they concluded that it was important to analyze both the static and dynamic properties in

order to more accurately characterize the pronunciation of a certain group of speakers.

Although the review of Chicano and Mexican Heritage English varieties

presented in this chapter is not exhaustive, the five studies reviewed above indicate

several important characteristics of the English of different Latino populations in the

36 The static properties measured were the F1 and F2 values of the English vowels measured at the 50% point throughout the vowel. The dynamic properties included the long/short duration ratio, the +voice/-voice duration ratio, the VISC.

95

United States.37 Some of these studies revealed accommodation to, or adoption of certain

characteristics present in, the majority English variety spoken in the region, whereas

others did not. Godiñez and Maddieson (1985) reported the absence of /u/-fronting in the

Latino males they examined in Los Angeles, whereas Fought (1999) found evidence of

/u/-fronting within the same city. In Fought’s study, however, the presence and degree of

/u/-fronting was influenced by several social factors (i.e., gender, social class, gang-

affiliation) such that only certain participants integrated this characteristic into their

speech. In the American Midwest, Roeder (2010) reported that the female Mexican

Americans exhibited most of the characteristics of the NCVS and thus had

accommodated to the majority, non-ethnic English variety in the area. She did report,

however, that the men were at earlier stages in the NCSV, and that all speakers differed

from their Anglo-counterparts in terms of the backness of /æ/ in prenasal position. In

contrast, the two studies conducted by Konopka and Pierrehumbert (2008; in press)

reported that Mexican Heritage English speakers in Chicago had not completely

assimilated to the non-ethnic norm of pronunciation. Their productions of /æ/ were not

fronted in a way that is typically associated with the NCVS, and the formant trajectory

within this vowel was found to be less than the Anglo speakers.

Taken together, these findings indicate that although Mexican American English

varieties spoken in the United States are more closely related to the non-ethnic English

majority norms than to learner varieties and Monolingual Spanish varieties, they are still

distinct from the majority variety in specific ways. Pronunciation of specific vowels, for

example the position of /i/ and /u/, the lowering or raising of the lax high vowels /I/ and

37 See also Thomas (2001) for Tejano English and Amastae (1978) for the acquisition of English vowels by Spanish-English bilinguals and L2 learners.

96

/ʊ/, and the pronunciation of /æ/ have emerged as distinct from the matrix variety in all of

the Latino and L2 English varieties discussed. What is perhaps more important, however,

is that each of the Mexican-influenced English dialects are distinct from each other.

Konopka and Pierrehumbert (2008: 102) argue that the distinctions between the Chicago

speakers and those reported in Roeder (2010) indicate that there is perhaps no “supra-

regional” Mexican Heritage English variety. The manner in which the non-ethnic

pronunciations are integrated into the Latino variety differ according to the region, as

well as according to the relationship that the minority community maintains with the

majority group of speakers. Thus, the creation of a unique system of pronunciation is

complex, and depends on a number of regional, social, and individualcharacteristics. The

next section of this chapter focuses on a review of the extralinguistic and linguistic

factors that may also influence heritage Spanish vowel pronunciation.

2.4 Individual speaker characteristics and speech production

Thus far, this chapter has provided an overview of the HS population as well as a

comprehensive description of investigations pertaining to vowel production and the

Spanish vowel system. Although the primary goal of the present study is to examine how

lexical stress, syllable type, and speech style influence the pronunciation of HS vowels, a

secondary focus is to assess the relationship between individual variables and vowel

production.

In contrast to the technical studies of vowels described in previous sections, the

secondary component of this research is sociolinguistic in nature, and could be more

specifically classified as sociophonetics. As described by Thomas (2011), studies in the

area of sociophonetics have become more prevalent within the past several decades, but

97

the exact definition, components, and goals of a sociophonetic investigation depend on

the linguistic subfield in which it is carried out. For phoneticians, sociophonetics

includes studies of dialectal variation or examinations of the timing of articulatory

gestures (Thomas, 2011). The field of sociolinguistics, however, has adopted a more

specific definition. According to Thomas (2011) from the sociolinguistic perspective,

sociophonetics includes studies within the variationist framework that employ modern

acoustic techniques and experimental methods. The present investigation more closely

matches the sociolinguistic definition of sociophonetics; it employs acoustic analytic

techniques to accurately describe the pronunciation of a specific population of Spanish

speakers, while at the same time establishing a connection between the actual vowel

pronunciations and a number of sociocultural and individual characteristics.

A considerable amount of research has established that variables such as age of

learning and age of arrival to a country affect a bilingual speaker’s pronunciation and

perceived accent in the L2 or non-dominant language (Flege et al., 1995; Munro et al.,

1996). Sociolinguistic studies of vowel and consonant production have also correlated

pronunciation with traditional sociolinguistic factors such as socioeconomic class, age,

and gender (Sadowsky, 2011; Serrano, 2006; Perisinotto, 1975, amongst others).38 In

addition to these traditional sociolinguistic variables, however, a number of other factors

have been shown to correlate with HS performance on grammatical tasks and

pronunciation of certain segments. The present investigation will thus focus more

specifically on sociocultural and individual characteristics such as overt grammatical

38 As all of the HS who participated in this study were either born in the United States or moved to the country before age 6, age of learning Spanish and age of arrival to the U.S. will not be examined since all speakers are similar in these respects. The age range of these speakers is similar, thus it will not be possible to assess generational effects on vowel pronunciation. Information pertaining to each speaker’s socioeconomic class was not collected via the language background questionnaires.

98

knowledge, course level, use of Spanish in the home and community, frequency of travel

abroad, and cultural sensitivity. The known relationships between each of these factors

and HS and bilingual speech production will be described in greater detail below.

Several of the investigations conducted by Montrul (2005) and Montrul et al.

(2008a; 2008b) indicated that grammar proficiency, as determined by a formal, written

exam, affected performance on different types of morphosyntactic tasks. When learners

and HS were divided into low, intermediate, and advanced groups based on their scores,

performance on morphosyntactic tasks was found to differ according to proficiency level.

The advanced level learners in particular outperformed those who had lower grammar

proficiency scores, and this distinction was revealed for both the L2 learner groups and

HS groups. As distinctions in performance on grammaticality judgment tasks based on

level of proficiency have been reported, the question then arises as to whether or not

overt grammatical knowledge will also have an effect on pronunciation. Comparisons

between L2 learners and HS often indicated that HS possessed more native-like

pronunciation, but what about within HS populations? Chang et al. (2009) reported that

at least for Chinese, experience with Mandarin and comprehension of formal and

informal situations affected the pronunciation of vowels. Although no formal grammar

proficiency test was conducted in their study, the findings suggest a correlation between

pronunciation and comprehension of the heritage language and/or the L2.39

Two other factors which have been shown to affect both morphosyntactic

knowledge and pronunciation accuracy are how often a HS speaker uses their heritage

39 As will be discussed later in chapter 5, there is some suggestion that paper-based grammar proficiency exams may not be appropriate for heritage populations given that their exposure and experience with the Spanish language has primarily been oral. For this reason, the participants will also be divided according to their course level (i.e. intermediate or advanced), as will be described in chapter 3.

99

language outside of the classroom, and how frequently they travel abroad to visit with

family and friends in a Spanish-speaking country. One such study conducted by

Mikulski (2010) examined how age of arrival to the United States, use of Spanish in the

home and community, and frequency and length of travel to hispanophone countries

affected HS’s recognition and judgment of the Spanish subjunctive in volitional

constructions. The participants in this investigation were 32 heritage speakers of Spanish

who were further divided into two groups. Those who were born in the United States or

arrived before age 6 were referred to as “early childhood” bilinguals because their

exposure to English began as young children (n=24). The other 8 participants were

grouped as “late childhood bilinguals” because they had arrived to the United States

between the ages of 6-12 years. All participants completed a language background and

use questionnaire, a grammaticality judgment task, an editing task, and a cloze task to

ensure that the participants had not confused the thematic vowel in the Spanish verbal

conjugations.

Multivariate linear regressions were conducted in order to determine the

relationship between the sociocultural variables and HS performance on the grammatical

tasks. The results indicated that there were no significant differences in performance

based on age of arrival (“early” vs. “late” childhood bilinguals). Spanish use and travel

abroad, however, were found to affect performance on certain tasks. Increased use of

Spanish with family and longer stays abroad were positively correlated with the scores in

the editing task. Thus, those participants who reported speaking Spanish more frequently

at home and those who had spent at least two weeks abroad and had visited Spanish-

100

speaking countries more than once outperformed those HS who used less Spanish and did

not travel.

Duprey-Almeyda (2009) examined the effects of language attitudes and travel

abroad the on vowel perception and production by L2 learners of English residing in

Puerto Rico. The participants in this study were tenth graders at a Puerto Rican high

school who were studying English as a second language. A series of production and

perception experiments were conducted which analyzed the learners’ ability to produce

and identify English monophthong vowels. Additionally, several language background

and attitudes questionnaires were administered in order to assess the extent to which

these extra-linguistic factors influenced speech production and perception. The results of

the investigation indicated that although all of the students had a positive attitude toward

English and showed a high level of cultural sensitivity, those learners who had “state-side

ties” showed a greater ability to produce native-like vowels and also had higher

perception scores. Thus, those students who maintained familial ties or relationships with

people in the United States exhibited more native-like production of English vowels and

were better able to perceive them when compared to those students who did not have the

same connections with the U.S.

The study conducted by Mikulski (2010) indicated that the frequency with which

a bilingual speaker or HS uses their heritage language outside of the classroom and how

often they travel abroad had significant effects on pronunciation and morphosyntactic

knowledge. Duprey-Almeyda’s (2009) investigation revealed that maintaining strong ties

with the heritage culture had a positive impact on language attitudes as well as speech

production and perception. Thus, in an indirect way, Duprey-Almeyda’s study found that

101

the strength of the connection with the heritage culture was influential in their speech

production. The results of Mikulski and Duprey-Almeyda therefore suggest that there

may be a connection between HS vowel production, frequency of travel abroad and

Spanish use.

A final sociocultural variable addressed in this investigation is cultural sensitivity.

Alvord and Christiansen (2009) examined the effects of language attitudes, cultural

sensitivity, and motivation on the acquisition of spirantization in Spanish by adult

learners in a study-abroad context. A group of 34 male participants who were studying

abroad in various Spanish speaking countries with a service-oriented program completed

narrative and elicitation tasks containing the target sounds /b d g/ and their spirantized

allophones. In addition to the linguistic tasks, the participants completed a cultural

sensitivity questionnaire developed by Cushner (1986), which was designed to assess

each participant’s general degree of cultural awareness and sensitivity. The statistical

analysis indicated that motivation and overall cultural integration were two of the factors

selected as significantly affecting acquisition of the target sounds. That is, those

participants who scored higher on the cultural sensitivity questionnaire and exhibited

higher motivation scores produced target sounds that more closely approximated native

speaker norms.

The present investigation examines each speaker’s level of cultural sensitivity by

utilizing the same questionnaire employed by Alvord and Christiansen (2009).40 As will

40 Although Alvord and Christiansen (2009) reported that motivation and attitudes towards language were also significant predictors of pronunciation accuracy, this dissertation will not directly examine these additional variables. Despite including open-ended and multiple choice questions that were meant to assess each speaker’s attitudes toward the language and motivation to speak it, all participants responded similarly. Because of the homogeneity in the responses, it was not possible to include these variables in the analysis.

102

be explained further in chapter three, this questionnaire is further divided into subscales

which can subsequently be correlated with pronunciation accuracy and performance.

Given the limited number of participants in this study, only the total sensitivity score will

be examined.

In summary, the literature presented in this section indicates that HS

pronunciation could potentially be influenced by a number of extra-linguistic factors.

Grammar proficiency and course level have been shown to correlate with

morphosyntactic knowledge, but thus far it is unclear if HS with more advanced

grammatical skills and those enrolled in upper-level courses will exhibit differences in

their pronunciation when compared to those with less overt grammatical knowledge and

formal instruction. The use of Spanish outside of the classroom and frequency of travel

abroad have been shown to significantly affect the pronunciation of vowels and

performance on morphosyntactic tasks. Thus, there may be a correlation between these

two variables and HS vowel pronunciation, even within this small sample size. Finally,

additional research has established a connection between a speaker’s cultural sensitivity

and their pronunciation. As cultural sensitivity increased, so did the pronunciation

accuracy of specific L2 segments. Further detail pertaining to the instruments and

scoring rubrics will be provided in the following chapter.

2.5 Research questions and hypotheses

The literature described in the previous sections of this chapter motivates the need

for further investigation of HS pronunciation as well as a more in-depth investigation of

Spanish vowels. Heritage speakers are a population of interest given that, despite the

large number of investigations focusing on morphosyntax and pedagogy, fewer studies

103

have focused on HS pronunciation. Vowel systems are also of interest in this

investigation. There are few acoustic studies of Spanish vowels to date, and of those

existing studies, there is a considerable amount of disagreement among researchers as to

which linguistic factors influence vowel pronunciation in Spanish—especially when

bilingual varieties of Spanish are taken into consideration. In addition, most of the

investigations focus on highly controlled and unnatural speech styles such as carrier

phrases, thus, little is known about how, and in what way, the vowel system of Spanish

can vary as a consequence of style. Previous studies of bilingual Spanish and bilingual

English vowel systems show that speakers of contact varieties possess a modified system

of pronunciation that differs from the monolingual norm and/or matrix variety within the

same region. Systems of contact pronunciation, however, are complex, and are

influenced by a number of extra-linguistic factors. These general findings, then, have

guided the development of research questions and hypotheses for the present study:

Q1: How are HS vowels organized and dispersed throughout the system, and how

does this organization and dispersion compare to traditional and acoustic

descriptions of monolingual Spanish varieties?

H1: If heritage speakers do exhibit native-like pronunciation, their vowel

productions will approximate the descriptions of monolingual Spanish vowel

systems presented in the literature.

H2: Heritage speaker vowel pronunciation will conform to sociocultural norms

associated with the population, such as the backed position of /e/ in Mexican

American Spanish described by Lipski (2008).

104

H3: The heritage Spanish vowel system will be a hybrid, exhibiting

characteristics associated with English such as the fronted position of /u/.

Q2. How do lexical stress, syllable type, and speech style affect HS vowel quality,

quantity, and dispersion?

Q2A: How does lexical stress affect HS vowel quality, quantity, and dispersion? Do

tonic and atonic vowels exhibit little quality and quantity differences as argued for

monolingual varieties of Spanish, or do they exhibit stress differences more in line

with what is argued for bilingual and learner varieties?

H1: Lexical stress will have little impact on vowel quality, dispersion, and

duration, as is traditionally claimed in descriptive accounts of monolingual

Spanish varieties (Delattre, 1969).

H2: Heritage speakers will exhibit the unstressed vowel reduction that is

characteristic of central Mexican varieties of Spanish. The front vowels /e/ and /i/

will be the most affected, with atonic /i/ and /e/ exhibiting shorter duration and a

position farther back in the vowel space when compared to their tonic

counterparts (Matluck, 1952; Lope Blanch, 1972).

H3: Heritage Spanish vowels will exhibit contact-induced differences in quality,

dispersion, and quantity similar to those described for other bilingual and L2

learner systems. In particular, atonic vowels will show movement towards the

center of the vowel space along the F2 dimension, will occupy less peripheral

105

locations in the acoustic space when compared to their tonic counterparts, and

will be shorter in duration than tonic vowels (Menke & Face, 2010; Willis, 2005).

Q2B: How does syllable structure (closed versus open syllables) affect vowel

quality, dispersion, and quantity? Is there variation based on syllable type, or do

HS fail show allophonic differences, as has been argued in the majority of the

acoustic literature?

H1: Heritage Spanish vowels will exhibit the allophonic variation based on

syllable type that is described for monolingual Spanish varieties (Navarro-Tomás,

1918). The mid vowels /e/ and /o/ will be characterized by more open

articulations in closed syllables, and close articulations in open syllables.

H2: Syllable type will not impact HS vowel quality, but will have an effect on

vowel duration (Marín Gálvez, 1995).

H3: Syllable structure will not impact the production of HS Spanish vowels,

indicating that the distinction is not present in this variety.

Q2C. How does speech style (spontaneous vs. controlled) affect HS vowel quality,

quantity, and dispersion? Do HS exhibit the same quality and quantity differences

based on speech style that have been found for monolingual and bilingual varieties?

H1: The HS in this study will not exhibit differences in vowel quality, dispersion,

or duration based on speech style. The absence of a style effect may suggest that

HS have not acquired the fine-grained distinctions in style exhibited by

monolingual speakers.

106

H2: Heritage speakers of Spanish will exhibit differences in vowel quality,

dispersion, and duration much like those reported for other monolingual and

bilingual populations. More specifically, vowels produced in controlled speech

will occupy a more peripheral location within the acoustic space when compared

to those produced in the spontaneous speech tasks (Alvord & Rodgers, 2011;

Harmegnies and Poch-Olivé, 1992; Poch-Olivé et al., 2008).

Q3: Do individual variables influence the way in which HS pronounce their vowels?

H1: There will be no relationship between individual variables (i.e., course level,

grammar proficiency, travel, Spanish use, and cultural sensitivity) and HS vowel

production.

H2: The same factors known to impact HS morphosyntactic knowledge (i.e.,

grammar proficiency, Spanish use, travel abroad) will impact HS pronunciation of

the Spanish vowels (Mikulski, 2010; Montrul, 2005).

H3: HS vowel production will be influenced by the same factors known to

influence L2 pronunciation accuracy and the acquisition of L2 phonology.

Course level, travel abroad, regular use of the L2, and high degrees of cultural

sensitivity have been positively correlated with the acquisition of L2 segments

(Alvord & Christiansen, 2009; Chang et al., 2009; Díaz-Campos, 2004; Menke &

Face, 2010).

In order to assess the research questions addressed above, a series of acoustic

experiments were designed and carried out with a group of HS from the Chicago area.

107

The next chapter provides detailed explanations of the participants, tasks, data analysis,

and statistical methods that were used to analyze the acoustic data.

108

3 METHODOLOGY

The purpose of this chapter is to describe the speech elicitation tasks and

questionnaires, the participants, the data collection and analysis, and the statistical

analyses that were conducted in the present investigation.41 Each of these

methodological components is described in greater detail below.42

3.1 Instruments

The participants took part in three oral speech tasks in order to elicit the

production of Spanish vowels in different contexts as well as in different speech styles.

A language background questionnaire and a cultural sensitivity questionnaire were

administered, along with a grammar proficiency test. The speech elicitation instruments

are discussed in sections 3.1.1 through 3.1.3. The other instruments are described in

sections 3.1.4 through 3.1.6.

3.1.1 Narrative retelling task

The first speech elicitation instrument consisted of a narrative retelling task

(hereafter NRT) that was included as a means to gather spontaneous speech. The

participants were asked to view the video “The Tortilla Rag” (Malovrh, 2008) and then

asked to orally narrate the series of events upon completion of the film. “The Tortilla

Rag” is a short, silent video that depicts two people preparing a tortilla española – a

41 Prior to collecting the official dissertation data, the instruments were piloted several times to ensure they would prompt the intended lexical items and to ensure that enough vowel tokens would be produced by the tortilla rag. Three separate pilot investigations were collected. Two were conducted in Bloomington, Indiana in August of 2010 and in September of 2010. Two participants took part in each pilot. The third pilot was conducted in Chicago, Illinois, during October of 2010. Four participants completed this version of the experiment. All lexical items that could not be consistently elicited in the picture memory task were either eliminated or were moved to the carrier phrase task. 42 The pilot version of this experiment as well as the final dissertation instruments are approved by the Indiana University Institutional Review Board Protocol # 0911000827.

109

traditional Spanish dish that resembles an omelet, consisting of eggs, potatoes, and

onions. Although it was originally designed to elicit direct and indirect object pronouns

(Malovrh, 2008), there are numerous actions and objects that prompt the production of

Spanish vowels in a variety of contexts. The author of the video gave his permission to

include it as one of the instruments in this dissertation.43 Images from the video are

shown in Figure 3-1.

Figure 3-1. Images from the Tortilla Rag, narrative retelling task.

The data collected from the narrative provided semi-spontaneous, unplanned

speech that more closely resembled natural speech when compared to the other tasks

utilized in this dissertation. Using a video such as this also provided some consistency in

terms of the number and types of lexical items obtained from each participant. Although

this video did not elicit an equal number of tokens of each vowel in all of the prosodic

and syllabic contexts, it did provide a sufficient amount of vowels to examine the basic

distribution within the acoustic space. The duration of the narrative retellings varied in

length according to the amount of detail the participant included. On average, the

43 The video (in two parts) can be accessed via Dr. Paul Malovrh’s personal website: http://www.cas.sc.edu /dllc/ Spanish/faculty/Malovrh/malovrh.html

110

recordings lasted approximately 90 seconds, although one recording was as short as 45

seconds and another reached 4 minutes and 25 seconds. The average number of vowels

produced in this task was 70. One participant only produced 38 analyzable vowels,

whereas another produced 114.

3.1.2 Picture identification task

Upon completion of the NRT, the participants were asked to complete a picture

identification task (hereafter PIT). The goal of the PIT was to obtain semi-spontaneous

speech by asking participants to identify and name images of objects, animals, and food

presented on a computer screen. Unlike the NRT which consisted of spontaneous speech,

the PIT was semi-balanced by the syllabic and prosodic contexts under investigation (i.e.,

tonic vs. atonic; closed vs. open syllables). The images utilized in the PIT were carefully

selected to elicit the five Spanish vowels in different prosodic and syllabic contexts,

resulting in the production of 80 total vowels. Although most of the images used in the

PIT represented common, everyday objects, some less common words were included in

order to provide examples of all vowels in all contexts. A list of the words included in

the PIT is presented in Appendix A. In total, the PIT lasted approximately 10 minutes,

and consisted of 123 power point slides that included the contextualization of the task,

the instructions and models, the familiarization phase, and the final task phase. Further

information about the PIT and sample slides are presented in the following paragraphs.

The PIT was presented to the participants as if it were a memory game, and they

were given a scenario to help contextualize the lexical items they would be asked to say.

It consisted of two phases: a training phase and an actual task phase. Before the actual

task began, participants were familiarized with the names of the objects they were going

111

to see.44 For example, they were presented with an image like the one shown in Figure

3-2, and were asked to say out loud the number and the name of the object they saw. The

first few letters of the object name were provided to help eliminate any ambiguity as to

what the object was. Figure 3-2 is a model slide consisting of a picture of three bananas

that was meant to prompt the phrase tres plátanos (“three bananas”).

Figure 3-2. Sample training slide from PIT.

Once the participants had been familiarized with the objects, they proceeded to

the actual task. They were told that they were helping their friend Marta plan a surprise

party for their mutual friend Pablo. Marta had invited a number of guests to the party,

and had asked that they each bring a gift or a of food item to share at the party. She had

asked you (the participant) to keep a record of who had already arrived and what they

brought as a gift. The participants were shown a model like the one presented in Figure

3-3 so they understood that they needed to say the name of the person who brought the

item, as well as what, and how many, they brought. Figure 3-3 presents two model slides

44 The decision to include a familiarization phase was motivated by difficulties in the pilot experiments. In the pilot trials of the picture identification instrument, some participants did not say the word intended, resulting in the inability to analyze the vowel that was needed. In order to ensure that the participants would say the intended lexical item, they were first familiarized with each item. If they were not able to identify the item, the investigator wrote down the name of the object for them.

112

from the PIT. The first slide is a sample, and prompts the response María trajo dos

galletas (“María brought two cookies”). The first slide includes the word modelo

(“model”) and ves (“you see”) to indicate that the slides are for practice. The second

slide includes the word dices (“you say”) in red in order to demonstrate how the

participant would be expected to respond upon seeing the slide. The phrase ¿Qué trajo?

(“What did he/she bring?”) was included on each slide to remind the participants that

they needed to respond to the question in a complete sentence. During the actual task

phase, the words modelo, ves, and dices were not included on the slide – only the picture

of the person and their name, the images of the objects, and the question ¿Qué trajo?

were presented.

Figure 3-3. Model slide from actual task phase in the PIT.

In addition to the normal, every-day object names presented in the PIT, a number

of uncommon objects and animal names were included in order to obtain all five Spanish

vowels in all of the stress and syllable type contexts. To integrate the less common

lexical items into the party scenario, participants were stopped halfway through the task

and given further instructions. They were told that some of the guests did not follow

instructions and brought very strange “gifts” with them. As with the previous portion of

113

the PIT, the role of the participant was to report to Marta what each person brought so

that she could decide if they would be permitted to come to the party or not. Figure 3-4

depicts one of the “strange” gifts, and was intended to elicit the response Carlitos trajo

tres avispas (“Carlitos brought three wasps”).

Figure 3-4. Slide of "strange" gift from the PIT.

As mentioned previously, the images used in this task include lexical items

containing tonic and atonic vowels in both closed and open syllables. Care was taken to

avoid the creation of vowel coalescence or synalepha between the indefinite article and

vowel-initial target words by including multiple pictures of the object: either “dos,”

“tres,” or “seis.” Also, the majority of the target vowels were flanked by either stop or

fricative consonants to facilitate segmentation and avoid any potential effects or

difficulties due to the presence of liquids or nasals45.

In total, there were 64 different lexical items included in the task, with two to four

examples of each vowel in each of the contexts (i.e., tonic open /a/, tonic closed /a/,

atonic open /a/, atonic closed /a/). Only the utterances from the task phase were included 45 Given the difficulty in finding images that were recognizable and were appropriate for the party scenario, some instances of nasals and liquids were included. As will be discussed later on in this chapter, all formant measurements were taken at the midpoint of the vowel in an attempt to minimize the influence of the surrounding consonantal context.

114

in the overall analysis.46 Several of the categories included more than four lexical items

containing the target vowel, as can be seen in Appendix A. For the atonic vowels, only

those in pretonic position were analyzed. The decision to exclude final vowels was made

based on previous findings that final vowels in some dialects of Spanish can be devoiced,

reduced, or in contrast, even lengthened (Delforge, 2008; Hualde, 2005; Lipski, 1994;

Lope Blanch, 1972; Matluck, 1952).

3.1.3 Carrier phrase task

After completing the PIT, participants completed a traditional carrier phrase task

(hereafter CPT) intended to gather controlled speech. The participants were asked to

produce a series of Spanish words within the carrier phrase La contraseña es_____ (“The

password is_____”). Fifty six total words were included in this task, yielding 66

potentially analyzable vowels.47 All of the words included in the carrier phrase task are

presented in Appendix B. All words were presented on a computer screen via power

point over the course of 64 slides that included the contextualization of the task, model

slides, and the target words. Most participants completed the CPT in five minutes.

As with the picture description task, the elicitation task was couched within a

game-like scenario in which the participants were told that they needed to remember a

certain password to enter a science lab. The word list consisted of real Spanish words

(i.e., no non-words) containing the five Spanish vowels between all combination of

46 Initially the intention was to include elicitations from both the training and the task phase in the PIT. Many of the participants did not know the name of the object and had to be helped, stuttered, or did not agree with what the object was called. Due to these complications, only the second repetition in the actual task phase was analyzed. 47 As some of the words included in the CPT were bi-syllabic, both the atonic and tonic vowels were extracted when possible.

115

/ptks/.48 The near-minimal series beginning with /m/ (i.e., masa, mesa, misa, etc.) was

also included in this task. Finally, in addition to the bi-syllabic words mentioned above,

several tri-syllabic words were included in the CPT. The additional words include

examples of tonic and atonic vowels in closed and open syllables, and were items that

were too difficult to include in the PIT because they were either too uncommon or were

not successfully elicited in the pilot experiments. Sample slides from the CPT are

presented in Figure 3-5. These two slides were included as practice, and for this reason

they include the words modelo (“model”), ves (“you see”), and dices (“you say”).

MODELOVES:

La contraseña es….

perro

MODELODICES:

“La contraseña es perro.”

Figure 3-5. Sample slides from CPT.

The CPT was included in the present investigation for two main reasons. First of

all, a carrier phrase including vowels in neutral or similar consonantal contexts is a

frequently-employed methodology in vowel research (see Bradlow, 1995; Morrison &

Escudero, 2007; Servín & Rodríguez, 2001). Including a CPT in the present study

facilitates comparisons across studies between HS vowel productions and vowels

produced by monolingual speakers of Spanish. Second of all, the CPT represents what

has been referred to as “laboratory” speech, and represents a very controlled speech style. 48 Not all combinations of /ptks/ and the five vowels produced actual Spanish words. For example, combinations such as /kuso/, /pupo/, and /tuto/ were not included. The consonants /p,t,k/ were also chosen so that future investigations could examine HS VOT.

116

As discussed in chapter 2, section 2.2.3.3, previous research suggests that vowels are

produced differently in terms of quality based on the type of speech in which they are

produced (Harmegenies & Poch-Olivé, 1992, Poch-Olivé et al, 2008). Thus, the current

investigation includes three tasks that represent three different speech styles:

spontaneous, semi-spontaneous, and controlled.

3.1.4 Language background questionnaire

All participants completed a written language background and history

questionnaire in order to gather information pertaining to their experiences and usage of

both English and Spanish in their daily lives (see Appendix C). The responses to the

written questionnaire ensured that participants originated from roughly the same region

within the United States and/or Mexico; inquired whether or not they had studied another

foreign language; how often they traveled abroad; and how often and how frequently they

spoke, listened to, or wrote English and Spanish in different contexts. Participants’

responses pertaining to frequency of travel abroad and Spanish use were included in the

statistical model and correlated with HS vowel production. The procedure for obtaining

travel frequency and Spanish use are described in greater detail in sections 3.4.2.3 and

3.4.2.4, respectively.

3.1.5 Grammar proficiency test

All participants completed a short grammar proficiency exam consisting of 25

multiple choice questions (Geeslin & Gudmestad, 2010). The purpose of the grammar

proficiency activity was to ensure that the participants had comparable proficiency in

Spanish. The scores on this exam were then correlated with the experimental results in

order to examine the effects of Spanish grammar proficiency on vowel production. The

117

grammar test is presented in Appendix D. Further information pertaining to the grammar

proficiency activity and how it was included in the statistical model is discussed in

section 3.4.2.2.

3.1.6 Cultural sensitivity questionnaire

Finally, each participant completed Cushner’s (1986) Inventory of Cross-Cultural

Sensitivity (ICCS) (see Appendix E). The ICCS questionnaire contains a series of

statements in English. Each participant was asked to indicate how strongly they agreed

or disagreed with each statement. The scale ranged from 1-7, with 1 indicating a

response of “strongly disagree” and 7 a response of “strongly agree.” The responses

were then organized into several different subscales: Cultural Integration, Behavioral,

Intellectual, Attitude, and Empathy. The sum of the scores tabulated for each of the

subscales produced the overall cultural sensitivity score. The range of possible scores on

the ICCS is between 32 and 224 points. Higher scores indicate greater degrees of cultural

sensitivity, whereas lower scores indicate lesser degrees of cultural sensitivity.

Additional information pertaining to the ICCS and how it was correlated with HS vowel

production is described in section 3.4.2.5.

3.2 Participants

Sixteen heritage speakers of Spanish (13 females and 3 males) participated in this

study. A full table of demographic information is presented in Appendix F. All

participants were current residents of Chicago, IL, and were students at the University of

Illinois, Chicago. The mean age of the participants was 20.1 years. Thirteen participants

were born in the United States and had lived in Chicago their entire lives. Three

participants were born in Mexico and moved to the United States at the ages of 2, 6, and

118

8 years of age. Fifteen of the participants were of Mexican heritage, and one was of

Puerto Rican heritage. Fourteen of the participants were born to parents of the same

heritage (i.e., either both Mexican or both Puerto Rican). Two participants were of mixed

heritage: one was born to a Mexican mother and a Castilian Father, and the other was

born to an American mother and a Mexican father.

The educational background of the participants who completed this study was fairly

consistent. Twelve participants began formal study of Spanish in high school at age 14 or

in college at ages 17-18. Two participants indicated that they received formal Spanish

instruction in elementary school, and one participant was enrolled in a bilingual

education program for one year. One participant did not indicate the age at which she

began studying Spanish.

All participants indicated that they lived in predominantly Hispanic communities

within the Chicago area and spoke Spanish at home and within the community on a

regular basis.49 Additional information pertaining to Spanish use was gathered from the

language background questionnaire, and will be explained in greater detail in section

3.4.2.4. All participants reported that they overheard and spoke Spanish starting from a

very early age, and most reported that their exposure to Spanish began at birth. Eleven

participants indicated that their native language was Spanish, one participant indicated

that her native languages were both Spanish and English, and the remaining four

participants indicated that they considered their native language to be English.

49 Although not all participants indicated the neighborhood in which they lived, more than half reported that they lived near the University of Illinois, Chicago. Some specifically mentioned residing in Mexican-American communities such as Pilsen and Little Village, which are located to the south and the southwest of the University, respectively.

119

At the time the study was conducted, seven of the 16 participants were enrolled in

S114 (Spanish for Bilinguals II). Spanish for Bilinguals II is an intermediate-level

grammar course designed specifically for HS of Spanish. The goals of the course are to

re-introduce HS to key grammatical concepts and help develop additional reading,

speaking, and writing skills. The other nine participants had already completed S114 and

were enrolled in 200 and/or 300 level courses such as S202 (Advanced Spanish

Grammar), S204 (Intensive Writing), S210 (Literature), or S380 (Independent Study).

More detailed demographic information pertaining to linguistic and developmental

history, language use, and ethnic self-identification is presented in Appendix F.

3.3 Procedure

This section of the chapter describes the order in which the tasks were completed

and the devices that were used for stimulus presentation and recording. After completing

the informed consent documents, each participant was first asked to view the video

“Tortilla Rag” and subsequently asked to retell as much of the information as they could

remember. Next, each participant completed the PIT, followed by the CPT. Once the

three recorded tasks were finished, all participants completed the written grammar

proficiency test, the language background questionnaire, and the cultural sensitivity

questionnaire.50 Participants took part in the experiment one at a time, with only the

experimenter present to help guide them through the tasks if they had any questions.

50 In addition to the tasks for eliciting Spanish vowels described in section 3.1, the participants were also asked to provide examples of their English vowels by completing a carrier phrase task and paragraph reading in English. The phrase for the carrier was “I say ___ for you.” The paragraph reading involved a fictional story about a summer vacation in which words containing all of the cardinal vowels were embedded. The production of English vowels is not discussed in the present dissertation.

120

Upon completion of all of the materials, each participant was paid $10 for taking part in

the experiment.

All of the stimuli were presented on a Hewlett-Packard G62 laptop computer with

a 15.6 inch screen. The “Tortilla Rag” was presented via Windows Media Player, and the

PIT and CPT were presented with Microsoft Power Point 2010. Each participant’s

speech was recorded using a SHURE head-mounted microphone and a USBPre interface

that was run through a Dell Inspiron mini netbook.51 The participants’ speech was

subsequently analyzed using Praat Software (Boersma & Weenink, 2010). All of the

recordings took place in a quiet room or classroom on or near the UIC campus. The

series of experiments and questionnaires took approximately one hour to complete.

3.4 Data analysis

The purpose of this section of the chapter is to describe how the data were

segmented and analyzed and explain the statistical tests that were conducted on the

variables examined in this dissertation. Section 3.4.1 describes the procedure for

segmenting vowels, the criteria for excluding vowel tokens, extracting vowel formants,

calculating vowel duration, calculating the centroid and Euclidean distance measures, and

normalization. Section 3.4.2 explains the individual variables in greater detail, and

describes how the questionnaires were scored. Finally, section 3.4.3 describes the

statistical analyses employed in the present study.

51 For more information about the USBPre, see the following website: http://www.sounddevices.com /products/usbpremaster.htm

121

3.4.1 Acoustic measures

The audio recordings obtained from each participant were analyzed in Praat

Software v. 5.1.44 (Boersma & Weenink, 2010). Vowels were first segmented out of the

running speech samples, as described in section 3.4.1.1. The first and second formant

frequencies and vowel duration measures were calculated using procedures described in

sections 3.4.1.3 and 3.4.1.4, respectively. The first and second formant frequencies were

used to determine the center of each speaker’s vowel space, and the Euclidean distance

measure of each vowel to the center was subsequently calculated (section 3.4.1.5). The

first and second formant frequencies, duration, and Euclidean distance are the four

dependent variables in the statistical analysis. Further information about statistics is

presented in section 3.4.3.

3.4.1.1 Segmentation of vowels

All vowels were segmented out of the larger speech samples obtained in the three

elicitation tasks. The formant transitions into and out of vowels and the degree and

direction of formant trajectories are extremely sensitive to the surrounding consonantal

context (Rosner & Pickering, 1994; Stevens & House, 1963). Given this contextual

sensitivity, a series of cues found in the waveform and the spectrogram were utilized in

order to determine the onset and offset of each vowel.

In order to identify the onset of a vowel following a stop consonant, several main

acoustic cues were identified in the acoustic signal. First, the release burst from the stop

consonant was located, which signals the release of the stop consonant and the onset of

the following vowel. A second important cue was the increase in intensity in the

waveform and the spectrogram. As a stop consonant is characterized by the lack of

122

activity in the waveform and the spectrogram, the sudden increase in energy in the

acoustic signal signified the onset of the vowel. Periodicity, or cyclic repetition, in the

waveform was the third cue that facilitated segmentation of the vowel. Finally, the

onset of clear and regular formant structure was also used as a means to determine the

starting point for each vowel. In order to determine the offset of a vowel when it

preceded a stop consonant, the reverse of these cues was used. Thus, the end point of the

vowel was identified based on a sudden loss of intensity and energy in the waveform and

the spectrogram, a sudden lack of periodicity, and a shifting and breaking up of the

regular formant structure characteristic of a vowel.

Figure 3-6 illustrates the segmentation of the vowel [a] in the word tapa (“lid,”

“top”). The top panel is a representation of the waveform in which the boundaries of the

first “a” are delineated by the dashed lines. The second panel includes the spectrogram in

which the same vowel is segmented. Below the acoustic panels, a text grid is included

which serves to label the vowel sound and the entire word. The silent portion before the

segmented vowel is the closure for the stop consonant. The small burst represents the

release of the stop consonant, followed by the voiced onset time. After the VOT, there is

a large increase in intensity in the waveform as well as periodicity and clear formant

structure in the spectrogram. Once the vowel has ended, another period of silence, seen

in both the waveform and the spectrogram, corresponds to the following stop consonant

[p].

123

Figure 3-6. Waveform and spectrogram of the word tapa (“lid”) with acoustic cues for segmentation and measurement.

In order to identify vowels that were surrounded by fricative consonants, a

slightly different set of cues needed to be used given the nature of fricatives. As

fricatives allow for the partial passage of air through the oral cavity, they exhibit a degree

of aperiodic noise in the waveform as well as high frequency noise in the spectrogram.

Vowels in the context of fricatives were thus identified based on the increase in intensity

in the waveform as well as a shift from aperiodic noise to a periodic waveform. In the

spectrogram, the vowel was identified by the onset of clear formant structure. The offset

of a vowel in contact with a fricative was determined by the shift from periodic waves to

aperiodic noise, a decrease in intensity in the waveform, and the breaking up and shifting

124

of formant in the spectrogram. Figure 3-7 exemplifies the segmentation of the vowel [i]

in the word sistema (“system”), which is flanked by two sibilants. Prior to the vowel, one

can see the aperiodicity of the sound wave in the waveform, as well as high frequency

noise and lack of clear formant structure in the spectrogram. This corresponds to the first

[s] in sistema. The wave then becomes periodic and has a higher amplitude, signifying

the vowel. Steady formants also appear in the spectrogram, corresponding to the vowel

[i]. The wave amplitude decreases as the offset of the vowel approaches, and the wave

becomes less sinusoidal. The formants that were once clear in the vowel begin to

disperse and are replaced once again by high frequency noise that represents the second

[s] in sistema.

Figure 3-7. Waveform and spectrogram for the word sistema (“system”).

125

The segmentation of vowels in contact with resonant consonants can be less

straight-forward than in the previous examples. The difficulty in finding clear boundaries

for a vowel in contact with sonorant consonants is due in part to the fact that they share

many spectral characteristics with vowels. Sonorants, too, exhibit a high degree of

energy and periodicity in the waveform and have clear formant structure that is visible in

the spectrogram. The main cues for identifying vowels in contact with nasals and liquids

is the increase in intensity in the waveform relative to the neighboring consonants and a

shifting in formant structure as indicated by the formant tracker.52 Nasal consonants, for

example, are characterized by the coupling of the first and second formants, resulting in a

large concentration of energy in the low frequency ranges. When a nasal consonant is in

contact with a front vowel that has a high second formant such as [e], the second formant

must shift and rise as it transitions from the nasal into the following vowel. This

movement is one of the main cues that helps identify the onset of a vowel in this

consonantal environment. Figure 3-8 is a representation of the waveform and

spectrogram of the word mano (“hand”). The vowel [a] has been segmented out of this

word as indicated by the vertical lines and the label on the first tier of the text grid.

Notice that there is a decrease in wave amplitude in the nasal sounds, as well as a

concentration of energy in the low frequency ranges for the nasal that is not present in the

vowel.

52 The formant tracker is a component of Praat Software that identifies the center frequencies of the formants, thus indicating the frequency of the concentration of energy for a given speech sound.

126

Figure 3-8. Waveform and spectrogram for the word mano (“hand”).

The shifting and transitioning of formants and the relative increase in intensity in

the waveform are also the primary cues for identifying a vowel when it is in contact with

a lateral consonant. As there is a partial closure formed by the tongue and the alveolar

ridge when forming an [l] in Spanish, there is some decreased intensity in the waveform

even though the formant structure may be maintained in the spectrogram. As laterals

typically involve a closure, there is also an apparent break in the formant structure (a

discontinuity) indicating the transition between sounds (Ladefoged, 2003, p. 146). Figure

3-9 shows the segmentation of the vowel [a] in the word lapiz (“pencil”). Note the

shifting of formants and increase in wave amplitude as the vowel is produced.

127

Figure 3-9. Waveform and spectrogram of the word un lápiz (“a pencil”).

Finally, the segmentation of a vowel when in contact with a rhotic (trill or flap) is,

in some ways, similar to when a vowel is in contact with a stop consonant. As Spanish

rhotics are typically characterized by one or more closures in the oral cavity caused by

contact between the tongue and the alveolar ridge, a brief silence (white portion) is often

visible in the spectrogram. Although the closure may be too brief to obscure the formant

structure as indicated by the formant tracker in Praat, the silence in the spectrogram is

also accompanied by a rapid and brief decrease in intensity in the waveform. Figure 3-10

represents the waveform and spectrogram for the word cucharas (“spoons”). The first

repetition of [a] has been segmented out of the word as indicated by the vertical lines and

128

label on the first tier of the text grid. The brief silence following the vowel represents the

closure of the rhotic.53

Figure 3-10. Waveform and spectrogram of the word cucharas (“spoons”).

To summarize this section briefly, the onset and offset of the vocalic segments

analyzed in the present investigation were determined by taking into consideration a

combination of the acoustic cues mentioned above. The surrounding segmental context

determined which characteristics were the strongest cues signaling the onset and offset of

the vowel under investigation. The presence of at least two cues was preferred in order to

identify a vowel. If at least two cues were not identifiable, the vowel under investigation

53 The second /a/ in the figure is produced with creaky voice. As will be discussed later in this chapter, vowels produced with creak were not included in the present study.

129

was marked as questionable and was checked by another trained phonetician. If the

researchers were unable to reach agreement on the boundary of the vowel given the

presence of only one acoustic cue or two weak acoustic cues, the vowel was excluded

from the analysis.54 Care was taken in the design of the stimulus materials to include

lexical items with vowels in contexts that were conducive to segmentation (e.g., between

two stop consonants or between a stop and a fricative). In the narrative retelling task,

however, the lexical items produced by each participant were not always consistent. In

order to include vowels in all of the syllable and stress type contexts under investigation,

it was sometimes necessary to include vowels produced in less clear-cut segmentation

contexts.

3.4.1.2 Exclusion of vowels

The primary goal of this study is to provide a descriptive characterization of the

Spanish vowel system of HS of Spanish. For this reason, only monophthong (“pure”)

vowels were included in the present analysis. Only the vowels in the target words in the

PIT and CPT were included. In the NRT, all analyzable vowels that occurred in lexical

words (i.e., nouns, verbs, adjectives, and adverbs) were included in the analysis. Vowels

that were in absolute word-final position were excluded from the analysis because, as

mentioned earlier, final vowels in certain dialects are subject to lengthening, devoicing,

or reduction (Delforge, 2008; Hualde, 2005; Lipski, 1994; Lope Blanch, 1972; Matluck,

1952). Phonemic diphthongs were excluded, as well as mid vowels that had been

diphthongized. Diphthongs consist of a combination of two vowels within the same

54 In total, 104 vowels were excluded from the analysis because they were either produced with creak, devoiced, in contexts where segmentation was unclear, because the participant produced an unintended lexical item, or because the participant produced the intended lexical item in English.

130

syllable: a high vowel [i] or [u] that is weakened to a semivowel, and an additional

vowel (Hualde, 2005). Diphthongs differ from simple vowels in that they tend to be

longer in duration and are characterized by considerable spectral movement from one

vowel to another. Thus, the inclusion of diphthongs in this analysis would not provide an

accurate description of the Spanish five-vowel system, but rather show how vocalic

segments interact and combine with one another.

Vowels that occurred in the context of the approximant [j] were also excluded,

given the difficulty in locating the boundary between voiced approximants and vowels.

The location of the boundary between these sounds is problematic because they often

share many of the same acoustic attributes: high energy and intensity, periodic waves,

and clear and stable formant structure. In a word like cebolla (“onion”), for example, the

exact boundary of the [o] positioned between the approximants [β] and [j] is difficult to

locate because of the spectral similarity of the three sounds. In order to ensure

consistency in the segmentation, vowels in these types of contexts were avoided.

Vowels in the context of the voiced approximants [β,ð,ɣ] were included only if a clear

boundary could be determined based on decreased intensity in the waveform.

Individual speaker characteristics, most notably creaky voice, prevented certain

tokens from being analyzed. Creaky voice is a type of phonation that, unlike modal

voicing, is characterized by irregular glottal pulsation.55 This irregularity makes reliable

extraction and systematic analysis of the formants very difficult. In addition, the goal of

the present dissertation is to describe the production of vowels with modal voicing, and

not to provide an analysis of creak. For these reasons, creaked vowels, as well those that

55 For further information pertaining to creaky voice, see Johnson (2003, pp. 136-139).

131

were produced with breathy voice or that were devoiced were not included in the

analysis.

Finally, in rare instances, it was necessary to exclude vowel tokens as a

consequence of speaker characteristics or equipment malfunction. If the participant

spoke too softly and devoiced a vowel, the sound analysis software was unable to provide

accurate and consistent readings of the speech signal. If the participant touched or moved

the microphone or cleared his or her throat during the production of a specific vowel, this

also obscured the speech signal making it impossible to obtain an accurate measure of the

formant structure. In other cases, especially in the narrative retelling task which involved

more rapid speech, initial vowels were deleted almost entirely and therefore could not be

analyzed. The first [e] in the phrase dos espejos (“two mirrors”), for example, was

subject to deletion in Mexican speech because it is atonic and was positioned after a

sibilant consonant. Several of the participants simply did not produce this vowel, or what

was produced was too short to analyze or completely devoiced.56 Only vowels last had

durations of at least 40 milliseconds were included in the overall analysis. Further

explanation of duration is provided below in section 3.4.1.4.

3.4.1.3 Formant frequencies

In order to provide a descriptive account of the vowel system of heritage speakers

of Spanish, the first, second, and third formant frequency measures in Hertz of each of

the segmented vowels were extracted using an automated Praat script created by Lennes

56 Vowel reduction, devoicing, and deletion are known to be somewhat common in this phonetic environment, especially within the Andean and Mexican varieties (Matluck, 1952; Lope Blanch, 1972; Lipski, 1994; Delforge, 2008; Hualde, 2005).

132

(2003) and modified by Todt (2009).57 As described previously in chapter 2, the first

formant frequency (F1) is the acoustic correlate of vowel height; high vowels are

characterized by a low F1 whereas low vowels are characterized by a high. The second

formant frequency (F2) is the acoustic correlate of vowel backness. Front vowels, such

as /i/ and /e/ have higher F2 values whereas the back vowels /o/ and /u/ have lower F2

values. Several investigations of Spanish vowels have shown that the first two formant

frequencies are sufficient to distinguish all of the members of the five vowel system

(Hualde, 2005; Martínez-Celdrán, 1995; Quilis & Esgueva, 1983), and therefore, only the

first two formant frequencies will be analyzed in the present study.

Formant frequency measures of each vowel were taken at three different points

within the vowel: the 25% point, the 50% point, and the 75% point as calculated by

Praat. The midpoint of the vowel is the preferred point of analysis because it is the

farthest removed from the surrounding consonantal context and is thus less subject to

coarticulatory effects from neighboring consonants. Only the formant measurements

taken at the 50% point will be reported in chapter 4, and not those taken at the other

points within the vowel.58

Praat settings and maximum formant settings were adjusted as needed for each

speaker and for each vowel. The typical maximum frequency was 5500 Hz and the

dynamic range was set at 40. The number of formants analyzed ranged between 4 and 6

57 The third formant frequencies were measures by the automated script, but will not be discussed in the present study. The third formant (F3) is typically associated with rounding, r-coloring, and retroflexion. As rounding is not a contrastive feature in Spanish, this measure is not necessary to distinguish Spanish vowels. Future studies could focus on examining the similarities and differences in rounding between HS, monolingual Spanish speakers, and monolingual English speakers. 58 The measures taken at the 25% and the 75% points within the vowel will allow for further investigation of the effects of surrounding consonantal context on vowel production. Analyzing the formant values at the 75% point within the vowel, for example, may reveal additional quality differences between vowels in open and closed syllables. Thus, the additional measurements taken within the vowel will not be discussed in the present study but will provide the basis for future investigations of heritage Spanish vowels.

133

depending on the speaker’s gender and if the vowel was front or back. The

measurements extracted from the Praat script were written to a text file and then

transferred to Microsoft Excel for further analysis.59

3.4.1.4 Vowel duration

The duration of each segmented vowel was calculated in using a script written by

Lennes (2002) for Praat (Boersma & Weenink, 2010). The automated script measured

the duration of all vowels as they were labeled on the second tier of the text grid. The

output of the script was written to a text file, which was then transferred to Microsoft

Excel and sorted by vowel label. Vowels with extremely short durations (less than 40

ms) and long durations (greater than 110 ms) were hand-checked a second time to ensure

that the boundaries had been placed appropriately. The duration measurements extracted

by the Praat script were then submitted to statistical analysis to determine if stress,

syllable type, or task had a significant effect on the duration of Spanish vowels.

3.4.1.5 Vowel dispersion

In order to determine the degree of vowel dispersion within the acoustic space, the

center point of the vowel space, or “centroid,” was calculated for each individual speaker

using a process similar to the S-Centroid procedure described by Watt and Fabricius

(2002). The center of the vowel space was determined by calculating the average F1 and

59 The Praat script had default settings that could be adjusted overall for each speaker, but the maximum number of formants could not be altered for each individual vowel. Given the need to change the maximum number of formants analyzed by Praat depending on the vowel, many of the tokens were analyzed by hand using the “get formant” procedure from the drop down menu in Praat. The back vowel /u/ was especially problematic because the F1 and F2 values are positioned very close to one another. In some cases, the Praat script identified the third formant as the second, and reported an incorrect value. For this reason, all of the back vowels were carefully checked and hand-corrected. If the F2 of a back vowel was considerably out of range for an individual speaker (i.e., in the range of 2000Hz when all others had been near 1100-1200 Hz), the vowel was reexamined and the settings in Praat adjusted so that the formant tracker aligned with the second formant frequency and not the third.

134

F2 frequencies of the point vowels /i,a,u/ in tonic, open syllables produced in the CPT.60

The average F1 and F2 values of the point vowels determined a center point in the

acoustic space that was unique to each individual. The Euclidean distance of each of the

individual’s vowels was then measured relative to the centroid. The formula to calculate

the Euclidean distance is presented in Equation 1 below. The distance (d) of p

(individual vowel) from q (the centroid) was calculated by taking the square root of the

sum of the squared difference between the F1 value of the individual vowel and the F1 of

the centroid, and the F2 of the individual vowel and the F2 of the centroid.

d(p,q) = √(p1 – q1)2 + (p2 – q2)2

Equation 1. Euclidean distance formula utilized to measure the distance of vowels from the center of the vowel space.

The calculation of the Euclidean distance thus permitted an examination of how

far away, or dispersed, each individual’s vowels were situated from the center of the

vowel space by providing a single value in Hertz. A pictographic representation of the

center of the vowel space and the distance of the point vowels from the center is

presented in Figure 3-11. The “X” marks the location of the hypothetical centroid, and

the dashed lines represent the Euclidean distance of the point vowels from the center.

Note that this illustration does not include the mid vowels and is meant to represent a

hypothetical vowel space. In a perfectly triangular vowel space such as the one

60 Previous analyses from the pilot experiment indicated that the tonic vowels in open syllables produced in the CPT were the most peripheral. In order to calculate the truest center of the vowel space, the F1 and F2 values from the most peripheral and extreme vowels were used as opposed to the average of the point vowels extracted from all three tasks.

135

represented below, the Euclidean distance values of the point vowels from the center of

the space would be equal, as would the distances of the mid vowels from the center.

Figure 3-11. Representation of the centroid and distance of the point vowels from the centroid.

The Euclidean distance of the individual vowels from the center of the vowel

space was included as an additional dependent variable in the statistical analysis along

with the F1, F2, and duration of each speaker’s vowels. The statistical analyses are

discussed in detail in section 3.4.3.

3.4.1.6 Normalization

Normalization is a process that is traditionally performed on vowel formants in

order to remove differences in vowel formant frequencies that are a consequence of

physiology. That is, as males and females, as well as adults and children, have different

vocal tract sizes, the frequency ranges characterizing their vowels are inherently

different. In most acoustic studies of vowels where male and female speakers or adult

and child speakers are analyzed, it is important to remove as many of these differences as

136

possible so that the different groups can either be pooled and analyzed together or

compared to other speaker populations. As this investigation includes male and female

speakers, the raw formant data were normalized using the NORM vowel normalization

suite (Thomas & Kendall, 2007). The Lobanov (1971) method of normalization was

employed because it outperformed other methods of normalization, more accurately

reducing physiological differences while at the same time maintaining sociolinguistic

differences between speakers (Adank, et al., 2004). The normalized vowels were then

scaled to a vocal tract model in NORM, thus resulting in formant values that represented

the overall vowel space of all the heritage speakers who participated in this study.61

3.4.2 Individual variables

In addition to examining the influence of stress, syllable type, and speech style on

vowel production, a second goal of this investigation was to determine the relationship

between several individual variables and HS vowel pronunciation. This information was

gathered based on the language background questionnaire, the grammar proficiency

activity (Geeslin & Gudmestad, 2010), and the cultural sensitivity questionnaire

(Cushner, 1986) described earlier in this chapter. Each of these variables is described in

greater detail below.62

61 As is discussed in section 3.4.3.2, the final statistical model and the results reported in chapter 4 pertain to the non-normalized data (i.e,. those that were NOT normalized with the Lobanov (1971) normalization procedure). Statistical tests were conducted with both the Lobanov normalized and non-normalized formants, however, and some reference is made to the Lobanov normalized data in chapter 5. A comparison of the statistics conducted with the normalized and non-normalized data are presented in Appendix L. 62 The language background questionnaire also included a series of statements and further information about whether or not each participant spoke another language, how well they spoke another language, and demographic terms they felt best described them (e.g., latino/a, mexico/a, americano/a, etc.) This additional information was excluded from the analysis given the similarity of the responses provided by the participants. For example, it was intended to examine how strongly each speaker felt that speaking Spanish helped them feel connected to their heritage. As all speakers responded with either “strongly agree” or

137

3.4.2.1 Course level

The participants in this study were divided into two groups based on the Spanish

course-level in which they were currently enrolled. At the time of the investigation,

seven participants were enrolled in an intermediate-level Spanish course designed for

heritage speakers. The other nine participants had already completed this course and

were enrolled in upper 200-level or 300-level courses in literature, culture, or linguistics.

As the distribution of the participants was roughly even, their course level was one of the

variables included in the analysis. Course level was included in the statistical model as a

categorical covariate, meaning that each participant belonged to either the intermediate

group or the advanced group. The inclusion of course level as a variable may help

determine if greater or lesser degrees of overt language instruction influence the

pronunciation of HS vowels.

3.4.2.2 Grammar proficiency

As mentioned in chapter 2, an additional factor which could influence

pronunciation is a speaker’s grammar proficiency, or knowledge of formal grammar. In

order to examine the relationship between formal grammatical knowledge and

pronunciation, all participants completed a contextualized grammar activity designed by

Geeslin and Gudmestad (2010). This activity consisted of 25 multiple choice questions

which examined knowledge of object pronoun use, preterite and imperfect, ser and estar,

and conditional clauses, among other aspects of Spanish grammar. Each participant’s

“agree,” there was not enough variation in the responses to include this question in the analysis. There was very little variation in the selection of demographic terms, resulting in the same complication. Some participants indicated that they identified with all terms, whereas others indicated that they self-identified with just one term. Thus, the intention was to include more individual information in this portion of the analysis, but as there was very little variation between the speakers and a small number of total speakers, only the ICCS, grammar score, travel frequency, course level, and Spanish use were analyzed statistically.

138

score on this proficiency test was included in the statistical analysis to determine if

knowledge of formal Spanish grammar impacted HS vowel quality and/or quantity. The

grammar score was included as a continuous covariate, meaning that each individual

speaker’s score was entered into the model separately. In other words, the participants

were not grouped into low, mid, or high proficiency levels as is typical in previous

studies. As will be discussed further in section 3.4.3, the parameter estimates of fixed

effects were examined in order to determine the relationship between grammar

proficiency score and HS vowel production.

3.4.2.3 Travel abroad

The frequency with which each speaker traveled abroad to their country of

heritage or another Spanish-speaking country was obtained from the language

background questionnaire. The responses provided by each participant helped divide

them into 3 groups. The four speakers who indicated that they had never traveled abroad,

or had not done so in the past 8 years, were grouped into the “no travel group” (group 0).

Those who indicated that they visited a Spanish-speaking country every 3-4 years formed

the “infrequent/some travel” group (group 1, N=6). Finally, those participants who

indicated that they traveled every year formed the “frequent travel” group (group 2,

N=6). Travel abroad was included in the statistical model as a categorical covariate,

which will be described in greater detail in section 3.4.3. The group scores were entered

into the analysis to determine if frequency of travel abroad influenced the way in which

these speakers pronounced their Spanish vowels.

139

3.4.2.4 Spanish use

The language background questionnaire also contained a series of multiple choice

questions inquiring about how frequently each participant spoke or used Spanish in a

variety of contexts. The participants were asked to indicate how frequently (i.e., daily,

weekly, monthly, every few months, almost never) they spoke or used Spanish with

friends, family, watching television, reading books or magazines, listening to music or

the radio, watching movies, at the work place, and in school/class. For each category in

which the speaker indicated that they used Spanish daily, they were assigned the score of

“5,” for weekly a “4,” and so forth. The eight contexts and five degrees of use resulted in

a total possible score of 40. The higher the score, the more frequently Spanish was used.

Each participant was assigned their own individual Spanish score based on their

responses that was then included in the statistical analysis as a continuous covariate. The

goal of including this variable was to determine if speaking Spanish more or less had an

impact on how the Spanish vowels were pronounced.

3.4.2.5 Cultural Sensitivity

The final individual variable analyzed in this dissertation was the measure of

cultural sensitivity obtained via Cushner’s (1986) Inventory of Cross-Cultural Sensitivity

(ICCS). As described previously, participants were asked to rate a series of statements on

a scale of 1-7 to indicate their level of agreement or disagreement. The responses were

then organized into several different subscales (e.g., Cultural Integration, Behavioral,

Intellectual, Attitude, and Empathy), and the sum of the subscales resulted in the total

cultural sensitivity score. Given that the sample size in this investigation is small, only

the total ICCS score was submitted to the statistical analyses, and the not the individual

140

subscale totals. As with grammar proficiency and Spanish use, the ICCS total was

included in the statistical model as a continuous covariate.

3.4.3 Statistical analyses

The purpose of this section of the chapter is to discuss the dependent and

independent variables, the way the data were coded, and describe the statistical analyses

that were conducted with the normalized and non-normalized data.

3.4.3.1 Variables and coding

Prior to describing the variables and the coding procedure, it is important to first

briefly summarize the research questions investigated in this dissertation. A summarized

version of each of the questions is presented below:

1) What is the basic distribution and organization of HS Spanish vowels within the acoustic space?

2) What is the role of lexical stress (i.e., tonic vs. atonic) in determining vowel height (F1), vowel backness (F2), vowel dispersion (Euclidean distance), and vowel duration?

3) How does syllable type (i.e., closed vs. open) influence vowel height (F1), vowel backness (F2), vowel dispersion (Euclidean distance), and vowel duration?

4) What is the connection between speech style (i.e., spontaneous, semi-spontaneous, and controlled) and vowel height (F1), vowel backness (F2), vowel dispersion (Euclidean distance), and vowel duration?

5) What is the relationship between a participant’s course level, grammar proficiency, travel frequency, Spanish use, and cultural sensitivity and vowel height (F1), vowel backness (F2), vowel dispersion (Euclidean distance), and vowel duration?

Based on the research questions presented above, the dependent variables in the

statistical analyses were F1, F2, Euclidean distance, and duration. The independent

141

variables were subdivided into two groups: within-subjects variables and between-

subjects variables. The within-subjects variables included vowel, lexical stress, syllable

type, and speech style. The between-subjects variables included course level, grammar

proficiency, travel frequency, Spanish use, and cultural sensitivity.63 Whereas all of the

within-subjects variables were categorical, meaning vowels were either tonic or atonic, in

closed or open syllables, or produced in one of the three speech styles, the between-

subjects variables were both categorical and continuous. The categorical between-

subjects variables included course level (intermediate or advanced) and travel frequency

(never, infrequent, or frequent). Grammar proficiency, Spanish use, and cultural

sensitivity were continuous covariates, meaning that the scores varied along a continuum

and were not grouped together in any specific way.

Each individual vowel was coded based on the lexical stress category in which it

occurred, the syllable type in which it occurred, and the speech style in which it was

produced. Furthermore, the vowels were coded by each of the individual variables

relevant to the particular speaker who produced the vowel. The coded vowel tokens were

then submitted to a series of Mixed Linear Models with a random effect of speaker,

which is explained in greater detail in the following section.

3.4.3.2 Statistical analyses and final model

Determining which statistical analyses, and which variables and interactions to

include in the analyses, was not a straightforward process. Several different types of

statistical analyses were carried out on the data prior to deciding which statistical test 63 Within-subjects variables are factors in which the subject is tested on each level of the variable. Lexical stress, for example, has two levels, and each subject is tested on each level of the variable. That is, each speaker produced both tonic and atonic vowels. Between-subjects variables are independent variables in which a different group of subjects is tested on each level of the variable. Gender is an example of a between-subjects factor because the participants could only belong to one group: either male or female.

142

should be conducted and which interactions should be included. Three different rounds

of statistics were conducted on the data: separate 4-Way ANOVA’s for each individual

speaker, a Mixed Linear model with the random effect of speaker on the formant values

that had been normalized using the Lobanov (1971) method of normalization, and a

Mixed Linear Model with the random effect of speaker on the non-normalized formant

values.64 The final option (i.e., Mixed Linear Model with random effect of speaker on

non-normalized data) was selected as the statistical method for this data

The Mixed Linear Model was selected as the preferred statistical method for three

main reasons. First of all, this model permitted the inclusion of all variables (i.e., within-

and between-subjects variables) in a single analysis, therefore providing the ability to

make greater generalizations across the population. Second of all, the Mixed model

allowed for the analysis of both fixed effects and random effects. Including the random

effect of “speaker” in the model accounted for individual variation in vowel production in

much the same way that a traditional normalization procedure would. For this reason, the

64 Prior to deciding upon the Mixed Linear Model, separate four-way ANOVA’s, one for each of the dependent variables (F1, F2, duration, and Euclidean distance) were calculated for each individual speaker. A full-factorial design was conducted, thus including interactions between all of the independent variables. The results of each of the individual results were then compared to one another to gain an overall picture of which main effects and interactions consistently occurred across the group of speakers. This method of analysis, however, would not have allowed for a general picture of HS vowel pronunciation. Comparing the analyses conducted on completely different sets of data without taking into consideration the range of variation exhibited by each speaker in terms of formant values or the potential differences in speaking rate is not a meaningful comparison and does not allow for generalization. In addition, a simple ANOVA model would not have allowed for an analysis of the effects of individual variables on vowel pronunciation because each speaker could only belong to one group or one level. The inability to include between-subjects variables in the ANOVA thus resulted in it being ill-suited for this investigation. The full-factorial model also proved to be problematic, as some of the interactions were not meaningful and did not address the research questions. The interaction between stress and syllable type on the F2 dimension, for example, would not have provided a meaningful result. An interaction of this type would only reveal that the F2 values were higher or lower in certain syllable types based on lexical stress, but as front and back vowels are characterized by very different formant frequencies, the outcome would have been misleading or uninterpretable. An overall effect of stress according to syllable type without taking into consideration which vowel or vowels were being affected would not have offered insight into the effect of this variable on the HS vowel system. For these reasons, the use of ANOVA’s was not selected as the primary statistical method in this investigation.

143

raw formant values were entered into the model and statistically analyzed, and not those

that were first normalized with the Lobanov (1971) method.65 Third of all, this model

allowed for a custom design, so that only meaningful main effects and interactions could

be included.

Mixed Linear Models with a random effect of speaker were conducted separately

on non-normalized F1, F2, Euclidean distance, and duration and were custom designed so

that they only included the independent variables and covariates that were needed to

address the research questions. A full-factorial model was not used, as this would have

included too many factors and would have diminished statistical power. All main effects

were included in the final model for both the within-subjects (i.e., vowel, stress, syllable

type, and task) and between-subjects variables (i.e., group, gender,66 grammar score,

travel abroad, Spanish use, and cultural sensitivity). All simple interactions between

vowel and the other within-subjects variables, as well as three-way interactions between

vowel, task, and stress and vowel, task, and syllable type were included in the statistical

65 The decision to report the analyses conducted with the non-normalized data was motivated by several reasons. First and foremost, as mentioned in text, the random effects component helped account for individual speaker variation. With random effects included, the model created a separate intercept for each individual speaker, thus recognizing that each participant was a different speaker and should be treated differently. The formant values were therefore adjusted statistically to reflect the variation present in the data. A second reason for maintaining the formant values in their original form and not normalizing is that no direct comparisons between speakers or between genders were made in the present study. The distribution of male and female participants was too unequal for a meaningful comparison. As one of the main purposes of normalization is to account for variation in formant frequencies due to physiology, and the comparison between genders was not made, the need to normalize was not as extreme as it would have been if the vowel spaces of male and female HS were being compared. One of the primary differences between the random effect of speaker and a traditional normalization procedure is that the speaker effect did not scale the formant values. The lack of scaling, however, was not problematic, especially because the NORM normalization suite cautions against scaling data, arguing that it may not be accurate. The disadvantage of the random effect is that it does not control for between-subjects variation as well as a formal normalization procedure would. Thus, as expected, a gender effect was observed in the analysis of the non-normalized data that is partially removed in the analysis of the Lobanov (1971) normalized data. A comparison between the statistical results calculated with the normalized and non-normalized data are presented in Appendix L. As illustrated in the table of results presented in Appendix L, there were very few differences in the results conducted with the normalized and non-normalized data. 66 Gender was included in the model to remove some of the variance and to test the effectiveness of the normalization procedure.

144

model. Two-way interactions between vowel and each of the between-subjects variables

were also included (i.e., vowel and travel, vowel and Spanish use, etc.) in the statistical

analysis. No additional interactions were included in the final model in order to prevent

the model from over-fitting the data. The final statistical model included the main effects

and interactions presented in Table 3-1 below, with the “X” denoting an interaction.

Table 3-1

Variables included in Mixed Linear model Main effects Interactions Within-subjects Between-subjects Within-subjects Between-subjects

Vowel Course level Vowel X stress Vowel X course level

Stress Grammar score Vowel X syllable type Vowel X grammar score

Syllable type Travel abroad Vowel X task Vowel X travel abroad

Task Spanish use Vowel X task X stress Vowel X Spanish use

C2 manner Cultural sensitivity Vowel X task X syllable type

Vowel X cultural sensitivity

C2 voice Gender Note. C2 manner and C2 voice were only included in a second analysis conducted with duration. “C2 manner” refers to the manner of articulation of the following consonant, and “C2 voice” refers to the voicing characteristics of the following consonant.

Chapter four reports the statistical results of the analyses conducted on each of the

dependent variables.

145

4 RESULTS

This chapter presents the results of the statistical analyses described in chapter three.

Section 4.1 begins with summary information (i.e. mean, standard deviation) for each of

the dependent variables, and subsequently reports the effects of the within-subjects

variables (i.e., lexical stress, syllable type, and speech style) on the dependent variables.

The statistical results of the between-subjects variables (i.e., course level, frequency of

travel, grammar proficiency, Spanish use, and cultural sensitivity) are described in

section 4.2. An overall summary of results is presented in section 4.3. The total number

of vowel tokens analyzed in this dissertation and their distribution in each of the within-

subjects variable categories are presented in Appendix G. All of the statistical outputs

are included in Appendices H-K.

4.1 Within-subjects variables

Prior to describing the statistical results, the mean formant values, Euclidean

distances, and duration of vowels produced by the male and female participants are

presented in Tables 4-1 and 4-2. As the values are raw and not normalized or adjusted by

the statistical model, male and female data are presented separately. The tables include

the number of cases, mean, and standard deviation of F1, F2, Euclidean distance, and

duration of each vowel averaged across all male and female speakers, respectively.

146

Table 4-1

Basic characteristics of vowels produced by three male heritage speakers

F1 F2

Euclidean distance Duration

Vowel N Mean S.D. Mean S.D. Mean S.D. Mean S.D.

/a/ 172 652.67 73.62 1449.18 135.87 203.70 100.21 77.87 23.06

/e/ 154 474.32 47.57 1777.03 183.90 261.31 149.07 64.75 20.73

/i/ 120 350.42 41.59 2109.45 186.07 578.82 193.17 59.60 17.58

/o/ 111 506.71 42.82 1085.08 110.65 452.94 109.98 68.79 23.73

/u/ 82 398.18 37.38 1099.27 199.92 452.30 180.22 69.02 23.50

Note. The values are raw averages (Mean) and standard deviations (S.D.) that have not been adjusted by the statistical model. Formant values and the Euclidean distance are measured in Hertz. Duration is measured in milliseconds.

Table 4-2

Basic characteristics of vowels produced by thirteen female heritage speakers

F1 F2

Euclidean distance Duration

Vowel N Mean S.D. Mean S.D. Mean S.D. Mean S.D.

/a/ 765 778.50 103.94 1580.93 164.26 322.29 106.00 88.96 29.83

/e/ 642 548.98 72.07 1958.21 241.55 272.43 169.27 74.54 28.37

/i/ 484 383.86 53.38 2500.04 277.18 772.04 242.91 70.37 23.67

/o/ 444 573.29 71.81 1139.53 176.52 615.64 154.66 78.21 27.85

/u/ 368 429.31 52.25 1180.04 256.39 594.40 221.99 73.96 29.18

Note. The values are raw averages (Mean) and standard deviations (S.D.) that have not been adjusted by the statistical model. Formant values and the Euclidean distance are measured in Hertz. Duration is measured in milliseconds.

147

The remainder of this section reports the statistical results of the within-subjects

variables that were included in the Mixed Linear Models.67 The results will be further

divided by the main effects and simple interactions (section 4.1.1), and complex

interactions (4.1.2). Within each sub-section, the effect of the independent variables on

vowel quality (F1 and F2), Euclidean distance, and duration will be discussed in that

order.

4.1.1 Main effects and simple interactions

4.1.1.1 Vowel The independent variable of vowel was included in order to 1) examine how HS

vowels are organized by height and backness, 2) assess if they exhibit durational

differences, and finally 3) address how they are organized or dispersed within the

acoustic domain relative to the center of the vowel space. The effect of vowel was found

to have a significant effect on all four dependent variables, each of which will be

discussed in greater detail below. Only the main effect of vowel will be discussed in this

section, meaning that the results are collapsed across all stress, syllable, and task types.

The significant interactions involving vowel will be discussed in greater detail later on in

this chapter.

A total of 3,342 vowels were analyzed via the Mixed Linear Model. The

statistical analyses indicated that there was a significant main effect of vowel on both F1

and F2, indicating that the five Spanish vowels differed from one another in terms of

height and backness. As will be discussed later on in this section, the Euclidean distance

67 As a reminder, the raw F1 and F2 values in Hertz were calculated for each speaker, the Euclidean distance of the vowels from the centroid of each speaker’s vowel space was calculated based on the F1 and F2 of the point vowels, and the total duration in milliseconds was measured for each vowel. The results presented in this chapter are based on the estimated marginal means calculated by the Mixed Linear Models and include all speakers unless otherwise indicated.

148

measures also differed for each vowel. The F1, F2, and Euclidean distance measures of

each of the five Spanish vowels are presented in Table 4-3. The table presents the

estimated marginal means calculated by the statistical model, and not the raw averages

produced by all speakers.68

Table 4-3

F1, F2, and Euclidean distance measures of HS vowels

Vowel F1 (Hz) F2 (Hz) Euclidean Distance (Hz)

/i/ 687.81 2207.13 573.50 /e/ 499.14 1893.06 276.73 /a/ 687.81 1529.55 239.50 /o/ 537.44 1166.03 489.56 /u/ 420.64 1208.83 462.14

The overall distribution of the vowels is shown in Figure 4-1. A more in-depth

description of the differences between the vowels is provided below the figure.

68 Unless otherwise indicated, all tables and figures presented in this chapter present the estimated marginal means determined by the statistical model and not the raw averages. Standard deviations are not included because these are estimated marginal means and not the raw averages. Standard deviation is used to describe the variance within the population, and would need to be based off of the raw averages.

149

Figure 4-1. Overall acoustic distribution of HS vowels averaged across all speakers, stress contexts, syllable types, and tasks (N = 3,3,42).

The Mixed Linear Model with F1 as the dependent variable indicated a significant

main effect of vowel (F(4, 3254.08) = 16.56, p < .001), confirming that the HS vowels

were distinguished from one another in terms of vowel height. A closer examination of

this effect via pairwise comparisons indicated that the HS vowel system exhibits five

statistical degrees of height. The low vowel /a/ exhibited the highest F1 and was

articulated significantly lower in the vowel space than all other vowels (F1 = 687.81 Hz,

p < .001). The mid vowel /o/ was significantly higher than /a/ and significantly lower

than all other vowels with a value of 537.44 Hz (p < .001). The front mid vowel /e/ was

significantly higher than /o/ and /a/ and significantly lower than /i/ and /u/, with an

average F1 value of 499.14 Hz (p < .001). The high vowels /i/ and /u/ were significantly

higher than all other vowels, but differed significantly from each other in terms of their

relative height. The back vowel /u/ was produced with the second lowest F1 value

(420.64 Hz) and the front vowel /i/ with the lowest F1 value (368.35 Hz, p < .001).

/i/

/e/

/a/

/o/

/u/

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Overall vowel space

150

Thus, unlike traditional descriptions of the Spanish vowel system which argue for three

degrees of height, the HS vowel system exhibits five degrees. These differences are

represented in Figure 4-1.

The main effect of vowel on F2 was also significant (F(4, 3254.13) = 19.22, p <

.001), indicating the HS vowels were also distinct with respect to backness. Pairwise

comparisons of this main effect revealed four distinct degrees of backness. The front

vowels /i/ and /e/ were distinct from one another, with /i/ exhibiting the highest F2

frequency (2207.13 Hz) followed by /e/ (1893.06 Hz). The front vowels differed

significantly from each other and from all other vowels in terms of backness (p < .001).

The central vowel /a/ was distinct from all other vowels with an average F2 value of

1529.55 Hz (p < .001). Finally, the back vowels /o/ and /u/ were articulated significantly

farther back in the vowel space than all other vowels (p < .001) but did not differ

significantly from each other (p = .204). The /o/ was produced with an average value of

1166.03 Hz, and the /u/ with an average value of 1208.83 Hz. That the /u/ was produced

farther front in the vowel space than the /o/ conflicts with what is typically described for

the Spanish vowel system (see sections 2.2.2 and 2.2.3 concerning Spanish vowel

systems). Generally /u/, and not /o/ is argued to be articulated farthest back in the vowel

space. As these vowels did not differ statistically from one another, however, indicates

that for this group of HS, the back vowels occupy roughly the same region within the

acoustic space. The front vowels, on the other hand, differed significantly from one

another with respect to backness. Thus, a distinction in vowel backness was observed in

the higher frequency ranges for the front vowels, but not for the back vowels. The

differences in vowel backness can also be observed in Figure 4-1. Two characteristics

151

that stand out in this figure that differ from traditional characterizations of Spanish vowel

systems are the backing of /e/ and its overall distance from the high vowel /i/, as well as

the fronting of /u/.

The analysis conducted on the Euclidean distance measure indicated a significant

main effect of vowel (F(4, 3254.14) = 5.06, p < .001), revealing that HS vowels were

distributed differently with respect to the centroid (see Table 4-3 for values, and section

3.4.1.5 for the Euclidean distance formula). The overall vowel space, centroid, and

Euclidean distance of each vowel are represented in Figure 4-2. The values presented

here are based on the estimated marginal means tabulated for all subjects combined.69

Figure 4-2. Overall dispersion of HS vowels averaged across speakers, stress, syllable, and task (N=3,342).

69 Figure 4-2 represents the Euclidean distance calculated based on the estimated marginal means across all speakers. It is important to note that the statistical analysis conducted on the Euclidean distance was calculated based on each speaker’s Euclidean distance measure. That is, a centroid and Euclidean distance was calculated for each participant separately, and these measures were submitted to the statistical test. What is presented in the figure is a means of illustrating the overall trend observed by all speakers separately, and was the most cohesive way of representing this measure as opposed to including 16 separate vowel charts.

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Overall vowel space with centroid

152

A closer examination of the Euclidean distance measures calculated for all vowels

via pairwise comparisons indicated that all vowels differed with respect to their proximity

to the centroid except for /u/ and /o/. The vowel situated closest to the centroid was /a/

(239.50 Hz), and this difference was significant from /e/ at the level of p = .013 and from

all other vowels at the level of p < .001. The mid vowel /e/ had the second smallest

Euclidean distance (276.73 Hz), differing significantly from /a/ at p = .013 and from all

other vowels at p < .001. The Euclidean distance of the back vowels /u/ and /o/ were

462.14 Hz and 489.56 Hz, respectively. As mentioned previously, these distances were

not significantly different from one another (p = .661), but were significantly different

from all other vowels (p < .001). Finally, the front vowel /i/ exhibited the greatest

Euclidean distance, indicating that it was articulated farthest from the centroid, and

farther than any other vowel (573.50 Hz, p < .001). The results of this analysis thus

demonstrate that HS vowels are not distributed evenly throughout the acoustic space with

respect to the centroid. This finding conflicts with arguments that the Spanish vowel

system is symmetrical in nature.

An examination of the overall vowel space and centroid indicate that /a/ was

produced closest to the center of the vowel space than any other vowel. The figure

representing this distribution, Figure 4-2, however, is somewhat misleading given the

scale on which F1 and F2 are represented. Based solely on the graphical representation

of the vowels of the centroid, the mid vowel /e/ appears to be the closest. If the F1 and

F2 scales are made identical, however, the proximity of /a/ to the center of the vowel

space becomes more apparent. Figure 4-3 provides a more accurate representation of the

proximity of the vowels to the center of the space.

153

Figure 4-3. Overall dispersion of HS vowels with adjusted scale.

Finally, the statistical analysis of duration revealed a significant main effect of

vowel, indicating that the vowels differed from each other with respect to duration across

differences due to task, stress type, or syllable type in which they were produced (F(4,

3254.16) = 3.80, p = .004). The average duration in milliseconds of all vowels are

presented numerically in Table 4-4.

Table 4-4

Overall HS vowel duration

Vowel Duration Standard Error

/i/ 61.67 3.77 /e/ 68.80 3.71 /a/ 79.61 3.66 /o/ 70.05 3.73 /u/ 65.83 3.93

A graphical representation of vowel duration is provided in Figure 4-4. The scale

of the graph is in milliseconds, and each bar represents the duration of one of the five

30050070090011001300150017001900210023002500

30050070090011001300150017001900210023002500

F1 H

z

F2 Hz

Overall vowel space with centroid

154

Spanish vowels averaged across all speakers, stress contexts, syllable types, and tasks.

Standard error bars are also included in the figure.70

Figure 4-4. Overall vowel duration for all speakers averaged across stress, syllable type, and task. (N = 3,342). The low vowel /a/ was the longest vowel with an average duration of 80

milliseconds, and differed significantly in length from all other vowels (p < .001). The

mid vowel /o/ exhibited the second longest duration (79 milliseconds) and differed

significantly from /i/ (p = .001) and / a/ (p < .001). The front mid vowel /e/ was produced

with an average duration of 69 milliseconds, and was significantly longer than /i/ (p =

.005) and shorter than /a/ (p < .001). The high vowels /i/ and /u/ exhibited the shortest

duration and were significantly shorter than all vowels but did not differ from each other.

The front vowel /i/ was significantly shorter than /a/ (p < .001), from /e/ (p = .001), and

70 Standard error bars are included on this graph, as opposed to standard deviations, because the values are the estimated marginal means based on the model’s predictions. The standard error is used to show the variation within the sample, whereas the standard deviation is used to represent the variation within the population. One of the assumptions of this model, however, is equal variance. For this reason, the standard error bars are approximately the same size. The error bars will not be included on all of the charts for vowel duration that represent estimated marginal means.

0 10 20 30 40 50 60 70 80 90

/i/

/e/

/a/

/o/

/u/

Duration in ms

Overall vowel duration

155

from /o/ (p = .005). The back vowel /u/ only differed significantly in length from /a/ (p <

.001). The results of this analysis indicate that HS Spanish vowels exhibit a degree of

intrinsic vowel duration similar to that described by Marín Gálvez (1995) for

monolingual Spanish. Generally speaking, vowel duration decreased with vowel height,

such that /a/ had the longest duration, the mid vowels /e/ and /o/ had intermediate

duration, and the high vowels had the shortest duration.

Another way of conceptualizing the intrinsic vowel duration is presented in

Figure 4-5. This figure represents the organization of the HS vowels within the acoustic

space. The size of each circle represents the vowel duration, with the largest circles

indicating longer duration and the smaller circles denoting shorter vowels.

Figure 4-5. Vowel duration based on acoustic distribution for all speakers averaged across stress, syllable type, and task.

To summarize briefly the main findings of this section, an examination of the

effect of vowel on F1, F2, Euclidean distance, and duration provide an overall view of the

organization of the HS Spanish vowel system. The analyses of F1 and F2 also revealed

/i/

/e/

/a/

/o/

/u/

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Vowel duration by acoustic distribution

156

subtle differences in vowel height and backness (i.e., a five-level height distinction and a

four-level backness distinction) that are typically not described in impressionistic

accounts of Spanish vowel systems.71 The analysis of Euclidean distance showed that

not all vowels are organized symmetrically or evenly throughout the acoustic space,

questioning the assumption that Spanish possess a simple, symmetrical vowel system.

Finally, the results of duration are in line with previous accounts that vowel duration

decreases as vowel height increases (Marín Gálvez, 1995). Although /u/, /e/, and /o/ did

not differ significantly from one another with respect to duration, a basic pattern

emerged: /a/ > /e/ and /o/ > /i/ and /u/.

4.1.1.2 Stress

This section examines the effect of lexical stress on vowel quality, quantity, and

dispersion. The results of the statistical analysis revealed several important main effects

and interactions that will be explained further throughout this section. As with the

previous section, the results presented here reflect the stress effects observed for all

vowel categories collapsed over syllable type and task. An interaction involving vowel

category and stress indicates that a stress effect depended upon which vowel was being

produced. Both the main effects and simple interactions will be discussed below.

The F1, F2, and Euclidean distance measures of the atonic and tonic HS vowels

are presented in Table 4-5.

71 O’Rourke (2010) also noted a five-level distinction in both height and backness in Peruvian Spanish.

157

Table 4-5

F1, F2, and Euclidean distance measures of HS vowels in each stress context

Vowel Stress F1 (Hz) F2 (Hz) Euclidean distance (Hz)

/i/ atonic 376.32* 2112.42* 483.60*

tonic 360.39* 2301.84* 663.41* /e/ atonic 465.18* 1872.56* 276.89

tonic 533.10* 1913.57* 276.58 /a/ atonic 647.83* 1524.60 214.47*

tonic 727.80* 1534.52 264.55* /o/ atonic 517.85* 1211.00* 440.54*

tonic 557.04* 1121.06* 538.59* /u/ atonic 420.42 1255.54* 421.66*

tonic 420.88 1162.13* 502.63* Note. The “*” denotes a statistically significant difference between tonic and atonic measures.

A graphical representation of the distribution of HS vowels based on stress type is

presented in Figure 4-6. The dashed lines represent the acoustic space occupied by the

atonic vowels, and the solid line represents the tonic vowel space. A detailed description

of the significant main effects and interactions is presented below the figure.

158

Figure 4-6. Tonic and atonic vowel productions for all speakers averaged across syllable type and task.

The Mixed Linear Model with F1 as the dependent variable revealed a significant

main effect of stress (F(1, 3254.20) = 204.223, p < .001) and a significant vowel by stress

interaction for F1 (F(4, 3254.16) = 62.18, p < .001). The main effect of stress on F1

indicated that overall, vowels in atonic syllables had lower F1 values (485.52 Hz), or

were produced higher in the vowel space, than tonic vowels (519.84 Hz). The vowel by

stress interaction indicated that the difference in vowel height was significant for all

vowels except for /u/. A closer examination of vowel by stress interaction via pairwise

comparisons indicated that atonic /a/, /e/, and /o/ were produced with significantly lower

F1 values than when they occurred in tonic syllables (all comparisons p < .001). Thus,

the atonic productions of the non-high vowels were raised when atonic as compared to

tonic. The opposite pattern was observed for the high vowel /i/, which exhibited a

significantly higher F1 value (i.e., produced lower in the vowel space) when it occurred

in an atonic syllable (p = .004).

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Tonic and atonic vowel productions

tonic atonic

159

Although there was no significant main effect of stress along the F2 dimension

collapsing across all vowels, the vowel by stress interaction was significant for F2 (F(4,

3254.24) = 46.59, p < .001). This interaction indicated that the effect of stress along the

front-back continuum depended upon the vowel. The pairwise comparisons indicated

that all vowels except for /a/ differed with respect to backness based on their stress type.

The front vowels /e/ and /i/ exhibited significantly lower F2 values in atonic productions

as compared to tonic productions, indicating that unstressed vowels had a more

centralized articulation than tonic vowels. Atonic /e/ was produced significantly farther

back in the vowel space when compared to tonic /e/ (atonic = 1872.55 Hz, tonic =

1913.56 Hz, p = .011), and the same pattern was observed for /i/ (atonic = 2112.42 Hz,

tonic = 2301.84 Hz, p < .001). The back vowels were also produced with a more

centralized articulation when compared to their tonic counterparts. Atonic /u/ and /o/

were produced with significantly higher F2 values than tonic /u/ and /o/ (for /u/, atonic =

1255.54 Hz, tonic = 1162.13 Hz, p < .001; for /o/, atonic = 1211.00 Hz, tonic = 1121.06

Hz, p < .001). The differences between atonic and tonic vowel articulations with respect

to backness are also represented in Figure 4-6. As can be observed, the atonic vowel

space, represented by the dashed line, falls within the tonic vowel space. Thus, all atonic

vowels were centralized in comparison to their tonic counterparts.

The dispersion of the vowels throughout the vowel space, as measured by the

Euclidean distance of each vowel from the centroid, was also affected by lexical stress in

much the same way as F1 and F2 (see Table 4-5 for actual values). The Euclidean

distance measure is another way of representing the combined influence of F1 and F2 on

the positioning of vowels within the acoustic space. An illustration of the distance of

160

each vowel from the centroid is represented in Figure 4-7, which includes the tonic and

atonic vowel spaces as well as the centroid measure.

Figure 4-7. Tonic and atonic vowel dispersion for all speakers averaged across syllable type and task.

The statistical analysis with Euclidean distance as the dependent variable revealed

a significant main effect of stress (F(1, 3254.39) = 190.06, p < .001) and a significant

vowel by stress interaction (F(4, 3254.30) = 25.01, p < .001). The main effect of stress

indicated that overall, atonic vowels had a shorter Euclidean distance (i.e., were closer to

the centroid) than were the tonic vowels (atonic= 367.43 Hz, tonic = 449.15 Hz). The

significant vowel by stress interaction showed that the proximity of atonic and tonic

vowels to the centroid varied for each vowel. With the exception of the mid vowel /e/, all

other vowels exhibited significantly shorter Euclidean distance measures when they were

produced in atonic syllables as compared to tonic syllables (for all vowels, p < .001).

The difference for /e/, however, was not significant, indicating that stress did not have an

effect on the location of this vowel with respect to the centroid (p = .981). These results

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Tonic and atonic dispersion

tonic atonic centroid

161

are similar to those reported for F1 and F2, in that the atonic vowels were found to be

articulated closer to the center of the vowel space than the tonic vowels.

Moving on to vowel duration, the statistical analyses revealed that lexical stress

had a significant influence on vowel duration. The duration in milliseconds of the atonic

and tonic vowels produced by all HS averaged across tasks and syllable types are

presented in Table 4-6.

Table 4-6

Atonic and tonic vowel duration Vowel Atonic Tonic

/i/ 53.78 69.57 /e/ 53.85 83.75 /a/ 65.15 94.07 /o/ 56.38 83.73 /u/ 52.56 79.10

Note. The atonic vowels were always significantly shorter than their tonic counterparts at the level of p < .001.

The differences between atonic and tonic vowel duration are also represented

graphically in Figure 4-8. The gray bars represent atonic vowel duration, and the black

bars represent tonic duration.

162

Figure 4-8. Tonic and atonic vowel duration for all speakers averaged across syllable type and task.

Lexical stress had a significant effect on vowel duration as evidenced by a

significant main effect of stress (F(1, 3254. 41) = 749.66, p < .001) and a significant

vowel by stress interaction (F(4, 3254.32) = 7.25, p < .001). The main effect of stress

indicated that across all vowels, task types, and syllable types, the tonic vowels were

significantly longer than the atonic vowels (tonic vowel duration = 82 milliseconds,

atonic vowel duration = 56 milliseconds). The vowel by stress interaction was also

significant, indicating that the effect of stress depended upon the vowel (see Figure 4-8).

The pairwise comparisons of the interaction indicated that for all vowels, the duration of

tonic vowels was significantly longer than the atonic vowels (for all vowels, p < .001).

To summarize the overall results of this section, the statistical analyses of F1, F2,

duration, and Euclidean distance indicated that lexical stress has a considerable effect on

HS vowel production. In terms of vowel quality, the non-high vowels were found to

have lower F1 values (i.e., were situated higher in the space). With respect to F2, the

0 10 20 30 40 50 60 70 80 90 100

/i/

/e/

/a/

/o/

/u/

Duration in ms

Tonic and atonic vowel duration

atonic

tonic

163

atonic front vowels exhibited lower F2 values and the atonic back vowels higher F2

values than their tonic counterparts. This pattern resulted in a movement of unstressed

vowels towards the center of the space along the front-back dimension. This tendency to

centralize was reflected further based on the Euclidean distance measure. With the

exception of /e/, all atonic vowels exhibited significantly shorter distance measures than

the tonic vowels. Finally, tonic vowels were significantly longer in duration than atonic

vowels, and this distinction was statistically significant for all five of the vowel pairs.

Taken together, the results indicate that the HS vowel system shows reduction in terms of

quality and quantity. The presence of large stress differences conflicts with traditional

arguments that Spanish vowel quality and duration are fairly similar in tonic and atonic

syllables.

4.1.1.3 Syllable type

An additional component of the statistical model was to examine the effect of

syllable type on vowel height, backness, duration, and dispersion. The statistical analyses

revealed that there was no significant main effect of syllable type or simple interaction

between vowel and syllable type on F1. The lack of an overall main effect or simple

interaction indicated that vowels did not differ in vowel height based on the syllable type

in which they were produced. There was a significant three-way interaction involving

F1, however, which will be discussed in section 4.1.2.2.

The F1, F2, and Euclidean distances of the vowels produced in open and closed

syllables averaged across speakers, stress types, and tasks are presented in Table 4-7.

164

Table 4-7

F1, F2, and Euclidean distance measures of HS vowels produced in open and closed syllables

Vowel Syllable type F1 (Hz) F2 (Hz) Euclidean

distance (Hz) /i/ closed 369.90 2189.15* 553.31*

open 366.81 2225.11* 593.70* /e/ closed 497.37 1878.47 259.58*

open 500.92 1907.65 293.89* /a/ closed 690.76 1549.88* 227.85*

open 684.87 1509.24* 251.17* /o/ closed 537.97 1181.24 474.71*

open 536.93 1150.82 504.42* /u/ closed 426.45 1235.70* 432.10*

open 414.85 1181.98* 492.19* Note. The “*” denotes a significant difference between vowels produced in open and closed syllables.

A graphical representation of the vowel pronunciation based on syllable type is

shown in Figure 4-9. The solid line represents the vowel space for vowels produced in

closed syllables, and the dashed line represents the open syllable vowel space. It is

important to note that although the lines appear to be generally overlapping, the statistical

analysis examines the amount of variation within each of these categories. Even though

many of the category averages appear to be situated in similar locations, the statistical

tests indicate that the closed and open vowel categories show greater or lesser amounts of

variance. The statistical results will be described in greater detail below the figure.

165

Figure 4-9. Acoustic distribution of vowels produced by all speakers in open and closed syllables averaged across stress and task.

The effect of syllable type was present along the F2 dimension as evidenced by

the significant vowel by syllable type interaction (F(4, 3254.15) = 5.83, p < .001). The

significant interaction between vowel and syllable type indicates that although syllable

type did not have an overall effect on F2, it did have an effect on certain vowels.

Pairwise comparisons of this interaction indicated that the F2 of /a/ produced in closed

syllables (1549.87 Hz) was significantly higher than the F2 of vowels produced in open

syllables (1509.24 Hz, p = .003). Thus, the /a/ in closed syllables was significantly

farther front than the /a/ produced in open syllables. A similar result was found for the

back vowel /u/, which also exhibited higher F2 values in closed syllables (1235.70 Hz)

than in open syllables (1181.97 Hz, p = .019). The opposite pattern was observed for the

front vowel /i/. In closed syllables, /i/ was produced with a significantly lower F2 value

(2189.15) than in open syllables (2225.114 Hz, p = .036). Thus, the /i/ in closed syllables

was articulated further back in the vowel space than /i/ in open syllables. What is most

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Vowel production by syllable type

closed open

166

interesting about this result is that the mid vowels /e/ and /o/ did not exhibit any

significant differences with respect to syllable type, and these are the two vowels that are

traditionally argued to show allophonic variation based on syllable structure and the

nature of the following consonant (Navarro-Tomás, 1918).

Syllable type was also found to have a significant effect on the Euclidean distance

of vowels from the centroid. The Euclidean distance measures are presented above in

Table 4-7. Figure 4-10 represents the acoustic space occupied by vowels produced in

closed and open syllables, and includes the centroid as a point of reference.

Figure 4-10. Vowel dispersion based on syllable type.

The main effect of syllable type on Euclidean distance was significant (F(1,

3254.22) = 35.29, p < .001), indicating that across all vowels, stress, and task types, the

vowels produced in open syllables had greater Euclidean distances and were thus situated

farther from the centroid than vowels produced in closed syllables (closed syllables =

389.50, open syllables = 427.07 Hz). This difference was consistent for all vowels, and

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Vowel dispersion by syllable type

closed open centroid

167

this consistency explains why the vowel by syllable type interaction was not found to be

significant: The distance of vowels produced in open and closed syllables did not depend

on the vowel because vowels produced in closed syllables were always closer to the

centroid.72 To conceptualize this effect in a different way, the vowels produced in open

syllables were more greatly dispersed than those produced in closed syllables.

Finally, the analyses of duration revealed both a significant main effect of syllable

type (F(1, 3254.24) = 9.3, p = .002) and a significant vowel by syllable type interaction

(F(4, 3254.19) = 4.28, p = .002). The duration in milliseconds of vowels produced in

open and closed syllables are presented in Table 4-8.

Table 4-8

Vowel duration in closed and open syllables

Vowel Open Closed /i/ 56.90* 66.44* /e/ 65.90* 71.69* /a/ 79.64 79.57 /o/ 68.93 71.18 /u/ 66.95 64.71

Note. The “*” denotes a significant difference.

The durational differences are further represented in Figure 4-11. The gray bars

represent vowels produced in open syllables, and the black bars represent the duration of

vowels produced in closed syllables. A more detailed explanation of the statistical

analyses will be presented below the figure.

72 Even though the interaction was not significant, the pairwise comparisons indicated that the vowels produced in closed syllables were always significantly closer to the centroid than those produced in open syllables. The p-values are as follows: /a/-- p = .034; /e/ -- p = .008; /i/-- p = .004; /o/-- p = .026; /u/-- p = .001.

168

Figure 4-11. Vowel duration by syllable type. Values for all speakers averaged across stress and task.

The main effect revealed that averaged across all vowels, stress types, and tasks,

the vowels produced in closed syllables were significantly longer than those produced in

open syllables (closed syllables = 71 milliseconds, open syllables = 68 milliseconds).

The interaction between vowel and syllable type indicated that the effect of syllable type

on duration depended upon the vowel. The pairwise comparisons of this interaction

revealed that only the front vowels /e/ and /i/ exhibited significant differences in length

based on syllable type. The /e/ produced in closed syllables was significantly longer (72

milliseconds) than the /e/ produced in open syllables (66 milliseconds, p = .005). The

same pattern emerged for /i/, in which longer duration was observed in closed syllables

(66 milliseconds) than in open syllables (57 milliseconds, p < .001). No other vowels

exhibited significant differences with respect to syllable type.

The fact that vowels produced in closed syllables were longer than those in open

syllables led to the question of whether the following consonant could potentially

0 10 20 30 40 50 60 70 80 90

/i/

/e/

/a/

/o/

/u/

Duration in ms

Vowel duration by syllable type

open

closed

169

influence vowel length. In order to test for this effect, the Mixed Linear Model on

duration was re-run including two additional variables: the voicing characteristics of the

following consonant (voiced or voiceless) and the manner of articulation of the following

consonant (stop, fricative, approximant, affricate, lateral, rhotic, and nasal). The results

of this additional analysis revealed a significant main effect of voicing (F(1, 3219.53) =

21.792, p < .001) and a significant main effect of manner (F(6, 3219.40) = 16.26, p <

.001). Pairwise comparisons of the main effect of voicing indicated that overall, vowels

that were followed by voiced consonants were significantly longer (77.29 milliseconds)

than those that were followed by voiceless consonants (60.23 milliseconds). The main

effect of manner of the following consonant indicated that the length of the vowel varied

significantly based on the manner in which the following consonant was produced.

Figure 4-12 provides a general picture of how vowel duration varied according to the

following consonantal context.

Figure 4-12. Average vowel duration based on the manner of the following consonant.

0 10 20 30 40 50 60 70 80 90

fricative

affricate

occlusive

approximant

rhotic

lateral

nasal

Duration in milliseconds Vowel duration by following consonant

170

Vowels followed by fricatives were the longest (78.64 milliseconds) and those

preceding nasals were found to be the shortest (59.83 milliseconds).73 Because of the

unequal distribution of vowels in each of these categories, an in-depth discussion of

which consonants resulted in significantly longer vowel duration and pairwise

comparisons of this effect will not be discussed further.

The previous section discussed the effects of syllable type on HS vowel

production. There were no overall effects of syllable type on F1, indicating that syllable

structure did not have a significant effect on vowel height. An analysis of the effect of

syllable type on F2, however, did reveal several interesting effects. The significant

interaction between vowel and syllable type along the F2 dimension showed that vowels

in closed syllables had a slight tendency to be produced towards the center of the vowel

space. This difference was only found for the point vowels /i,a,u/, however, and not for

the mid vowels /e/ and /o/ which in the traditional Hispanic literature have typically been

argued to exhibit differences in F2 based on syllable type. The analysis of the Euclidean

distance measures revealed that vowels produced in closed syllables were closer to the

centroid than those produced in open syllables. Thus, the dispersion of vowels was

greater for vowels in open syllables and lesser for those in closed syllables. Syllable type

was also found to have an effect on vowel duration. Overall, vowels produced in closed

syllables were longer than those in open syllables, but this distinction was only

statistically significant for the front vowels /e/ and /i/.

73 Only one of the three tasks was somewhat balanced for preceding and following consonantal contexts due to the original goal of this research, and for this reason there were not an equal number of the five Spanish vowels in the context of each of the manners of articulation represented. In addition, many of the vowels in closed syllable contexts were in contact with /s/, which may explain why vowels produced in closed syllables were longer in duration than those produced open syllables, thus driving the vowel by syllable type interaction. These aspects of this investigation will be discussed in greater detail in chapter 5.

171

4.1.1.4 Task

The final within-subjects variable included in the analysis is task, or speech style.

As mentioned in chapter 3, the participants completed three different activities which

elicited different speech styles: a narrative retelling task which represents the

spontaneous speech, a picture information task which represents semi-spontaneous

speech, and a carrier phrase task which represents controlled speech. For ease of

reference throughout this section and the rest of the chapter, the narrative retelling task

(spontaneous) will be referred to as the “NRT,” the picture identification task (semi-

spontaneous speech) as the “PIT,” and the carrier phrase task (controlled speech) as the

“CPT.” Prior to discussing the statistical results, the formant values and Euclidean

distances of each of the HS vowels are presented in Table 4-9.

Table 4-9

Formant values and Euclidean distance measures of HS vowels produced in all three tasks

Vowel Task F1 (Hz) F2 (Hz) Euclidean distance (Hz)

NRT 396.01bc 2141.73bc 509.79bc /i/ PIT 367.07ac 2205.96ac 568.44ac

CPT 341.99ab 2273.71ab 642.29ab

NRT 527.01bc 1803.16c 213.73c /e/ PIT 489.49b 1830.22c 216.17c

CPT 480.93a 2045.80ab 400.30ab

NRT 663.11bc 1556.31c 207.43c /a/ PIT 679.00ac 1533.09 230.06c

CPT 721.33ab 1499.27a 281.04ab

NRT 550.19bc 1204.97c 452.28c /o/ PIT 528.75b 1173.00c 479.39c

CPT 533.41a 1120.12ab 537.02ab

NRT 446.75bc 1208.27 455.15 /u/ PIT 417.81ac 1250.79c 428.05c

CPT 397.38ab 1167.46b 503.24b Note. The “a” denotes a significant difference from the NRT, the “b” from the PIT, and the “c” from the CPT.

172

The F1 and F2 values for each vowel in each of the three tasks are represented

graphically in Figure 4-13. The solid line represents the vowel space in the NRT. The

dotted line represents the semi-spontaneous speech from the PIT. The dashed line

represents the controlled speech that was obtained in the CPT. As can be observed in the

figure, the controlled speech task occupies the greatest area of the acoustic space and

results in the most peripheral vowel productions. The NRT and PIT are more centralized

and pattern similarly to one another. The distribution of vowels based on speech style

and the statistical analyses will be described in greater detail below the figure.

Figure 4-13. Overall vowel space based on task (speech style). Values for all speakers averaged across stress and syllable type. NRT = narrative, PIT = picture identification, CPT = carrier phrase.

The Mixed Linear Model conducted with F1 as the dependent variable revealed a

significant main effect of task (F(2, 3254.38) = 25.24, p < .001) and a significant vowel

by task interaction (F(8, 3254.14) = 25.85, p < .001). The main effect of task indicated

that across all vowels, stress types, and syllable types, the vowels produced in the NRT

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz HS vowel organization by task

NRT PIT CPT

173

(spontaneous speech) exhibited higher F1 values than vowels produced in the other two

tasks. The average F1 value in the NRT was 516.61 Hz, while in the PIT it was 496.42

Hz, and in the CPT 495.00 Hz. Pairwise comparisons of this main effect indicated that

the F1 values in the NRT were significantly higher than those in the PIT and CPT (p <

.001), but the F1 values in the PIT and CPT did not differ significantly from one another

(p = 1.00).

The vowel by task interaction for F1 indicated that the height of some of the HS

vowels differed according to the speech style in which they were produced. The F1 value

of /a/ was significantly lower in the NRT (663.10 Hz) than in the PIT (679.00 Hz, p =

.002), and the F1 of /a/ in the PIT was significantly lower than in the CPT (721.33 Hz, p

< .001). The height differences observed between the NRT and CPT were also

significant (p < .001). Thus, the /a/ was produced higher in the vowel space as the speech

became more spontaneous. The opposite pattern was observed for the high vowels: the

tokens of /i/ and /u/ produced in the NRT exhibited the highest F1 values and in the CPT

the lowest F1 values. For /i/, the F1 of vowels produced in the NRT (396.01 Hz) was

significantly higher than the F1 of vowels in the PIT (367.06 Hz), which was

significantly higher than the F1 of /i/ produced in the CPT (341.99 Hz). The difference

between the NRT and the two other tasks was significant at the level of p < .001, and the

difference between the PIT and CPT was significant at the level of p = .002. The same

pattern was observed for /u/, in which the vowels in the NRT had the highest F1 values

(446.75 Hz), followed by the PIT (417.81 Hz), and finally the CPT (397.38 Hz). The

differences reported for /u/ were significant on all levels, such that the F1 values in the

NRT differed significantly from the PIT (p = .014) and the CPT (p < .001), and the

174

values in the PIT and CPT also differed from one another (p = .009). The high vowels

thus exhibited the opposite behavior of /a/: the F1 values of /i/ and /u/ increased as

speech became more spontaneous, resulting in their movement downward in the acoustic

space.

The effect of task on the mid vowels was slightly different in that neither /e/ nor

/o/ showed significant differences between the NRT and the CPT. For /e/, the vowels

produced in the NRT had significantly higher F1 values (527.00 Hz) and were thus

produced lower in the vowel space than those in the PIT (489.49 Hz) and the CPT

(480.92 Hz). The difference in the F1 values of /e/ between the NRT and PIT/CPT was

significant at the level of p < .001. A similar pattern was revealed for the mid vowel /o/.

The F1 of /o/ produced in the NRT (550.18 Hz) was significantly higher than that

observed in the PIT (528.74 Hz) and the CPT (533.40 Hz), but the latter two tasks did not

differ significantly from each other (NRT from PIT, p = .002; NRT from CPT, p = .021;

PIT from CPT, p = 1.00). As with the high vowels, as the spontaneity of the speech

increased, so did the F1 value of the vowel. Thus, the mid vowels were produced the

lowest in the vowel space in the narrative retelling task, and higher in the picture

information and carrier phrase tasks.

The statistical analysis with F2 as the dependent variable also revealed robust

effects of speech style (again, see Figure 4-13). The Mixed Linear model revealed a

significant main effect of task (F(2, 3254.55) = 6.37, p = .002) as well as a significant

vowel by task interaction (F(8, 3254.21) = 23.711, p < .001) for F2. An examination of

the main effect via pairwise comparisons revealed that overall, the vowels produced in

the CPT had the highest F2 values (1621.27 Hz). The F2 in the CPT was found to be

175

significantly higher (i.e., farther front) than the F2 values of vowels produced in the NRT

(1582.88 Hz, p = .001) and in the PIT (1598.61 Hz, p = .044). The overall differences in

backness between the NRT and PIT were not significantly different from one another (p

= .283).

The significant interaction between vowel and task on the F2 dimension indicated

that some vowels exhibited distinct degrees of backness in different tasks. A closer

examination of this interaction via pairwise comparisons revealed that generally

speaking, the F2 values produced in the CPT were the most peripheral in the vowel space

and tended to differ significantly from those produced in the NRT. For the low vowel /a/,

the F2 produced in the NRT was significantly higher (1546.31 Hz) than the F2 of vowels

produced in the CPT (1499.27 Hz, p = .009). In the most spontaneous speech task, then,

the /a/ was produced farther front than when it was produced in the most controlled

speech style. The front vowel /i/ exhibited the most extreme (i.e., farthest front) F2 value

in the CPT (2273.70 Hz). This F2 value differed significantly from the F2 of /i/ in the

PIT (2205.95 Hz, p = .006) and from the NRT (2141.73 Hz, p < .000). The F2 values of

/i/ in the NRT and PIT also differed significantly from one another at the level of p =

.006. The mid front vowel /e/ also exhibited the most extreme F2 values in the CPT

(2045.80 Hz), which differed significantly from the F2 values produced in the PIT

(1830.15 Hz, p < .001) and NRT (1803.15 Hz, p < .001). The F2 values of /e/ in the NRT

and PIT did not differ significantly from one another (p = .221). Thus, as the speech style

became more controlled, /e/ and /i/ were produced with an increasingly more fronted

articulation.

176

The back vowels showed the same pattern and tendency for more peripheral (i.e.,

lower) F2 values in the CPT, but in the opposite direction of the front vowels. For /o/,

the vowels produced in the CPT had the lowest F2 value (1120.12 Hz), and this was

significantly lower than the F2 in the PIT (1172.99 Hz, p = .019) and the NRT (1204.97

Hz, p < .001). There was no significant difference between the F2 values in the NRT and

PIT, however. Finally, /u/ only exhibited significant differences between the F2 values in

the PIT and CPT (PIT = 1250.79 Hz, CPT = 1167.46 Hz, p < .001). The /u/ produced in

the CPT had the lowest F2 value, thus resulting in this vowel being produced the furthest

back in the vowel space in the most controlled speech task. Based on these results, it can

be concluded that as the speech style became more controlled, the back vowels were

produced farther back in the vowel space.

An additional analysis of the effect of speech style on vowel production indicated

that the Euclidean distance of HS vowels from the center of the vowel space differs as a

consequence of speech style. Figure 4-14 illustrates the relative location of the five

Spanish vowels to the centroid, and how their positions varied in the different speech

styles. Refer to Table 4-9 for the actual Euclidean distance values.

177

Figure 4-14. Vowel dispersion based on task (speech style). Values for all speakers averaged across stress and syllable type.

The Mixed Linear Model conducted with Euclidean distance as the dependent

variable revealed a significant main effect of task (F(2, 3254.73) = 89.78, p < .001) and a

significant vowel by task interaction (F(8, 3254.26) = 6.16, p <. 001). The main effect of

task indicated that vowels were dispersed differently throughout the acoustic space

according to speech style. The pairwise comparisons of this main effect revealed that the

vowels produced in the CPT, the most controlled speech style, were significantly farther

from the centroid than the vowels produced in the other two tasks. The average

Euclidean distance of vowels produced in the CPT was 472.77 Hz, which was

significantly longer (p < .001) than those produced in the PIT (384.42 Hz) and the NRT

(367.67 Hz). Although the average Euclidean distance of vowels produced in the PIT

was greater than that of NRT, this difference was not statistically significant (p = .083).

A closer examination of the vowel by task interaction revealed that the same

pattern described previously for the main effect was observed for all other vowels except

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz HS vowel dispersion by task

NRT PIT CPT centroid

178

for /u/. For the low vowel /a/, the greatest distance from the centroid was observed in the

CPT (281.04 Hz), which was significantly greater than the distance in the PIT (230.05, p

= .002), and also significantly greater than the distance of vowels in NRT (207.42 Hz, p <

.001). The mid vowel /e/ was dispersed farthest from the centroid in the CPT (400.29

Hz), and this was significantly greater than the distance calculated in the PIT (216.17 Hz,

p < .001) and NRT (213.73 Hz, p < .001). The front vowel /i/ had the greatest degree of

dispersion overall, which was most apparent in the CPT (642.29 Hz). This distance was

significantly greater than those calculated in the NRT (509.78 Hz) and the PIT (568.43

Hz). Both comparisons with the CPT were significant at the level of p < .001. Finally,

/o/ exhibited the same decreasing pattern, with the longest Euclidean distance observed in

the CPT (537.01 Hz). Significantly closer to the centroid was the /o/ produced in the PIT

(479.39 Hz, p = .001), followed by the /o/ produced in the NRT (452.28 Hz, p < .001).

For the high back vowel /u/, the Euclidean distance of the vowels produced in the CPT

(503.23 Hz) was significantly greater than that of the PIT (428.04 Hz, p < .001), but not

when compared to the NRT (455.15 Hz, p = 2.10). The Euclidean distance of vowels

produced in the NRT and PIT were not significantly different for any of the five Spanish

vowels except for /i/, indicating that these two speech styles were characterized by

similar rates of dispersion from the centroid.

The duration of some HS vowels also differed statistically according to speech

style. The average vowel duration in milliseconds produced in each of the three speech

tasks are presented in Table 4-10.

179

Table 4-10

Vowel duration in each speech style

Vowel NRT PIT CPT

/i/ 60.73 61.43 62.86 /e/ 69.71 67.34 69.34 /a/ 72.44* 82.74 83.64 /o/ 60.12* 74.10 74.94 /u/ 63.01 68.07 66.41

Note. The “*” indicate that the duration of /a/ and /o/ in the NRT was significantly shorter than the duration in the PIT and CPT, but that the latter two did not differ significantly.

Figure 4-15 graphically represents the breakdown of vowel duration by each

speech style. The dark gray bars represent the NRT, the light gray bars the PIT, and the

black bars represent the CPT. As can be observed in the figure, the black bars have a

tendency to extend farthest to the right, indicating the longest duration, while the dark

gray bars are the shortest, indicating shorter duration. The light gray bars tend to have

intermediate values, although they are similar in length to the black bars. The statistical

analysis of duration will be discussed below the figure.

180

Figure 4-15. Vowel duration based on task (speech style). Values for all speakers averaged across stress and syllable type.

The statistical analysis of vowel duration revealed a significant main effect of task

(F(2, 3254.76) = 13.39, p < .001), as well as a significant vowel by task interaction (F(8,

3254.28) = 5.95, p < .001). The main effect of task indicated that collapsed across all

vowels, stress types, and syllable types, the vowels produced in the narrative retelling

task were significantly shorter than those produced in the other two tasks. The average

vowel duration in the NRT was 65.20 milliseconds. Pairwise comparisons of this main

effect indicated that the vowels produced in the NRT were significantly shorter than

vowels produced in the PIT (70.74 milliseconds) and the CPT (71.64 milliseconds).

These differences were significant at the level of p < .001. Although the vowels

produced in the CPT were the longest overall, they were not significantly longer than

those produced in the PIT.

The vowel by task interaction indicated that differences in duration based on

speech style were only significant for certain vowels. A closer examination of the

0 10 20 30 40 50 60 70 80 90

/i/

/e/

/a/

/o/

/u/

Duration in ms

Vowel duration in three speech tasks

NRT

PIT

CPT

181

interaction via pairwise comparisons revealed that only /a/ and /o/ differed significantly

with respect to duration. For /a/, the average vowel duration produced in the NRT (72.44

milliseconds) was significantly shorter than the average duration of /a/ in the PIT (82.74

milliseconds) and the CPT (83.64 milliseconds, p < .001 for all comparisons). There was

no statistically significant difference between the average duration of /a/ in the PIT and in

the CPT. The same pattern was observed for the back vowel /o/. The shortest duration

was observed in the NRT (60.12 milliseconds), followed by the PIT (74.10 milliseconds)

and the CPT (75.94 milliseconds). For /o/, the differences in duration between the NRT

and PIT, as well as between the NRT and CPT, were significant at the level of p < .001.

As with /a/, the vowels in the CPT were not significantly longer than those produced in

the PIT.

To conclude, this section presented the results of the analyses on speech style.

This variable had significant and robust effects on all four of the dependent variables. In

terms of vowel quality, there was a connection between the spontaneity of speech and the

degree of centralization in terms of height and backness. The vowels produced in the

spontaneous and semi-spontaneous speech styles occupied a more centralized location

within the vowel space when compared to the vowels produced in the carrier phrase task.

This difference in dispersion was further supported by the analysis of Euclidean distance.

The vowels produced in the carrier phrase task, the most controlled speech style,

exhibited the greatest distance from the center of the vowel space. Speech style also had

an impact on vowel duration. Vowels were the shortest in duration in the spontaneous

speech task and the longest in the controlled speech task, but the differences were only

statistically significant for /a/ and /o/. Taken together, the results of the analyses of

182

speech style indicate that vowels are produced differently in terms of both quality and

quantity depending on the type of speech in which they occur.

4.1.2 Complex interactions

In addition to the main effects and simple interactions described in the previous

four sections, two three-way interactions were included in the model in order to address

the effect of stress and syllable type within the different speech styles. The goal of this

section is to discuss how stress and syllable type effects manifested themselves within the

different speech styles, and how they impacted each of the dependent variables.

4.1.2.1 The interaction between vowel, task, and stress

The three-way interaction between vowel, task, and stress examined the

relationship between lexical stress and speech style for each of the five Spanish vowels.

This interaction was found to be significant for F1 (F(10, 3254.17) = 2.6, p = .004), F2

(F(10, 3254.26) = 13.86, p < .001), and Euclidean distance (F(10, 3254.26) = 14.74, p <

.001). These findings indicate that the effect of stress on each vowel depended upon the

task in which the vowel was produced. Prior to discussing the statistical results, the mean

F1, F2, and Euclidean distance values in Hertz of each vowel in each task are presented

Table 4-11. Vowel plots for each of the tasks will be presented throughout this section in

order to illustrate the statistical effects described in the text.

183

Forman

t values

and Eu

clidean

distan

ce meas

ures of

tonic a

nd aton

ic HS vo

wels pr

oduced

in each

speech

style

NRT

PITCP

T

Vowel

Stress

F1 (Hz

)F2

(Hz)

Euclide

an dist

ance (H

z)F1

(Hz)

F2 (Hz

)Euc

lidean

distanc

e (Hz)

F1 (Hz

)F2

(Hz)

Euclide

an dist

ance (H

z)/i/

atonic

414.49*

2021.05

*400

.60*362

.94216

7.67*

531.99*

351.53

2148.55

*518

.21*toni

c377

.53*226

2.41*

618.97*

371.19

2244.25

*604

.88*332

.45239

8.87*

766.37*

/e/aton

ic498

.32*170

7.00*

159.30*

459.34*

1765.84

*168

.62*437

.89*214

4.84*

502.74*

tonic

555.69*

1899.32

*268

.17*519

.65*189

4.61*

263.72*

523.96*

1946.77

*297

.85*/a/

atonic

627.87*

1587.49

*163

.60*642

.93*152

4.73204

.56*672

.68*146

1.57*

275.23

tonic

698.35*

1525.13

*251

.25*715

.07*154

1.46255

.56*769

.98*153

6.98*

286.85

/o/aton

ic528

.84*123

1.97*

419.17*

507.88*

1221.94

*429

.83*516

.84*117

9.10*

472.61*

tonic

571.54*

1177.98

*485

.40*549

.62*112

4.06*

528.96*

549.97*

1061.15

*601

.42*/u/

atonic

450.64

1215.37

452.00

418.96

1337.27

*352

.80*391

.66121

3.99*

460.18*

tonic

442.86

1201.16

458.30

416.66

1164.31

*503

.29*403

.11112

0.93*

546.30*

Note.

Asteris

ks deno

te signi

ficant d

ifferenc

es betw

een ton

ic and a

tonic fo

rmant v

alues an

d Eucli

dean di

stance m

easures

.

Tabl

e 4-

11

184

In spontaneous speech (the NRT), all vowels showed significant height

differences based on stress except for /u/. The differences in vowel quality in the NRT

are presented in Figure 4-16. The solid line represents the overall space of the tonic

vowels produced by all speakers, and the dashed line represents the atonic productions.

Figure 4-16. Tonic and atonic vowel space in spontaneous speech (NRT).

The pairwise comparisons of this interaction along the F1 dimension indicated

that the non-high vowels /a/, /e/, and /o/ exhibited significantly lower F1 values (i.e.,

were higher vowels) when unstressed (atonic) as compared to when they were stressed

(tonic). Thus, the atonic non-high vowels were produced higher in the acoustic space

than their tonic counterparts. The tonic/atonic comparisons for /a/, /e/, and /o/ were

significant with p-values < .001. In contrast, the high vowel /i/ exhibited significantly

higher F1 values in this task when it was atonic (p < .001). Atonic productions of /i/

were therefore situated lower in the acoustic space than the tonic productions.

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Stress effects in the NRT

tonic atonic

185

Along the F2 dimension, all vowels except for /u/ showed significant stress

differences. The atonic productions of /a/ had higher F2 values than the tonic

productions, resulting in the atonic vowels being articulated further front in the vowel

space (p = .001). Likewise, the same pattern was observed with /o/. The atonic

productions of /o/ had significantly higher F2 values, resulting in a more fronted

production (p = .037). The front vowels /i/ and /e/ exhibited significantly lower F2

values when produced in atonic syllables when compared to tonic syllables (for /e/, p =

.022; for /i/, p < .001). The effects on F1 and F2 resulted in the atonic vowels occupying

a more centralized portion of the acoustic space when compared to the tonic productions.

As can be observed in Figure 4-16, the atonic vowel space falls within the tonic vowel

space.

The analysis of the Euclidean distance measure of vowels in the NRT indicated

that the atonic productions of /i/, /e/, /a/, and /o/ were all significantly closer to the

centroid than the tonic productions of these vowels. This difference was significant at the

level of p < .001 for /i/, /e/, and /a/, and at the level of p = .002 for /o/. The relationship

between these vowels and the center of the vowel space are presented in the Figure 4-17.

The center of the vowel space is marked with an “x” and helps conceptualize the

proximity of the vowels to the center of the space.

186

Figure 4-17. Tonic and atonic dispersion in spontaneous speech (NRT).

In the semi-spontaneous speech task (the PIT) a similar pattern emerged. The

stress effects in this task are represented in Figure 4-18. As with the previous figures, the

dashed line represents the atonic vowel productions and the solid line denotes the tonic

vowel productions.

Figure 4-18. Tonic and atonic vowel space in semi-spontaneous speech (PIT).

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Tonic and atonic dispersion in the NRT


300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z F2 Hz

Stress effects in the PIT

tonic atonic

187

In the PIT, the non-high vowels /a/, /e/, and /o/ exhibited significant differences in

vowel height based on stress context. The pairwise comparisons of this interaction

revealed that all three of these vowels were produced with significantly lower F1 values

when atonic as compared to tonic (for all comparisons, p < .001). Thus, the non-high

vowels moved up towards the center of the vowel space when they were unstressed.

Along the F2 dimension, all vowels except for /a/ showed significant movement towards

the middle of the vowel space when they were produced in atonic syllables. The front

vowels /i/ and /e/ exhibited significantly lower F2 values in atonic syllables and were thus

articulated farther back in the vowel space when compared to their tonic counterparts (for

/i/ tonic = 2244.24 Hz, atonic = 2167.67 Hz, p = .002; for /e/, tonic = 1894.61 Hz, atonic

= 1706.99 Hz, p < .001). The back vowels exhibited the same movement towards the

center of the vowel space when produced in atonic syllables. The atonic values of /o/ and

/u/ were 1221.94 Hz and 1337.27 Hz, respectively, which were significantly higher than

those of tonic /o/ (1124.05 Hz) and tonic /u/ (1164.30 Hz). The tonic and atonic

comparisons between these two vowels were both statistically significant (p < .001 for

both /o/ and /u/).

The Euclidean distance measures in the PIT were also found to differ significantly

based on stress context. The pairwise comparisons of this interaction revealed that all

atonic vowels exhibited significantly shorter Euclidean distance measures when

compared to their tonic counterparts. For all vowels, the comparisons between tonic and

atonic Euclidean distance were significant with a value of p < .001. The tonic and atonic

vowel space in the PIT is represented in Figure 4-19. The “x” marks the position of the

188

center of the vowel space and helps conceptualize the relationship of each vowel to this

centroid.

Figure 4-19. Tonic and atonic dispersion in semi-spontaneous speech (PIT).

Finally, several vowels showed stress effects in the controlled speech task (the

CPT). Figure 4-20 represents the tonic and atonic vowel productions in the CPT, with

the solid lines representing the tonic productions and the dashed lines the atonic vowel

productions.

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Stress effects in the PIT


189

Figure 4-20. Tonic and atonic vowel space in controlled speech (CPT).

The pairwise comparisons of this interaction revealed that atonic productions of

/a/, /e/, and /o/ exhibited significantly lower F1 values in the CPT. This finding indicates

that the non-high vowels were situated higher in the vowel space when they were atonic

as compared to when they were tonic (p < .001 for /a/ and /e/; p = .002 for /o/). In terms

of vowel backness, the atonic productions of /a/ exhibited a significantly lower F2 vowels

when compared to tonic productions, and were therefore produced farther back in the

vowel space (p = .022). The same pattern was observed for atonic /i/, which had a

significantly lower F2 value when produced in atonic syllables as compared to tonic

syllables (p < .001). Thus, both /a/ and /i/ were produced farther back in the vowel space

when they were unstressed as compared to when they were stressed. The same pattern,

but in the opposite direction, was shown again for the back vowels /o/ and /u/. Both of

these vowels were characterized by significantly higher F2 values in atonic syllables as

compared to tonic syllables, resulting in the atonic productions of these vowels moving

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Stress effects in the CPT

tonic atonic

190

farther front, or more towards the center, of the vowel space. These differences were

statistically significant for both vowels (for /o/, p < .001; for /u/, p = .002). Interestingly,

in this task only, the atonic /e/ had a higher F2 value than tonic /e/ and was thus situated

farther front in the vowel space (p < .001). This unexpected result is explained in greater

detail later in this chapter.

The analysis of Euclidean distance calculated for the vowels produced in the CPT

revealed that even in the controlled speech task (CPT), stress had a significant effect on

the proximity of each vowel to the center of the space. The relationship between the

vowels produced in the CPT and the centroid are illustrated in Figure 4-21.

Figure 4-21. Tonic and atonic vowel dispersion in controlled speech (CPT).

The pairwise comparisons of the interaction between vowel, stress, and task in the

CPT revealed that atonic /i/, /o/, and /u/ had significantly shorter Euclidean distances

when compared to their tonic counterparts (all comparisons p < .001). The front vowel

/e/, on the other hand, had a significantly longer Euclidean distance when it was atonic as

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Tonic and atonic dispersion in the CPT


191

compared to when it was tonic, indicating that the atonic vowel was actually farther from

the centroid than the tonic vowel (p < .001). This finding is directly related to the F2

value calculated for atonic /e/ in the CPT. As reported earlier, the F2 of the atonic /e/ was

higher than the tonic vowel, resulting in it occupying a more fronted position within the

vowel space. When entered into the formula to calculate the distance from the centroid,

the larger F2 value thus resulted in a larger Euclidean distance.

The difference in vowel backness and Euclidean distance may be a consequence

of the lexical items and consonantal context in which atonic /e/ occurred in the CPT. The

carrier phrase task was more heavily weighted toward examining tonic vowels in open

syllables. The atonic /e/ vowels in this task were obtained from the words “eclipse”

(eclipse) and “equipo” (team). Both of these words contain word-initial /e/ in the context

of a voiceless velar consonant. The word “eclipse” is also a cognate, and some speakers

Anglicized their pronunciation, thus resulting in the elimination of this token from the

analysis. In the English pronunciation, the initial vowel is a high front /i/, and not the /e/

of Spanish. The combination of “equipo” and “eclipse” both containing an /e/ embedded

in the same consonantal context, and the possibility of producing the high front /i/ instead

of /e/ in “eclipse” could have potentially skewed the results. Thus, the findings with

respect to F2 and Euclidean distance for this vowel in this task should be interpreted with

caution.

An examination of the Type III effects in the analysis of duration revealed that the

three-way interaction between vowel, task, and stress was not significant for duration.

This result occurred because in all tasks and all speech styles, atonic vowels were always

significantly shorter than tonic vowels. The tendency for atonic vowels to be produced

192

with shorter duration (in milliseconds) is represented in Table 4-12, and represented

graphically in Figure 4-22.

Table 4-12

Tonic and atonic duration in three speech tasks Vowel Stress NRT PIT CPT

/i/ atonic 50.93 55.16 55.25

tonic 70.54 67.69 70.47

/e/ atonic 54.72 53.72 53.10

tonic 84.71 80.97 85.57

/a/ atonic 58.33 68.51 68.60

tonic 86.55 96.97 98.68

/o/ atonic 49.28 57.45 62.40

tonic 70.96 90.75 89.48

/u/ atonic 53.58 51.75 52.35

tonic 72.43 84.40 80.47

Note. All atonic vowels were significantly shorter than their tonic counterparts.

The interaction did not emerge significant because the effect of lexical stress on

duration does not depend on the task: it was consistent in all tasks and for all vowels.

The effects of lexical stress were confirmed by the pairwise comparisons of the

interactions, and were significant for all vowels in all tasks at the level of p < .001 except

for /i/ in the CPT (p = .001). The consistency of this pattern is depicted in Figure 4-22,

which represents the average vowel duration of tonic and atonic vowels in each of the

three speech styles. The gray bars represent atonic productions and the black bars tonic

productions. As shown below, the gray bars representing atonic vowel duration are

consistently shorter than the black bars, regardless of the speech style in which they were

produced.

193

Figure 4-22. Tonic and atonic duration in each of the three speech styles, averaged across syllable type.

To summarize the main results of this section, the significant interaction between

vowel, task, and stress indicated the interplay between these three factors. Overall, atonic

vowels in all speech styles exhibited significantly lower F1 values, resulting in vowels

that were produced higher in the vowel space in all three tasks (see Figure 4-16, Figure

4-18, and Figure 4-20). This raising of unstressed vowels, and lowering of the high

vowel /i/, seemed to be slightly more prevalent in the spontaneous speech task (NRT).

With respect to F2, nearly all atonic vowels in all speech styles exhibited formant values

that were more centralized, thus lower F2 values for the front vowels and higher F2

values for the back vowels. The low vowel /a/ showed some variation with respect to

0 10 20 30 40 50 60 70 80 90 100

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

atonictonic

NR

TPI

TCP

TN

RT

PIT

CPT

NR

TPI

TCP

TN

RT

PIT

CPT

NR

TPI

TCP

T

/i//e

//a

//o

//u

/

Duration in ms Tonic and atonic vowel duration in three tasks

194

speech style, and /e/ exhibited an unanticipated pattern in the CPT which may be the

result of the lexical items elicited in that task. The overall tendency of atonic vowels to

centralize in all three tasks was further corroborated by the Euclidean distance measures

(see Figure 4-17, Figure 4-19, and Figure 4-21). With few exceptions, the atonic vowels

were characterized by significantly shorter Euclidean distances and were therefore closer

to the center of the vowel space than tonic vowels. Finally, although the interaction

between vowel, task, and stress was not significant for duration, a closer examination of

vowel duration revealed that across the board, atonic vowels were shorter than tonic

vowels (see Figure 4-22). The interaction was not significant for duration given the

consistency of the pattern and the fact that the shortening of atonic vowels did not depend

on the vowel or the task in which it was produced.

4.1.2.2 The interaction between vowel, task, and syllable type

The inclusion of the three-way interaction between vowel, task, and syllable type

allowed for closer examination of how syllable structure effects manifested themselves in

different speech styles. This vowel by task by syllable type interaction was significant

for F1 (F(10, 3254.10) = 6.70, p < . 001), F2 (F10, 3254.16) = 4.44, p < .001), Euclidean

distance, (F(10, 3254.19) = 4.08, p < .001), and duration (F(10, 3254.21) = 3.24, p <

.001). The significant interaction on all four of the dependent variables indicated the

interplay of vowel, task, and syllable type on vowel height, backness, distance from the

centroid, and duration. Unlike the three-way interaction between vowel, task, and stress,

however, the interactions involving syllable type were not as robust or consistent. The

vowel quality (i.e., F1 and F2) and Euclidean distance results will be presented separately

for each task, followed by analysis of duration in each task.

195

Prior to discussing the statistical results, Table 4-13 presents the average F1, F2,

and Euclidean distance measures in Hertz of each of the five Spanish vowels produced in

open and closed syllables. The table is divided according to speech task. Additional

figures illustrating the effects will be presented throughout the section.

196

Forman

t values

and Eu

clidean

distan

ce meas

ures of

HS vow

els pro

duced i

n close

d and op

en sylla

bles in t

hree sp

eech st

ylesNR

TPIT

CPT

Vowel

Syllable

type

F1F2

Euclide

an dist

ance

F1F2

Euclide

an dist

ance

F1F2

Euclide

an dist

ance

/i/clos

ed400

.54212

0.32478

.33*368

.18220

1.12565

.84340

.96224

6.01615

.76*ope

n391

.48216

3.14541

.24*365

.95221

0.80571

.03343

.02230

1.40668

.82*/e/

closed

532.49

1817.42

210.10

483.53

1795.75

*195

.35*476

.08202

2.25373

.28ope

n521

.52178

8.90217

.37495

.46186

4.70*

236.99*

485.77

2069.35

427.31

/a/clos

ed650

.57*158

3.09*

183.74*

686.51*

1518.61

239.76

735.19*

1547.93

*260

.04ope

n675

.65*152

9.54*

231.11*

671.49*

1547.57

220.36

707.48*

1450.62

*302

.04/o/

closed

569.15*

1243.41

*417

.68*525

.89117

8.67474

.47518

.87*112

1.63531

.98ope

n531

.23*116

6.53*

486.88*

531.61

1167.33

484.32

547.94*

1118.61

542.06

/u/clos

ed456

.66123

4.49414

.24428

.50*124

3.13438

.04394

.18122

9.48*

444.01*

open

436.84

1182.05

496.07

407.12*

1258.45

418.05

400.58

1105.43

*562

.46*Not

e. Aster

isks den

ote sign

ificant d

ifferenc

es betw

een vow

els pro

duced i

n close

d and o

pen syl

lables.

Tabl

e 4-

13

197

The distribution of vowels produced in open and closed syllables in the NRT is

presented in Figure 4-23. The solid lines represent the vowels produced in closed

syllables, and the dashed lines represent the distribution of vowels produced in open

syllables.

Figure 4-23. Vowel space based on syllable type in spontaneous speech (NRT).

As depicted in Figure 4-23, the vowels /a/, /o/, and to a lesser degree /i/, showed

significant quality (i.e., F1 and F2) differences and Euclidean distances in the NRT based

on the syllable type in which they were produced. Pairwise comparisons of this

interaction revealed that only /a/ and /o/ showed significant height differences in this task

based on syllable type. The /a/ was characterized by a lower F1 when it was produced in

a closed syllable, indicating that it was articulated higher in the vowel space (p < .001).

The opposite effect was observed for /o/. The F1 of /o/ produced in closed syllables was

significantly higher, indicating that the vowel was produced lower in the acoustic space

(p < .001). With respect to F2, only /a/ and /o/ exhibited significant differences in

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Syllable type effects in the NRT

closed open

198

backness based on syllable type. For /a/, the closed syllables were characterized by

higher F2 values and were thus produced farther front in the vowel space when compared

to the vowels produced in open syllables (p = .006). The tokens of /o/ in closed syllables

also exhibited higher F2 values and were also more centralized when compared to vowels

in open syllables (p = .003).

In the NRT, only three vowels showed effects of syllable type on Euclidean

distance. The pairwise comparisons indicated that /a/, /i/, and /o/ produced in closed

syllables had significantly shorter Euclidean distance measures than their open

counterparts (for /a/, p = .003; for /o/, p = .001; for /i/, p = .017). Thus, the /a/, /i/, and /o/

produced in closed syllables were situated closer to the centroid than the same vowels

produced in open syllables. Figure 4-24 illustrates these distance effects.

Figure 4-24. Vowel dispersion by syllable type in spontaneous speech (NRT).

The analysis of the effect of syllable type in the PIT revealed few significant

results. The only vowels to exhibit significant differences in terms of height or backness

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Dispersion by syllable type in the NRT


199

in this task were /e/, /a/, and /u/. These differences are illustrated in Figure 4-25, and will

be explained in greater detail below the figure.

Figure 4-25. Vowel space based on syllable type in semi-spontaneous speech (PIT).

The pairwise comparisons of the interaction between vowel, task, and syllable

type indicated that only /a/ and /u/ exhibited height differences as a consequence of

syllable structure in the PIT. Both the /a/ and the /u/ produced in closed syllables had

higher F1 values, meaning that they were produced lower in the vowel space, when

compared to their open counterparts (for /a/, p = .016; for /u/, p = .014). In terms of

vowel backness (F2), only /e/ showed significant effects of syllable type. The /e/

produced in closed syllables in this task were produced with significantly lower F2 values

and thus were situated farther back in the vowel space when compared to the /e/ produced

in open syllables (p = .001). Although the effects of syllable type on /a/ and /u/ appear

marginal in Figure 4-25, the effect on /e/ is quite clear.

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Syllable type effects in the PIT

closed open

200

The /e/ was the only vowel to show a significant difference in Euclidean distance

based on syllable type. According to the pairwise comparisons, /e/ produced in closed

syllables was situated significantly closer to the centroid than the /e/ produced in open

syllables. Figure 4-26 presents the relationship between the vowels produced in the PIT

and their proximity to the center of the vowel space.

Figure 4-26. Vowel dispersion by syllable type in semi-spontaneous speech (PIT).

In the carrier phrase task (the CPT), which elicited the most controlled speech, all

vowels except for /e/ exhibit some kind of difference in either height or backness as a

consequence of syllable type. The effects of syllable type in the CPT are presented in

Figure 4-27.

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Dispersion by syllable type in the PIT


201

Figure 4-27. Vowel space based on syllable type in controlled speech (CPT).

As with the NRT, only /a/ and /o/ exhibited significant height differences as a

result of syllable type. Interestingly, however, the effects observed in the CPT are the

opposite of those reported for the spontaneous speech style (NRT). The /a/ produced in

closed syllables had a significantly higher F1 than the /a/ produced in open syllables,

resulting in it occupying a lower position in the acoustic space (p = .005). For /o/, the F1

value in closed syllables was significantly lower than the F1 in open syllables, meaning

that the /o/ was produced higher in the acoustic space in closed syllables (p = .005). Only

two vowels were affected by syllable type on the F2 dimension. The low vowel /a/ was

produced with a higher F2 value in closed syllables than in open syllables (p = .001).

The same effect was observed for /u/ (p < .001), indicating that both the /a/ and the /u/

were produced farther front in the vowel space when they occurred in closed syllables as

compared to open syllables.

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Syllable type effects in the CPT

closed open

202

The Euclidean distance measures of the vowels produced in the CPT were also

found to differ according to syllable type. The distribution of the vowels produced in

closed and open syllables in the CPT is illustrated in Figure 4-28.

Figure 4-28. Vowel dispersion by syllable type in controlled speech (CPT).

Pairwise comparisons of the interaction between vowel, task, and syllable type

indicated that only the high vowels exhibited significant differences in distance based on

syllable type. For both /i/ and /u/, the Euclidean distance measure was smaller for closed

syllables than for open syllables. In other words, in the controlled speech task, /i/ and /u/

produced in closed syllables were closer to the center of the vowel space than their open

counterparts. No other Euclidean distance measures were found to be significant in this

task.

With respect to vowel duration, the vowel by task by syllable type interaction was

also found to be significant (F(10, 3254.21) = 3.24, p < .001), indicating that the effects

of syllable type on length differed for certain vowels and in certain tasks. The length

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Dispersion by syllable type in the CPT


203

differences were only found to be significant for /e/, /i/, and /u/, but not in all speech

styles. Table 4-14 presents the average vowel duration in each task and in each syllable

type. Values are presented in milliseconds.

Table 4-14

Vowel duration by syllable type in three tasks Vowel Syllable

type NRT PIT CPT

/i/ closed 66.48* 69.99* 62.85 open 54.99* 52.86* 62.86

/e/ closed 74.38* 68.27 72.44 open 65.05* 66.42 66.24

/a/ closed 72.91 80.93 84.87 open 71.97 84.55 82.42

/o/ closed 61.25 77.49 74.81 open 58.99 70.71 77.07

/u/ closed 59.16 72.49* 62.48* open 66.86 63.65* 70.33*

Note. “*” denote a significant difference.

Pairwise comparisons of this interaction indicated that in the NRT, the /e/

produced in closed syllables was significantly longer than the /e/ produced in open

syllables (p = .001). The same effect was found for /i/ in the NRT and PIT, in which the

vowels in closed syllables were significantly longer than in open syllables (for the NRT,

p = .006; for the PIT, p < .001). The back vowel /u/ also exhibited differences in duration

according to syllable type in the PIT and the CPT. In the PIT, the vowels produced in

closed syllables were significantly longer than those produced in open syllables (p =

.009), but in the CPT, the vowels in closed syllables were significantly shorter than those

produced in open syllables (p = .040). Figure 4-29 represents the average vowel duration

by syllable type in each of the three tasks. The gray bars represent the duration in closed

syllables, and the black bars the duration of vowels in open syllables

204

Figure 4-29. Vowel duration in closed and open syllables in each task.

To summarize the findings presented in this section, the significant interaction

between vowel, task, and syllable type indicated that the effects of syllable type on HS

vowel quality and quantity varied according to the vowel and according to the speech

style. Differences in vowel quality and dispersion based on syllable type were present in

all three tasks, but none of the three speech styles seemed to show greater effects of

syllable structure than the others. The vowels /a/ and /o/ showed the greatest effects in

terms of vowel height. When comparing the spontaneous speech task (NRT) to the

carrier phrase task (CPT), however, the height of these vowels actually exhibited the

opposite pattern. In the NRT, the /a/ in closed syllables was produced higher than open

0 10 20 30 40 50 60 70 80 90

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

closedopen

NR

TPI

TCP

TN

RT

PIT

CPT

NR

TPI

TCP

TN

RT

PIT

CPT

NR

TPI

TCP

T

/i//e

//a

//o

//u

/

Duration in ms Vowel duration by syllable type in three tasks

205

syllables and the closed /o/ was produced lower than the /o/ in open syllables, but the

exact reverse occurred in the CPT. This interaction also revealed differences in backness

based on syllable structure. The general trend was for vowels in closed syllables to move

closer to the center of the vowel space, and this effect was most visible with the

pronunciation of /e/ in the PIT. When effects of Euclidean distance were observed, the

vowels produced in closed syllables were found to be closer to the centroid than those

produced in open syllables. Finally, syllable type effects were also observed for duration,

but only the vowels /i/, /e/, and /u/ were affected, and only in certain tasks. Generally,

these vowels were longer in duration when they occurred in closed syllables in the NRT

(spontaneous) and the PIT (semi-spontaneous). In the CPT the /u/ actually showed the

reverse pattern. Taken together, the results of this analysis indicated that the effects of

syllable type on vowel quality and quantity in different speech styles were not as

consistent as the effects of lexical stress. As few vowels showed effects and the effects

often differed according to the task, it can be concluded that speech style and syllable

type interact, but minimally and in variable fashion.

4.1.3 Summary of within-subjects effects

To summarize the findings presented in this section, HS Spanish vowels exhibit a

number of qualities that differ from traditional descriptions of monolingual Spanish

varieties. In terms of the overall organization and distribution of HS vowels, the analyses

revealed statistical differences in vowel height and backness. The high front vowel /i/

was found to be significantly higher than the back vowel /u/, and the front mid vowel /e/

was significantly higher than the back vowel /o/. The /a/ occupied the lowest position in

the acoustic space, and was significantly lower than all vowels. In terms of backness, the

206

front vowels /i/ and /e/ differed significantly on the F2 dimension, with the /i/

significantly farther front than the /e/. This same distinction was not observed for the

back vowels /o/ and /u/, which statistically did not differ along the F2 dimension. The

central vowel /a/ was different from all other vowels in terms of F2.

The overall organization and dispersion of HS vowels, as measured by the

Euclidean distance from the center of the vowel space, revealed that this system is not as

symmetrical as previous descriptions suggest. Not all vowels were located at an equal

distance from the center, which differs from arguments that the Spanish vowel system

exhibits a high degree of symmetry. The low vowel /a/ was actually closest to the center

in Euclidean terms, followed by /e/, then /o/ and /u/, and finally /i/ with the largest

Euclidean distance.

The initial examination of duration indicated that the HS vowels exhibit intrinsic

vowel duration. The overall length of the vowels varied as a function of vowel height,

with the low vowel /a/ exhibiting the longest duration and the high vowels /i/ and /u/ the

shortest duration. This shortening of duration as vowel height increases has been

documented in other languages as well as in monolingual varieties of Spanish.

In contrast to what is typically argued for Spanish vowels, lexical stress had

significant effects on vowel quality, quantity, and dispersion. In terms of quality, atonic

vowels, and especially the non-high vowels, were articulated significantly higher in the

acoustic space in comparison to tonic vowels. In terms of backness, atonic vowels

exhibited more centralized F2 values (i.e., lower for the front vowels and higher for the

back vowels) when compared to their tonic counterparts. The tendency for centralization

in atonic syllables was confirmed based on the Euclidean distance measures calculated

207

for each vowel in the two stress contexts. Overall, the atonic vowels were situated

significantly closer to the center of the vowel space than were tonic vowels, and this trend

was observed to some degree in all speech styles. Robust differences in vowel duration

were also reported. Collapsing across all syllable and task types, as well as within the

individual task types, atonic vowels were significantly shorter than tonic vowels.

The results of the analyses of syllable type were less transparent than those

reported for lexical stress. Although the Spanish mid vowels are typically argued to

exhibit allophonic variation along the F1 dimension based on syllable type, there is little

evidence to support this claim in the data. There was no significant main effect or simple

interaction for syllable type along the F1 dimension, indicating that overall, vowel height

was not affected by syllable type. When speech style was considered and examined via

the three-way interaction between vowel, task, and syllable type, however, /a/, /o/, and /u/

showed differences in height. Although the patterns that emerged were not consistent,

there was a slight tendency for vowels in closed syllables to be articulated higher in the

vowel space. Along the F2 dimension, only the point vowels showed significant

differences in backness. The /i/, /a/, and /u/ were all characterized by more centralized F2

values when they were produced in closed syllables as compared to open syllables. The

tendency for vowels in closed syllables to occupy a more centralized location was

observed in the different speech styles, and especially in spontaneous speech (NRT) and

controlled speech (CPT). The Euclidean distance measure painted a clearer picture of the

effects of syllable type on vowel quality. By taking into consideration both the F1 and F2

values, this measure showed that vowels produced in closed syllables tended to be closer

to the center of the vowel space than vowels produced in open syllables. Finally, the

208

analyses of duration showed that vowels produced in closed syllables were overall longer

than those produced in open syllables. This finding was particularly relevant for the front

vowels /e/ and /i/. When speech style was added to the equation, these vowels were

generally longer in closed syllables in the NRT and the PIT, but the reverse was revealed

for the CPT, and more specifically for /u/.

Finally, the HS Spanish vowels exhibited differences in quality, duration, and

distribution based on speech style. These findings corroborate previous descriptions of

the effect of speech style on vowel production in monolingual and bilingual varieties of

Spanish. In terms of quality, the vowels produced in the spontaneous speech task (NRT)

and the semi-spontaneous speech task (PIT) occupied a more centralized location within

the vowel space. Thus, the vowels produced in these tasks exhibited F1 values that were

higher (except for the high vowels), lower F2 values for the front vowels, and higher F2

values for the back vowels. The vowels produced in the controlled speech task (CPT),

however, exhibited more extreme F1 and F2 values resulting in their situating themselves

on the periphery of the vowel space. These findings were further confirmed by the

Euclidean distance measures. Overall, the vowels produced in the NRT were situated

closest to the center of the vowel space, followed by the vowels produced in the PIT, and

finally the CPT. Duration also differed as a consequence of speech style. Averaged

across all stress and syllable types, the vowels produced in the NRT were significantly

shorter than those produced in the other two tasks. Vowel duration in the PIT was

generally shorter than that observed in the CPT, but the differences were not statistically

significant.

209

Taken together, the results of the statistical analyses of the within-subjects

variables (i.e., lexical stress, syllable type, and speech style) revealed that the HS vowel

system is similar to descriptions of monolingual Spanish varieties in some ways but

differs in others. The overall distribution and positioning of the vowels within the space,

as well as the lack of symmetry, distinguishes the HS system from the traditional

descriptions of monolingual Spanish. The presence of intrinsic vowel duration is similar

to other Spanish vowel systems, although, as will be described and discussed in the

following chapter, the duration of HS vowel seems to be longer than values reported in

other studies. Although some dialects of Spanish, more specifically Mexican Spanish,

have been reported to exhibit reduction of atonic vowels, the general consensus in the

literature is that stress differences in Spanish tend to be minimal. The results described

previously in this section, however, present evidence for robust differences in vowel

pronunciation based on lexical stress. Syllable type effects in HS Spanish are less clear.

Although it was expected that the mid vowels would show differences in vowel height

and backness based on syllable type, there was no overall effect of syllable structure on

F1 and the effects on F2 were limited to the point vowels. Finally, HS vowels showed

the anticipated effects of speech style: vowels were more centralized in spontaneous

speech and more peripheral in controlled speech.

The following chapter will provide a more in-depth analysis and comparison of

the HS vowel system to other monolingual and bilingual systems. Before proceeding to

the discussion and conclusions, however, it is important to address how a different set of

variables, namely the social and individual variables, affected HS vowel production.

This topic will be discussed in section 4.2.

210

4.2 Between-subjects variables

In addition to the within-subjects variables described in section 4.1, a series of

individual characteristics (i.e., between-subjects variables) were also included in the

model to address how each speaker’s language background and use, travel frequency,

formal grammatical knowledge, and course level were correlated with their vowel

production (see chapter 3, section 3.4.2). These between-subjects variables can be

divided into two larger groups, each necessitating a different type of statistical analysis:

categorical variables and continuous variables. The categorical social variables are

course level, travel frequency74, and gender.75 These variables were analyzed in much

the same way as the within-subjects variables via an examination of the estimated

marginal means. Participants were grouped into categories based on the course in which

they were currently enrolled, how frequently they traveled to their country of heritage,

and by their gender. The analyses then assessed how each group differed with respect to

F1, F2, duration, and Euclidean distance.

74 As a reminder, travel frequency was categorical. Participants were divided into three groups based on how often they reported traveling to the heritage country: frequently, infrequently, and almost never. 75 Although both male and female speakers participated in this study, no direct comparison will be made concerning gender and vowel pronunciation. The ratio of males to females (3:13) is too uneven for a meaningful analysis to be conducted. The variable of gender was, nonetheless, included in the analysis to remove some of the variance caused by gender differences. It is well-accepted that male and female speakers produce vowels differently as a consequence of physiology (Hillenbrand et al., 1996; Peterson & Barney, 1952; Rosner & Pickering, 1994). The statistical tests included in this analysis proved that this was the case. The analyses revealed a significant main effect of gender on F1 (F(1, 8.004) = 7.58, p = .025) and on Euclidean distance (F(1, 8.00) = 15.16, p = .005). These results indicated that, as expected, the overall F1 values for males were lower than for females. The significant effect of Euclidean distance indicated that female speakers showed greater degrees of vowel dispersion than males. These effects are due to the physiological differences between the genders, as females have the capacity to produce sounds at higher frequency ranges. For this reason, their F1 frequencies were higher (i.e. lower vowels) and their F2 frequencies were also higher (i.e. front vowels that were farther front). The combination of the higher F1 and F2 values resulted in a greater degree of dispersion for female speakers as compared to males. These are the basic findings of the analysis of gender, and no further discussion will be provided. Had an equal number of male and female participants completed this study, a direct comparison of male and female vowel production could have been conducted.

211

In contrast, the continuous variables were analyzed using a different set of

statistical tests and comparisons because each speaker had their own numerical score and

was not placed into a larger group. These variables included grammar proficiency score,

Spanish use, and cultural sensitivity score. As these variables, as well as the dependent

variables, were both continuous, the model was not able to calculate estimated marginal

means like it could for the categorical within- and between-subjects factors. In order to

determine how the continuous independent variables affected the dependent variables, the

parameter estimates of fixed effects were used. The coefficients functioned much like

they do in a linear regression. The coefficient of the parameter estimate could be either

positive or negative, indicating a positive or negative relationship between the

independent factor(s) and the dependent variable. Thus, a negative coefficient indicated

that there was a negative correlation between the independent factor and the dependent

variable (i.e., as grammar score increased, F2 decreased), whereas a positive coefficient

indicated the reverse (i.e., as cultural sensitivity increased, F1 increased). The value of

the coefficient represented the steepness of the slope of the regression line. The larger

the number, the steeper the slope, and the larger the effect when compared to a slope of

zero.76 In the case of interactions between the vowels and a certain characteristic, the

interaction term was calculated by adding the parameter estimate of the individual vowel 76 It is important to address, however, that the vowel /u/ was always set to zero and served as the standard to which the other parameter estimates were compared. In some cases, however, the differences between the parameter estimate for a specific vowel as compared to the actual value of /u/ as observed in a separate portion of the parameter estimates table was greater than it was from zero. In certain cases, which will be explained in greater detail throughout this section as they apply, it is more meaningful to compare the largest parameter estimate to the smallest. This comparison not only indicates the direction of the effect (i.e. positive or negative correlation) but also the size of the effect. A parameter estimate of -10, for example, would indicate a greater (negative) effect than an estimate of +4. The comparison of these two values to each other, however, shows an even greater difference than comparing either of the values to zero. As will be explained later in this chapter, some of the significant interactions were driven based on the large differences between positive and negative parameter estimates.

212

to the parameter estimate of the main effect. This procedure will be described in greater

detail later on in this chapter as it applies to the results. All information pertaining to

each participant’s group, travel frequency, and scores for grammar proficiency, cultural

sensitivity, and Spanish use are presented in Table 4-15.

Table 4-15

Information pertaining to individual characteristics

Participant ID Gender Course level

Travel group

Score on grammar proficiency test

Score on cultural sensitivity questionnaire

Score on Spanish use survey

Speaker 1F F 1 1 20 159 37 Speaker 2F F 1 2 14 186 25 Speaker 3F F 1 2 22 186 30 Speaker 4F F 1 1 21 181 38 Speaker 1M M 1 0 16 183 40 Speaker 2M M 1 2 19 193 30 Speaker 5F F 1 0 19 162 29 Speaker 6F F 2 1 21 160 32 Speaker 7F F 2 0 19 151 32 Speaker 3M M 2 2 24 190 31 Speaker 8F F 2 2 21 186 25 Speaker 9F F 2 2 21 160 31 Speaker 10F F 2 0 25 203 38 Speaker 11F F 2 1 18 175 31 Speaker 12F F 2 1 19 166 26 Speaker 13F F 2 1 23 172 30

Note. For travel frequency, “0” indicates little/no traveling, “1” indicates traveling every three to four years, and “2” indicates traveling every year. The possible range of scores on the grammar proficiency test was 1-25 points. The possible range of scores on the cultural sensitivity questionnaire was 32-224 points. The score for Spanish use was tabulated based on self-reported usage in eight different contexts. Scores ranged from 1-40, with higher scores indicating greater usage of Spanish.

4.2.1 Course level

Although unanticipated at the start of this investigation, the first individual

variable included in the analysis was course level. The participants were fairly equally

213

divided into two groups of speakers: those who were currently enrolled in a second-

semester intermediate-level Spanish course designed for HS (N = 7), and those who had

already completed this course and were thus enrolled in an upper-level language or

linguistics course (N = 9). The inclusion of HS enrolled in two different course levels

thus permits the examination of how language level, or overt level of language

instruction, may affect HS pronunciation. If HS experience a “benefit” in pronunciation

based on their early exposure to the Spanish language but not as a result of classroom

instruction, then one would expect that there would be no significant differences in vowel

production based on course level. If, however, additional exposure to Spanish via the

classroom and additional speaking do influence HS pronunciation, one would expect that

these two groups of participants would differ in terms of their vowel production.

The main effect of course level was not found to be significant in any of the

analyses conducted with the four independent variables. However, the vowel by course

level interaction was significant in the analyses of F1 (F(4, 3254.07) = 2.44, p = .046), F2

(F(4, 3254.12) = 18.11, p < .001), and Euclidean distance (F(4, 3254.13) = 8.99, p <

.001). The F1, F2 values and Euclidean distance measures calculated by the statistical

model are presented in Table 4-16.

214

Table 4-16

Formant values and Euclidean distance measures for two course levels

Vowel Course level F1 (Hz) F2 (Hz) Euclidean

distance (Hz) /i/ intermediate 672.41 1533.17 244.30

advanced 703.21 1525.94 234.71 /e/ intermediate 487.36 1884.27 252.81

advanced 510.92 1901.85 300.66 /a/ intermediate 357.72 2183.81 538.92

advanced 378.99 2230.45 608.10 /o/ intermediate 534.94 1217.09 459.55 advanced 539.95 1114.97 519.58

/u/ intermediate 409.51 1287.16* 408.14* advanced 431.79 1130.52* 516.15* Note. The “*” denote a significant difference between groups.

The differences in vowel organization between the two course levels are

illustrated visually in Figure 4-30. The gray symbols represent the acoustic space for the

participants enrolled in the intermediate-level course, and the black lines represent the

vowel space produced by the participants in the advanced group.

215

Figure 4-30. HS vowel production by course level, averaged across stress, syllable type, and task.

The pairwise comparisons of the vowel by course level interaction along the F1

dimension indicated that although none of the comparisons were found to be statistically

significant, the mean difference value of the /a/ was considerably greater than the mean

difference values of the other four vowels. This difference (+/- 30.80, p < .215) indicated

that the /a/ produced by the HS in the intermediate course was overall higher (F1 = 672.

41 Hz) than the /a/ produced by the HS in the advanced classes (F1 = 703.21 Hz). The

mean difference values for the other vowels, and especially that of /o/ (+/- 5.01), were

considerably smaller, suggesting that the vowel by course level interaction for F1 was

driven by the differences in /a/. The differences in vowel height for the vowel /a/, and the

overall lack of height differences for the other four vowels, are presented in Figure 4-30.

The pairwise comparisons along the F2 dimension indicated that the vowel by

course level interaction for F2 was driven by a difference in backness of the back vowels

/u/ and /o/. The main differences between the productions of /o/ and /u/ by the

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Vowel production by course level

intermediate advanced

216

participants enrolled in the two course levels are also illustrated in Figure 4-30. The

intermediate-level group pronounced /u/ with an F2 value that was significantly higher

than the advanced group, indicating that those enrolled in the intermediate-level course

produced /u/ farther front in the acoustic space than those in the advanced classes

(intermediate = 1287.16 Hz, advanced = 1130.51 Hz; p = .031). Although it was not

statistically significant, an examination of the mean difference values for /o/ also

suggested that the participants in the intermediate-level course produced /o/ considerably

farther front than the speakers in the advanced classes. The average intermediate-level

value was 1217.09 Hz, while the advanced-level HS exhibited an average of 1114.97 Hz,

resulting in a mean difference of +/- 102.11 (p = .132). The significance of this

interaction was thus influenced by the large differences between these groups with

respect to /o/ and /u/.

The backing effects of /u/ and /o/ by the advanced-level speakers were further

confirmed by the analysis of Euclidean distance. The advanced-level participants tended

to exhibit larger Euclidean distances, indicating that their vowels were dispersed farther

from the center of the vowel space than those produced by the intermediate-level

speakers. This difference was only statistically significant for /u/ (intermediate = 408.14

Hz, advanced = 516.15 Hz; mean difference = +/- 108.00 Hz; p = .031). As with the

analysis of F2, however, an examination of the mean difference values between these two

groups offers additional insight into vowel dispersion. The mean difference of /o/ was

also considerably high (+/- 60.03), indicating that the participants enrolled in the

intermediate-level course produced /o/ with a Euclidean distance that was 60 Hz closer to

the centroid than the advanced group (p = .192). The Euclidean distance measure of /i/

217

from the centroid by the intermediate-level speakers was 538.91 Hz and for the

advanced-level speakers was 608.09 Hz. The mean difference value between the two

groups was 69.18 Hz, but the difference was overall not found to be statistically

significant (p = .136). The differences in vowel height, backness, and distance from the

centroid are presented in Figure 4-30.

To review, the analysis of course level thus revealed differences in the

pronunciation and acoustic distribution of HS vowels. The only vowel to marginally

show height differences between groups was /a/, which was produced higher in the

acoustic space by the intermediate group than by the advanced group. In terms of vowel

backness, the back vowels /o/ and /u/ exhibited the greatest difference based on group.

The /u/ and /o/ were both produced farther front in the vowel space by the intermediate-

level participants, but this difference was only statistically significant for /u/. The

Euclidean distance measures indicated differences in dispersion, again for the vowels /o/

and especially /u/. These vowels, but only statistically for /u/, were closer to the centroid

when produced by the intermediate-level speakers than by those in the advanced courses.

The front vowel /i/ showed an important trend: it was articulated closer to the centroid by

the intermediate group than by the advanced group. Although not all differences between

vowels were found to be statistically significant, an examination of Figure 4-30 shows an

overall trend: the advanced-level group produced vowels that were dispersed farther

from the centroid than the intermediate-level participants.

4.2.2 Frequency of travel

The frequency with which each of the participants traveled abroad was also of

interest in this investigation. As mentioned in chapter 2, previous research has shown

218

that speakers who maintain strong ties with the heritage culture pronounce the heritage

language differently than those who are less connected. The participants in this study

were divided into three groups based on how frequently they traveled abroad. Those who

had never traveled abroad or had not done so in more than 8 years were grouped into the

“0-little or no travel” group (N = 4). Those who indicated that they traveled abroad every

3-4 years were placed into the “1-some/infrequent travel” group (N = 6). Finally, those

who traveled once a year (or more) or every other year to a Spanish-speaking country or

their country of heritage were assigned to the “2-frequent travel” group (N = 6).

As with the effect of course level, there was no significant main effect of travel

frequency on any of the dependent variables. The vowel by travel interaction was found

to be significant for F1 (F(8, 3254.07) = 19.72, p < .001), F2 (F(8, 3254.13) = 23.61, p <

.001), and Euclidean distance (F(8, 3254.14) = 10.77, p < .001). The formant values and

Euclidean distance measures produced by the participants in the three travel groups are

presented in

219

Table 4-17. Significant differences between the formant values and Euclidean

distances are indicated by superscripted letters.

Table 4-17

Formant values and Euclidean distance measures for three travel groups

Vowel Travel group F1 (Hz) F2 (Hz) Euclidean distance (Hz)

/i/ Never (0) 369.72 2106.57c 489.36c

Infrequent (1) 347.69 2163.58 538.75c

Frequent (2) 387.65 2351.25a 692.40ab

/e/ Never (0) 498.59 1888.34 274.77

Infrequent (1) 465.79 1809.08 216.89

Frequent (2) 533.05 1981.77 338.54

/a/ Never (0) 681.61 1499.91 245.93

Infrequent (1) 629.23c 1529.14 215.57

Frequent (2) 752.61b 1559.63 257.02

/o/ Never (0) 558.28 1196.80 444.10

Infrequent (1) 505.02 1183.56 467.44

Frequent (2) 549.05 1117.73 557.16

/u/ Never (0) 423.36 1253.49 404.78

Infrequent (1) 409.03 1248.11 415.39

Frequent (2) 429.55 1124.91 566.27

Note. The symbols next to the formant values indicate significant differences between travel groups. The “a” denotes a significant difference from group 0 (no travel), a “b” denotes a significant difference from group 1 (infrequent travel), and a “c” denotes a significant difference from group 2 (frequent travel). Although not indicated here, the Euclidean distance measures for /u/ approached significance.

The distinctions between the three travel groups are presented in Figure 4-31.

The dashed line represents the frequent travel group (group 2), the solid line represents

those who travel every few years (group 1), and the dotted line represents those who do

not travel (group 0). Although not all of the differences in vowel production between

these groups were significant, an examination of this figure shows that the frequent

travelers utilized a larger portion of the vowel space relative to the other two groups.

220

Figure 4-31. Vowel distribution based on frequency of travel abroad. Values averaged across stress, syllable type, and task.

A closer examination of the vowel by travel interaction along the F1 dimension

via pairwise comparisons indicated that only the vowel /a/ showed height differences

based on travel. The difference was only significant between group 1 (some travel) and

group 2 (frequent travel). The speakers who reported traveling every few years (group 1)

pronounced /a/ with a significantly lower F1 value than those who traveled every year (p

= .006). In other words, those who traveled less produced this vowel higher in the

acoustic space. The comparisons between the other two groups were not statistically

significant, nor were any other vowels significantly different with respect to height based

on travel frequency.

With respect to vowel backness, the pairwise comparisons of the F2 values

indicated that the vowel by travel interaction was driven by a significant difference in the

pronunciation of /i/. Those speakers in group 2 (frequent travel) pronounced /i/ with a

significantly higher F2 value than those speakers who did not travel at all (frequent travel

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz

Vowel distribution by travel

little/no travel some travel frequent travel

221

= 2351.24 Hz, no travel = 2106.57 Hz; p = .046). Thus, the participants who spent more

time visiting the heritage country or another Spanish-speaking country pronounced /i/

farther front in the vowel space when compared to those who did not travel at all. There

were no significant differences between the frequent travelers and those who never

traveled group when compared to the infrequent travel group. No other vowel was

significantly affected by travel group. The differences between vowel height and

backness based on travel frequency are presented in Figure 4-31.

The analysis of Euclidean distance from the centroid indicated that the speakers in

group 2 (frequent travel) exhibited a significantly larger Euclidean distance of the vowel

/i/ when compared to the other two groups of travelers (group 0 = 489.36 Hz, group 1 =

538.75 Hz, group 2 = 692.40 Hz). The difference in Euclidean distance was significant

from those who never traveled at the level of p = .016 and from those who infrequently

traveled at the level of p = .045. The longer Euclidean distance indicated that the

participants who traveled abroad every year produced the /i/ significantly farther from the

centroid than the other two groups. Another interesting finding of this analysis was that

participants in the frequent travel group also produced the /u/ with a greater Euclidean

distance than the speakers in the other two travel groups. Although the difference was

not statistically significant, it closely approached significance, with the comparison

between groups 0 and 2 at the level of p = .055 and between groups 1 and 2 at the level of

p = .051. Taken together, the longer Euclidean distances revealed for the frequent

travelers’ productions of /i/ and /u/ indicate that those speakers who traveled and spent

more time abroad articulated the high vowels more peripherally.

222

To summarize these findings briefly, the analysis of the effect of travel revealed

several interesting differences, but only for certain vowels. Only the point vowels

showed differences in articulation in terms of height, backness, and distance from the

centroid. The low vowel /a/ was produced significantly lower in the vowel space by

those speakers who traveled frequently, and they also produced /i/ farther front in the

vowel space when compared to the other groups. Differences in the back vowel /u/ also

approached significance, with the frequent travelers pronouncing this vowel farther back

in the vowel space than the other two groups. Overall, the results of the vowel by travel

interaction suggest that the frequency of travel to the heritage country has a significant

effect on the pronunciation of some Spanish vowels.

4.2.3 Grammar proficiency

The third continuous covariate to be discussed is the relationship between each

participant’s grammar proficiency score and their pronunciation of the Spanish vowels.

All participants completed a short grammar assessment instrument designed by Geeslin

and Gudmestad (2010) (see chapter 3, section 3.4.2.2). Each participant’s score was

included in the model to determine if there was a connection between formal grammatical

knowledge and pronunciation. Although there was no main effect of grammar score on

any of the four dependent variables, there was a significant interaction between vowel

and grammar for F1 (F(4, 3254.03) = 10.28, p < .001), F2 (F(4, 3254.07) = 23.22, p <

.001), and for Euclidean distance (F(4, 3254.05) = 4.25, p = .002). The vowel by

grammar interaction indicated that although there was no overall effect of grammar score

on F1, F2, or Euclidean distance, performance on the grammar assessment instrument

correlated with the pronunciation of certain vowels.

223

Along the F1 dimension, the only vowel that was found to have a significant

parameter estimate was /o/ (6.39 t (3254.004) = 3.48, p = .001). The parameter estimate

for this vowel was 6.39, indicating a positive relationship with grammar score and the F1

of /o/. That is, as grammar score increased, there was a tendency for the F1 value of /o/

to also increase, thus resulting in a lower vowel. The estimate of the vowel /a/ also

approached significance (-3.2 t (3254.03) = -1.96, p = .05). The negative coefficient

indicated that there was an overall negative relationship between grammar score and the

F1 value of /a/. That is, as the grammar score increased, the F1 value of /a/ decreased,

resulting in it occupying a higher position within the acoustic space. An examination of

the other parameter estimates revealed that none were significantly different from zero.

Thus, there was no consistent effect of grammar score on the F1 of the vowels /e/, /i/, and

/u/ like there was for /o/ (and marginally for /a/). The differences in vowel height are

presented in Figure 4-32. For illustrative purposes, this figure plots the vowel production

of two female speakers, each with very different grammar scores. Speaker 2f had the

lowest grammar score (14 out of 25 points) and is represented by the dashed line.

Speaker 10f had the highest grammar score (25 points) and is represented by the solid

black line. The values represented are the raw, non-normalized averages produced by

each speaker and not the estimated marginal means that have been presented in previous

figures. The statistical results discussed above, however, were based on the vowel

productions of all of the speakers

224

Figure 4-32. Vowel distribution based on grammar score. Speaker 2f (dashed line) had a lower grammar score than speaker 10f (solid line). Values averaged across stress, syllable type, and task.

The analysis of the relationship between grammar and vowel backness (F2)

showed that only the estimates for /i/ and /e/ were found to be statistically significant.

Both of the coefficients were negative, indicating an inverse relationship between

grammar score and vowel backness (for /i/: -33.21 t (3254.07) = 6.11, p < .001; for /e/: -

27.47 t (3254.07) = 5.36, p < .001). In other words, as the grammar score increased,

there was a tendency for the F2 values of /i/ and /e/ to decrease. Speakers with higher

grammar scores produced /i/ and /e/ farther back in the vowel space. The differences in

backness of /i/ and /e/ are presented in Figure 4-33. The vowels produced by speaker 10f,

who scored 25 points on the grammar proficiency test, and speaker 7f, who scored 19 out

of 25 points on the grammar test, are included in the figure. The dashed lines represent

speaker 7f, and show how her productions of /i/ and /e/ are farther front in the acoustic

space when compared to those of speaker 10f.

300

400

500

600

700

800

900115014001650190021502400

F1 H

z

F2 Hz

Vowel by Grammar interaction

speaker 2f speaker 10f

225

Figure 4-33. Vowel distribution based on grammar score. Speaker 10f (solid line) had a higher grammar score than speaker 7f (dashed line). Values averaged across stress, syllable type, and task.

Finally, the interaction between vowel and grammar was found to be significant

for Euclidean distance based on the Type III Tests of Fixed Effects. A closer

examination of this interaction indicated, however, that none of the parameter estimates

for the individual vowels were significantly different from zero. A closer inspection of

the parameter estimates suggests that the interaction was driven not by differences

between the parameter estimates and zero, but rather by the differences between some of

the vowels themselves. For example, the estimate value for /u/ was -10.78 and the p –

value for this estimate was .212. This negative coefficient indicates an inverse

relationship between grammar and /u/: as grammar score increased, Euclidean distance

value decreased. Compare this value to the estimate for /a/, 4.98 (p = .216), which is

300

400

500

600

700

800

900

9001150140016501900215024002650

F1 H

z

F2 Hz

Vowel by Grammar interaction


226

positive.77 The difference between -10.78 and zero, and the difference between 4.98 and

zero, are both smaller than the comparison of these two parameter estimates to each

other. The distinction between the steepness of these slopes is what resulted in the

significance of this interaction, even though the parameter estimates for the vowels were

not statistically significant. The only vowel that approached significance in this

interaction was /i/ (-8.54 t (3254.05) = -1.94, p = .052). The negative coefficient

indicates that as grammar score increased, the Euclidean distance of /i/ from the centroid

decreased, but again this value only approached significance.

In summary, the participants’ scores on the grammar proficiency test were shown

to correlate with their pronunciation of certain vowels. In terms of vowel height, those

who scored higher on the grammar test tended to articulate their /o/ lower in the acoustic

space and their /a/ higher in the acoustic space when compared to participants with lower

grammar scores. The pronunciation of /i/ and /e/ was also affected, but with respect to

backness. As a participant’s grammar score increased, the F2 values of the front vowels

was found to decrease, resulting in vowels that were produced farther back in the acoustic

space. The analysis of the relationship between dispersion (Euclidean distance) and

grammar were difficult to interpret because none of the parameter estimates were found

to be significantly different from zero. All but one of the parameter estimates were

negative, indicating that there could be an inverse relationship between grammar score

77 As will be explained in greater detail later in this chapter, the parameter estimates for the individual vowels are relative to the parameter for the main effect, which in this analysis is also the vowel /u/. In the breakdown of the interaction, this value is converted to zero, and the estimates of the other vowels are compared to the zero value in order to determine their significance. Determining the coefficient of the parameter estimate and the size of the effect present for each vowel—“the interaction term” -- is actually determined based on the original parameter for the main effect. The interaction term for /a/ is thus actually negative (-5.8) because this is the difference between the main effect (-10.78) and the value reported for /a/ (4.98). As this coefficient was not found to be significant, however, it will not be discussed further here. A more in-depth explanation of the relativity of the parameter estimates is presented later in this chapter.

227

and distance from the centroid. Again, these findings were not significant, and therefore

it is difficult to draw firm conclusions of the effect of grammar on dispersion based on

this data.

4.2.4 Spanish use

As mentioned in chapter three (section 3.4.2.4), each participant completed a

questionnaire and indicated the frequency with which they spoke Spanish (e.g., daily,

weekly, monthly, etc.) in a variety of contexts (e.g., at home, TV, reading magazines).

Each person was assigned a score of Spanish use based on how often they indicated that

they spoke/read/listened to Spanish in a variety of contexts (refer to chapter 3 for a

detailed description of the scoring procedure). Of the three continuous social variables,

the frequency of use of Spanish had the clearest effect on vowel height, backness, and

distance from the centroid. Although there was no overall main effect of the use of

Spanish on vowel height, backness, and dispersion, the interaction between vowel and

Spanish use was found to be significant for F1 (F(4, 3254.08) = 5.70, p < .001), F2 (F(4,

3254.13) = 30.54, p < .001), and Euclidean distance (F(4, 3254.14) = 16.95, p < .001).

Each of these interactions will be described in greater detail in the following paragraphs.

The effect of Spanish use on vowel organization and dispersion is presented in

Figure 4-34. The figure represents the overall vowel space of two male participants’

productions of vowels: speaker 1m and speaker 3m. Speaker 1m had a Spanish use score

of 40 points, indicating that he spoke and used Spanish in all of the contexts included in

the questionnaire on a daily basis (see Appendix C for all contexts). Speaker 3m had a

Spanish use score of 30 points, meaning that he spoke Spanish frequently, but not in

every context on a daily, or even weekly, basis. As will be explained in greater detail

228

throughout this section, all vowels showed some kind of effect in terms of height,

backness, or Euclidean distance, such that the overall vowel space was greater for those

who spoke Spanish with the greatest frequency. The expansion effect is clearly presented

in the figure, except for the differences in the back vowel /u/, which for these two

speakers was situated in approximately the same location. The solid line represents the

overall vowel space for speaker 1m, and the dashed line represents the vowel production

of speaker 3m.

Figure 4-34. Representation of vowel production based on Spanish use. Speaker 1m (solid line) had a higher Spanish use score than speaker 3m (dashed line). Values averaged across stress, syllable type, and task.

The Table of Estimates of Fixed Effects indicated that the parameter estimates for

the vowels /a/ and /o/ were significant for F1. The coefficient for /a/ was positive,

indicating that as the use of Spanish increased (i.e., the participant reported speaking

Spanish more frequently), the F1 value of /a/ increased (2.39 t (3254.09) = 2.09, p =

.036). Thus, the /a/ was produced lower in the vowel space as the usage of Spanish

300

400

500

600

700

800

900115014001650190021502400

F1 H

z

F2 Hz

Vowel distribution by Spanish use

speaker 1m speaker 3m

229

increased. For the back vowel /o/, the coefficient was negative, revealing an inverse

relationship between Spanish use and vowel height (-2.72 t (3254.09) = -2.09, p = .036).

As Spanish use increased, the F1 value of /o/ decreased, resulting in it raising to a higher

position within the acoustic space.

Along the F2 dimension, every vowel had a parameter estimate that differed

significantly from zero.78 The parameter estimate for /u/, which also serves as the

comparison value, was -23.29 (t(10.02) = -2.77, p = .019). The negative coefficient

indicated an inverse relationship between the F2 value of this vowel and the Spanish use

score. As the Spanish use score increased, the value of F2 decreased, resulting in the /u/

situating itself farther back in the vowel space. The same pattern was observed for /o/,

which on its own had a positive coefficient (10.26 t (3254.14) = 2.58, p = .010), but the

interaction term (i.e., the sum of the coefficient for /o/ (10.26) and for /u/ (-23.29)) was

actually negative (-13.03). The negative interaction term for /o/ also indicate an inverse

relationship between Spanish use and the F2 value of /o/. As the Spanish use score

increased, the F2 value of /o/ decreased, indicating that the /o/ moved farther back in the

78 Determining the coefficient (+/-) of the parameter estimate and thus the direction and size of the effect is determined by comparing the parameter of the individual vowel to that of the main effect (which in this analysis is also the parameter for the vowel /u/). The parameter estimates are thus relative to the main effect parameter value. This relationship is referred to as the “interaction term” and it is calculated by adding the parameter estimate of the main effect to that of the individual vowel. In some cases, this results in the changing of the coefficient for the parameter value. Thus far, the comparisons between the individual vowels and the parameter estimates, or at least the significant comparisons, have not resulted in the changing of the coefficient from negative to positive or vice-versa. For F1, for example, the parameter estimate for /u/ is 1.97, and all other values are compared to it in order to determine the direction of the effect. The parameter estimate for /a/ is also positive (2.39), so the direction of the effect positive. In the case of /o/, the parameter estimate is negative (-2.722). The relative parameter estimate of /o/, or the interaction term, is thus actually -.75, but still negative. In examining the interaction between vowel and Spanish use along the F2 dimension, the parameter estimate of /u/, which is added to the other estimates to determine the interaction term, is a relatively large negative number (-23.29). All other values by themselves are positive, but when their values are determined relative to -23.29, the parameters for /a/ (18.51) and /o/ (10.26) are actually negative: -4.53 and -12.78, respectively. In the discussion of the interaction of Spanish use and vowel for F2, the parameter estimates for individual vowels will be used in parenthetical citations, but reference to their interaction terms (relative values) will also be made. This interaction term most accurately describes the direction of the effect.

230

acoustic space. The low vowel /a/ also exhibited this same pattern (18.51 t(3254.14) =

5.30, p < .001; relative to /u/, the estimate is -4.53). As Spanish use increased, the F2

value of /a/ decreased, resulting in a vowel that was produced farther back in the vowel

space. The front vowels exhibited the same pattern, but in the opposite direction. Both

/i/ and /e/ had parameter estimates that were positive, thus as Spanish use increased, so

did the F2 value (for /i/: 38.38 t (3254.18) = 10.09, p < .001; for /e/: 24.91 t(3254.29) =

6.90, p < .001). The front vowels were therefore produced farther front in the vowel

space as the use of Spanish increased.

The relationship between vowel and Spanish use and the effect on Euclidean

distance confirms the results reported for F1 and F2. The parameter estimates for all

vowels except /i/ were found to be significantly different from zero. The parameter

estimate for /u/, which also serves as the comparison value, had a positive coefficient

(16.07 t(10.86) = 2.79, p = .018). Thus, as Spanish use increased, the Euclidean distance

of /u/ from the centroid also increased. The same behavior was observed for /o/, /a/, and

/e/: the distance of these vowels from the centroid also increased as Spanish use

increased.79 The overall trend revealed is that as the Spanish use score increased, the

overall dispersion in the vowels within the acoustic space also increased. Those speakers

who indicated that they spoke Spanish more frequently and in a larger number of contexts

exhibited vowel productions that were farther dispersed from the centroid.

79 On its own, the coefficient for /o/ was -9.28 (t(3254.16) = -2.89, p = .004). The interaction term, however, is positive: 6.79. As explained in the previous footnote, the coefficient, which determines the direction of the effect, is the comparison value for the main effect of Spanish use (in this case, 16.071). The same procedure was used to determine that the coefficients for /e/ and /a/ were actually positive. The parameter estimates reported in the Table of Parameter Estimates of Fixed Effects for /e/ was -6.87 (t(3254.37) = -2.36, p = .018) and for /a/ was -14.47 (t (3254.16) = -5.13, p < .001. Their relative values are both positive, however, with values of 9.19 and 1.6, respectively.

231

Based on the findings presented above, there is a strong relationship between

vowel production and Spanish use. Taking into consideration the results of the analysis

on F1, F2, and Euclidean distance, the overall pattern that emerges is that those who use

Spanish more exhibit an expansion of vowels within the acoustic space and an expansion

relative to their own centroid.

4.2.5 Cultural sensitivity

The final continuous social variable to be discussed is cultural sensitivity. Each

participant completed a cultural sensitivity questionnaire at the start of the experiment

(see chapter 3, section 3.4.2.5 and Appendix E). This questionnaire contained a number

of statements that were grouped into different categories (e.g., Empathy scale, Behavioral

scale) based on the type of cultural sensitivity and/or awareness they aimed to examine.

As there were a small number of participants in this investigation, only the total score

was included in the analysis and not the scores tabulated for the different subscales. The

decision to use the total score was based partly on the inability of the statistical model to

maintain its power with so many interactions containing vowel, and partly on the small

sample size. Thus, the results reported in this section pertain only to the overall cultural

sensitivity as calculated by the sum of each of the subscales within the questionnaire.

The Mixed Linear Model again indicated no significant main effects of cultural

sensitivity on any of the four dependent variables. The interaction between vowel and

cultural sensitivity, however, was significant for F1 (F(4, 3254.07) = 2.86, p = .002) and

for Euclidean distance (F(4, 3254.14) = 4.83, p = .001). The relationship between vowel

and cultural sensitivity will be explained in greater detail below, and a comparison

between two speakers is presented in Figure 4-35.

232

Although the Type III Tests of Fixed Effects indicated that the interaction

between vowel and cultural sensitivity was significant along the F1 dimension, an

examination of the parameter estimates indicated that none of the parameter estimates

were significantly different from zero. Because of the lack of significance found within

the parameter estimates of this interaction, it is again likely that the interaction was

driven by the differences between the intercepts for /u/ (-.760) and /i/ (.275). The

parameter estimate and negative coefficient for /u/ suggest that as cultural sensitivity

increased, the F1 value of /u/ decreased. In other words, /u/ was produced higher in the

acoustic space as cultural sensitivity increased. The reverse could be hypothesized for /i/,

which had a positive coefficient. Thus, as cultural sensitivity increased, the F1 of /i/ also

increased, resulting in a lower vowel. The p-values for these parameters were not

significant, however, indicating that statistically, there was no effect of cultural

sensitivity on these vowels. On a final note, the parameter for /e/ approached

significance (-.556 t (3254.03) = -.367, p = .051), suggesting that as cultural sensitivity

increased, there was a tendency for the F1 of /e/ to decrease (i.e.,, the /e/ was produced

higher in the acoustic space). Due to the lack of statistical significance and a clear and

observable pattern, however, it is inconclusive how cultural sensitivity affected the height

of HS Spanish vowels.

The interaction between vowel and cultural sensitivity was also significant for

Euclidean distance, but only the parameter estimate for the vowel /i/ was found to differ

significantly from zero ( -2.00 t (3254.03) = -2.71, p = .007). The significant effect

observed for /i/ may be related to the fact that /i/ had the largest positive parameter

estimate in the analysis of F1. The negative coefficient of the parameter estimate

233

indicated that as cultural sensitivity increased, the Euclidean distance of /i/ from the

center of the vowel space decreased. In addition, and as with the analysis of F1, the

parameter estimate for /e/ also approached significance (-1.36 t (3254.11) = -1.93, p =

.053). The negative coefficient indicates an inverse relationship with cultural sensitivity:

as cultural sensitivity increased, the Euclidean distance of /e/ from the centroid decreased.

Thus, with respect to the connection between cultural sensitivity and vowel dispersion,

there was a tendency for the front vowels to move closer to the center of the vowel space

as cultural sensitivity increased, although this trend was only significant for /i/.

The effects described for F1 and Euclidean distance are represented in the Figure

4-35. The figure contains the overall vowel space for two female speakers. Speaker 2f

earned a higher score on the cultural sensitivity questionnaire (186 points) than speaker

7f (151 points). As described in the results section above, the distance of the vowel /i/

from the center of the space decreased as the cultural sensitivity increased. Also notable

is the marginal effect of /e/ raising in speaker 2f.

234

Figure 4-35. Vowel distribution based on Cultural Sensitivity. Speaker 2f (dashed line) scored higher on the cultural sensitivity questionnaire than speaker 7f. Values averaged across stress, syllable type, and task.

Taken together, the results of this analysis suggest that although cultural

sensitivity may have some effect on the positioning of the front vowels and their

proximity to the center of each speaker’s vowel space, the relationship between cultural

sensitivity and vowel pronunciation was minimal. Only the parameter estimate for /i/

was found to differ significantly in the analysis of Euclidean distance. There were no

effects on F2 or duration, indicating that for this group of participants, cultural sensitivity

was not a major factor in determining how they pronounced the Spanish vowels.

4.2.6 Summary of between-subjects effects

The results of the analyses discussed in sections 4.2.1 – 4.2.5 indicated that

several of the individual variables affected vowel production. Although there were no

significant main effects on any parameter, there were significant interactions between the

variable “vowel” and all of the covariates for F1, F2, and Euclidean distance. An

300

400

500

600

700

800

900

9001150140016501900215024002650

F1 H

z

F2 Hz

Vowel by cultural sensitivity interaction


235

examination of the significant interactions showed that certain vowels were produced

differently based on the participants’ course level, travel frequency, grammar proficiency,

Spanish use, and cultural sensitivity.

The overall findings of course level revealed several interesting differences

between those participants who were enrolled in an intermediate-level Spanish course

and those who had already completed this course. The intermediate-level participants

produced /a/ higher in the acoustic space, and /o/ and /u/ were farther front in the vowel

space. The back vowel /u/ and marginally /i/ were closer to the centroid for the

intermediate-level participants. This suggests that the intermediate-level participants’

vowels were more condensed and that they utilized a smaller portion of the acoustic

space when compared to the advanced group. The advanced group also showed less /u/-

fronting than the intermediate group.

The effects of grammar proficiency and cultural sensitivity on HS vowel

pronunciation were less clear and less robust. The analysis on the effect of grammar

proficiency indicated that /o/ tended to lower as grammar score raised, and that the front

vowels showed a movement farther back in the vowel space as the grammar score

increased. As very few parameter estimates were found to differ significantly from zero,

it can be concluded that there is not a strong relationship between formal knowledge of

prescriptive Spanish grammar and vowel pronunciation. For cultural sensitivity, the only

significant effect was found with the vowel /i/: the Euclidean distance of /i/ decreased as

the cultural sensitivity score increased. The lack of significance in these analyses leads to

the conclusion that cultural sensitivity did not have a major impact on HS Spanish vowel

production.

236

Travel frequency and Spanish use were the two individual variables that appeared

to have the greatest effect on HS vowel pronunciation. With respect to the frequency of

travel, those speakers who indicated that they traveled frequently (once every 1-2 years)

differed significantly from the other two travel groups. Most notable is that the overall

vowel space was expanded relative to the other two groups, but this difference was only

significant for the point vowels. A similar trend was observed for the use of Spanish.

Those participants who scored the highest on the Spanish use questionnaire, indicating

that they spoke the language more frequently and in a larger number of contexts, showed

greater degrees of vowel dispersion from the center of the vowel space. Thus the front

vowels were produced farther front, the back vowels farther back, and the /a/ was

lowered. These findings suggest that speakers who have a stronger connection with the

heritage culture by traveling and speaking Spanish more frequently utilize a larger

portion of the acoustic space than those who are less connected.

Interestingly, there were no overall main effects or significant interactions

between the individual variables and vowel duration. The lack of significance in these

analyses indicated that duration did not vary or show any consistent effects based on the

between-subjects variables examined in this study. Thus, although the formant values

and acoustic distribution of the vowels for each speaker significantly different, and at

times robust, the duration of HS vowels was fairly stable across the participant groups.

The lack of effects of the between-subjects variables on duration contrasts with the

results reported for the within-subjects variables, in which vowel duration was found to

be affected fairly consistently for stress, syllable type, and speech style.

237

4.3 Overall summary of results

Sections 4.1 and 4.2. provided an explanation of the statistical results for the

analyses of the within- and between-subjects variables, respectively. All of the within-

subjects variables were shown to have a significant influence on the acoustic and

durational properties of HS Spanish vowels. In terms of the organization of the system,

the results indicated that the HS system exhibits five degrees of phonetic height and four

degrees of backness—subtle differences in height and backness that have not been

included in traditional descriptions of the Spanish vowel system. Upon examining the

overall vowel space, one can observe that the /i/ is produced far front in the vowel space,

the /e/ is raised and more central than what is typically described for Spanish, and the /u/

is much farther front than what would be expected in a Spanish vowel system. The

dispersion of the vowels also indicates that the system it is not symmetrical. Although

most researchers would argue against total symmetry of the Spanish vowel system, the

HS vowels analyzed in the present study occupy a shape that barely resembles a

“triangle” – a term often used to describe the Spanish vowel space.

The within-subjects variables examined in this investigation were stress, syllable

type, and speech style. All of these variables had a significant impact on the

pronunciation of HS vowels in terms of both acoustic properties (formant values and

Euclidean distance) and duration. Lexical stress had the most robust effects. Overall,

atonic vowels were found to have lower F1 values, more centralized F2 values, shorter

Euclidean distances, and shorter duration. The overall effect of lexical stress on HS

vowel production is thus shortening and centralizing atonic vowels. Although previous

research has argued for minor differences in quality and duration based on lexical stress,

238

the findings reported in this study show much greater effects. As will be discussed in

greater detail in chapter 5, the stress effects may be conditioned by a variety of external

factors.

Syllable type also significantly affected HS vowel quality and quantity. In terms

of quality, the overall pattern based on the results of F1, F2, and Euclidean distance

suggests that vowels produced in closed syllables are slightly more central than those

produced in open syllables. This effect was observed especially for the point vowels /i/,

/a/, and /u/. The analysis of the Euclidean distance measure offered more insight into this

effect by showing that vowels produced in closed syllables were overall closer to the

center of the vowel space than vowels produced in open syllables. The effect of syllable

type on duration was fairly robust. Overall, vowels produced in closed syllables were

longer in duration than those produced in open syllables. A secondary analysis revealed,

however, that the voicing characteristics and manner of articulation of the following

sound played a role in duration. Vowels in contact with voiced consonants were longer

in duration than those in contact with voiceless consonants. Interesting, however, is that

the consonant which was shown to have the greatest lengthening effect on vowel duration

was voiceless: /s/. As many syllables were closed with this consonant, the longer

duration observed for vowels produced in closed syllables may have been due to a

contextual effect. This limitation and imbalance will be addressed further in the

discussion chapter.

HS vowel quality and duration were also affected by speech style. Vowels that

were produced in the most controlled speech task (i.e., the carrier phrase task) were

shown to occupy a more peripheral location the vowel space. They were dispersed

239

significantly farther from the center of the vowel space when compared to the vowels

produced in the other two tasks. Vowels produced in the spontaneous speech task (i.e.,

narrative retelling) tended to be the most centralized. The vowels produced in the semi-

spontaneous speech task (i.e., picture/memory task) tended to pattern more with the

spontaneous speech than with the controlled speech, although there were some significant

differences in quality between these two tasks. Vowel duration was also significantly

affected by speech style. The shortest vowel durations were observed in the spontaneous

speech sample and the longest in the controlled speech. The overall pattern that emerges

from these analyses is that as speech style becomes more controlled, the degree of

centralization and proximity to the centroid decrease, and the vowel duration increases.

The analyses of the between-subjects variables offered insight into which

individual characteristics had the greatest influence on the pronunciation of Spanish

vowels. These analyses revealed that some of the individual characteristics might be

considered better predictors of, or more closely related to, HS vowel pronunciation.

Grammar proficiency score and cultural sensitivity were found to have little influence on

the pronunciation of HS vowels. The only vowels that were affected by grammar

proficiency were /i/, /e/, and /o/. As grammar proficiency score increased, the analyses

revealed a tendency for /o/ to be produced lower in the vowel space and for /i/ and /e/ to

be produced farther back in the vowel space. The other vowels did not exhibit any

significant effects based on grammar score. The only significant effect revealed by the

analysis of cultural sensitivity was the movement of /i/ closer to the center of the vowel

space. That there were so few effects of cultural sensitivity and grammar proficiency

suggest that these characteristics have little influence on the vowel pronunciation of this

240

group of HS. The lack of effect may be due in part to the small sample size—a limitation

that will be discussed further in the following chapter.

Course level, travel frequency, and Spanish use were found to have greater

impacts on pronunciation. Although at the start of this project the inclusion of course

level was not intended, the analysis based on each participant’s class level revealed a

trend.80 Overall, the participants who formed the “advanced” group (i.e., had already

completed the intermediate-level Spanish course designed for HS), produced /u/

significantly farther back in the vowel space than those in the intermediate-level classes.

The advanced group also exhibited a vowel space that was expanded relative to the

intermediate group of HS, although the expansion was not statistically significant. The

expansion effect was also observed in the analysis of travel frequency. The statistical

tests revealed that those HS who travel abroad every year or every other year pronounced

vowels that were more dispersed. Thus, the “frequent travel” group exhibited the largest

vowel space, whereas the other two travel groups pronounced vowels that occupied a

smaller and slightly more central portion of the vowel space. The frequency of use of

Spanish also had an effect on HS vowel quality. Those participants who spoke Spanish

more frequently also produced vowels that were more peripheral and spread out within

the acoustic domain. The analyses of the parameter estimates for this variable suggest

that, in a general sense, for this group of HS, the distance of the vowels relative to the

center of the vowel space increases as Spanish use increases.

80 Interestingly, the trend observed for course level seems to contradict with the findings of the analyses on grammar score. As grammar score increased, the front vowels were shown to move backward in the acoustic space. The opposite pattern was observed based on course level. The advanced-level participants, or those who were enrolled in 300-level Spanish classes, exhibited a fronting of these vowels relative to the intermediate group. This contradiction will be addressed further in chapter 5.

241

Taken together, these analyses portray the HS Spanish vowel system as differing

considerably from traditional descriptions of the Spanish vowels. Characteristics that are

typically described as not having a significant effect on Spanish vowel pronunciation

were found to have significant effects on HS vowel quality and quantity. In other cases,

differences that were expected to occur were not found to be significant (i.e., the absence

of the effect of syllable type on /e/ and /o/). Information pertaining to each individual’s

background and language use also proved to be insightful, and suggested that some

factors play a larger role in determining how the HS Spanish vowels are pronounced. A

generalized summary of all of the results is presented in Table 4-18.

242

Table 4-18

Independent variable Effect

Lexical stress Atonic vowels were centralized along the F2 dimension, had shorter Euclidean distances, and shorter duration than tonic vowels.

Syllable type

Few consistent effects on height and backness; Shorter Euclidean distances (i.e., less dispersion) for vowels produced in closed syllables; Longer vowel duration in closed syllables, only significant for front vowels.

Speech style

Vowels in spontaneous speech were the most centralized and showed less dispersion than controlled speech; Shortest duration in spontaneous speech, although significant effects only observed for /o/ and /a/.

Course level Intermediate-level participants produced /u/ that was significantly farther front when compared to advanced-level participants.

Travel frequency Frequent travelers produced point vowels more peripherally and exhibited greater dispersion overall when compared to the other two travel groups.

Grammar proficiency Front vowels were produced farther back in the vowel space and /o/ was produced lower in the vowel space by those participants who scored higher on the grammar proficiency test.

Spanish use Participants who reported speaking Spanish more produced vowels that were more peripheral than those who reported less Spanish use.

Cultural sensitivity The high front vowel /i/ was produced less peripherally (i.e., closer to the centroid) as cultural sensitivity score increased.

A more in-depth analysis and explanation of the effects described in the results

chapter will be presented in the following chapter, which contains the discussion.

243

5 DISCUSSION

The goal of this chapter is to revisit the research questions that guided the

investigation, explain the extent to which the hypotheses were confirmed or refuted by

the acoustic analyses, and discuss the implications of the findings. Sections 5.1 through

5.3 address each of the main research questions separately, providing a brief summary of

the findings and a more in-depth explanation of their significance. Section 5.4 provides

further discussion and interpretation of the expansion effect described in section 5.3. The

limitations of this investigation and suggestions for future research are discussed in

section 5.5. Finally, section 5.6 provides a brief summary of the discussion points.

5.1 Overall distribution and organization of HS vowels

The first research question guiding this study is presented below:

Q1: How are HS vowels organized and dispersed throughout the system, and

how does this organization and dispersion compare to traditional and acoustic

descriptions of monolingual Spanish varieties?

The analyses of the vowel pronunciation of the HS in this study revealed that the

distribution of their vowels throughout the acoustic space differs from traditional

impressionistic and acoustic descriptions of monolingual Spanish varieties in several

ways. Figure 4-1, presented in the results chapter, is replicated below as Figure 5-1, and

depicts the overall vowel space of the HS investigated in this study. The primary

differences will be described in greater detail below.

244

Figure 5-1. Overall vowel space for HS of Spanish averaged across all speakers, stress contexts, syllable types, and tasks.

First and foremost, the most notable characteristic presented in the figure above is

the asymmetrical distribution and organization of HS vowels within the acoustic space.

The asymmetry conflicts with previous descriptions of Spanish vowels, which argue for a

simple, stable, symmetrical, five-vowel system (Hualde, 2005). The statistical analyses

of vowel dispersion from the center of the vowel space of each individual speaker

indicated that the lack of symmetry was significant.81 The low vowel /a/ was found to be

located the closest to the center of the vowel space, followed by the front vowel /e/. The

back vowels /o/ and /u/ did not differ and were the third closest to the center, and the

81 It could potentially be argued that the centroid measure calculated for each speaker is not the true center of the vowel space. Previous studies which have calculated the center of the vowel space for a particular speaker, or group of speakers, have employed various methods. Harmegnies and Poch Olivé (1992), for example, determined the center of the vowel space based on a projected measure of schwa. In the present study, calculating the center of the space based on the F1 and F2 values of the point vowels assumed an equal weight along the F1 and F2 dimensions. That is, there was no adjustment made to account for the fact that small differences in F1 may be more perceptible than a small difference in F2 given the different frequency ranges that the formants tend to occupy. Even if this were not the exact center of the space, however, the distance of each of the vowels was at the very least measured from a stable point within the vowel space. This limitation will be discussed later on in this chapter.

/i/

/e/

/a/

/o/

/u/

300

400

500

600

700

800

90011001300150017001900210023002500

F1 H

z

F2 Hz Overall vowel space

245

front vowel /i/ was found to be the farthest from the center. If the HS vowel space were

symmetrical, as has been suggested, then one would expect that the high vowels /i/ and

/u/ would be roughly the same distance from the center of the space, and that /e/ and /o/

would be slightly closer and not differ significantly from one another. The statistical

analyses revealed that the triangular pattern typically attested for the Spanish vowel

system is not observed in the speech of bilingual HS of Spanish. The distinct pattern of

dispersion may be a characteristic specific to this group of speakers, but may also reveal

that some of the assumptions about the nature and organization of the Spanish vowel

system are not borne out when the acoustic properties of vowels are taken into

consideration.

Not only are the vowels not distributed symmetrically, but there are noticeable

differences in the interplay of height and backness between the high and the mid vowels.

The front vowels /i/ and /e/ were situated higher in the acoustic space than their back

vowel counterparts, /u/ and /o/. The statistical analyses presented in chapter 4 revealed

that the height differences were indeed significant, resulting in a five-level distinction in

phonetic vowel height. O’Rourke (2010) also reported a five-vowel distinction in height

for the bilingual and monolingual Peruvian speakers analyzed in her study. Thus, the

results of the present study support O’Rourke’s findings, suggesting that bilingual and

monolingual vowel systems exhibit fine-grained distinctions in height that cannot be

observed impressionistically.

Differences in displacement along the F2 (i.e., front-back) dimension are also

visible in Figure 5-1. The most outstanding characteristic is the position of the /u/. In

comparison to traditional descriptions of monolingual systems, the /u/ in heritage Spanish

246

is significantly farther front than would be expected. For several speakers, the /u/ was

actually pronounced farther front than the /o/, and the statistical analyses revealed that the

two back vowels were not significantly different from each other with respect to F2.

O’Rourke (2010) reported similar findings for the Peruvian bilinguals, and argued that

perhaps the distinction between the back vowels in Peruvian Spanish was due to

differences in F1 (height) and not F2 (backness). The results of the present study suggest

a similar conclusion: while the front vowels /i/ and /e/ are distinguished both in terms of

height and backness, the back vowels are only differentiated by height. The cue for

vowel identity for these speakers, thus, might be differences in F1 and not F2.

Further insight into the HS vowel system is gained by examining the distribution

of all vowels within the acoustic space, as well as the overall area of the acoustic space

they occupy. Figures 5-2 and 5-3 represent all of the vowel tokens produced by the

female and male HS analyzed in this study. As the figures below contain the raw formant

frequencies and not normalized values, the female and male data are presented separately.

Each color represents one of the five Spanish vowel categories, and the shapes represent

different speakers. Although they are particularly difficult to see for the female speakers

given the large number of data points, the ellipses represent one standard deviation from

the mean.

247

Figure 5-2. All vowel tokens produced by female HS (N = 2703). Green represents /i/, blue /e/, red /a/, orange /o/, and purple /u/.

Figure 5-3. All vowel tokens produced by male HS (N = 639). Purple represents /i/, blue /e/, green /a/, red /o/, and orange /u/

248

The individual vowel productions presented in Figures 5-2 and 5-3 demonstrate

that the vowels produced by HS occupy specific regions within the acoustic space.

Generally speaking, the acoustic areas are maintained separately, with each vowel

occupying its own specific region within the acoustic space. The figures do, however,

reveal that some vowel categories overlap slightly, and that the degree of overlap varies

by gender.82 The greatest degree of overlap is observed for the back vowels /o/ and /u/

for both genders. The front vowels /i/ and /e/ exhibit a greater degree of separation, with

only slight overlap between these two categories. As shown in Figure 5-2, some of the

female productions of /e/ were produced far back in the vowel space, so much so that

they overlap with the most fronted productions of /o/. In contrast, the male speakers did

not exhibit as much convergence of the mid vowel categories. Some male speakers did,

however, produce a few tokens that were extremely high and back, falling within the

boundaries of the space established for /i/ and /u/. Unlike the other vowels, the acoustic

space occupied by /a/ varies height-wise for both the male and female speakers. The low,

central vowel /a/ exhibited F1 values that varied roughly between 1100 Hz and 500 Hz

for the females, and 800 Hz and 600 Hz for the males. Thus, Figure 5-2 and Figure 5-3

indicate that HS Spanish vowels tend to cluster in separate regions within the acoustic

space, and that certain vowels show more variability in their productions along the front-

back dimension (F2) whereas others vary primarily in height (F1).

82 A comparison between the female data presented in Figure 5-2 and the male data presented in Figure 5-3 suggests that the amount of overlap between the vowel categories may be somewhat greater for females than for males. The unequal number of male and female participants, however, may have resulted in Figure 5-2 (females) appearing more crowded than Figure 5-3 (males). Figure 5-2 contains 4 times as many data points as Figure 5-3, leading to the appearance of greater overlap. Although it is possible that male and female speakers would exhibit differences in pronunciation, it is not possible to address this question further given the unequal distribution of gender in this study.

249

The overall cloud data presented in the figures above demonstrated some

separation of vowel categories for male and female speakers with the exception of the

back vowels. By examining just the mean values and the standard deviations from the

mean, however, a slightly different picture emerges when compared to the cloud data.

Figure 5-4 and Figure 5-5 illustrate the acoustic regions occupied by each of the vowel

categories by female and male HS, respectively. The ellipses represent two standard

deviations of F1 and F2, and the symbol in the center of the ellipse represents the mean

value of each vowel as averaged across all speakers of the same gender. The values

presented in Figure 5-4 and Figure 5-5 are non-normalized, and for this reason females

and males are presented in their own separate plots.

Figure 5-4. Dispersion of vowel categories for female HS (N = 2703). Ellipses represent two standard deviations from the non-normalized mean, and the symbol in the center of the ellipse represents the mean F1 and F2 value for each particular vowel category.

250

Figure 5-5. Dispersion of vowel categories for male HS (N = 639). Ellipses represent two standard deviations from the non-normalized mean, and the symbol in the center of the ellipse represents the mean F1 and F2 value for each particular vowel category.

Figures 5-4 and 5-5 above reveal several additional characteristics regarding the

organization and dispersion of HS vowels. First of all, the shape and size of the ellipses

confirm that certain vowels showed greater variation in terms of height, whereas others

showed more movement along the front-back continuum. The low vowel /a/ in particular

showed more variation along the height (F1) dimension and less dispersion along the

front-back continuum when compared to the high vowels /i/ and /u/. The mid vowels

exhibited somewhat equal amounts of variation in terms of both height and backness, as

evidenced by the near circular shape of the area it encompassed, especially for the /e/.

251

Thus, the type of variation that HS vowels exhibit is not uniform for all vowels, but rather

varies according to the vowel category. For this group of speakers, the majority of the

variation in /a/, for example, was observed in terms of height. The back vowel /u/, in

contrast, exhibited some height variation, but the movement was primarily observed

along the front-back (F2) dimension.

In addition, an examination of the ellipses confirms that there is some degree of

overlap between the vowel categories for both male and female speakers, especially when

two standard deviations from the mean are plotted.83 As can be observed above, there is

a considerable amount of overlap between the back vowel categories /o/ and /u/. This

result is not surprising given the proximity of the back vowels in the acoustic space (see

Figure 5-1). There is less overlap between /i/ and /e/, which follows given that the front

vowels exhibited a greater distance from one another in terms of pure averages when

compared to the back vowels. The mid vowel categories /e/ and /o/ showed very little

overlap, but the low vowel /a/ does encroach upon the space occupied by the mid vowels,

especially /e/.

In order to gain more perspective about the organization of the HS vowel space, the

non-normalized values for the male and female speakers in this study were compared to

those obtained by Servín and Rodríguez (2001) for male and females speakers of

Mexican Spanish. Although comparing across studies is less than ideal given that the

tasks and analysis techniques differ considerably, this comparison does offer an important

visual which illustrates the distinction between monolingual and bilingual Spanish

varieties. Figure 5-6 compares the vowel space for the female speakers based on their

83 When only one standard deviation is represented in the figures, all of the vowel categories are maintained separate, and there is no overlap.

252

utterances in the carrier phrase task to the values reported by Servín and Rodríguez

(2001). Figure 5-7 represents the same comparison for the male speakers. The dashed

lines represent the overall space produced by the HS, and the solid lines represent the

monolingual Spanish productions. Only the vowels produced in the carrier phrase tasks

are included here in order to maintain as much consistency across studies as possible.

Note also that these are the raw averages, whereas other figures have presented the

estimated marginal means determined by the statistical model. For this reason, the values

presented in these figures differ somewhat from those presented previously in chapter 4.

Figure 5-6. Comparison of vowels (N = 2703) produced by 13 female HS to vowels (N = 60) produced by 2 monolingual females in S&R (2001).

2003004005006007008009001000

7001200170022002700

F1 H

z

F2 Hz

Female HS compared to monolingual Mexican speakers in S&R (2001)

Female HS Female S&R (2001)

253

Figure 5-7. Comparison of vowels (N = 639) produced by 3 male HS to vowels (N = 60) produced by 2 monolingual males in S&R (2001).

Figures 5-6 and 5-7 demonstrate that the organization of HS vowels in the

acoustic space and the amount of space they occupy differ considerably from the

monolingual speakers of Mexican Spanish examined in Servín and Rodríguez (2001).

Generally speaking, the monolingual vowel space is larger than the bilingual HS vowel

space: the high front vowel /i/ is farther front, and the /u/ is positioned farther back in the

vowel space for the monolinguals. In addition, the /a/ is positioned farther back for the

HS females, and the male and female HS also produced this vowel lower than the

monolingual speakers. The mid back vowel /o/ is slightly lowered for both the male and

the female HS when compared to the native speakers. The HS /o/ is produced farther

front than the monolingual /o/, and farther back for the female HS. Finally, as discussed

previously, the HS /u/ is situated considerably farther front than the monolingual Spanish

productions of /u/.

200

300

400

500

600

700

800

900

6001100160021002600

F1 H

z

F2 Hz

Male HS compared to monolingual Mexican speakers in S&R (2001)

Male HS Male S&R (2001)

254

The fronted position of the back vowel /u/ in HS Spanish may be a consequence of

these speakers also being fluent, and some even dominant, speakers of English. The

location of the back vowel /u/ in English is known to differ from the position traditionally

described for Spanish. Bradlow (1994), for example, directly compared the production of

vowels across English, Spanish, and Greek. She reported F2 values for English ranging

between 1100 and 1300 Hz, whereas the average F2 value of Spanish /u/ was less than

1000 Hz (992 Hz in CVCV words). Chang et al. (2009) reported an even higher (i.e.,

more fronted) F2 value for English /u/-- nearly 1500 Hz. In contrast, the typical F2 value

of Spanish /u/ reported in acoustic studies of Spanish vowels falls between 700 and 900

Hz (Quilis & Esgueva, 1983; Servín & Rodríguez, 2001). In addition, the position of /u/

is known to exhibit even more fronting in certain varieties of American English as well as

in Mexican American varieties of English (Fought, 1999). Given that /u/ is produced

farther front in the vowel space in English than in Spanish, and that /u/-fronting has been

attested in varieties of Mexican American English, the fronted position of the /u/ in HS

Spanish may be induced by contact with English.84

Another distinct characteristic of the HS vowel system is the lowering and

backing of the /e/ relative to monolingual Spanish vowels. As shown in Figure 5-6, the

female participants in this study produced /e/ considerably farther back and lower than

the female monolingual speakers from Mexico City. The male HS also produced /e/

lower in the space and slightly farther back, but the degree is not nearly as great as that

exhibited by the female speakers. The pronunciation of /e/ for the HS is somewhat

consistent with the description provided by Lipski (2008) for Mexican American Spanish.

84 It is important to note, however, that O’Rourke (2010) also reported the fronting of /u/ among Spanish monolinguals and Quechua-Spanish bilinguals in Peru who do not have extensive contact with English.

255

He argued that there is a tendency for this vowel to be laxed and pronounced more like

/ɛ/, as in the word let, especially in final syllables, and that this characteristic was most

prevalent in the Midwestern United States. Although all of the vowels analyzed in this

study were non-final, it is possible that the lowering and backing of /e/ has extended to all

productions of this vowel for this group of HS, especially among the female speakers.

Whereas large differences in vowel quality were observed for HS, the analyses of

vowel duration revealed that the HS vowels patterned similarly to what has been

described for monolingual Peninsular Spanish (Marín Gálvez, 1995). Like the

monolingual speakers examined by Marín Gálvez, HS exhibited intrinsic vowel duration:

as vowel height increased, the overall duration of the vowel decreased. In this way, the

HS vowels patterned like monolingual Spanish speakers. It is important to mention,

however, that although the pattern of duration exhibited by HS and monolinguals is

similar, the HS vowels were overall longer than those produced by the monolingual

speakers reported in Marín Gálvez. The differences in monolingual and HS duration are

presented below in Figure 5-8. The black bars represent the HS vowel duration in the

CPT, and the gray bars the monolingual duration reported in Marín Gálvez, which were

obtained from a reading task.

256

Figure 5-8. HS and monolingual vowel duration in controlled speech. Marín Gálvez’s (1995) data represents Peninsular Spanish.

An examination of the vowel durations for each group of speakers indicates that

HS vowels were always slightly longer than those produced by monolinguals. The

durational differences were greatest for the low vowel /a/ and least for the high vowels /i/

and /u/. As no statistical comparisons between these two populations were made,

however, it is not possible to determine if these differences were significant. Even

though HS vowel duration is slightly longer than monolingual vowel duration, the former

is more similar to the monolingual productions than it is to the L2 learner durations

reported in Stevens (2011).85 The learner durations reported in Stevens’ study exhibited

intrinsic vowel duration as well, but were upwards of 200 milliseconds in length, even for

the high vowels. Thus, unlike vowel quality, the HS vowel durations were found to differ

85 As will be discussed in greater detail in section 5.3.2, Stevens (2011) examined the Spanish vowel durations of L2 learners in study abroad and at-home contexts. He compared L2 vowel duration to monolingual Peninsular vowel duration, and reported that L2 learners produced Spanish vowels that were significantly longer than monolingual productions. The vowels he analyzed were word-final, which, as mentioned previously in chapter 3, are often lengthened in some varieties of Spanish.

0 10 20 30 40 50 60 70 80 90

/i/

/e/

/a/

/o/

/u/

Duration in ms HS and monolingual vowel duration

HS vowels Marín Gálvez (1995)

257

very little from those reported in previous studies for monolingual Peninsular Spanish.

Overall vowel duration was therefore perhaps less affected by the influence of English

than was vowel quality. This assumption will be addressed further in section 5.2.1

pertaining to lexical stress.

The analyses of HS vowel quality and duration and the comparisons between their

productions and those of monolinguals indicate that HS of Spanish exhibit a unique

system of vowel pronunciation. The aforementioned finding therefore confirms two of

the hypotheses presented in chapter 2: 1) HS of Spanish would exhibit patterns of

pronunciation that conform to sociocultural norms associated with the population, and 2)

HS vowel productions would reflect influence from English. The pronunciation of the

vowels /e/ and /u/ in HS Spanish suggests that the participants in the present study may

be integrating some characteristics of English into their Spanish vowel pronunciations,

while at the same time maintaining pronunciations associated with their heritage variety

of Spanish. The production of /u/ is more similar acoustically to the English /u/, whereas

the /e/ was produced lower and farther back, especially for females—a characteristic

described by Lipski (2008) as typical of Mexican American Spanish in the Midwestern

United States.86 As will be discussed in greater detail in sections 5.2.1 and 5.2.2 below,

however, the lowering and backing of /e/ may have been the consequence of the contact

of many of the /e/ tokens with /s/. Central Mexican varieties of Spanish are argued to

exhibit unstressed vowel reduction, especially when the vowel is followed by /s/ (Lope

Blanch, 1972). Thus, the centralized position of the /e/ could be the result of contextual

effects, as many of the stressed and unstressed productions of /e/ occurred in this context.

86 It is important to note that Mexican American Spanish is a contact variety. Thus, the laxing of /e/ described for this variety of Spanish could be contact induced. Lipski (2008) does not offer any explanation as to the cause of this lowering and backing of /e/.

258

A final important aspect to discuss is the position of the /a/ in HS Spanish. As the

HS described in this investigation are residents of Chicago and surrounded by speakers

who exhibit the Northern Cities Vowel Shift in English (NCVS), it is interesting that the

position of the /a/ is still central, or at least farther back in the space than the monolingual

Spanish speakers’ /a/. The F2 values exhibited by the HS are within the range of possible

fronted /a/ tokens associated with the NCVS, but the range of variation exhibited by the

/a/ is primarily in height and not in backness. The lack of variation along the F2

dimension for /a/ suggests that the HS in this study have most likely not integrated this

particular characteristic of the NCVS into their Spanish. The lack of /a/ fronting is

somewhat consistent with Konopka and Pierrehumbert (2008; in press) who reported that

speakers of Mexican American English in Chicago did not exhibit all of the

characteristics of this English vowel shift. Thus, as the NCVS is not entirely present in

Mexican American English varieties in Chicago, the lack of /a/ fronting in heritage

Spanish is not surprising.

This section of the chapter has offered an analysis of the HS vowel system, its

internal organization and dispersion, and a comparison between HS vowel properties and

those reported for monolingual Mexican Spanish. The following sections discuss each of

the between- and within-subjects variables separately, and offer greater insight into the

role of these variables in determining the organization of the HS vowel system.

5.2 Within-subjects variables

The second research question addressed in the present investigation pertains to the

effects of three within-subjects variables on HS vowel pronunciation. This question is

presented again below:

259

Q2. How do lexical stress, syllable type, and speech style affect HS vowel quality,

quantity, and dispersion?

The effects of each of the within-subjects variables are described in their own

separate subsection. Lexical stress is discussed in section 5.2.1, syllable type in section

5.2.2, and speech style in section 5.2.3.

5.2.1 Lexical stress

Of all of the analyses conducted in this dissertation, the influence of lexical stress

on HS vowel production was the most robust. HS vowels exhibited significant

differences in atonic and tonic vowel production in terms of quality, dispersion, and

quantity. The main finding was that atonic vowels were more centralized, exhibited

smaller Euclidean distances from the center of the vowel space, and were significantly

shorter in duration when compared to their tonic counterparts. The distinctions between

tonic and atonic vowels were also observed to some degree in all of the speech styles.

Thus, it was not the case that differences in stress were only observed in the most

spontaneous speech productions, but that the distinction between tonic and atonic vowels

as also observed, and was significant, in the controlled speech task.

That HS exhibit differences in quality conflicts with the argument that Spanish

vowels do not exhibit quality differences based on lexical stress (Delattre, 1969).

Although the former did report very small degrees of reduction in the form of

centralization for Spanish, the degree of reduction was considerably less than what was

reported for English. The stress differences observed for HS are, however, consistent

with the impressions of Navarro-Tomás (1918) and the findings of the acoustic

investigation conducted by Menke and Face (2010). Based on impressionistic data,

260

Navarro-Tomás did argue that vowels exhibited, albeit slight, differences in quality based

on stressed context. Menke and Face (2010) also reported that L2 learners of Spanish at

varying levels, as well as the monolingual control group, produced atonic vowels that

were more centralized when compared to tonic vowels. The latter reported that the

difference was more robust along the F2 dimension, in that the atonic vowels exhibited

greater movement along the front-back continuum when compared to the height

continuum. This same tendency was also observed in HS Spanish, as all vowels except

for /a/ exhibited significant degrees of centralization along the F2 dimension, but only the

non-high vowels varied significantly according to height. Taken together, the height and

backness differences resulted in atonic productions that were raised and centered, but like

Menke and Face’s (2010) findings, the centralization effect was greater along the F2

dimension. The raising and centering of atonic vowels was further reflected by their

shorter Euclidean distances from the center of the vowel space, confirming the robustness

of the stress effect.

The atonic vowels produced by HS were also significantly shorter in duration than

the tonic vowels. The durational differences between tonic and atonic vowels are

consistent with Navarro-Tomás’s (1918) impressionistic description that atonic vowels in

Spanish are reduced. The present study also corroborates Marín Gálvez´s (1995) acoustic

results, which reported that atonic vowels were significantly shorter than tonic vowels.

However, as with the previous comparison between overall HS duration and the

monolingual values reported by Marín Gálvez, an important difference emerges. The

degree of shortening between the tonic and atonic vowels is greater for HS than for

monolinguals. The monolingual tonic vowel durations reported in Marín Gálvez were

261

argued to be 20.3% longer than the atonic vowels (tonic = 70.17 milliseconds, atonic =

58.31 milliseconds). The HS in this study, however, showed more than a 45% increase in

duration between the atonic and tonic vowels (tonic = 82.04 milliseconds, atonic =

52.34). In other words, although monolingual speakers of Spanish do exhibit some

differences in duration as a consequence of lexical stress, the effect is nearly two times

greater for HS of Spanish.

Taken together, the quality and quantity differences observed in HS Spanish

indicate that this bilingual variety of Spanish exhibits robust differences in vowel

production based on lexical stress. There are two potential explanations for these

differences in quality and duration: 1) the HS in this study exhibit the unstressed vowel

reduction (UVR) that is argued to exist in central Mexico (Boyd-Bowman, 1952; Lope

Blanch, 1972; Matluck, 1952), or 2) HS have integrated characteristics typically

associated with English vowel reduction into their Spanish.87

As described in chapter 2, section 2.2.2, one notable characteristic of central

Mexican Spanish is the reduction of unstressed vowels, especially the mid vowels when

in contact with /s/. The reduction observed in this bilingual variety of Spanish could,

thus, represent the reduction reported by Boyd-Bowman (1952), Lope Blanch (1972), and

Matluck (1952). If the input that these speakers received as children consisted of a

variety which exhibited this reduction, the HS in this study could have adopted this

characteristic into their Spanish as adults.

Delforge (2008), however, argued that “reduction”, at least in Andean Spanish,

was better characterized acoustically as devoicing. The acoustic analyses of the HS

87 As few acoustic studies of monolingual Spanish vowels have directly examined the effects of lexical stress in recent years, a third possible explanation is that all Spanish dialects exhibit unstressed vowel reduction.

262

vowel productions in this study, however, did not reveal great amounts of devoicing of

unstressed vowels. What is more is that the reduction affected all vowels fairly equally.

Although the mid vowel /e/ did exhibit a noticeable degree of centralization, especially

for the female speakers, all atonic vowels were centralized based on the Euclidean

distance measures calculated for each speaker. Matluck (1952) argued that /a/ in

particular was somewhat resistant to reduction in monolingual Spanish. The HS in this

study, however, produced atonic /a/ significantly higher in the vowel space than their

productions of tonic /a/. All unstressed vowels exhibited a tendency to centralize,

although the non-low vowels exhibited a greater movement along the F2 dimension

whereas the low vowel was primarily only raised. In terms of reduction with respect to

duration, all vowels again exhibited shortening in unstressed syllables as opposed to

stressed syllables. Thus, the vocalic reduction was not restricted to only the front vowels

as might be expected if this effect were attributed to a characteristic of the input variety

that these speakers overheard during childhood.88

A second potential explanation for the reduction of atonic vowels in HS Spanish

is influence from English. As mentioned previously, English is characterized by great

degrees of unstressed vowel reduction both in terms of quality and quantity. Delattre’s

(1969) instrumental study confirmed this tendency, revealing that unstressed vowels in

English reduced and centralized to the neutral vowel schwa /ǝ/ more than French,

German, and Spanish vowels. In addition, Spanish and English are typically classified as

belonging to two different rhythm classes. English is generally described as a stress-

88 Another notable finding is that the /i/ and /e/ were found to be significantly longer when produced in closed syllables as compared to open syllables. Interestingly, many of the syllables were closed by /s/. That /i/ and /e/ were longer and not shorter in this context provides further evidence against UVR (as described by Lope Blanch, 1972) as the primary cause of the lexical stress differences in HS Spanish. Further description of the effect of syllable type on duration will be provided later on in this chapter.

263

timed language, meaning that it exhibits more variability in vowel durations based on the

placement of stress and a greater degree of vowel reduction (Pike, 1945; Grabe & Low,

2002; Harris & Gries, 2011). Spanish, in contrast, is typically considered a syllable-

timed language, characterized by less variability in syllable durations and lesser amounts

of reduction of unstressed vowels (Pike, 1945; Grabe & Low, 2002; Harris & Gries,

2011).

Bilingual speakers who have access to languages with different rhythmic profiles

have been shown to integrate the rhythmic characteristics of one language into their other

language. Carter (2007), for example, examined the Pairwise Vowel Variability Index

(PVI) of speakers of Spanish, Latino varieties of English in North Carolina and Texas,

Anglo-American English, and Afro-American English. He reported that the speakers of

Latino English in both North Carolina and Texas produced their English with a pattern

that was more syllable-timed (i.e., more Spanish-like) than Anglo- and Afro-American

English speakers. The Latino English PVI, however, was greater than that of the

monolingual Spanish speakers, thus indicating that their rhythmic patterns fell between

that exhibited by monolingual English and Spanish speakers. In addition, an examination

of the rhythmic pattern of one female speaker from North Carolina indicated that over the

course of two years, her English became slightly more stress-timed (i.e., English-like)

than it was when she was first interviewed shortly after her arrival to the United States

from Mexico. Therefore, these reports suggest that not only can bilingual speakers create

a modified rhythmic system, but that over time the prosodic characteristics of one of their

languages can impact the other. With reference to the HS in this study, it is possible that

264

the large differences in vowel quality and quantity due to stress exhibited in their vowel

production are the result of exposure to and use of English.

The overall findings from the present investigation and the others cited above

indicate that bilingual systems of pronunciation exhibit stress effects that are traditionally

argued to be non-existent in monolingual varieties of Spanish. For HS of Mexican

heritage residing in Chicago, these lexical stress differences are manifested in terms of

both quality and quantity. Although it is possible that the lexical stress differences

produced by this group of speakers are representative of UVR known to exist in central

Mexico, it is perhaps more likely that the influence of English has resulted in the robust

degrees of centralization and reduction observed in these speakers’ productions. As

English atonic vowels are known to reduce to schwa and exhibit shorter durations, it is

possible that this stress effect is so noticeable that it is one of the first to transfer into the

speech of bilingual Spanish HS in this region.

5.2.2 Syllable type

In contrast to the results of lexical stress discussed above, the analyses of syllable

type were less robust, and revealed several unanticipated findings. The analyses of

syllable type revealed that at least for this population of HS, syllable structure had little

effect on vowel production. The effects that did emerge were not consistent: only certain

vowels exhibited differences in pronunciation based on syllable type, and some vowels

showed opposite effects in the different speech styles. One of the most interesting

findings of this analysis, however, is that not only did the mid vowels /e/ and /o/

generally not exhibit the supposed allophony claimed in previous descriptions of Spanish

265

vowels (Navarro-Tomás, 1918), but that the point vowels did exhibit minor differences in

production.

Averaging across all speech styles and stress contexts, none of the HS vowels

exhibited significant differences in height based on syllable type. The lack of height

differences conflicts with the description provided by Navarro-Tomás (1918), who

claimed that the high vowels, for example, exhibited more close productions (i.e., higher)

when produced in open syllables and more open (i.e., lower) productions when they

occurred in closed syllables. An examination of the high vowels (see Figure 4-9) does

show that the vowels produced in open syllables had higher F1 values than those

produced in closed syllables, but the differences were not statistically significant.

Significant differences on the F2 dimension were interestingly only revealed for the point

vowels when averaging across all three tasks. The /u/ and /a/ were produced significantly

farther front in closed syllables and the /i/ significantly farther back when compared to

those produced in open syllables. Thus, there is some indication that the high vowels do

exhibit some of the distinctions argued by Navarro-Tomás in that they are slightly

centralized and lowered when produced in closed syllables. The finding that /a/ differed

along the F2 dimension is also interesting, given that Navarro-Tomás argued that the /a/

was central regardless of whether it was produced in a closed or open syllable. He did,

however, describe that the /a/ exhibited differences in quality when in the context of velar

and palatal consonants and diphthongs.

The analyses of the Euclidean distance measures, however, indicated a more

robust pattern. As Euclidean distance is based off of both F1 and F2, it may offer a

clearer picture of how the vowels are dispersed and organized throughout the space. The

266

overall analysis of Euclidean distance showed that vowels produced in closed syllables

were closer to the center of the vowel space than those produced in open syllables. This

trend was statistically significant for all vowels when averaging across all three speech

styles. The smaller Euclidean distance measures for vowels produced in closed syllables

lead to two potential conclusions: 1) Vowels produced in closed syllables are slightly

more centralized (i.e., /a/ is higher, the non-high vowels are lower, the front vowels have

lower F2 values, and the back vowels have higher F2 values) than vowels produced in

open syllables, or 2) Vowels produced in open and closed syllables do exhibit minor

differences in pronunciation, but these differences are not noticeable when height and

backness are considered alone.

When the effects of syllable type on vowel production were examined in each of

the three tasks, significant differences in /e/ and /o/ did emerge, but were uncommon.

The closed and open tokens of /e/ were only found to differ significantly from one

another in the PIT, and only along the F2 and Euclidean dimensions. In the PIT, the /e/

in closed syllables was produced significantly farther back in the vowel space than the /e/

produced in open syllables; it was also slightly higher, although not statistically. The

backing and slight raising of /e/ in this task could be attributed to the allophonic variation

described by Navarro-Tomás, but if this were the case, then we might expect the /e/ to

have been produced lower (i.e., more open). Another explanation for this effect is the

influence of the following consonantal context, and more specifically, contact with /s/.

The PIT contained the greatest number of /e/, and also /i/, tokens in contact with /s/.

Front vowels in contact with postdental consonants typically show a lower F2 value than

those produced in null contexts, and the effect of consonantal context has generally been

267

argued to affect F2 more than F1 (Stevens and House, 1963). Thus, although the /e/ may

have exhibited the allophony argued to exist by Navarro-Tomás, it is perhaps more likely

that the effect of syllable type on /e/ can be attributed to consonantal context.

The /o/ exhibited differences in height, backness, and Euclidean distance in the

NRT and only in height in the CPT. In the NRT, the /o/ produced in closed syllables was

produced farther front and lower than the /o/ produced in open syllables. The lowering

and fronting of /o/ in closed syllables is consistent with Navarro-Tomás’s description: as

predicted, /o/ produced in closed syllables had a more open articulation. In the CPT,

however, the /o/ was actually produced higher in the closed syllables than in the open

syllables, contradicting what would be expected if the effects of syllable type on vowel

pronunciation were consistent. Although no clear explanation for this reverse height

pattern can be presented at this time, it is possible that the differences in height were the

result of a task effect. The spontaneous nature of the NRT resulted in greater degrees of

centralization, thus the lowering and fronting effect observed for /o/ in this task may have

less to do with syllable structure and more to do with task. The CPT, in contrast, resulted

in more peripheral vowel productions, thus the raising of /o/ in this task could be a form

of hyperarticulation, or overshoot. The varying position, and the fact that /o/ exhibited no

differences based on syllable type in the PIT, could also indicate that syllable structure

does not have a consistent or robust effect on the pronunciation of this vowel.

That the mid vowels /e/ and /o/ exhibited few significant differences in height and

backness is perhaps the most striking finding of the overall analysis of syllable type. The

lack of an effect of syllable type on the pronunciation of the mid vowels differs greatly

from Navarro-Tomás’s detailed description of the variation argued to exist for the mid

268

vowels. In contrast, the results from the present study confirm the results of other

acoustic investigations (e.g., Morrison, 2004), which suggest that the distinctions

between open and closed mid vowels do not exist in monolingual varieties. One potential

factor which could have influenced this lack of effect, however, pertains to how the

vowel tokens were coded and categorized in the present study. Navarro-Tomás argued

that syllables closed by /s/ actually resulted in a close articulation (i.e., [e] as opposed to

[ɛ]), and thus perhaps should have been coded as “open” instead of “closed” in the coding

scheme. Henríquez Ureña (1921) and Matluck (1952), however, argued that in Mexican

varieties of Spanish, /e/ in a syllable closed by any consonant (including /s/) was

produced with the open variant [ɛ] and thus differed from the Castilian variety of Spanish

described by Navarro-Tomás (1918). Servín and Rodríguez (2001), in fact, also

classified syllables closed by /s/ as “closed” syllables. To maintain consistency with

Servín and Rodríguez, syllables closed by /s/ were coded as closed syllables in the

present investigation. If Navarro-Tomás was correct, however, the coding of /e/ in the

context of /s/ as “closed” could have skewed the results, in that the analysis would have

included a disproportionate number of /e/ tokens in open syllables.

In addition to vowel quality, vowel duration also varied according to syllable

type. Vowels produced in closed syllables were found to be significantly longer than

those produced in open syllables. This lengthening effect observed in closed syllables is

consistent with the results of Marín Gálvez (1995), although the length differences

reported in his study were not statistically significant. An additional analysis of vowel

durations in the present study revealed that the voicing characteristics and manner of

articulation affected the length of the preceding vowel. Vowels tended to be longer

269

when followed by voiced consonants, and were also found to be the longest when

followed by a fricative. The only vowels to show significant effects of duration based on

syllable type were /e/ and /i/. Interestingly, these vowels frequently occurred in syllables

closed by /s/. Thus, the /e/ and /i/ may have exhibited durational differences based on

syllable type as a consequence of their contact with /s/.89

The analyses conducted on syllable type indicated that HS vowels did not exhibit

consistent quality differences based on the type of syllable in which they were produced,

the mid vowels did not exhibit distinct categories based on syllable structure, and that

only the front vowels exhibited differences in duration. The aforementioned findings do

not support the hypothesis that if differences in syllable type were revealed, that they

would be the most robust for the mid vowels. The overall effect of syllable type on

duration partially confirmed the hypothesis that vowels produced in closed syllables

would be longer than those produced in open syllables, but again, the duration effect was

only found for the front vowels. As discussed previously, it is likely that the longer

durations in closed syllables were due to the front vowels’ contact with the consonant /s/.

The statistical analyses of syllable type thus have implications for how the

Spanish vowel system is described. Most traditional accounts of Spanish vowels argue

for the presence of distinct mid vowel allophones based on syllable type, and this

argument appears in numerous textbook descriptions of the system (see Hualde, 2005,

among others). The acoustic results of the present study, coupled with those of Morrison

89 The duration of /i/ and /e/ also relates to the question of whether or not the reduction of atonic vowels in HS is a consequence of UVR or a transfer effect from English. The front vowels /i/ and /e/ are argued to be the two vowels that are most subject to reduction and/or devoicing in Mexican Spanish, and especially when they are in contact with /s/. That the front vowels actually exhibited a lengthening effect when in contact with /s/suggests that HS do not exhibit the same UVR that characterizes central Mexican Spanish. The reduction of the front vowels, thus, might be better described as centralization in terms of quality, and to a lesser degree reduction in duration.

270

(2004) and Servín and Rodríguez (2001), suggest that the differences attested for the

Spanish mid vowels may not be consistent or robust enough to result in statistically

distinct categories. Although slight differences in quality may emerge as a consequence

of articulatory timing and/or coarticulation, they may be so slight as to not be manifested

acoustically. The slight differences in articulation may even exist below the perceptual

level, which could explain why the HS in this study did not exhibit substantial vowel

quality differences based on syllable type. Only one of the tasks in this study was

somewhat balanced by syllable type (the PIT). A more carefully controlled study that is

balanced by consonantal context, like Morrison’s (2004) investigation, could reveal

distinct mid vowel categories. In light of his results, however, the present investigation

seems to provide further evidence against mid vowel allophony based on syllable

structure.

5.2.3 Speech style

The final within-subjects variable analyzed in this dissertation was speech style.

The main goal was to determine if and how different styles of speech influenced vowel

quality and vowel quantity in heritage Spanish. Like the results for lexical stress, the

analyses of speech style revealed robust effects on vowel quality, dispersion, and

quantity. The main findings of the analysis of speech style revealed that vowels

produced in controlled speech (the CPT) were produced more peripherally in the vowel

space than those produced in the spontaneous (NRT) and semi-spontaneous (PIT) speech

tasks and had greater Euclidean distances from the center of the vowel space. The

vowels produced in the NRT exhibited shorter durations than the other two tasks. In

many instances, the vowels produced in the NRT and PIT did not differ significantly

271

from one another with respect to quality, indicating that the PIT results in speech

production that resembles that obtained from a spontaneous speech task. The vowels

produced in the PIT were more centralized than those produced in the CPT, also

suggesting that a picture identification task results in speech that differs from that

obtained in a controlled task that requires participants to read written text. Taken

together, the results described above confirm the hypotheses that vowels produced in

spontaneous speech would exhibit greater degrees of quality and quantity reduction when

compared to vowels produced in controlled speech samples.

The differences in vowel centralization, overlap, and dispersion are more clearly

represented in the vowel cloud diagrams presented below. Figure 5-9, Figure 5-10, and

Figure 5-11 plot all of the vowels produced in the CPT, PIT, and NRT, respectively. The

male and female speaker data were combined and normalized using the Lobanov (1971)

method in NORM (Thomas & Kendall, 2007) so that all of the data could be presented

together in one figure.

272

Figure 5-9. Dispersion of normalized male and female vowels in the CPT (N = 984).

Figure 5-10. Dispersion of normalized male and female vowels in the PIT (N = 1247).

273

Figure 5-11. Dispersion of normalized male and female vowels in the NRT (N = 1111).

As demonstrated in Figure 5-9, Figure 5-10, and Figure 5-11, the clustering and

overlap of vowel categories differ considerably based on the speech style in which the

vowels were produced. Figure 5-9, for example, represents the vowels produced in the

CPT—the most controlled speech task. There is almost complete separation of the vowel

categories in this task, with the exception of the back vowels. As shown in Figure 5-10,

which represents the PIT, there is overlap between the vowel spaces and the positions of

the vowels are more centralized than those in the CPT (Figure 5-9), but slightly less so

than the vowels produced in the NRT (Figure 5-11). Finally, in Figure 5-11, all of the

vowel categories converge upon one another, and occupy a more centralized location in

the space when compared to the vowels produced in the PIT (Figure 5-10) and CPT

274

(Figure 5-9). The findings of the analyses of dispersion and overlap of vowels in the

three tasks employed in the present investigation are similar to those reported by

Harmegnies and Poch-Olivé (1992), Poch-Olivé et al (2008), and Martín Butragueño

(2011). All three of the studies mentioned above reported that vowels produced in

spontaneous speech gathered in a narrative task were more centralized and showed

greater overlap than vowels produced in a carrier phrase task. The similarities between

the previous studies and the current investigation indicate that HS, like native Spanish

monolinguals from Spain and Mexico, also produce vowels differently based on the

spontaneity of speech style.

Vowel duration also differed as function of speech style, but the effects were not

as robust as those reported for vowel quality and Euclidean distance. The overall main

effect of speech style revealed that vowels produced in the NRT had overall shorter

durations than those produced in the PIT and CPT, but that the latter two tasks did not

differ significantly from one another. The pairwise comparisons indicated that only two

vowels, namely /a/ and /o/, exhibited significantly shorter durations in the NRT when

compared to the other two tasks. The results of the analyses of duration, then, are not

identical to those revealed for vowel quality. Whereas quality and Euclidean distance

measures were similar between the NRT and PIT, similarities in duration were observed

for the PIT and CPT. In other words, the two most spontaneous speech tasks patterned

together with respect to quality, but the most controlled speech task (CPT) and the semi-

spontaneous speech task (PIT) were more similar with respect to duration.

The lack of a significant difference in duration between the PIT and the CPT may

be due to the similar nature of these two tasks. Although the PIT required identifying and

275

naming objects that were presented on a screen and perhaps involved more cognitive load

than simply reading a word out loud, the utterance type was similar to that of the CPT.

The words containing the target vowels were produced at the end of a phrase in an

isolated sentence for both tasks, and the speaking rate was somewhat similar. The

similarity in the utterance type and speed at which the CPT and PIT were completed may

explain why the duration of the vowels produced in these two tasks did not differ

significantly from one another. The NRT, on the other hand, prompted rapid and

spontaneous speech, and vowels were extracted from different points throughout the

phrase, not just utterance-finally. The combination of speaking rate differences and the

analysis of vowels produced at all points throughout the phrase may explain why vowel

length was significantly shorter in the NRT than in the other two tasks. Taken together,

these findings 1) indicate that vowels produced in spontaneous speech are shorter and

more reduced in quality than those produced in carrier phrase-like tasks, and 2) suggest

that vowels produced in picture identification tasks exhibit durations that are similar to

those produced in carrier phrase tasks, but result in greater degrees of quality reduction

similar to that observed in spontaneous speech.

The quality and quantity differences observed in the different speech styles could

be further explained within the framework of H&H theory (Lindblom, 1990). According

to Lindblom, speech varies along a continuum of hypoarticulated speech, in which

phonetic targets are undershot, and hyperarticulated speech, in which the phonetic targets

are overshot. A complex interplay of listener-oriented and speaker-oriented processes

determine where along the H&H continuum a speech utterance will fall. In situations

when a lexical item is predictable and/or there are sufficient contextual cues as to the

276

identity of a lexical item, the speaker will often default to a more economic means of

speaking – hypo speech. When lexical access could be impeded, a speaker may

hyperarticulate in order to ensure that the listener is able to perceive and process the

necessary lexical item or items in an utterance.

In the controlled speech task when emphasis was placed on one target word but

there were fewer contextual cues, speakers had a tendency to overshoot the vowel targets

to ensure that the target lexical item was produced and subsequently perceived clearly.

As a result, the vowels in the CPT were produced with more extreme and peripheral

formant values. In contrast, in the spontaneous speech task (NRT) when more contextual

cues to the lexical items’ identity were present, the participants undershot the vowel

target, resulting in more centralized formant values. Thus, the vowels produced in the

CPT could be classified as hyperarticulated, whereas the vowels produced in the NRT

might best be described as hypoarticulated. Evidence for hypoarticulated speech in

spontaneous speech tasks has also been reported in the studies conducted by Harmegnies

and Poch-Olivé (1992) for Spanish and Savy and Cutugno (1998) for Italian.

All in all, the analyses of speech style yield three important conclusions. First of

all, like monolingual speakers of Spanish, HS also exhibit greater centralization of

vowels in spontaneous speech when compared to controlled speech. Thus, the

aforementioned finding suggests that speech style effects are robust even in bilingual

populations. Second of all, the similarities between the NRT and the PIT indicate that a

picture identification task is an adequate means to gather semi-spontaneous speech data.

That the PIT and CPT differed also suggest that speech production tasks that do not

involve reading result in pronunciation that differs from that obtained from read-speech

277

tasks. Third of all, the analyses of speech style indicate that in order to fully understand

the vowel system of a language, it is important to examine vowels produced in different

types of speech. The general consensus in the vowel literature has remained that the

Spanish vowel system is relatively stable and exhibits little variation. The assumption of

stability has been supported by acoustic studies of vowels, but most investigations have

only examined isolated vowels or words produced in highly controlled speech tasks. As

explained above and illustrated in Figure 5-9, vowels produced in controlled speech

styles are more peripheral and more closely represent the vowel triangle typically

associated with the Spanish system. An examination of vowels produced in more

spontaneous speech samples, however, offers insight into the type of variation vowels can

exhibit and the acoustic dimension on which the variation occurs.

5.3 Between-subjects variables

The third and final point of investigation in this study was to assess how several

individual variables affected HS vowel production. The third research question is

presented below:

Q3: Do individual variables influence the way in which HS pronounce their

vowels?

5.3.1 Course level

The first between-subjects variable analyzed was the relationship between each

speaker’s course level (intermediate or advanced) and vowel production. The results

indicated that participants who were enrolled in the intermediate-level grammar course

for HS produced /u/ farther front in the vowel space than those enrolled in advanced-level

courses. In addition, even though the production of the front vowels did not differ

278

significantly for the two groups, the representation of the vowel space presented in

chapter 4, Figure 4-30, suggested that the vowels produced by the intermediate-level

group occupied a smaller, more centralized portion of the acoustic space when compared

to those who were enrolled in upper-level courses. The difference in the overall use of

the acoustic space between these groups is consistent with the findings reported by

Menke and Face (2010). Their analysis of advanced L2 learners of Spanish at different

levels revealed that as a speaker gained more experience with the Spanish language and

passed through different course levels, their vowel space expanded. That is, the higher

the course level, the more expanded the vowel space became. Chang et al. (2009)

reported the same pattern of expansion for native speakers, HS, and L2 learners of

Mandarin Chinese: as the level of experience with Mandarin increased, the F2 of the

back vowel /u/ decreased (i.e., moved farther back) and the F2 of the front rounded /y/

increased (i.e., moved farther front).90 The findings from the present study thus suggest

that formal classroom instruction may result in HS of Spanish producing vowels in a

more “native-like” way, if “native-like” refers to an expansion of the vowel space.91

The differences in the overall size of the vowel space produced by the

intermediate and advanced-level participants could potentially be explained, however, by

taking into consideration the distribution of male and female speakers in each of the

groups, and the fact that the statistical analyses were conducted with data that had not

90 Although Chang et al. (2009) described an expansion effect similar to that reported in this dissertation, it is important to note that the Mandarin Chinese and Spanish vowel systems differ considerably (e.g., inventory size, contrastive rounding). While the findings are parallel, the effects observed in Chang et al. may be more complex, relating to the phonemic contrast between /y/ and /u/. 91 As will be explained in section 5.4, the expansion of the vowel space may indicate that some speakers have acquired the formal speech register. In other words, it is possible that all HS are “native-like” with respect to their pronunciation of the Spanish vowels, but that only the most experienced speakers have acquired the formal register: the controlled speech style obtained in the CPT that was also the most expanded.

279

been normalized using a traditional method (i.e., Lobanov, 1971). The intermediate-level

group contained two male speakers, and the advanced group only one male speaker. The

statistical analyses were conducted with non-normalized data, and generally speaking,

male speakers produce vowels with lower formant frequencies than female speakers as a

consequence of vocal tract size. Thus, the males’ low vowel /a/ was produced higher in

the acoustic space than the female speakers’ /a/, and the males’ front vowels were farther

back in the vowel space than those produced by the female speakers. The lower F1 and

F2 values of the male speakers who formed part of the intermediate-level group of

participants may have lowered the average formant frequencies of the low and front

vowels, effectively skewing the overall size and shape of the vowel space and leading to

the appearance of a smaller, more centralized space for the intermediate group. The

advanced group of speakers, however, contained more participants overall and more

female speakers, thus resulting in overall higher formant values and the appearance of

greater expansion.

In order to test the potential influence of gender distribution throughout the

groups, the normalized formant values were analyzed statistically. The analyses of

normalized F1, F2, and Euclidean distance seem to confirm that the gender distribution

may have been partly responsible for the group effect, but only with respect to the height

of the vowel /a/. Figure 5-12 below plots the normalized formant values for the

intermediate and advanced-level speakers. The gray line represents the vowel space of

the intermediate-level participants, and the black line represents the vowel production of

the advanced-level participants.

280

Figure 5-12. Normalized formant values for speakers enrolled in intermediate and advanced-level courses.

When formants were normalized (see chapter 3, section 3.4.1.6) and analyzed

statistically, the effect of course level was no longer significant along the F1 dimension.92

The production of /a/ by the intermediate-level group was no longer higher in the acoustic

space than the /a/ produced by the advanced-level speakers. The effect of vowel height

for /a/ revealed in the analysis of the non-normalized formant values reported in chapter 4

thus may have been caused by the higher productions of /a/ by the two male speakers in

this group. When the values were normalized and the gender differences were removed,

the height effect was no longer observed.

The analysis of the normalized formant values also revealed that interestingly, the

vowel by course level interaction was still significant along the F2 and Euclidean

92 The vowel by stress interaction along the F1 dimension with the non-normalized data was marginal, and in fact, the only vowel that approached significance was /a/. The intermediate-level group exhibited a lower F1 value for /a/ (i.e., a higher production) than the advanced-level group. As explained with the non-normalized results, the interaction between course level and vowel was driven by the large mean difference values between the two groups’ productions of /a/, even though none of the pairwise comparisons themselves revealed significant differences.

300

350

400

450

500

550

600

100012001400160018002000

F1 H

z

F2 Hz

Normalized formant values by course level

intermediate advanced

281

distance dimensions, indicating that the male participants’ lower F2 values for the front

vowels did not cause the apparent centralization effect observed for the intermediate-

level participants.93 The vowel by group interaction conducted with normalized data was

significant for F2 (F(4, 3255.05) = 4.81, p = .001) and for Euclidean distance

(F(4,3255.53) = 3.29, p = .011). In both of the interactions, the only vowel to exhibit a

significant effect was /u/. The results were therefore essentially identical to those

conducted on the non-normalized data. The pairwise comparisons of F2 indicated that

the /u/ produced by the participants in the intermediate-level course was significantly

farther front than the /u/ produced by those in the advanced courses (p = .003). The

pairwise comparisons of the Euclidean distance of /u/ indicated that the Euclidean

distance was smaller (i.e., closer to the center of the vowel space) for the intermediate-

level participants than for the advanced speakers (p = .020). Thus, even with normalized

values, the effect of course level still emerged as significant, but only for one vowel.

The analyses of normalized and non-normalized formant values thus indicate that

only /u/ exhibited significant differences in production based on course level. That the

same result was obtained in both analyses suggests that the overall vowel space of the

participants enrolled in the advanced classes was not expanded overall. Additional

classroom instruction therefore has little effect on HS vowel pronunciation. That the /u/

was the only vowel affected, however, is an important result. The /u/ was produced

farther back in the vowel space for the advanced-level group, and therefore more similar

93 Note that the analysis conducted with non-normalized formant values did not reveal a significant difference between groups with respect to their production of /i/ and /e/. Although the figure representing the overall vowel space led to the appearance of more retracted front vowels for the intermediate-level speakers, the differences were not statistically significant.

282

to the position of this vowel in monolingual Spanish varieties as it is typically described

in the literature.

5.3.2 Grammar proficiency

Each participant’s score on the grammar proficiency exam was also correlated

with their pronunciation of the Spanish vowels. The results of the statistical analyses

with non-normalized data revealed that as grammar proficiency increased, the F2 values

of /i/ and /e/ decreased, resulting in front vowels that were more centralized than those

with lower grammar scores. The position of the low vowel /a/ was marginally correlated

with grammar score, such that as grammar score increased, the position of the /a/ raised.

The mid vowel /o/ also exhibited differences in F1 frequency as a function of grammar

score, exhibiting higher F1 frequencies (i.e., a lower position) as grammar proficiency

increased. Taken together, the effects of grammar proficiency on vowel production

suggested that participants with higher grammar proficiency scores had a more

condensed vowel space.

The results of the analysis of grammar proficiency thus conflict with previous

arguments that increased proficiency in a second language or heritage language results in

a more expanded vowel space (Menke & Face, 2010; Chang et al., 2009). An additional

confounding aspect is that although no official statistical tests were conducted, there

could be a connection between grammar proficiency and course level (described in the

previous section). The average grammar proficiency score for each of the course level

groups (i.e., intermediate and advanced) was calculated, revealing that the advanced-level

participants scored higher on the grammar proficiency exam (average 22.1 points) than

those enrolled in the intermediate-level classes (average 18.7 points). Based on the

283

potential relationship between grammar proficiency and course level it thus might be

expected that the statistical analyses would yield similar results; the higher grammar

scores would be correlated with back vowels that were produced farther back in the

vowel space just like they were for course level, and potentially with front vowels that

were produced farther front, resulting in vowels occupying a larger acoustic space for

those with higher grammar proficiency scores.

The analysis of the connection between grammar proficiency and vowel

pronunciation described in the first paragraph of this section, however, revealed the

opposite trend: those who scored higher on the grammar proficiency exam produced the

front vowels /i/ and /e/ farther back in the vowel space – so less expanded – than those

who scored lower on the grammar proficiency test. The seemingly conflicting results of

the analyses of grammar proficiency score and course level on vowel pronunciation could

again potentially be explained based on gender distribution. For this reason, an additional

analysis was conducted with the Lobanov (1971) normalized formant values to test if the

retraction of the front vowels by the participants with higher grammar scores was at all

influenced by gender.94

The analyses of the non-normalized formant values described in section 4.2.3

indicated that as grammar proficiency increased, the F2 value of the front vowels /i/ and

/e/ decreased, resulting in a production that was farther back in the vowel space. When

the normalized formant values were analyzed statistically, however, there was no main

effect or interaction involving the F2 frequencies or the Euclidean distance measures.

Thus, the backing effect observed with non-normalized formant values was no longer

94 An examination of the grammar scores for each participant revealed that one of the males scored 24 out of 25 points. The average scores for the males and females were somewhat similar, however, with the females averaging 20.3 points and the males 19.6 points.

284

significant when the normalized values were analyzed statistically. The Mixed Linear

Model conducted with the normalized formant values only revealed a significant vowel

by grammar interaction for F1 (F(4,3255.54) = 10.89, p < .001). The pairwise

comparisons of this interaction revealed that only the height of the vowel /o/ was

correlated with grammar proficiency score (4.55t(3254.37) = 4.17, p < .001). The

positive coefficient indicated that as grammar proficiency increased, the F1 value of /o/

also increased, resulting in a lower production of the vowel.

Thus, an examination of the effect of grammar proficiency on the normalized

formant values revealed a trend that differs from that revealed for the non-normalized

formant values. Only the /o/, and not the /i/ and /e/, exhibited a significant difference

based on grammar score. The combined results of the analyses conducted on course level

described in the previous section and grammar proficiency described in this section

indicate that only /u/ and /o/ differed significantly based on course level and grammar

proficiency, respectively. These results suggest that formal classroom instruction and

knowledge of prescriptive grammar may only affect the pronunciation of the back

vowels. The high back vowel /u/ was produced farther back (i.e., more “Spanish-like”)

by the participants enrolled in the advanced-level Spanish courses, and the /o/ was

produced lower by those participants with higher grammar proficiency scores, indicating

that the combination of higher course level and higher grammar proficiency scores may

result in a greater separation of the back vowel categories. Thus, as HS gain more

experience with the Spanish language and receive more formal language instruction, their

back vowel space may become more separated, and more closely resemble the

configuration that is traditionally described for monolingual Spanish vowels.

285

In summary, the analyses of HS grammar proficiency and course level combined

indicated that even HS of Spanish, who are typically argued to be “native-sounding,”

may “benefit” or alter their pronunciation as a result of more formal language instruction

and increased knowledge of prescriptive grammatical structures. That the analyses

conducted with normalized and non-normalized formant values were somewhat

conflicting for grammar proficiency, however, indicates that the relationship between HS

vowel production and grammar proficiency in particularshould be interpreted with

caution. Research on HS has also suggested that paper-based grammar proficiency tests

may not be appropriate for heritage populations, given that their experience with the

Spanish language has been primarily aural and oral (Montrul, 2005). Heritage speakers

are less accustomed to formal, written proficiency tests, which may have resulted in the

grammar proficiency exam yielding inaccurate assessments of their knowledge of formal

grammatical concepts. Thus, the connection between HS course level, grammar

proficiency, and vowel production should be addressed in future investigations with

different (or additional) instruments, and with a larger sample size.

5.3.3 Travel

The frequency with which HS traveled abroad to their heritage country was found

to influence the pronunciation of their Spanish vowels.95 To briefly summarize the

results presented in section 4.2.2, only the vowels /i/ and /a/ exhibited statistically

significant differences along the F1 and F2 dimensions, respectively. The low vowel /a/

was produced significantly lower in the vowel space by participants who traveled

frequently when compared to those who traveled infrequently (i.e., every 3-4 years). The

95 The language background questionnaire only inquired how frequently each speaker traveled abroad and not how long they typically stayed in Spanish-speaking country.

286

front vowel /i/ was produced significantly farther front in the vowel space by the frequent

travelers, but only when compared to those who never traveled. The Euclidean distance

of /i/ from the center of the vowel space was also found to be significantly greater for the

frequent travel group when compared to the other two groups of participants. Finally, the

same pattern was observed for /u/. The frequent travelers exhibited larger Euclidean

distances of /u/ from the center of the vowel space when compared to those who never

traveled and those who traveled infrequently, although the results only approached

significance.

Taken together, the statistical results of the vowel by travel interaction suggest

that HS who travel and spend more time abroad produce their Spanish vowels more

peripherally and exhibited greater dispersion or expansion of the vowels within the

acoustic space than those who travel less frequently. The trend of an expansion effect for

frequent travelers is an important result, and could potentially be interpreted to mean that

frequent travelers pronounce Spanish point vowels that more closely resemble

monolingual productions. Menke and Face (2010) reported that the native speakers of

Spanish examined in their study produced vowels that occupied the largest portion of the

acoustic space and were more peripheral than those produced by the groups of L2

learners. In addition, the comparisons presented in Figures 5-6 and 5-7 suggest that

native monolingual speakers of Spanish produce the point vowels more peripherally and

utilize a larger portion of the acoustic space than the HS investigated in this study.

Therefore, the expansion effect revealed via the vowel by travel interaction supports the

argument that frequent visits to the heritage country influence HS vowel pronunciation,

resulting in more native-like pronunciations.

287

Although the analyses revealed few statistically significant comparisons, the

majority of the significant effects were observed between the frequent travel group and

those who never traveled. The comparisons between the speakers who never or rarely

traveled and those who traveled infrequently did not reach significance. The infrequent

travelers did, however, pronounce /a/ significantly higher than the frequent travelers, and

/i/ significantly closer to the centroid than the frequent travelers. The lack of statistically

significant differences between the infrequent travelers and those who never traveled

suggests that only regular and frequent travel has significant effects on HS vowel

production. That is, traveling every three to four years may not be sufficient exposure for

HS to alter their pronunciation. As there were a small number of participants in each of

the three groups (frequent travel N = 6, infrequent travel N = 6, no travel N=4), however,

the lack of effects between the no travel and infrequent travel groups should be

interpreted with caution.

Travel abroad may have an effect on HS vowel pronunciation for a few reasons.

First of all, traveling abroad provides additional opportunities to speak and hear Spanish

in a wider variety of contexts. Although most participants in this study indicated that

they spoke Spanish outside of the classroom on a fairly regular basis, complete

immersion results in more contact with the heritage language and the ability to speak

Spanish in contexts where it might not be understood or appropriate in the United States.

Secondly, increased input via hearing Spanish more often could also result in greater

awareness of the fine-grained details of Spanish pronunciation. Large quantities of input

may facilitate HS in creating or maintaining the phonetic categories associated with

Spanish vowels. Thirdly, HS who travel abroad more frequently may receive pressure

288

from family and friends abroad to maintain their “accent” in Spanish. During informal

conversations after the experiments had been completed, several participants in this

investigation reported that their friends and grandparents abroad had commented on how

their Spanish had become English-accented. In order to maintain a stronger connection

with the heritage culture and “fit-in” during their stays abroad, some HS may make a

concerted effort to pronounce Spanish in a way that they feel is more “correct.” Those

who travel abroad more frequently may thus have more motivation to assimilate to the

native monolingual norm of vowel pronunciation than those who do not spend a

significant amount of time in their heritage country.96

The results of the present investigation also have an important connection with the

L2 literature examining the effects of context of learning (i.e., study abroad versus at-

home) on the acquisition of L2 phonology. Although some investigations have provided

mixed results in terms of the benefits of studying abroad on L2 segmental production

(e.g., Díaz-Campos, 2004), the general literature argues that study abroad learners exhibit

gains in pronunciation accuracy when compared to learners in traditional foreign

language classrooms, or “at-home” learners.97 Díaz-Campos (2006), for example,

reported that L2 learners who studied abroad exhibited higher rates of target-like

pronunciation of word-initial unaspirated /p t k/, lateral /l/, and palatal /ɲ/ in

conversational speech styles when compared to at-home learners.98 An earlier study

96 During a recent presentation, an anonymous audience member asked if I had inquired about how much Spanish the HS spoke when they traveled abroad. She indicated that when she visits her family in Mexico, she speaks English because her friends and relatives want to practice speaking English with her. The questionnaire utilized in this dissertation did not ask participants to specify which language they spoke when they traveled abroad. In the future, additional questions pertaining to how much English and Spanish is spoken while traveling should be included. 97 For a full review, see Lafford (2006). 98 Díaz-Campos (2006) is a follow up study to Díaz-Campos (2004). His 2004 investigation provided mixed results, in that the study abroad learners did not exhibit significant gains in pronunciation of the

289

conducted by Simões (1996) revealed that two of the five learners who studied abroad in

Costa Rica produced fewer instances of schwa and exhibited less vowel reduction in the

posttest as compared to the pretest.99 Finally, Stevens (2011) compared the Spanish

vowel durations of study abroad learners, at-home learners, and monolingual Spanish

speakers from Spain.100 He reported that while the at-home learners did not exhibit any

gains in vowel duration, the study-abroad learners produced Spanish vowels with

significantly shorter durations in the posttest as compared to the pretest. As the

monolingual native speakers who participated in this study had shorter durations when

compared to the two L2 groups, Stevens (2011) interpreted the decrease in duration for

the study abroad learners to indicate that their vowel duration “improved” and became

more “native-like”. Another important finding of Stevens’ study was that only the point

vowels /i a u/ exhibited significant degrees of duration reduction for the study abroad

learners: the same vowels that exhibited the expansion effect (i.e., more “native-like”

pronunciation) for the HS who traveled the most frequently.

voiceless aspirated stops, voiced approximants, alveolar lateral /l/, and palatal nasal in a read-aloud text (i.e., controlled speech). The subsequent 2006 study compared not only context of learning (i.e., study abroad vs. at-home) but also the effect of speech style. When conversational speech was analyzed, the study abroad learners exhibited more target-like pronunciation than the at-home learners with respect to all segments except for the voiced approximants. 99 The methodology employed in Simões (1996) is unclear. Speech was obtained from the ACTFL Oral Proficiency Interview, and then random segments of the speech were analyzed acoustically. He explains that he randomly selected 25-30 two second segments of speech within the interviews for each speaker and then calculated the syllable nuclei “when targeted correctly” (p. 90). He does not provide further information as to what “targeted correctly” means, or exactly which portions of the syllable nuclei he examined. 100 The investigation conducted by Stevens (2011) analyzed the vowel durations produced in 40 randomized sentences containing all five Spanish vowel phonemes in stressed and unstressed position at the end of the word. The first analysis he conducted compared the durations of the L2 learners’ vowels to those of the monolinguals. The findings indicated that L2 learner Spanish vowels were significantly longer than those of the monolingual speakers. Unfortunately, no formal method was employed to control for speaking rate. Thus, the differences in duration observed between the L2 and monolingual groups could have been due to native monolingual speakers’ ability to speak faster than the L2 learners, resulting in shorter vowel durations.

290

Overall, the results of the vowel by travel interaction confirmed the hypothesis

that traveling abroad would impact HS pronunciation. In addition, the findings from the

present study further support Mikulski (2010) and Duprey-Almeyda (2009), who

reported that traveling abroad and maintaining strong connections with the heritage

culture were positively correlated with HS morphosyntactic knowledge and vowel

pronunciation, respectively. The results of the present study combined with those of

Mikulski (2010), Duprey-Almeyda (2009), Díaz-Campos (2004; 2006), Simões (1996)

and Stevens (2011) suggest that just like L2 learners, HS benefit from additional

experience abroad both in terms of grammatical knowledge and pronunciation. Even HS,

who are already argued to exhibit “native-like” pronunciation, can benefit and alter their

pronunciation of vowels as the result of increased exposure to their heritage language.

5.3.4 Spanish use

Of all of the between-subjects variables analyzed in this dissertation, the use of

Spanish outside of the classroom proved to have the greatest impact on HS vowel

production. The results of the statistical analyses revealed a very clear and robust trend:

as Spanish use increased, HS Spanish vowels were more greatly dispersed and produced

in more peripheral locations within the acoustic space. As described previously in this

chapter, an expansion of the vowel space may indicate that HS more closely approximate

the vowel productions of native monolinguals of Spanish. The results of the analyses of

Spanish use and vowel production therefore confirm the hypothesis that HS who speak

Spanish more would produce vowels differently than those who use Spanish less

regularly outside of the classroom.

291

The connection between Spanish use and HS morphosyntax and pronunciation on

the one hand, and Spanish use and L2 pronunciation accuracy on the other hand, has

already been established in the literature. Mikulski (2010), for example, reported that the

HS who reported speaking Spanish more outside of the classroom performed better on an

editing task when compared to HS who reported speaking Spanish less often. Au et al.

(2008), who analyzed HS who were further divided into childhood “speakers” and

childhood “overhearers,” suggested that Spanish use may have an impact on

pronunciation. They reported that HS who spoke Spanish more outside of the classroom

outperformed those who reported speaking Spanish less often, but the differences were

not statistically significant.101 Finally, although the goal of Díaz-Campos (2004) was to

examine the effects of context of learning on L2 segmental pronunciation accuracy, the

analyses revealed that speaking Spanish more regularly (i.e., 4-7 days a week and 4+

hours per day) positively correlated with L2 pronunciation accuracy. That is, those L2

learners who reported speaking Spanish more often produced Spanish sounds that were

more native-like than those who spoke Spanish less frequently. The findings of the

present dissertation indicate that additional speaking outside of the classroom does have a

significant impact on HS pronunciation as suggested by Au et al. (2008) for HS of

Spanish and by Díaz-Campos (2004) for L2 learners.

Much of the same logic used to explain the expansion effect for frequent travelers

abroad can also be applied to Spanish use. As with traveling abroad, additional speaking 101 As explained in section 2.1.2, Au et al. (2008) examined childhood speakers’ and childhood overhearers’ performance on a series of morphosyntactic and production tasks. The estimated number of hours that childhood speakers and childhood overhearers were exposed to Spanish impacted their pronunciation and the well-formedness of the constructions they produced in a narrative task. Those who were exposed to Spanish more during childhood produced a greater number of well-formed constructions in the narrative task, and exhibited more native-like VOT values. Spanish use and motivation to speak Spanish did not correlate with pronunciation, although the authors argue that the lack of an effect could have been due to the fact that they only analyzed the speech of 10 childhood speakers.

292

allows for greater opportunities to establish and maintain the Spanish vowel categories.

Participants who watch movies and listen to music in Spanish on a regular basis also

receive additional auditory input that may shape pronunciation. Thus, “native-sounding”

heritage speakers benefit from additional speaking and listening practice both inside and

outside of the classroom in much the same way as L2 learners.

5.3.5 Cultural sensitivity

The final between-subjects variable examined in this dissertation was cultural

sensitivity. Previous research conducted by Alvord and Christiansen (2009) with L2

learners found a connection between cultural sensitivity and the pronunciation of the

Spanish intervocalic stops. Their findings for learners suggested that there may be a

relationship between a HS’s level of cultural sensitivity and his or her pronunciation of

Spanish vowels. The statistical analyses of cultural sensitivity conducted in the present

investigation revealed that there was little connection between this particular individual

variable and HS vowel production. The only statistically significant effect involving

cultural sensitivity was found for the Euclidean distance of /i/: the Euclidean distance of

/i/ decreased as cultural sensitivity increased. The effect of cultural sensitivity also

approached statistical significance for /e/ on the F1 and Euclidean distance measures,

suggesting that like /i/, as cultural sensitivity increased, the /e/ was produced closer to the

center of the vowel space.

As was previously mentioned in sections 5.3.2 and 5.3.3, comparisons between

native monolingual speakers of Spanish and HS both in the present study and in Menke

and Face (2010) indicate that monolinguals utilize a larger area of the acoustic space and

may pronounce Spanish vowels more peripherally than HS and L2 learners, at least on

293

formal tasks. Thus far, HS who traveled more and reported speaking Spanish more often

have exhibited an expansion effect, leading to the conclusion that a more expanded vowel

space is more similar to the native-speaker norm for formal speech. Alvord and

Christiansen (2009) reported that L2 learners who scored higher on the motivation and

empathy scales produced intervocalic /b,d,g/ in a more native-like way, suggesting that

high degrees of cultural sensitivity resulted in native-like pronunciation for learners.

Thus, it might be hypothesized that HS with higher cultural sensitivity scores would

produce Spanish vowels more peripherally if there were a strong connection between

cultural sensitivity and vowel production. High levels of cultural sensitivity resulted in

/i/, and to some degree /e/, exhibiting shorter Euclidean distances, resulting in a retracted

position. If high cultural sensitivity were correlated with native-like pronunciation, the

front vowels would have been produced with longer Euclidean distance measures and

therefore more peripherally. That the front vowels were not produced farther front in the

vowel space by the participants with higher cultural sensitivity scores therefore conflicts

with the initial prediction that high degrees of cultural sensitivity would result in native-

like pronunciation.

The general lack of an effect of cultural sensitivity on HS vowel production could

be attributed to 1) the small sample size and lack of variability in scores, or 2) that only

the total cultural sensitivity score, and not the subscales, were submitted to the analysis.

Only 16 participants took part in the present study, whereas Alvord and Christiansen’s

(2009) investigation consisted of 34 participants. The sample size in the present study

may not have been large enough to statistically correlate cultural sensitivity and vowel

production. In addition, the range in cultural sensitivity scores may have been too small

294

to show an effect. Although the lowest score was 151 points and the highest 203 points –

a total range of over 50 points – many of the participants exhibited scores in the range of

172 to 186 points. Thus, there may not have been sufficient variability in the scores to

reveal a significant and consistent effect. Finally, Alvord and Christiansen (2009)

analyzed the scores on the different subscales (i.e., attitudes, behavioral, motivation, etc.),

and not the total cultural sensitivity score, which is the sum of all of the subscales. As

explained in chapters 3 and 4, only the total cultural sensitivity score was analyzed in the

present study in order to maintain as much statistical power as possible given the small

number of participants. Initial exploratory statistical analyses also revealed that none of

the subscales correlated with any of the dependent variables and that only the total score

emerged as significant. Future investigations with a large sample size could potentially

test the effect of the total cultural sensitivity and each of the subscales. A larger sample

size and additional analyses could more reliably tease apart which aspects of cultural

sensitivity impact HS vowel production.

The general lack of results, and the conflict between the analysis of cultural

sensitivity and the other between-subjects variables, suggests that the effect of cultural

sensitivity on vowel production in this investigation should be interpreted with caution.

Future studies including larger subject populations may reveal a more consistent and

robust effect. At present, however, the connection between HS vowel production and

cultural sensitivity is somewhat unclear.102

102 Although the statistical analyses examining the relationship between cultural sensitivity and vowel production were not robust, the total scores on the questionnaire did reveal that overall, HS of Spanish living in Chicago exhibit high levels of cultural sensitivity. The scores ranged between 151 and 203 points. The highest possible score on the ICCS is 224 points.

295

5.4 Interpreting the expansion effect

The discussion of the results presented in sections 5.3.3 and 5.3.4 of this chapter

have revealed a consistent and robust trend for expansion of the vowel space for those HS

who speak Spanish more regularly and travel more frequently. Interestingly, the

expansion effect observed for travel and Spanish use parallels the same effect reported

for speech style: vowels produced in the most controlled speech task, the CPT, were

produced more peripherally than those produced in the spontaneous and semi-

spontaneous speech tasks. If the controlled speech task represents the most formal

speech style, this parallelism could evidence that the more experienced HS have acquired,

or are closer to acquiring, the formal speaking register. In other words, does increased

exposure to, and use of, the heritage language result in the ability to modulate vowel

productions based on speech style? If so, then one might expect that the HS with more

experience with Spanish would show greater task effects than those who are less

experienced.

In order to test the possibility that more experience with the heritage language

leads to the ability to fine-tune vowel productions to represent different speech styles, an

additional interaction between vowel, task, and travel was added into the statistical model

post-hoc.103 The results of the statistical analyses indicated that this interaction was

significant for F1 (F(20, 3234.16) = 3.63, p < .001), F2 (F (20, 3234.24) = 2.68, p <

.001), and Euclidean distance (F(20, 3234.20) = 3.00, p < .001). Figures 5-13, 5-14, and

5-15 represent HS vowel production in each of the three speech styles produced by the

103 Only the interaction involving travel frequency, and not course level or Spanish use, will be discussed in this section. An exploratory analysis including the vowel by task by course level interaction was not found to be significant. An additional test including the vowel by task by Spanish use interaction was found to be significant, but only for F1, and only the height of the vowel /a/ was correlated with Spanish use.

296

non-travelers, infrequent travelers, and frequent travelers, respectively. Given the

complexity of this interaction (five vowels by three travel groups by three tasks), a more

generalized description will be presented following the figures and will be organized by

dependent variable (i.e., F1, F2, Euclidean distance). The tables of Fixed Effects and the

pairwise comparisons of this interaction can be viewed in Appendix M.

Figure 5-13. Vowel production in three speech tasks for the non-travel group (N = 4). Values based on the estimated marginal means, and averaged across all stress and syllable types.

300

400

500

600

700

800

900

90011001300150017001900210023002500

F1 H

z

F2 Hz

Vowel production in three tasks by non-travelers

NRT PIT CPT

297

Figure 5-14. Vowel production in three speech tasks for the infrequent travel group (N = 6). Values based on the estimated marginal means, and averaged across all stress and syllable types.

Figure 5-15. Vowel production in three speech tasks for the frequent travel group (N = 6). Values based on the estimated marginal means, and averaged across all stress and syllable types.

300

400

500

600

700

800

900

90011001300150017001900210023002500

F1 H

z

F2 Hz

Vowel production in three tasks by infrequent travelers

NRT PIT CPT

300

400

500

600

700

800

900

90011001300150017001900210023002500

F1 H

z

F2 Hz

Vowel production in three tasks by frequent travelers

NRT PIT CPT

298

The results of the interaction on F1 did not reveal a clear and consistent pattern to

suggest greater or lesser height differences across tasks based on travel frequency.

Generally speaking, all three groups showed significant differences in height for all

vowels between the NRT (most spontaneous) and the CPT (most controlled). The

infrequent travelers presented in Figure 5-14 did not produce /o/ with any significant

differences in height across the three tasks. The frequent travelers presented in Figure 5-

15 did not exhibit any significant height differences across tasks for the vowel /u/. The

non-travelers, shown in Figure 5-13, actually exhibited the largest number of differences

in height across tasks. For the non-travelers, the F1 values in the NRT and CPT differed

for all vowels: /a/ produced in the CPT was significantly lower than the /a/ in the NRT,

and /e/, /i/, /o/, and /u/ produced in the CPT were significantly higher than the same

vowels in the NRT. The statistical results, therefore, do not support the hypothesis that

more experienced HS would show greater task effects, at least not with respect to vowel

height.

The analyses conducted with F2 and Euclidean distance, however, suggest that

non-travelers did exhibit fewer distinctions in vowel backness and dispersion across the

three tasks than those who reported traveling abroad more often. The pairwise

comparisons of the interaction involving F2 revealed that the non-travelers did not

produce the vowels /a/ and /o/ significantly differently across tasks. In contrast, all

vowels produced by the infrequent travelers, and all vowels except for /a/ for the frequent

travelers, exhibited significant differences in backness: the front vowels produced in the

CPT were produced significantly farther front than those produced in the NRT, and the

299

back vowels produced in the CPT were significantly farther back than those produced in

the NRT.

The analyses of the interaction with Euclidean distance revealed findings similar

to those reported for F2. For the infrequent and frequent travelers, the Euclidean

distances of all vowels were significantly longer (i.e., farther from the centroid and more

peripheral) in the CPT when compared to at least one of the other tasks.104 The

Euclidean distances of the vowels /a/ and /o/ produced by the non-travelers, however,

were not statistically different when comparing across tasks. That is, speech style did not

significantly impact the distance of /a/ and /o/ from the center of the vowel space for this

group of HS. Therefore, the results of the analysis indicate that while the infrequent and

frequent travelers produced all vowels in the CPT more peripherally and more dispersed

than the vowels produced in the other two tasks, the non-travelers did not exhibit this

distinction for all five vowels.

The results of the additional interaction between vowel, task, and travel suggest

that more experience with Spanish via traveling to the heritage country regularly not only

results in a more expanded vowel space, but may also promote the acquisition of fine-

grained differences in pronunciation associated with speech style. The non-travelers did

not produce /a/ and /o/ significantly differently across tasks, indicating that they may not

have the experience and ability to adjust their pronunciation of these two vowel sounds to

correspond with speech style. As there were few participants in each group, however, the

results of this interaction should be interpreted cautiously. Future investigations with a

104 Differences were primarily significant between the CPT (most controlled) and the NRT (most spontaneous), although in some cases all three comparisons were significant. For the frequent travelers, the Euclidean distance of /u/ in the CPT was significantly longer than that produced in the PIT, but not the NRT.

300

larger number of participants should be conducted, and may reveal more robust and

consistent effects than those described here. Additional variables such as 1) length of

stay and 2) which languages and language registers are used during visits abroad may

offer additional insight into the connection between traveling abroad and the acquisition

of fine-tuned distinctions in speech styles.

5.5 Limitations and future directions

Despite the interesting and important findings described in chapter 4, the present

investigation does have some limitations. The purpose of this section of the chapter is to

discuss some of these limitations, and describe how they might be addressed in future

studies. Section 5.5.1 discusses some of the issues with the experimental design and how

they might be addressed in future studies. Section 5.5.2 explains the connection between

language and identity that was unfortunately not examined in the present study. Finally,

section 5.5.3 acknowledges the lack of comparison between HS vowel productions and

monolingual control groups.

5.5.1 Experimental design

The instruments utilized in this investigation were designed to examine the

production of vowels in different speech styles. The narrative retelling task (NRT) was

aimed at gathering spontaneous speech while at the same time providing some control

over the lexical items and vowels produced by each speaker. In other words, based on

the objects that appeared throughout the video, there was some consistency among

speakers as to how they retold the story and the words they used to describe the plot of

the film. Unfortunately, some participants were much more detailed in their retelling of

the story than others, resulting in an unequal number of vowels produced by each

301

speaker. Whereas some participants spent nearly five minutes recalling as many details

as possible, others only spoke for one to two minutes and provided a basic summary of

the movie. Thus, although some participants produced well over 100 analyzable vowels

in this task, others produced only 50.

The NRT posed two additional difficulties. First, it was not always possible to

obtain examples of all five Spanish vowels in the four contexts of interest (i.e., atonic

open, atonic closed, tonic open, tonic closed). Although tonic and atonic vowels in open

syllables were easily obtained for all speakers, analyzable vowels in closed syllables were

infrequently encountered in this task. The lack of examples in all four categories for each

individual speaker was part of the reason why individual analyses of three-way

interactions were not always possible, and motivated the need for a pooled analysis. In

other words, the vowel by task by syllable type interaction would not have provided

sufficient data to yield interpretable findings for a participant who did not produce atonic,

closed /u/ in the narrative retelling task, because there would have no vowel to compare

across the three speech styles. When the data were pooled across all speakers, however, a

sufficient number of tokens in all four categories were present, allowing for the

comparison of stress type and syllable type across the three tasks. Ideally, multiple

examples of each of the five Spanish vowels should have been analyzed in each of the

four contexts for each participant in all tasks. In spontaneous speech samples, however,

the investigator is not always afforded the ability to control the output enough to obtain a

balanced corpus. Thus, future investigations of HS Spanish vowels could focus on

creating narrative retelling/spontaneous speech instruments that are carefully designed to

prompt sufficient tokens in both stress types and syllable types.

302

Second, in contrast to the PIT and CPT, vowels were extracted from different types

of lexical items in the NRT, and also from different portions throughout the phrase. In

addition to analyzing vowels produced in nouns, vowel tokens were also extracted from

verbs and adjectives. In the other two tasks (the PIT and CPT), only nouns were

analyzed. Although lexical words (i.e., nouns, verbs, adjectives, and adverbs) tend to

pattern together with respect to lexical stress, there may be differences in the way that

vowels are produced in the different word classes. In addition, the vowel tokens analyzed

in the NRT occurred in varying positions within the phrase, and were not phrase-final as

they were in the other two tasks. As Schmidt and Willis (2011) showed that phrase

position can result in different rates of /s/ voicing in Mexican Spanish, it may also be

possible that phrasal position influences vowel pronunciation as well. Thus, the

combination of different word classes and different positions within the phrase may have

introduced more variability into the analyses than was intended. Word class and phrasal

position are thus two variables that could be included in future investigations of vowel

pronunciation.

The picture identification task (PIT) and carrier phrase tasks (CPT) also posed

several difficulties. First and foremost, the vowels analyzed in the PIT and CPT were

embedded within a variety of consonantal contexts. Thus, neither of the tasks was

completely phonetically balanced (see Appendix A for the words presented in the PIT

and Appendix B for the CPT). The PIT was balanced by stress type and syllable type,

but as the goal of the task was to obtain speech that was not read, the lexical items had to

be represented by pictures. It was too difficult to find words that were frequent enough

and identifiable enough to be represented by images in addition to phonetically balancing

303

the PIT. The CPT was slightly more balanced in that it included all five Spanish vowels

embedded within the context of /p,t,k,s/. The CPT included additional words in order to

obtain atonic vowels and vowel tokens in closed syllables, but the following consonantal

context was variable and included many voiceless consonants. Although the F1 and F2

measurements were extracted at the midpoint of each vowel in an attempt to reduce the

influence of the following context, Stevens and House (1963) reported differences in

vowel pronunciation based on context even when the steady-state portion was measured.

The lack of phonetic balance in the CPT and especially the PIT could have introduced

unintended variability into the vowel productions of the HS examined in the present

study.

Secondly, in both tasks, many of the syllables were closed by the consonant /s/.

The inclusion of /s/ as a closing consonant could have significantly impacted the formant

values of vowels, as well as potentially introduced a new variable into the analyses of

syllable type. With respect to the effect of /s/ on the formant values of vowels, Stevens

and House (1963) reported differences in vowel pronunciation based on both voicing and

manner of articulation. Vowels produced in the context of voiceless consonants

exhibited higher F1 values than those produced in the context of voiced consonants.

Thus, a voiceless consonantal context resulted in a vowel that was produced lower in the

acoustic space when compared to vowels produced in a voiced context. An examination

of stop, fricative, and null contexts in Stevens and House’s study revealed additional

differences in vowel pronunciation based on manner of articulation. The effect of

manner of articulation was not consistent along the F1 dimension, but the F2 values of

the front vowels especially were shown to differ. Front vowels produced in the context

304

of a fricative consonant exhibited lower F2 values than those produced in the context of

stops and in the null context. Thus, front vowels in the context of a fricative were

produced farther back in the acoustic space than vowels produced in the other two

contexts.

Based on Stevens and House’s (1963) findings, one might predict that front vowels

in the context of a voiceless fricative /s/ would be produced lower and farther back in the

acoustic space: the exact pattern observed for the pronunciation of HS /e/. The peculiar

position of the HS /e/ has already been described in this chapter in section 5.1. Although

the lowering and backing could be the result of laxing and the influence of English as

described by Lipski (1994), contact with /s/ may also be partly responsible for the

lowered and backed position of the /e/ in HS Spanish when compared to monolingual

Spanish varieties (see Figure 5-6 and Figure 5-7 for a comparison). That the /e/ was also

produced farther back in the vowel space in the NRT (spontaneous speech task) in which

there was much less contact with /s/, however, indicates that consonantal context may not

be the sole reason for the position of /e/ within the acoustic space. Although it is possible

that a combination of these influences (i.e., English contact or contact with /s) contributed

to the way in which HS produced their /e/ in Spanish, the influence of /s/ could be

determined by simply excluding all of the vowel tokens in contact with /s/ and

reanalyzing the data. A reanalysis without /s/ would reveal the influence of this

particular voiceless fricative consonant on the acoustic properties of vowels, and indicate

if the lowered and backed articulation of the /e/ was indeed due to contact with /s/ or if it

was more likely caused by contact with English.

305

The consonant /s/ was included primarily as a means to close a syllable in which a

vowel occurred so that the effect of syllable type on HS vowel production could be

analyzed. The phonotactic constraints of Spanish are more restrictive than English, and

therefore Spanish exhibits a simpler syllable structure and fewer consonant clusters than

English. Consonant clusters beginning with /s/ are prohibited in the onset in Spanish,

resulting in the /s/ forming part of the coda of the preceding syllable. As designing the

PIT was already difficult due to the necessity of including common items representable

by pictures, one of the simplest ways to close a syllable in this task was to select words

that included /s/ plus another consonant. The /s/ was the most frequent closing consonant,

almost to the exclusion of all other consonants (see Appendix A and Appendix B).

Including /s/ as a closing consonant resulted in a conflict with Navarro-Tomás’s

(1918) description of the contexts in which the close and open vowel allophones were

argued to occur. He argued that for peninsular Spanish, syllables closed by /s/ actually

prompted the close allophone of /e/ (i.e., [e]), and not the open [ɛ] that was argued to

occur in syllables closed by most other consonants. In contrast, Matluck (1952) argued

that in Mexican varieties of Spanish, the open [ɛ] did occur in syllables closed by /s/.

Thus, the vowel tokens that were closed by /s/ were coded as “closed” in the present

study as opposed to “open” had the analysis provided by Navarro-Tomás been strictly

followed. The absence of the effect of syllable type may have been due to the coding

scheme utilized in the present study and the lack of a variety of consonants in the words

containing closed syllables. Future investigations of HS vowels and Spanish vowels in

general could focus on including more phonetically balanced instruments, excluding /s/

entirely, or analyzing vowels in contact with /s/ separately from vowels in contact with

306

other consonants. The vowels in contact with /s/ in the present study could also be

recoded as “open,” and the analyses could be conducted a second time. The results of a

secondary analysis with recoded data could offer insight into the effect of the following

consonant /s/ on HS vowel production. Robust syllable type effects with recoded data

would indicate that the tokens were initially coded incorrectly, and lend evidence in

support of Navarro-Tomás’s (1918) argument that /e/ in the context of /s/ is actually

produced as close [e].

A final limitation in the experimental design of the present study is that the lexical

items included in the PIT and CPT were not balanced by lexical frequency or

neighborhood density, and included some cognates. Although both tasks, and especially

the PIT, were designed to include identifiable items and terms for common objects, food,

and animals, some of the lexical items were arguably less common than others. Several

participants had difficulty remembering terms like ostras (“oysters”), or difficulty

pronouncing words in the CPT such as cuscurro (“crouton”). Previous research has

indicated that lexical frequency and neighborhood density influence spoken word

recognition as well as speech production (Luce & Pisoni, 1998; Bybee, 2001; Vitevitch &

Stamer, 2006; Vitevitch & Stamer, 2009; Munson & Solomon, 2004; Yao, 2011).

Words with high lexical frequency have a greater tendency to exhibit reduction than low

frequency words (Bybee, 2001). Additional studies have indicated that neighborhood

density (i.e., whether a word has a high or low number of phonological neighbors) also

plays a role in speech production (Munson & Solomon, 2004; Yao, 2011). Munson and

Solomon (2004), for example, reported that lexically difficult words (i.e., low frequency

words with many phonological neighbors) were produced with greater vowel space

307

expansion than lexically easy (i.e., high frequency words with few phonological

neighbors). Yao (2011) also reported that words in high density neighborhoods consisted

of vowels that were shorter in duration and also more centralized than those in sparse

neighborhoods. Although the centralization and shortening effects of lexical frequency

and neighborhood density are well-supported for English, two investigations by Vitevitch

and Stamer (2006; 2009) suggest that processing may vary across languages. Their

investigations of picture naming tasks in Spanish revealed a trend that is the opposite of

what is typically described for English: participants named Spanish words from dense

phonological neighborhoods more slowly than those in sparse neighborhoods.

As no formal measures of lexical frequency or neighborhood density were

calculated for the words included in this study, it is not possible at this time the analyze

the extent to which these characteristics influenced HS vowel pronunciation. Based on

Vitevitch and Stamer’s (2006; 2009) findings, which conflict with those reported for

English, it is unclear how frequency and density would affect the pronunciation of HS

Spanish vowels. The complex nature of the bilingual lexicon, lexical access and priming

effects, and the role of frequency and neighborhood density could be explored in future

investigations. The corpora utilized in future studies could carefully control for

frequency and density in order to compare the pronunciation of vowels in high and low

frequency words that belong to dense and sparse neighborhoods.

In addition, several words utilized in the PIT and CPT were cognates with English

words. The word eclipse (“eclipse”), for example, was sometimes pronounced in its

English form, resulting in its elimination from the analysis. In fact, the inclusion of

eclipse in the third task may have resulted in the extremely high and fronted /e/ that was

308

produced in the CPT. As discussed in 4.1.2.1, the atonic /e/ in the CPT did not exhibit

centralization like it did in the other two tasks, which may have been caused by the some

of the atonic tokens being extracted from the word eclipse. In English, the first vowel is

pronounced as the high front vowel /i/. The participants’ knowledge and familiarity with

the English translation of “eclipse” may have resulted in their altering their pronunciation

and producing a vowel that was higher and farther front. Additional cognates included

eskeletos (“skeletons”), tostadora (“toaster”), cristales (“cristals”), and espaguetis

(“spaghetti”). The inclusion of cognates in these two tasks was a methodological

oversight that could have a significant impact on the pronunciation of the vowels within

these particular words. When the cognates were pronounced in their English form or

were Anglicized, they were excluded from the statistical analyses.105 Future

investigations of Spanish vowels, and especially of bilingual vowels, should focus on

excluding cognates, or examining cognate and non-cognate words separately.

Taken together, the limitations described in this section of the chapter suggest ways

in which speech elicitation instruments could be improved in future studies of HS and

monolingual Spanish vowels. Future investigations could utilize different spontaneous

speech instruments to ensure that a more equal number of Spanish vowels are obtained.

True spontaneous speech data in the form of sociolinguistic interviews could also be

examined, which may yield results that differ from spontaneous speech gathered in semi-

directed narratives. In addition, picture identification and carrier phrase tasks should be

comprised of more phonetically balanced corpora, and the inclusion of vowels in contact

105 Cognates were selectively excluded based on a variety of cues. If the lexical items were pronounced in English or were Anglicized, they were not included in the analysis. For example, if the /t/ in “espaguetis” was pronounced as a flap, the vowels were not included in the analysis. If the final /e/ was not produced in “eclipse,” or the initial vowel was not produced with F1 and F2 values within the range of /e/ tokens for that particular speaker, the vowels extracted from this word were excluded.

309

with /s/ could be analyzed separately and with caution. Finally, lexical frequency and

neighborhood density may have a significant impact on HS vowel pronunciation and

could be examined in greater detail in future studies.

5.5.2 Language and identity

One individual characteristic that may have a significant impact on HS vowel

pronunciation is the connection between pronunciation and identity. As explained by

Gatbonton et al. (2005, p. 491), a speaker’s accent, or pronunciation of certain segments,

is one of the most “visible” indicators of one’s ethnic group, and can also prompt

stereotyping. They suggest that bilingual speakers and language learners with more than

one social group affiliation will sometimes alter speech to avoid stereotyping and

ridicule, or change their speech practices in order to suppress or emphasize one of their

identities (p. 492). An L2 learner, for example, who wishes to remain loyal to the home

ethnic group may purposely pronounce the L2 with an accent in order to maintain ties

with their home identity, whereas in other situations, the learner may strive to pronounce

the L2 differently because they see a reward in mastering the L2 and gaining membership

in another ethnic or social group (Gatbonton et al, 2005, p. 505). A third alternative is

that the learner or bilingual will learn which aspects of the pronunciation signal group

affiliation, and choose to only alter those specific characteristics when speaking in one or

the other of their two languages (Gatbonton et al, 2005, p. 506). In many ways, the

bilingual speaker and the L2 learner are faced with the challenge of deciding in which

ethnic or social group they desire membership, how they can signal that membership, and

the consequences of this decision with respect to social ties with other groups.

310

Not all instances of language contact result in suppressing or stressing one of the

bilinguals’ identities, however. In fact, in some cases, bicultural speakers will integrate

characteristics associated with both heritages in order express their cultural and linguistic

hybridity. Potowski and Matts (2008), for example, investigated the connection between

language and identity among MexiRican young adults living in Chicago. MexiRicans are

speakers of mixed parentage, with one parent of Mexican heritage and the other of Puerto

Rican heritage. Twenty-four MexiRican young adults responded to a series of questions,

including how they would categorize their own speech (i.e., as either Mexican Spanish or

Puerto Rican Spanish), how they self-identified, and how strongly they felt that speaking

Spanish signaled their identity.106 In addition, the participants completed several speech

elicitation and vocabulary tasks that were later rated as either “Mexican” or “Puerto

Rican” by a group of outside observers with varying backgrounds.107

Potowski and Matts (2008) reported that many of the participants were clearly

rated as either Mexican or Puerto Rican, and that the ratings assigned by the outside

listeners often coincided with the way the speakers had identified their own speech.108

Several participants, however, were found to exhibit dialectal characteristics associated

with both Mexican and Puerto Rican varieties of Spanish, such as mixing syllable-final

/s/ retention (typically Mexican) with rhotacism (typically Puerto Rican). Some

106 Potowski and Matts (2008) examined several other aspects of the MexiRican community. In addition to basic questions about family history, background, and language use, participants were also asked to discuss their participation in cultural activities, what type of music they listened to, their attitudes towards Spanish, and to explain their awareness of stereotypes associated with each of their heritages. 107 The outside raters were the following: two Mexican Americans, one mainland US-raised Puerto Rican, one island-raised Puerto Rican, one island-raised Dominican, and three Anglo-Americans who were fluent in Spanish. The raters were asked to listen to the story told by each of the participants and indicate if the speaker was 1) definitely Mexican, 2) probably Mexican, 3) unsure/neither, 4) probably Puerto Rican, and 5) definitely Puerto Rican. Each rater was then asked to indicate why they made their choice, and which characteristics signaled the dialect they chose. 108 Another important finding of this investigation is that speakers often adopt the dialectal characteristics of the mother into their own speech.

311

participants also argued that they were able to “switch” dialects, which Potowski and

Matts (2008, p. 156) argue indicates that these particular speakers “consciously used

language to mark either their MX or PR identity in given contexts.”

Although none of the HS who participated in the present investigation were

MexiRican and only two were of mixed parentage, all could possibly be described, to

some extent, as bi-cultural. All of the HS were raised in Spanish-speaking households

and communities, but were also current residents of the United States, attended an

American university, and were fluent (and some even dominant) speakers of English.

These facts lead to the questions 1) could HS of Spanish combine characteristics of

English and Spanish vowels as a means to emphasize their hybrid identity? or 2) could

language dominance, social or familial pressures, and the desire to “fit in” to a particular

social or ethnic group result in their stressing or deemphasizing characteristics of one of

their two languages?

Although the present study intended to examine the connection between identity

and Spanish vowel pronunciation by means of the language background questionnaire,

methodological difficulties prevented the analysis. The background questionnaire

included a series of questions which asked the participants to indicate which ethnic terms

they would use to self-identify, as well as statements about how strongly they felt

speaking Spanish helped them feel connected with their heritage. Nearly all participants

indicated that they identified as latino/as, mexicano/as, and americano/as. In addition,

they all indicated that they either “agreed” or “strongly agreed” with the statement

“Speaking Spanish helps me feel connected with my heritage.” There was thus very little

variability within the group of participants in terms of how they responded to these items

312

on the background questionnaire. Future investigations could incorporate more detailed

and open-ended questions in order to further explore the connection between HS

language use, pronunciation, and identity. A larger subject population and more carefully

designed instruments may reveal that HS identity is linked to the pronunciation of their

vowels in Spanish.

5.5.3 Control data

The goal of the present study was first and foremost, to describe HS vowel

pronunciation in a systematic way by assessing the extent to which linguistic variables,

speech style, and individual characteristics known to affect monolingual and L2 learner

varieties also affect HS speech production. Although this study does provide a

comprehensive acoustic examination of HS vowels, several important questions about HS

speech production cannot be addressed in this dissertation due to the lack of control data.

One of the primary arguments in the heritage literature is that HS exhibit a “benefit” in

the domain of pronunciation due to their early exposure to Spanish, resulting in their

sounding more native-like when compared to L2 learners who acquired the language later

in life. Comparisons between HS vowels and monolingual Spanish vowel pronunciation

obtained in other studies offered some insight into which characteristics HS share with

native monolinguals and how they differ, but comparing across studies is less than ideal.

As studies employ different instruments and analysis techniques, the comparisons

between two separate investigations are often not reliable. In order to truly understand

how “native-like” HS pronunciation is, vowel productions of monolingual Spanish

speakers from a similar dialect region, as well as the pronunciation of late L2 learners of

Spanish, need to be obtained.

313

An examination of monolingual Spanish vowels produced in the same tasks

utilized in this dissertation would reveal with greater reliability the characteristics that HS

share with monolinguals as well as how they differ. As few cross-dialectal studies of

Spanish vowels have been conducted, it remains unclear if the HS pronunciations of /e/

and /u/ actually fall within the range of possible productions among monolingual

speakers. In other words, the backed /e/ and fronted /u/ produced by HS may be a

characteristic of monolingual Spanish spoken in central Mexico, and not the result of

English influence as has been suggested throughout this dissertation. Most acoustic

studies of Spanish vowels have examined peninsular varieties (the exceptions being

Servín and Rodríguez (2001), Serrano (2006), and the impressionistic studies carried out

by Matluck (1952), Boyd-Bowman (1952) and Lope Blanch (1972), and therefore little is

known about the vowel system of central Mexico. Thus, obtaining monolingual vowel

productions with the same speech elicitation instruments employed in this dissertation

would not only indicate if the backing of /e/ and fronting of /u/ are present in

monolingual Spanish, but also contribute to the general body of literature pertaining to

Spanish vowels.

In order to understand how early exposure to a language affects HS vowel

production later in life and if HS actually do “benefit” in the domain of pronunciation, a

comparison between HS and late learners of Spanish is also necessary. A number of

investigations have indicated that age of learning and age of arrival to a foreign country

impact pronunciation in the heritage language or the L2 (Chang et al., 2009; Flege et al.,

1995; Godson, 2003, among others). A comparison between Spanish vowels produced

by HS and late L2 learners of Spanish could support previous findings if the HS vowel

314

productions were found to more closely resemble the monolingual Spanish vowel

productions than the vowels produced by L2 learners.

Another question that cannot be fully answered based solely on the data obtained

from HS is the extent to which HS have integrated characteristics of English into their

Spanish. In order to understand if and how the heritage Spanish vowel system is

modified or hybridized directly due to English contact, an analysis of the English vowels

produced by the same HS who participated in this study should also be conducted and

compared to the English vowels of monolingual English speakers from the Chicago area.

Such an analysis would allow for three important comparisons. First of all, an analysis of

HS English vowels would allow for an examination of how lexical stress, syllable type,

and speech style affect their non-heritage language, and how the effects of the

aforementioned variables differ across their two languages. Second, comparing HS

English vowels to monolingual English vowels would indicate the extent to which HS

integrate characteristics of the Northern Cities Vowel Shift (NCVS) into their English.

Third, a comparison across languages between HS Spanish vowels and monolingual

English vowels would reveal if HS of Spanish have integrated characteristics of the

NCVS into their Spanish. Although several studies have examined Heritage English

pronunciation in the Northern Cities region (Konopka and Pierrehumbert, 2008, in press;

Roeder, 2010), none of the previous investigations have analyzed if and how the NCVS

in English affects the pronunciation of Spanish vowels.

To summarize this section briefly, analyses and comparisons between

monolingual Spanish, English, and L2 learner vowels would offer additional insight into

the HS vowel system. Comparisons between HS, monolingual Spanish speakers, and

315

advanced late L2 learners would indicate the extent to which HS pronounce Spanish in a

more native-like fashion, and potentially support the argument that early exposure to a

language results in a benefit in pronunciation later in life. An examination of HS English

and monolingual English vowels would reveal if the unique distribution of HS vowels

and the presence of robust stress effects are partly attributable to English influence.

Finally, a cross-language comparison between monolingual English vowels and HS

Spanish vowels might suggest that characteristics of the NCVS have influenced the

Spanish vowel pronunciation of this population of HS of Spanish from Chicago. Taken

together, the additional experiments and comparisons would allow for a more complete

understanding of HS vowel pronunciation and the factors that influence this population’s

productions of Spanish vowels.

5.6 Summary of discussion

This chapter focused on discussing the results of the statistical analyses described

in chapter 4. Section 5.1 described the overall distribution of Spanish vowels, comparing

the HS patterns of pronunciation to native speaker norms described in the literature. The

organization of the HS vowel system differed from traditional descriptions of the

monolingual Spanish vowel system in that it was less symmetrical than typically

described, and exhibited an /e/ that was produced farther back and an /u/ that was

produced farther front than traditional characterizations of monolingual Spanish vowels.

Section 5.2 of this chapter discussed how lexical stress, syllable type, and speech

style affected HS vowel production. The atonic vowels produced by the HS in this study

were more centralized and shorter than the tonic vowels: a pattern that is generally

unattested in monolingual Spanish varieties. As discussed in section 5.2.1, the shortening

316

and centralization could be attributed to HS exhibiting the unstressed vowel reduction

argued to exist in central Mexico, or could be the result of contact with English. Section

5.2.2 focused on discussing the influence of syllable type on HS vowel production.

Overall, the anticipated mid vowel allophony argued to exist for monolingual varieties of

Spanish was not observed in the HS system. The overall lack of a syllable type effect in

HS Spanish could indicate that the differences in vowel production based on syllable

structure are not robust enough to manifest themselves categorically or statistically.

However, the lack of an effect of syllable type on pronunciation could have been due, in

part, to the inclusion of /s/ as the coda consonant in many of the closed syllable examples.

Finally, section 5.2.3 discussed how speech style affected the organization of HS vowels.

Like monolingual speakers of Spanish, HS also exhibited greater degrees of

centralization and duration reduction in spontaneous speech when compared to controlled

speech. The differences in vowel dispersion revealed across the three tasks confirmed

that in order to truly understand the vowel space of a specific group of speakers, it is

important to analyze different speech styles.

A discussion of the effects of the between-subjects variables was presented in

section 5.3. The analysis of the connection between course level and vowel production

revealed that only the /u/ differed statistically between the groups: the advanced-level

participants produced an /u/ that was farther back in the vowel space than the

intermediate-level speakers. Section 5.3.2 discussed the relationship between grammar

proficiency and vowel production. The analyses of the non-normalized formant values

suggested that participants with higher grammar proficiency scores had a tendency to

produce vowels that occupied a smaller portion of the acoustic space than those with

317

lower grammar proficiency scores: a finding that conflicted with the results of Menke

and Face (2010) and Chang et al. (2009). When the analyses were conducted with the

normalized formant values, however, the only vowel to show a significant relationship

with grammar proficiency was /o/. Combined with the results of course level, it was

concluded that HS with more overt language instruction and more formal knowledge of

prescriptive grammar might exhibit a greater separation of the back vowel categories.

However, as traditional grammar proficiency tests are perhaps not the most appropriate

means of examining HS proficiency in Spanish, the results should be interpreted with

caution.

Section 5.3.3 and section 5.3.4 described how traveling abroad and speaking

Spanish outside of the classroom impacted HS vowel production. Those who traveled

abroad more frequently were found to produce the point vowels more peripherally than

those who traveled less frequently. A similar trend for expansion was revealed for

Spanish use. The participants who reported speaking Spanish more often and in a wider

variety of contexts tended to produce the Spanish vowels more peripherally. The trend

for expansion based on travel frequency and Spanish use was interpreted to mean that

traveling and speaking Spanish frequently promote more “native-like” vowel

pronunciation, if “native-ness” refers to expansion of the vowel space. The question of

“native-ness” was discussed further in section 5.4., which described an additional

interaction between vowel, task, and travel. The results of the interaction suggested that

more experienced HS have an expanded vowel space because they have acquired, or are

closer to acquiring, the formal speech style: the most hyperarticulated speech represented

by the controlled speech task (CPT).

318

The findings of the present study also revealed an important connection between

HS and L2 learners. A large body of literature suggests that traveling abroad and

speaking more Spanish outside of the classroom have a positive impact on the acquisition

of L2 pronunciation. That the same behaviors affected HS vowel production suggests

that even HS, who are argued to “sound” like native speakers, may benefit from increased

exposure to Spanish.

Section 5.3.5 discussed the connection between cultural sensitivity and HS vowel

production. Overall, only the vowel /i/ was found to correlate with cultural sensitivity:

the Euclidean distance of the /i/ from the centroid decreased as the cultural sensitivity

score increased. The results of this analysis suggest that, unlike the L2 learners in Alvord

& Christiansen (2009), there was very little connection between HS vowel production

and cultural sensitivity. Section 5.3.5 discussed two possible explanations for the

absence of a connection between cultural sensitivity and vowel production: 1) only the

total cultural sensitivity score was analyzed and not the subscales, and 2) there may not

have been sufficient variability within the total scores to show a consistent effect.

Last of all, this chapter discussed some of the limitations of the present

investigation and suggested directions for future research. One of the limitations of this

study was difficulty in creating phonetically balanced instruments (i.e., PIT and CPT that

could be performed by HS of Spanish). A second limitation was the inability to directly

examine the connection between HS language use and identity with the language

background questionnaire utilized in this study. Finally, as there were no control groups

included in this dissertation, the question of how “native-like” HS pronunciation really is

cannot be fully addressed until native monolingual vowel productions in multiple speech

319

styles are analyzed. In the future, additional studies of HS and monolingual Spanish

vowels may consider balancing the corpora by consonantal context and lexical frequency,

as well as integrating additional instruments that can more accurately address the

connection between language use and identity.

320

6 CONCLUSIONS

The present investigation has offered the first acoustic characterization of the HS

Spanish vowel system by systematically measuring the F1 and F2 frequencies and

duration, calculating a centroid and the Euclidean distance of vowels from the centroid,

and analyzing these measures statistically. The analyses examined the effects of several

variables on HS vowel production, indicating that both language internal and language

external factors play an important role in how HS of Spanish pronounce their vowels.

To summarize the general findings of this dissertation, HS exhibited robust

differences in vowel quality, dispersion, and duration based on lexical stress: atonic

vowels were produced in a more central location within the vowel space, were situated

closer to the centroid, and were shorter in duration than their tonic counterparts. The

effects of speech style on vowel production were similar to those described for lexical

stress, in that vowels were more centralized and shorter in spontaneous speech styles

when compared to vowels produced in controlled styles. The effects of syllable type

were less clear and robust, suggesting that the mid vowel allophony argued to exist in

monolingual Spanish vowel systems is not present in the HS system.

Extralinguistic influences such as Spanish use and travel frequency were found to

have the greatest effect on pronunciation. Heritage speakers who reported traveling more

and speaking Spanish more regularly were found to produce vowels that were more

peripheral than those produced by infrequent travelers and speakers of Spanish. The

expansion of the vowel space suggests a closer approximation to the native speaker

vowel configuration reported in other studies. Course level and grammar proficiency had

some effects on HS vowel pronunciation and also suggested an expansion effect for more

321

advanced HS and those with higher proficiency scores, but the effects were primarily

limited to the back vowels. Cultural sensitivity had the least robust effects on

pronunciation, indicating that at least for this group of HS, cultural sensitivity does not

play an integral role in shaping vowel pronunciation.

Taken together, the overall findings of this dissertation have important implications

for our understanding of Spanish vowel systems, the HS Spanish vowel system, and the

complex interplay of factors that influence HS pronunciation. First and foremost, this

investigation contributes to the growing number of acoustic studies of Spanish vowels.

The general description of the HS vowel space provided further evidence against a

simple, stable, symmetrical, five-vowel system typically argued for Spanish. Although

the HS in this study are bilingual and therefore the distribution and organization of their

vowels could be influenced by language contact, the examination of HS vowels indicates

that Spanish vowels can vary from the established norms and still potentially be

perceived as “native-sounding.” In addition, the description of the vowel space indicated

how vowel productions can vary within the acoustic space, on which dimensions they

exhibited greater or lesser variation, and which vowels showed the greatest degree of

overlap. Understanding the distribution of HS Spanish vowels and the dimensions on

which they vary and overlap may help establish the possible acoustic ranges that Spanish

vowels can occupy.

The effects (or lack thereof) of lexical stress, speech style, and syllable type also

have implications for how the Spanish vowel system is described and analyzed in future

studies. That atonic vowels reduced in HS Spanish, but not in the direction of schwa,

may suggest that 1) Spanish vowels exhibit more quality and quantity differences based

322

on lexical stress than previously believed, and 2) the way that lexical stress effects

manifest themselves in Spanish differs from what is observed in English. The analysis of

syllable type lends more evidence against the argument that the Spanish mid vowels

exhibit allophonic variation: an assumption that had been questioned and partially

disproven by Morrison (2004) and Servín and Rodríguez (2001). Instead of arguing for

distinct allophones for the mid vowels, future descriptions of the system might emphasize

that small differences in pronunciation may result as a consequence of coarticulatory

effects or gestural timing based on the surrounding consonantal context. Finally, an

examination of HS vowel quality and duration in the three different tasks confirmed the

influence of stylistic effects on vowel pronunciation. The similar effects observed for

both HS and monolingual speaker populations indicate that in order to truly understand

the Spanish vowel system, it is important to analyze pronunciation in multiple speech

styles and on multiple levels, and not base descriptions of the system on controlled

utterances alone.

Combined, the statistical analyses carried out in this dissertation offer insight into

which characteristics HS share with monolingual speakers of Spanish, and provide

further support for a modified system of pronunciation for bilingual speakers. Like

monolingual speakers of Spanish, HS exhibited vowel quality and quantity reduction in

spontaneous speech, as well as the intrinsic vowel duration described by Marín Gálvez

(1995). In addition, they patterned with the monolingual speakers of Morrison (2004)

and Servín and Rodríguez (2001) in that they did not produce the Spanish mid vowels

differently in closed and open syllables. Despite the similarities, however, HS vowels

differ from traditional descriptions of monolingual Spanish vowels in several important

323

ways. In contrast to what has traditionally been described for Spanish, the HS vowel

space is asymmetrical and is characterized by five degrees of height and four degrees of

backness. In addition, the position of the /e/ and /u/ in the acoustic space differed from

presupposed monolingual productions in that /e/ was produced further back and lower in

the space, and the /u/ was significantly fronted. Finally, the degree of unstressed vowel

reduction exhibited by HS is unattested in monolingual varieties, although the absence of

a stress effect has been mostly assumed for Mexican varieties of Spanish and not

thoroughly acoustically examined.

Thus, although HS of Spanish are typically argued to produce Spanish sounds in a

native-like way, the present study indicates that their productions are not identical to

those of a monolingual, or at least not to previous descriptions of monolingual Spanish

vowels; their system is modified, sharing some characteristics with monolingual varieties

but differing on other dimensions. This first acoustic analysis of HS vowels therefore

lays the foundation for future comparisons to monolingual Mexican Spanish vowels

using the same speech elicitation instruments employed here. In-depth comparisons with

monolingual Mexican age-matched peers will provide further insight into the similarity

between HS and monolingual vowel production.

Another important aspect of this dissertation is that it has revealed additional

similarities between HS and L2 learners in terms of which individual behaviors and

practices impact speech production. The same behaviors argued to have a positive impact

on HS and L2 morphosyntactic performance and on L2 acquisition of pronunciation were

also found to influence HS pronunciation. That is, although HS are argued to “sound”

like native speakers, behaviors that are typically encouraged for L2 learners to acquire the

324

language were found to impact HS vowel production. Traveling abroad resulted in a

greater expansion of the acoustic space, as did speaking Spanish outside of the classroom

more frequently. In addition, there is some suggestion that formal grammatical

knowledge and instruction can also alter the way that HS produce their vowels even

though the effects of grammar proficiency and course level reported in this investigation

were less robust than those obtained for travel and Spanish use.

The finding that extralinguistic factors influence HS pronunciation contributes to

our general understanding of HS populations and has important implications for how HS

are described in the literature. That traveling abroad and more Spanish use, two

behaviors known to facilitate L2 acquisition of pronunciation, also affect HS production

of vowels suggests that there may be more similarities between HS and learners than

previously believed. Investigations focusing on morphosyntactic knowledge and

pedagogical needs have already indicated that HS and L2 learners exhibit similar

behaviors in both domains, and that traveling abroad, formal language instruction, and

Spanish use outside of the classroom are three factors that correlate with morpho-

syntactic knowledge. The assumption present in the literature, however, is that HS

pronunciation is already superior to, or more native-like than, that of L2 learners at

similar levels of proficiency. The results of this dissertation lend evidence to the

variability of HS vowel categories: just like L2 learners, HS pronunciation may be

shaped by a number of language- external influences.

The effect of extralinguistic influences on vowel production also confirms that

seemingly homogenous populations may exhibit considerable variability in

pronunciation, and that small changes in behavior can have a significant impact on

325

speech production. Although all of the HS who participated in this study had similar

backgrounds, their pronunciation of vowels was not identical and was affected by

multiple factors. The morphosyntactic and pedagogical literature has already recognized

the wide-range of variation in HS abilities, but variation in HS pronunciation has received

little attention. Thus, future descriptions of HS should recognize the variability not only

in proficiency level, but also in pronunciation, and not assume that all HS “benefit” from

early exposure to Spanish in the same way.

In conclusion, the acoustic and statistical analyses presented in this study have

contributed to our overall understanding of the Spanish vowel system, and have provided

the first comprehensive description of HS vowels. Taken together, the findings depict

HS of Spanish as a unique population of bilingual individuals who, although often

equated with native speakers in terms of their pronunciation, exhibit a distinct

organization of vowels that is influenced by a complex interaction of linguistic, stylistic,

and extralinguistic factors. All in all, this investigation emphasizes the importance of

acoustic analyses of pronunciation and demonstrates how detailed, acoustic studies of

speech production can offer insight into the system that is not achievable by impressions

alone.

326

REFERENCES

Adank, P., Smits, R., & van Hout, R. (2004). A comparison of vowel normalization

procedures for language variation research. The Journal of the Acoustical Society

of America, 116(5), 3099-3107.

Alvord, S., & Christiansen, D. (2009). Acquisition of spirantization of voiced stops

during anextended stay abroad. Paper presented at the 9th Hispanic Linguistics

Symposium 21-24 of October, 2009. Universidad de Puerto Rico, Río Piedras,

Puerto Rico.

Alvord, S., & Rogers, B. (2011). Vowel quality and language contact in Miami-Cuban

Spanish. Paper presented at The 2011 Hispanic Linguistic Symposium 6-9 of

October, 2011. University of Georgia, Athens, GA.

Amastae, J. (1978). The Acquisition of English Vowels. Papers in Linguistics, 11(3-4),

423-457.

Amastae, J. (1982). Mid vowel raising and its consequences in Spanish. Linguistics, 20,

157-202.

Amengual, M. (2011). Spanish and Catalan in Majorca: Are there contact-induced

changes in the Catalan vowel system? In L. A. Ortiz-López (Ed.), Selected

Proceedings of the 13th Hispanic Linguistics Symposium (pp. 214-223).

Somerville, MA: Cascadilla Proceedings Project.

Au, T. K., Knightly, L. M., Jun, S.-A., & Oh, J. S. (2002). Overhearing a language during

childhood. Psychological Science, 13, 238–243.

Au, T.K., Oh, J.S., Knightly, L.M., Jun, S-A., & Romo, L. (2008). Salvaging a childhood

language. Journal of Memory and Language, 58, 998-1011.

327

Boersma, P., & Weenink, D. (2010). Praat: doing phonetics by computer [Computer

program]. Version 5.1.44, retrieved 4 October 2010 from http://www.praat.org/

Bolger, P., & Zapata, G. (2011). Psycholinguistic Approaches to Language Processing in

Heritage Speakers. The Heritage Language Journal, 8(1), 1-29.

Boomershine, A. (2011). The perception of English vowel contrasts: The case of

Monolingual, Bilingual, and Heritage Speakers of Spanish and English. Paper

presented at The 2011 Hispanic Linguistic Symposium 6-9 of October, 2011.

University of Georgia, Athens, GA.

Boyd-Bowman, P. (1952). La pérdida de vocales átonas en la altiplanicie Mexicana.

Nueva Revista de Filología Hispánica, 6(2), 138-140.

Bradlow, A. (1994). A comparative acoustic study of English and Spanish vowels. JASA,

97(3), 1916-1924.

Bullock, B. E. (2009). Prosody in contact in French: A case study from a heritage variety

in the USA. International Journal of Bilingualism, 13(2), 165–194.

Bybee, J. (2001). Phonology and language use. Cambridge,U.K.: Cambridge University

Press.

Carter, P. (2007). Phonetic variation and speaker agency: Mexicana identity in a North

Carolina Middle school. University of Pennsylvania Working Papers in

Linguistics, 13(2), 1-14.

Chang, C., Haynes, E., Yao, Y., & Rhodes, R. (2009). The Phonetic Space of

Phonological Categories in Heritage Speakers of Mandarin. In M. Elliott et al.

(Eds.), Proceedings from the 43rd Annual Meeting of the Chicago Linguistic

Society: The Main Session, 31-45. Chicago, IL: Chicago Linguistic Society.

http://www.praat.org/

328

Christiansen, D. (2009). Acquisition of spirantization of voiced stops during an

extended stay abroad. Master’s Thesis, Brigham Young University.

Clopper, C., Pisoni, D., & de Jong, K. (2005). Acoustic characteristics of the vowel

systems of six regional varieties of American English. Journal of the Acoustical

Society of America, 118 (3), 1661-1676.

Cushner, K. (1986). The Inventory of Cross-Cultural Sensitivity. Kent State University:

School of Education.

Delattre, P. (1969). An acoustic and articulatory study of vowel reduction in four

languages. International Review of Applied Linguistics in Language Teaching,

7(4), 295-325.

Delforge, A. (2008). Unstressed vowel reduction in Andean Spanish. In L. Colantoni & J.

Steele (Eds.), Selected Proceedings of the 3rd Conference on Laboratory

Approaches to Spanish Phonology (pp.107-124). Somerville, MA: Cascadilla

Proceedings Project.

Díaz-Campos, M. (2004). Context of learning of Spanish second language phonology.

Studies in Second Language Acquisition, 26, 249-273.

Díaz-Campos, Manuel. (2006). The effect of style in second language phonology: an

analysis of segmental acquisition in study abroad and regular-classroom students.

In C. Klee & T. Face (Eds.), Selected Proceedings of the 7th Conference on the

Acquisition of Spanish and Portuguese as First and Second Languages (pp. 26-

39). Somerville, MA: Cascadilla Proceedings Project.

329

Duprey-Almeyda, M. (2009). Language use, attitudes, perception and production of

English vowels among tenth graders in a public high school in Aguadilla, Puerto

Rico. University of Puerto Rico: Mayaguez, Puerto Rico.

Flege, J. (1991). Age of learning affects the authenticity of voice-onset time (VOT) in

stop consonants produced in a second language. Journal of the Acoustical Society

of America, 89(1), 395-411.

Flege, J. & Eefting, W. (1987). The production and perception of English stops by

Spanish speakers of English. Journal of Phonetics, 15, 67-83.

Flege, J. & Eefting, W. (1988). Imitation of a VOT continuum by native speakers of

English and Spanish; Evidence for phonetic category formation. Journal of the

Acoustical Society of America, 83(2), 729-740.

Flege, J., Munro, M., & MacKay, I. (1995). Factors affecting degree of perceived foreign

accent in a second language. Journal of the Acoustical Society of America, 97,

3125-3134.

Fought, C. (1999) A majority sound change in a minority community: /u/-fronting in

Chicano English. Journal of Sociolinguistics, 3, 5-23.

Geeslin, K. & Gudmestad, A. (2010). An exploration of the range and frequency of

occurrence of forms in potentially-variable structures in second language Spanish.

Studies in Second Language Acquisition, 32(3), 433-463.

Godinez, M. & Maddieson, I. (1985). Vowel differences between Chicano and general

Californian English? International Journal of the Sociology of Language,

43-58.

http://jimflege.com/files/Flege_Eefting_English_stops_JP_1987.pdf

http://jimflege.com/files/Flege_Eefting_English_stops_JP_1987.pdf

http://jimflege.com/files/Flege_Munro_factors_affecting_JASA_1995.pdf

330

Godson, L. (2003). Phonetics of Language Attrition: Vowel Production and Articulatory

Setting in the Speech of Western Armenian Heritage Speakers. Doctoral

dissertation, University of California, San Diego.

Goldstein, U. (1980) An articulatory model for the vocal tracts of growing children.

Doctoral dissertation ,M . I . T .

Grabe, E., & Low, E. (2002). Durational variability in speech and the rhythm class

hypothesis. In C. Gussenhoven & N. Warner (Eds.), Papers in Laboratory

Phonology 7 (pp. 515-546). Berlin: Mouton de Gruyter.

Guirao, M., & Borzone de Manrique, A. (1975). Identification of Argentine Spanish

vowels. Journal of Psycholinguistic Research, 4(1), 17-25.

Guion, Susan G. (2003). The vowel systems of Quichua-Spanish bilinguals. Phonetica,

60, 98-128.

Harmegnies, B., & Poch-Olivé, D. (1992). A study of style-induced vowel variability:

Laboratory versus spontaneous speech in Spanish. Speech Communication, 11(4-

5), 429-437.

Harris, M., & Gries, S. (2011). Measures of speech rhythm and the role of corpus-based

word frequency: a multifactorial comparison of Spanish(-English) speakers.

International Journal of English Studies, 11(2), 1-22.

Henríquez Ureña, P. (1938). El español en Méjico, los Estados Unidos y la América

Central. Buenos Aires: Biblioteca de Dialectología Hispanoamericana 4.

Hillenbrand, J., Getty, L., Clark, M., & Wheeler, K. (1995). Acoustic characteristics of

American English vowels. Journal of the Acoustical Society of America, 97,

3099-3111.

331

House, A. & Fairbanks, G. (1953). The influence of consonant environment upon the

secondary acoustical characteristics of vowels. In N. Lass, (Ed.), UC Berkeley

Phonology Lab Annual Report (2011) (pp. 79-99).

Hualde, J.I. (2005). The Sounds of Spanish. Cambridge: Cambridge University Press.

Johnson, K. (2003). Acoustic and Auditory Phonetics. Malden: Blackwell.

Jurado, M., & Arenas, M. (2005). La fonética del español: Análisis e investigación de

los sonidos del habla. Buenos Aires: Editorial Quorum.

Kagan, O. (2005). In support of a proficiency-based definition of Heritage Language

learners: The Case of Russian. The International Journal of Bilingual Education

and Bilingualism, 8(2-3), 213-221.

Knightly, L., Jun, S.A., Oh, J., & Au, T. (2003). Production benefits of childhood

overhearing. Journal of the Acoustical Society of America, 114(1), 465-474.

Koenig, W., Dunn, H. K., & Lacey, L. Y. (1946). The sound spectrograph. Journal of the

Acoustical Society of America, 18, 19-49.

Konopka, K. & Pierrehumbert J.B. (in press). Vowel dynamics of Mexican Heritage

English: Language contact and phonetic change in a Chicago community. Papers

from the 46th Annual Meeting of the Chicago Linguistic Society.

Konopka, K. & Pierrehumbert, J.B. (2008) Vowels in Contact: Mexican Heritage English

in Chicago. SALSA, 10, 94-104.

Labov, W. (1972). Sociolinguistic Patterns. Philadelphia: University of Pennsylvania

Press.

Labov, W., Ash, S., & Boberg, C. (2006). Atlas of North American English. New York:

Mouton de Gruyter.

332

Lafford, B. (2006). The effects of study abroad vs. classroom contexts on Spanish SLA:

Old assumptions, new insights and future research directions. In C. A. Klee & T.

L. Face (Eds.), Selected proceedings of the 7th Conference on the Acquisition of

Spanish and Portuguese as First and Second Languages (pp. 1-25). Somerville,

MA: Cascadilla Proceedings Project.

Ladefoged, P. (1980). What are linguistic sounds made of? Language, 56 (3), 485-502.

Ladefoged, P. (2001). A course in phonetics (4th ed.). Boston: Thomson.

Ladefoged, P. (2003). Phonetic data analysis. Malden: Blackwell.

Ladefoged, P. (2005). Vowels and consonants (2nd ed.). Malden: Blackwell.

Lennes, M. (2002). Calculate_segment_durations.praat. http://www.helsinki.fi/

~lennes/praat-scripts/ (15 October 2010).

Lennes, M. (2003). Collect_formant_data_from_files.praat. http://www.helsinki.fi/

~lennes /praat-scripts/ (15 October 2010).

Lindblom, B. (1990). Explaining Phonetic Variation: a Sketch of the H&H Theory. In H.

J. Hardcastle & A. Marchal (Eds.), Speech Production and Speech Modeling,

NATO ASI Series D: Behavioural and Social Sciences (Vol. 55, pp. 403–439).

Dordrecht: Kluwer.

Lipski, J.M. (1994). Latin American Spanish. London: Longman.

Lipski, J.M. (2008). Varieties of Spanish in the United States. Washington, D.C.:

Georgetown University Press.

Lope Blanch, J. (1972). En torno a las vocales caedizas del español mexicano. Estudios

sobre el español de México. México: Editorial Universidad Nacional Autónoma

de México.

333

Lobanov, B.M. (1971). Classification of Russian vowels spoken by different listeners.

Journal of the Acoustical Society of America, 49, 606-08.

Luce, P., & Pisoni, D. (1998). Recognizing spoken words: The neighborhood activation

model. Ear and Hearing, 19 (1), 1–36.

Lynch, A. (2003). The Relationship between Second and Heritage Language

Acquisition:Notes on Research and Theory Building. The Heritage Language

Journal, 1(1), 1-18.

Malovrh, P. (2008). A multifaceted analysis of the interlanguage development of

Spanish direct-object clitic pronouns observed in L2-learner production. Doctoral

Dissertation, Indiana University, Bloomington.

Marín Gálvez, R. (1995). La duración vocálica en español. E.L.U.A., 10, 213-226.

Martín Butragueño, P. (2011). Vocales en contexto. In El homenaje a Thomas C. Smith-

Stark. Mexico: Colegio de Mexico.

Martínez-Celdrán, E. (1984). Cantidad e intensidad en los sonidos obstruyentes del

castellano: Hacia una caracterización acústica de los sonidos aproximantes.

Estudios de fonética experimental, 1, 73–129.

Martínez-Celdrán, E. (1995). En torno a las vocales del español. En torno a las vocales

del español: análisis y reconocimiento. Estudios de Fonética Experimental, VII,

PPU, 195-218.

Matluck, J. (1952). La pronunciación del español en el valle de México. Nueva Revista

de Filología Hispánica, 6(2), 109-120.

Mendoza-Denton, N. (1999). Sociolinguistics and linguistic anthropology of US Latinos.

Annual Review of Anthropology, 28, 375-379.

334

Menke, M., & Face, T. (2010). Second language Spanish vowel production: An acoustic

Analysis. Studies in Hispanic and Lusophone Linguistics, 3(1), 181-214.

Mikulski, A. (2010). Age of onset of bilingualism, language use, and the volitional

subjunctive in heritage learners of Spanish. Heritage Language Journal, 7(1), 28-

45.

Montrul, S. (2005). Second language acquisition and first language loss in adult early

bilinguals: exploring some differences and similarities. Second Language

Research, 2(3), 199–249.

Montrul, S. & Bowles, M. (2010). Is Grammar Instruction Beneficial for Heritage

Language Learners? Dative Case Marking in Spanish. Heritage Language

Journal, 7(1), 47-73.

Montrul, S., Foote, R., & Perpiñan, S. (2008a). Gender Agreement in Adult Second

Language Learners and Spanish HeritageSpeakers: The Effects of Age and

Contextof Acquisition. Language Learning, 58(3), 503–553.

Montrul, S., Foote, R., & Perpiñán, S. (2008b). Knowledge of Wh-movement in Spanish

L2 Learners and Heritage speakers. In J. Bruhn de Garavito & E. Valenzuela

(Eds.), Selected Proceedings of the 10th Hispanic Linguistics Symposium (pp. 93-

106). Somerville, MA: Cascadilla Proceedings Project.

Morrison, G. (2004). An acoustic and statistical analysis of Spanish mid-vowel

allophones. Estudios de Fonética Experimental, 13, 11-37.

Morrison, G., & Escudero, P. (2007). A cross-dialect comparison of Peninsular- and

Peruvian-Spanish vowels. Proceedings of the 16th International Congress of

Phonetic Sciences: Saarbrücken 2007.

335

Mrak, A. (2011). Heritage speakers and the Standard: Fighting linguistic hegemony. In

L. Ortiz-López (Ed.), Selected Proceedings of the 13th Hispanic Linguistics

Symposium (pp. 161-168). Somerville, MA: Cascadilla Proceedings Project.

Munson, B., & Solomon, N. (2004). The effect of phonological neighborhood density on

vowel articulation. Journal of Speech, Language, and Hearing Research, 47,

1048–1058.

Munro, M., Flege, J., & MacKay, I. (1996). The effect of age of second-language

learning on the production of English vowels. Applied Psycholinguistics, 17, 313-

334.

Navarro Tomás, T. (1918). Manual de pronunciación español (12th ed.). Madrid, Spain:

CSIC.

Navarro Tomás, Tomás. (1917). Cantidad de las vocales inacentuadas. Revista de

Filología Española, 4, 371–388.

Oh, J. S., Jun, S.-A., Knightly, L. M., & Au, T. K. (2003). Holding on to childhood

language memory. Cognition, 86, B53–B64.

O’Rourke, E. (2010). Dialect differences and the bilingual vowel space in Peruvian

Spanish. In O. Ortega-Llebaria (Ed.), Selected Proceedings of the 4th Conference

on Laboratory Approaches to Spanish Phonology (pp. 20-30). Somerville, MA:

Cascadilla Proceedings Project.

Peterson, G.E., & Barney, H.L., (1952). Control methods used in a study of the vowels.

Journal of the Acoustical Society of America, 24, 175-184.

Perrisinotto, G. (1975). Fonología del español hablado en la Ciudad de México. Ensayo

de un método sociolingüístico. México: El Colegio de México.

http://jimflege.com/files/Munro_Flege_production_vowels_AP_1996.pdf

336

Poch-Olive, D., Harmegnies, B.,& Butragueño, P. (2008). Influencia del estilo de habla

sobre las características de las realizaciones vocálicas en el español de la ciudad

de México. Actas del XV Congreso Internacional ALFAL. Montevideo,

Uruguay, 18-21 de agosto de 2008, CD.

Pike, K. (1945). The intonation of American English. Ann Arbor: University of Michigan

Press.

Potowski, K. (in press). Identity and heritage languages: Moving beyond essentializations. In

S. Beaudrie & M. Fairclough (Eds.), Spanish as a heritage language in the U.S.:

State of the science. Washington, DC: Georgetown University Press.

Potowski, K., & Matts, J. (2008). MexiRicans: Interethnic Language and Identity.

Journal of Language, Identity, and Education, 7, 137–160.

Potowski, K., Potowski, Jegerski, J., & Morgan-Short, K. (2009). The effects of

instruction on linguistic development in Spanish heritage language speakers.

Language Learning, 59 (3), 537-579.

Prieto, P., Mar Vanrell, M., Astruc, L., Payne, E., Brechtje, P. (In Press). Phonotactic

and phrasal properties of speech rhythm: Evidence from Catalan, English, and

Spanish. Speech Communication. doi: 10.1016/j.specom.2011.12.001.

Quilis, A., & Esgueva, M. (1983). Realización de los fonemas vocálicos españoles en

posición fonética normal. In M. Esgueva & M. Cantarero (Eds.), Estudios de

fonética (pp. 159-251). Madrid: Consejo Superior de Investigaciones Científicas.

Roeder, R. (2010). Effects of consonantal context on the pronunciation of /æ/ in the

English of speakers of Mexican heritage from south central Michigan. In N.

Niedzielski and D. Preston (Eds.), Studies in Sociophonetics (pp. 71-89). Mouton

de Gruyter.

337

Rosner, B., & Pickering, J. (1994). Vowel perception and production. Oxford: Oxford

University Press.

Rothman, J. (2007). Heritage speaker competence differences, language change, and

input type: Inflected infinitives in Heritage Brazilian Portuguese. International

Journal of Bilingualism, 11(4), 359-389.

Sadowsky, S. (2010). Los alófonos vocálicos como marcadores de identidad

sociocultural en el discurso oral del castellano de Chile. VI Encuentro Nacional

de Estudios del Discurso Santiago, Chile – 27 y 28 de septiembre de 2010.

Savy, R., & Cutugno, F. (1998). Hypospeech, vowel reduction, and centralization: how

do they interact? Proceedings of the XVIth International Congress of Linguistics.

Schmidt, L., & Willis, E. Systematic Investigation of Voicing Assimilation of Spanish /s/

in Mexico City. In S. Alvord (Ed.), Selected Proceedings of the 5th Conference

on Laboratory Approaches to Romance Phonology (pp. 1-20). Somerville, MA:

Cascadilla Proceedings Project.

Serrano, J. (2006). En torno a las vocales caedizas del español mexicano: una

aproximación sociolingüística. Líderes lingüísticos. México: El Colegio de

México.

Servín, E., & Rodríguez, M. (2001). Estrcutura formántica de las vocales del español de

la ciudad de México. In E. Herrera (Ed.), Temas de fonética instrumental (pp.

39-58). México, D.F.: El Colegio de México, Centro de Estudios Lingüísticos y

Literarios.

Simões, A. (1996). Phonetics in second language acquisition: An acoustic study of

fluency in adult learners of Spanish. Hispania, 79(1), 87–95.

338

Skelton, R B. (1969). The pattern of Spanish vowel sounds. International Review of

AppliedLinguistics in Language Teaching, 7, 231-237.

Stevens, J. (2011). Vowel duration in second language Spanish vowels: Study abroad

versus at-home learners. Arizona Working Papers in SLA & Teaching, 18, 77-104.

Stevens, K. N., & House, A. S. (1963). Perturbation of vowel articulations by

consonantal context: An acoustical study. Journal of Speech and Hearing

Research, 6, 111–128.

Thomas, E.R. (2001). An Acoustic Analysis of Vowel Variation in New World English.

Publication of the American Dialect Society 85. Durham, NC: Duke University

Press.

Thomas, E. R. (2011). Sociophonetics: An Introduction. Basingstoke, England: Palgrave

Macmillan.

Thomas, E.R. & Kendall, T. 2007. NORM: The vowel normalization and plotting suite. [

Online Resource: http://ncslaap.lib.ncsu.edu/tools/norm/ ]

Valdés, G. (2005). Bilingualism, heritage language learners, and SLA research:

Opportunities lost or seized? The Modern Language Journal, 89(3), 410-426.

Valdés, G., and Geoffrion-Vinci, M. (1998). Chicano Spanish: The Problem of the

'Underdeveloped' Code in Bilingual Repertories. The Modern Language Journal,

82, 473-501.

Vitevitch, M., & Stamer, M. (2006). The curious case of competition in Spanish speech

production. Language and Cognitive Processes, 21(6), 760–770.

http://ncslaap.lib.ncsu.edu/tools/norm/

339

Vitevitch, M., & Stamer, M. (2009). The influence of neighborhood density (and

neighborhood frequency) in spanish speech production: A follow-up report. In

Spoken Language Laboratory Technical Report, Volume 1, pp. 1–6. University of

Kansas.

Watt, D., & Fabricius, A. (2002). Evaluation of a technique for improving the mapping

of multiple speakers' vowel spaces in the F1 ~ F2 plane. In D. Nelson (Ed.), Leeds

Working Papers in Linguistics and Phonetics 9 (pp. 159-73).

Willis, E. (2005). An Initial Examination of Southwest Spanish Vowels. Southwest

Journal of Lingusitics, 24, 185-198.

Willis, E. (2008). No se comen, pero sí se mascan: Variación de las vocales plenas en la

República Dominicana. Actas del XV Congreso Internacional de la Asociación

de Lingüística y Filología de AméricaLatina (ALFAL). Montevideo, Uruguay,

August 18-21, 2008. 6pp.

Yao, Y. (2011). The effects of phonological neighborhoods on pronunciation variation

in conversational speech. Doctoral Dissertation, University of California,

Berkley.

Yavas, M. (1996). Differences in voice onset time in early and later Spanish-English

bilinguals. In A. Roca & J. Jensen (Eds.), Spanish in Contact (pp. 131-141).

Sommerville, MA: Cascadilla Press.

Zampini, M., & Green, K. (2001). The voicing contrast in English and Spanish: The

relationship between perception and production. In J. L. Nicol (Ed.), One Mind,

Two Languages: Bilingual Language Processing (pp. 23-48). Oxford: Blackwell

Publishers.

340

Appendix A: List of words presented in the picture identification task (PIT)

Vowel Syllable type Tonic Atonic

A open lápiz, pescado, llaves, taza, cucharas paquetes, café, avispas,

espaguetis, avestruz, mariscos, tostador

closed máscaras, cáscaras, durazno, tanque sandía, bacteria, pasteles,

langostas

E open espaguetis, bacteria, pasteles, peces, eskeletos, espejos, panqueques

películas, lechugas, revistas, pelota, eskeletos

closed bistéc, insectos, césped pescado, espaguetis, espejos,

escobas, eskeletos

I open películas, cursiva, cuchillos, piscinas, oxígeno piscinas, bicicleta, chicharrones

closed avispas, revistas, mariscos bistéc, insectos, cristal

O open chicharrones, osos, escobas fotografías, autobúses

closed langostas, ostras, tostador tostador, oxígeno

U open lechugas, luces, uvas, estufa, pelota, burbujas, autobúses

cuchillos, cucharas, judías verdes

closed fútbol, avestruz cursiva, durazno, burbujas

341

Appendix B: List of words presented in the carrier phrase task (CPT)

Vowel Syllable type Tonic Atonic

A open mano, masa, caso, paco, papa, paso, pata, codazo, taco, tapa, taza, papaya

closed pasto, atlas atleta

E open mesa, atleta, pecho, pepe, peso, quepo, queso, sistema, techo equipo

closed peste eclipse

I open misa, pico, pipa, piso, pupila, quiso, quito, tico, tipo, tiza, tiburón

closed Eclipse, equipo, pista sistema

O open coco, copa, cosa, mono, mozo, pozo, toco, toso codazo

closed poste conduzco, costumbre, oscura

U open cupo, cuscurro, oscura, puso, sutura pupila, sutura

closed conduzco, costumbre, susto cuscurro

342

Appendix C: Language Background questionnaire Participant Background questionnaire

Gender: M F

Date of birth : __________ Current age: _______

Where did your mother live until age 14? ______________________

Where did your father live until age 14? ______________________

In what country were you born? ____________________________

What do you consider to be your native language? ______________________________

Do you speak another language(s) fluently other than your native language? __________

Please list the countries and states in which you have lived in the last 10 years. If you need more space, please continue on the back of this page. If you do not wish to list the specific location, you may leave this blank.

Number of years spent

Ages spent City, State, Country

1.

2.

3.

4.

Please list any foreign languages you have studied, the duration for which they were studied, and rate your abilities in each language. If you need more space, please continue on the back of this page.

Language Ages, years, and place studied

Abilities

1. Listening: good fair poor Reading: good fair poor Speaking: good fair poor

2. Listening: good fair poor

343

Reading: good fair poor Speaking: good fair poor

3. Listening: good fair poor Reading: good fair poor Speaking: good fair poor

Please rate the frequency with which you complete the following activities in ENGLISH.

Activity Daily Weekly Monthly Every few months

(Almost) Never

Speak with friends

●

●

●

● ●

Speak with family

● ● ●

● ●

Watch television ● ● ●

● ●

Read books or magazines

● ● ●

● ●

Listen to the radio or music

● ● ●

● ●

Watch movies ● ● ●

● ●

At the workplace ● ● ●

● ●

At school/ in class

● ● ●

● ●

Please rate the frequency with which you complete the following activities in SPANISH.

Activity Daily Weekly Monthly Every few months

(Almost) Never

Speak with friends

●

●

●

● ●

Speak with family

● ● ●

● ●

Watch television ● ● ●

● ●

Read books or magazines

● ● ●

● ●

Listen to the radio or music

● ● ●

● ●

344

Watch movies ● ● ●

● ●

At the workplace ● ● ●

● ●

At school/ in class

● ● ●

● ●

Please respond to the following questions. If there is a question you do not wish to answer, you may leave this part blank.

Do you ever travel abroad to visit family or friends in a Spanish-speaking country? If so, which countries to you visit, and how often to you travel there?

__________________________________________________________________

__________________________________________________________________

How comfortable do you feel speaking English? Do you feel as though you have any difficulties with the language?

__________________________________________________________________

__________________________________________________________________

How comfortable do you feel speaking Spanish? Do you feel as though you have any difficulties with the language?

__________________________________________________________________

__________________________________________________________________

345

Please rate on a scale of 1-5 (one being STRONGLY DISAGREE, 5 being STRONGLY AGREE) your feelings about the following statements.

STRONGLY DISAGREE

1

AGREE

2

NO OPINION

3

DISAGREE

4

STRONGLY AGREE

5 I like speaking Spanish because it feels natural.

● ● ● ● ●

It is NOT difficult for me to UNDERSTAND Spanish.

● ● ● ● ●

I like speaking English because it feels natural.

● ● ● ● ●

It is NOT difficult for me to UNDERSTAND English.

● ● ● ● ●

I read a lot of books and magazines in Spanish.

● ● ● ● ●

I read a lot of books and magazines in English.

● ● ● ● ●

I understand Spanish grammar well.

● ● ● ● ●

I wish I spoke more Spanish at home and with friends.

● ● ● ● ●

I understand English grammar well.

● ● ● ● ●

I wish I spoke more English at home and with friends.

● ● ● ● ●

Speaking Spanish helps

● ● ● ● ●

346

me feel connected with my heritage. There are a lot of Spanish words that I don’t know.

● ● ● ● ●

There are a lot of English words that I don’t know.

● ● ● ● ●

I wish I understood more about Spanish grammar.

● ● ● ● ●

I wish I understood more about English grammar.

● ● ● ● ●

Speaking both English and Spanish is important to me.

● ● ● ● ●

Please indicate YES or NO for the following questions. If the answer is yes, please indicate the language spoken. If you do not wish to answer any of the following questions, you may circle NR for “No Response”.

I have family members who are non-native speakers of English. Yes No NR

I have classmates who are non-native speakers of English. Yes No NR

I have co-workers who are non-native speakers of English. Yes No NR

I have friends who are non-native speakers of English. Yes No NR

347

Appendix D: Language Proficiency Test (Spanish) From:

Geeslin, K. and Gudmestad, A. (forthcoming, 2010). An exploration of the range and frequency of occurrence of forms in potentially-variable structures in second language Spanish. Studies in Second Language Acquisition, 32, 3.

Read the story below about a Hispanic female college student and select the answer that best completes each sentence.

Creo que es muy interesante _____ de los hábitos alimenticios de la gente. Yo, por mi parte,

a. hablo b. hablar c. hablando

_____ vegetariana. Cuando voy a eventos sociales, como por ejemplo fiestas, bodas o bailes,

a. soy

b. estoy

c. tengo

espero que _____ comida vegetariana allí. Algunas personas dicen que _____ representa un

a. hay a. le

b. haya b. los

c. sea c. les

inconveniente proveer _____, pero yo creo que no _____ que ser así. De hecho, la comida

a. lo a. tiene b. la b. tenga

c. le c. tengo

vegetariana es muy fácil _____ preparar. Y cuando no _____ ofrece, puede ser _____ gran

a. en a. la a. un

348

b. a b. le b. una

c. de c. se c. el

problema. Yo recuerdo una vez que _____ a una fiesta de cumpleaños y _____ ser todo un

a. fui a. resultó

b. iba b. resultaba

c. voy c. resulté

desastre. La fiesta _____ en la casa de un amigo, y él había invitado a mucha gente. Me

a. estaba

b. era

c. fue

sorprendió porque _____ ser un estudiante de postgrado con poco dinero, _____ una gran

a. entre a. tuvo

b. por b. tenía

c. para c. tuviera

variedad de comida para los invitados. Yo creo que si me _____ tocado a mí dar la fiesta, no a. hubiera

b. habría c. había

_____ dado ni la mitad de lo que _____ allí. Pero pronto me _____ cuenta que él no había

a. hubiera a. era a. doy

b. habría b. había b. daba

c. había c. hubiera c. di

preparado nada vegetariano. Yo no pongo problemas por ese tipo de cosas, pero una amiga

349

_____ sí _____ hace. _____ a quejarse en frente de todo el mundo, mientras el anfitrión sólo

a. mía a. le a. Empezó

b. mi b. se b. Empezaba

c. de mí c. lo c. Empezado

sólo _____ la escena con _____ boca abierta. Yo le dije a mi amiga que _____ de causar tanto

a. miró a. su a. dejaba

b. miraba b. una b. deje

c. miraría c. la c. dejara

escándalo, pero no me puso atención. Por fin, el anfitrión dijo: “La próxima vez que tenga una

fiesta, _____ algo vegetariano.” Yo le dije después a mi amiga: “Mejor tarde que nunca, ¿no?”

a. prepararé

b. prepararía

c. preparara

350

Appendix E: Language attitude and cultural sensitivity questionnaire

From : Cushner, K. (1986). The Inventory of Cross-Cultural Sensitivity. Kent State University: School of Education.

-------------------

Circle the number that best corresponds to your level of agreement with each statement below.

1 = Strongly Disagree 7 = Strongly Agree

1. I speak only one language.

1 2 3 4 5 6 7

2. The way other people express themselves is very interesting to me.

1 2 3 4 5 6 7

3. I enjoy being with people from other cultures

1 2 3 4 5 6 7

4. Foreign influence in our country threatens our national identity.

1 2 3 4 5 6 7

5. Others’ feelings rarely influence decisions I make.

1 2 3 4 5 6 7

6. I cannot eat with chopsticks.

1 2 3 4 5 6 7

7. I avoid people who are different from me.

1 2 3 4 5 6 7

8. It is better that people from other cultures avoid one another.

1 2 3 4 5 6 7

9. Culturally mixed marriages are wrong.

1 2 3 4 5 6 7

351

10. I think people are basically alike.

1 2 3 4 5 6 7

11. I have never lived outside my own culture for any great length of time.

1 2 3 4 5 6 7

12. I have foreigners over to my home on a regular basis.

1 2 3 4 5 6 7

13. It makes me nervous to talk about people who are different than me.

1 2 3 4 5 6 7

14. I enjoy studying about people from other cultures.

1 2 3 4 5 6 7

15. People from other cultures do things differently because they do not know any other way.

1 2 3 4 5 6 7

16. There is usually more than one good way to get things done.

1 2 3 4 5 6 7

17. I listen to music from another culture on a regular basis.

1 2 3 4 5 6 7

18. I decorate my home or room with artifacts from other countries.

1 2 3 4 5 6 7

19. I feel uncomfortable when in a crowd of people.

1 2 3 4 5 6 7

20. The very existence of humanity depends upon our knowledge about other people.

1 2 3 4 5 6 7

21. Residential neighborhoods should be culturally separated.

352

1 2 3 4 5 6 7

22. I have many friends.

1 2 3 4 5 6 7

23. I dislike eating foods from other cultures.

1 2 3 4 5 6 7

24. I think about living within another culture in the future.

1 2 3 4 5 6 7

25. Moving into another culture would be easy.

1 2 3 4 5 6 7

26. I like to discuss issues with people from other cultures.

1 2 3 4 5 6 7

27. There should be tighter controls on the number of immigrants allowed into my country.

1 2 3 4 5 6 7

28. The more I know about people, the more I dislike them.

1 2 3 4 5 6 7

29. I read more national news than international news in the daily newspaper.

1 2 3 4 5 6 7

30. Crowds of foreigners frighten me.

1 2 3 4 5 6 7

31. When something newsworthy happens I seek out someone from that part of the world to discuss the issue with.

1 2 3 4 5 6 7

32. I eat ethnic foods at least twice a week.

1 2 3 4 5 6 7

353

Scoring the ICCS

The ICCS can be scored by subscales. Simply enter the number for each question you answered in the space provided under each subscale heading. Reverse the values for the items marked with an asterisk (*). For example, reverse scoring results in:

7=1, 6=2, 5=3, 4=4, 3=5, 2=6, 1=7

A total ICCS score is obtained by adding the various subscale scores together. Individuals can be ranked relative to others in a particular group. You can also identify relative strengths and weaknesses that may lead to more focused orientation and planning.

ICSS Scoring Guide

C Scale

Item Score

1*

6*

11*

12

17

18

23*

24

29*

32

TOTAL

B Scale

Item Score

2

7*

13*

19*

25*

30

TOTAL

I Scale

Item Score

3

8*

14

20

26

31

TOTAL

354

A Scale

Item Score

4*

9*

15*

21*

27*

TOTAL

E Scale

Item Score

5*

10

16

22

28*

TOTAL

TOTALS

C Scale

B Scale

I Scale

A Scale

E Scale

TOTAL ICCS SCORE

The five subscales and the range of scores include:

Subscale Range of Scores

Cultural Integration (C Scale) 10-70

Behavioral Scale (B Scale) 6-42

Intellectual Interaction (I Scale) 6-42

Attitude Toward Others (A Scale) 5-35

Empathy Scale (E Scale) 5-35

TOTAL SCORE RANGE 32-224

355

Appendix F: Participant demographic information

ID# Gen. Age Country born

Parents' country of

origin

Native language

Time spent studying Spanish

Age when started

studying Spanish

Current Spanish

class

Other languages

studied

Travel abroad Self ID terms

1F F 19 USA Mexico, D.F. Spanish 1 yr college 18 S114

(Biling Spanish II)

French

Every 3 years or so

visits Mexico

Latina, Hispanic, Mexicana,

Midwesterner, American

2F F NR USA Mexico English 1 yr college 20 S114

(Biling Spanish II)

French (fair), Mandarin (fair, poor,

poor), Japanese

(poor)

Mexico every

summer No response

3F F 19 USA Mexico English 1 yr

elementary, 1 yr college

8 S114

(Biling Spanish II)

French (poor, good,

fair)

Mexico, every

summer

Hispanic, Mexicana

4F F 18 USA Puerto Rico English and Spanish

2 yrs HS, 2 sem college 16

S114 (Biling

Spanish II) No

Puerto Rico,

every 3-4 years

Puertorriqueña

1M M 24 Mexico: moved to

US at 8 yrs Mexico Spanish 1 sem HS, 1

sem college 18 S114

(Biling Spanish II)

No No

Latino, Hispanic, Mexicano, American

2M M 19 USA

Mother - Valparaiso

Father -Zacatecas (Mexico)

Spanish 1 sem college 19 S114

(Biling Spanish II)

No Once a year to Mexico

Latino, Hispanic, Mexicano, American

356

ID# Gen. Age Current city

Country born

Parents' country of

origin

Native language

Time spent

studying Spanish

Age when started

studying Spanish

Current Spanish

class

Other languages

studied

Travel abroad

Self ID terms

5F F 20 Chicago, Il USA Mexico Spanish

4 yrs HS, 1 yr

college 14 S114 (Biling

Spanish II) French (poor)

Infrequent. Mexico

last year, first time in 10 yrs

Latina, Hispanic, Mexicana

6F F 22 Chicago, Il USA

Mother - DF

Father - Jalisco

(Meixco)

Spanish

preschool up

through 3rd grade; then high

school and college

5 (preschool aged)

S206 and S210,

finished 114

Italian (good)

Every few years

Latina, Hispanic, Chicana,

Mexicana, American


Mother - DF

Father - Michoacan

Spanish No response No response 212 and 303;

finished 114 No No

Latina, Hispanic, Chicana, Mexicana

3M M 19 Chicago, Il USA

Mother-Mexico Father-Spain

Spanish

5 yrs in high

school and college

14

202-grammar,

210 literature,

204-writing

French (good, fair,

good), English (good)

Mexico twice a year, Spain every

summer

Latino, Mexicano, American,

other: Spanish

8F F 21 Chicago, Il

Mexico: moved

to US at 2 yrs

Mexico, D.F. English

1 yr HS, 3 Sem

college 17

202--Grammar,

already took 113-114

Italian (good,

fair, fair)

Mexico, every two

years Latina

357

ID# Gen. Age Current city

Country born

Parents' country of origin

Native language

Time spent

studying Spanish

Age when started

studying Spanish

Current Spanish class

Other languag

es studied

Travel abroad

Self ID terms

9F F 22 Chicago, Il USA Mexico Spanish

4-8 yrs preschool; 3rd grade--bilingual education

4

380--Professional,

Latinos in Chicago, Ind

study

No Every year Mexico Mexicana

10F F 21 Chicago, Il

Mexico, DF.

Moved to US at 6

yrs

Mexico Spanish

bilingual preschool; then high

school and college

overhearing at birth; study at 4 yrs and 17

yrs

202-grammar, 210

literature, 204-writing,

206-linguistics

English (good)

Spain once, visiting

friends and family in

Mexico and Venezuela

soon

Mexicana

11F F 19 Chicago, Il

USA--Chicago

Mexico, D.F. Spanish

overheard Spanish

from birth;

English in school;

Spanish in college

overhearing at birth; study 18

yrs

204 English, Spanish

(fair)

every few years

No response


Mother-Chicago Father-

Guanajuato

English

17-19 yrs of age in

high school and

college

birth-grew up

overhearing, study at

17

204--writing, 210 text analysis

NO

Guanajuato, about every 3 years

Latina, Mexicana Hispanic

F13 F 19 Chicago, Il USA Jalisco,

Mexico Spanish High

school and college

15

202--Grammar,

already took 113-114

NO every few years to Jalisco,

Latina, Hispanic, Chicana,

Mexicana, American

358

Appendix G: Total vowels analyzed in the analyses (N)

1) Total number of vowel tokens collapsed across all speakers, stress types, syllable types, and tasks.

vowel N

a 937 e 796 i 604 o 555 u 450 Total 3342

2) Total number of tonic and atonic vowels, organized by stress.

stress vowel N

A a 304

e 370

i 176

o 231

u 168

Total 1249

T a 633

e 426

i 428

o 324

u 282

Total 2093

Total a 937

e 796

i 604

o 555

u 450

Total 3342

359

3) Total number of vowels produced in closed and open syllables, organized by syllable type.

syll vowel N

C a 295

e 243

i 149

o 201

u 121

Total 1009

O a 642

e 553

i 455

o 354

u 329

Total 2333

Total a 937

e 796

i 604

o 555

u 450

Total 3342

4) Total number of vowels produced in the three speech tasks, organized by vowel.

vowel task N

a 1 378

2 332

3 227

Total 937

e 1 294

2 324

3 178

Total 796

i 1 171

2 214

360

3 219

Total 604

o 1 180

2 174

3 201

Total 555

u 1 88

2 203

3 159

Total 450

Total 1 1111

2 1247

3 984

Total 3342

5) Total number of atonic and tonic vowels in each of the three speech tasks.

task stress vowel N

1 A a 120

e 188

i 63

o 90

u 35

Total 496

T a 258

e 106

i 108

o 90

u 53

Total 615

Total a 378

e 294

i 171

o 180

u 88

361

Total 1111

2 A a 147

e 156

i 83

o 84

u 86

Total 556

T a 185

e 168

i 131

o 90

u 117

Total 691

Total a 332

e 324

i 214

o 174

u 203

Total 1247

3 A a 37

e 26

i 30

o 57

u 47

Total 197

T a 190

e 152

i 189

o 144

u 112

Total 787

Total a 227

e 178

i 219

o 201

u 159

Total 984

362

Total A a 304

e 370

i 176

o 231

u 168

Total 1249

T a 633

e 426

i 428

o 324

u 282

Total 2093

Total a 937

e 796

i 604

o 555

u 450

Total 3342

6) Total number of vowels produced in open and closed syllables in each of the three speech tasks.

task syll vowel N

1 C a 108

e 82

i 36

o 83

u 11

Total 320

O a 270

e 212

i 135

o 97

u 77

Total 791

363

Total a 378

e 294

i 171

o 180

u 88

Total 1111

2 C a 142

e 133

i 74

o 58

u 61

Total 468

O a 190

e 191

i 140

o 116

u 142

Total 779

Total a 332

e 324

i 214

o 174

u 203

Total 1247

3 C a 45

e 28

i 39

o 60

u 49

Total 221

O a 182

e 150

i 180

o 141

u 110

Total 763

Total a 227

364

e 178

i 219

o 201

u 159

Total 984

Total C a 295

e 243

i 149

o 201

u 121

Total 1009

O a 642

e 553

i 455

o 354

u 329

Total 2333

Total a 937

e 796

i 604

o 555

u 450

Total 3342

7) Total number of vowels organized by course level.

CLevel vowel N

1 a 379

e 316

i 236

o 224

u 181

Total 1336

2 a 558

e 480

i 368

365

o 331

u 269

Total 2006

Total a 937

e 796

i 604

o 555

u 450

Total 3342

8) Total number of vowels organized by travel group.

Travel vowel N

0 a 274

e 229

i 166

o 164

u 121

Total 954

1 a 325

e 283

i 207

o 184

u 159

Total 1158

2 a 338

e 284

i 231

o 207

u 170

Total 1230

Total a 937

e 796

i 604

o 555

u 450

366

Travel vowel N

0 a 274

e 229

i 166

o 164

u 121

Total 954

1 a 325

e 283

i 207

o 184

u 159

Total 1158

2 a 338

e 284

i 231

o 207

u 170

Total 1230

Total a 937

e 796

i 604

o 555

u 450

Total 3342

367

Appendix H: Results of statistical analyses conducted with F1

1) Table of Type II Tests of Fixed Effects indicates which main effects and interactions were significant.

Type III Tests of Fixed Effectsa

Source Numerator df Denominator df F Sig.

Intercept 1 7.989 19.436 .002

vowel 4 3254.080 16.560 .000

task 2 3254.389 25.244 .000

stress 1 3254.209 204.223 .000

syll 1 3254.123 1.986 .159

Gender 1 8.004 7.580 .025

Group 1 8.010 .829 .389

Travel 2 7.998 2.682 .128

Grammar 1 8.010 .007 .934

ICCSTotal 1 7.991 1.502 .255

Spanuse 1 8.002 .451 .521

vowel * task 8 3254.141 25.853 .000

vowel * stress 4 3254.160 62.184 .000

vowel * syll 4 3254.093 .853 .491

vowel * task * stress 10 3254.174 2.600 .004

vowel * task * syll 10 3254.103 6.704 .000

vowel * Gender 4 3254.100 68.444 .000

vowel * Group 4 3254.072 2.423 .046

vowel * Travel 8 3254.078 19.723 .000

vowel * Grammar 4 3254.031 10.281 .000

vowel * ICCSTotal 4 3254.079 2.865 .022

vowel * Spanuse 4 3254.081 5.704 .000

a. Dependent Variable: F1.

368

2) Pairwise comparisons of the main effect of vowel, which indicates which vowels differ statistically from each other.

Pairwise Comparisonsb

(I) vowel (J) vowel

Mean

Difference (I-

J) Std. Error df Sig.a

95% Confidence Interval for

Differencea

Lower Bound

Upper

Bound

a e 188.671* 4.697 3254.027 .000 175.477 201.865

i 319.458* 5.014 3254.126 .000 305.373 333.543

o 150.365* 4.831 3254.229 .000 136.795 163.936

u 267.164* 5.747 3254.121 .000 251.020 283.308

e a -188.671* 4.697 3254.027 .000 -201.865 -175.477

i 130.787* 5.228 3254.066 .000 116.102 145.472

o -38.305* 5.054 3254.217 .000 -52.502 -24.109

u 78.493* 5.936 3254.111 .000 61.820 95.167

i a -319.458* 5.014 3254.126 .000 -333.543 -305.373

e -130.787* 5.228 3254.066 .000 -145.472 -116.102

o -169.093* 5.351 3254.314 .000 -184.123 -154.063

u -52.294* 6.186 3254.063 .000 -69.671 -34.917

o a -150.365* 4.831 3254.229 .000 -163.936 -136.795

e 38.305* 5.054 3254.217 .000 24.109 52.502

i 169.093* 5.351 3254.314 .000 154.063 184.123

u 116.799* 6.043 3254.255 .000 99.824 133.774

u a -267.164* 5.747 3254.121 .000 -283.308 -251.020

e -78.493* 5.936 3254.111 .000 -95.167 -61.820

i 52.294* 6.186 3254.063 .000 34.917 69.671

o -116.799* 6.043 3254.255 .000 -133.774 -99.824

Based on estimated marginal means

*. The mean difference is significant at the .05 level.

a. Adjustment for multiple comparisons: Bonferroni.

b. Dependent Variable: F1.

369

3) Pairwise comparisons of the main effect of task (speech style). “1” represents the

NRT, “2” the PIT, and “3” the CPT.


(I) task (J) task

Mean

Difference (I-J) Std. Error df Sig.a


Differencea

Lower Bound Upper Bound

1 2 20.189* 3.078 3254.616 .000 12.818 27.561

3 21.605* 3.557 3254.445 .000 13.086 30.123

2 1 -20.189* 3.078 3254.616 .000 -27.561 -12.818

3 1.416 3.034 3254.126 1.000 -5.852 8.683

3 1 -21.605* 3.557 3254.445 .000 -30.123 -13.086

2 -1.416 3.034 3254.126 1.000 -8.683 5.852





4) Pairwise comparisons of the main effect of stress. “T” indicates tonic, and “A” indicates atonic.


(I)

stress

(J)

stress

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea


A T -34.322* 2.402 3254.209 .000 -39.031 -29.613

T A 34.322* 2.402 3254.209 .000 29.613 39.031





370

5) Pairwise comparisons of the significant vowel by task interaction.


vowel (I) task (J) task

Mean

Difference (I-



Differencea


a 1 2 -15.891* 4.582 3254.095 .002 -26.867 -4.916

3 -58.223* 6.287 3254.036 .000 -73.282 -43.163

2 1 15.891* 4.582 3254.095 .002 4.916 26.867

3 -42.331* 6.175 3254.040 .000 -57.123 -27.540

3 1 58.223* 6.287 3254.036 .000 43.163 73.282

2 42.331* 6.175 3254.040 .000 27.540 57.123

e 1 2 37.513* 4.953 3254.414 .000 25.648 49.377

3 46.078* 7.534 3254.212 .000 28.032 64.124

2 1 -37.513* 4.953 3254.414 .000 -49.377 -25.648

3 8.565 7.243 3254.046 .711 -8.783 25.914

3 1 -46.078* 7.534 3254.212 .000 -64.124 -28.032

2 -8.565 7.243 3254.046 .711 -25.914 8.783

i 1 2 28.944* 6.829 3254.216 .000 12.587 45.301

3 54.020* 8.076 3254.186 .000 34.675 73.364

2 1 -28.944* 6.829 3254.216 .000 -45.301 -12.587

3 25.075* 7.221 3254.053 .002 7.780 42.370

3 1 -54.020* 8.076 3254.186 .000 -73.364 -34.675

2 -25.075* 7.221 3254.053 .002 -42.370 -7.780

o 1 2 21.442* 6.221 3254.378 .002 6.541 36.343

3 16.782* 6.203 3254.380 .021 1.923 31.640

2 1 -21.442* 6.221 3254.378 .002 -36.343 -6.541

3 -4.660 6.336 3254.062 1.000 -19.836 10.516

3 1 -16.782* 6.203 3254.380 .021 -31.640 -1.923

2 4.660 6.336 3254.062 1.000 -10.516 19.836

u 1 2 28.938* 10.260 3254.406 .014 4.364 53.513

3 49.368* 10.752 3254.358 .000 23.615 75.121

2 1 -28.938* 10.260 3254.406 .014 -53.513 -4.364

3 20.429* 6.860 3254.038 .009 3.998 36.861

3 1 -49.368* 10.752 3254.358 .000 -75.121 -23.615

2 -20.429* 6.860 3254.038 .009 -36.861 -3.998 Dependent variable: F1. Based on estimated marginal means *. The mean difference is significant at the.05 level. a. Adjustment for multiple comparisons: Bonferroni.

371

6) Pairwise comparisons of the vowel by stress interaction.


vowel (I) stress (J) stress

Mean

Difference (I-

J)

Std.

Error df Sig.a

95% Confidence

Interval for Differencea

Lower

Bound

Upper

Bound

a A T -79.971* 4.627 3254.093 .000 -89.042 -70.899

T A 79.971* 4.627 3254.093 .000 70.899 89.042

e A T -67.921* 5.294 3254.187 .000 -78.300 -57.542

T A 67.921* 5.294 3254.187 .000 57.542 78.300

i A T 15.929* 5.524 3254.153 .004 5.098 26.760

T A -15.929* 5.524 3254.153 .004 -26.760 -5.098

o A T -39.189* 5.341 3254.263 .000 -49.660 -28.717

T A 39.189* 5.341 3254.263 .000 28.717 49.660

u A T -.459 5.973 3254.161 .939 -12.170 11.252

T A .459 5.973 3254.161 .939 -11.252 12.170 Based on estimated marginal means




7) Pairwise comparisons of the significant interaction between vowel, task, and stress. The 1,2,3 stand for the NRT, PIT, and CPT respectively. “T” denotes tonic, and “A” denotes atonic.


vowel task

(I)

stress

(J)

stress

Mean

Difference (I-

J)

Std.

Error df Sig.a

95% Confidence


Lower

Bound

Upper

Bound

a 1 A T -70.477* 6.246 3254.232 .000 -82.723 -58.232

T A 70.477* 6.246 3254.232 .000 58.232 82.723

2 A T -72.141* 6.203 3254.061 .000 -84.304 -59.978

T A 72.141* 6.203 3254.061 .000 59.978 84.304

372

3 A T -97.295* 10.743 3254.107 .000 -118.359 -76.230

T A 97.295* 10.743 3254.107 .000 76.230 118.359

e 1 A T -57.375* 6.828 3254.171 .000 -70.764 -43.986

T A 57.375* 6.828 3254.171 .000 43.986 70.764

2 A T -60.316* 6.694 3254.066 .000 -73.441 -47.190

T A 60.316* 6.694 3254.066 .000 47.190 73.441

3 A T -86.073* 12.670 3254.087 .000 -110.916 -61.231

T A 86.073* 12.670 3254.087 .000 61.231 110.916

i 1 A T 36.962* 9.051 3254.612 .000 19.217 54.708

T A -36.962* 9.051 3254.612 .000 -54.708 -19.217

2 A T -8.259 7.922 3254.066 .297 -23.791 7.274

T A 8.259 7.922 3254.066 .297 -7.274 23.791

3 A T 19.082 11.412 3254.006 .095 -3.294 41.459

T A -19.082 11.412 3254.006 .095 -41.459 3.294

o 1 A T -42.699* 8.493 3254.747 .000 -59.351 -26.048

T A 42.699* 8.493 3254.747 .000 26.048 59.351

2 A T -41.737* 8.581 3254.098 .000 -58.563 -24.912

T A 41.737* 8.581 3254.098 .000 24.912 58.563

3 A T -33.130* 10.537 3254.013 .002 -53.789 -12.471

T A 33.130* 10.537 3254.013 .002 12.471 53.789

u 1 A T 7.773 12.625 3254.294 .538 -16.982 32.527

T A -7.773 12.625 3254.294 .538 -32.527 16.982

2 A T 2.297 8.027 3254.024 .775 -13.440 18.035

T A -2.297 8.027 3254.024 .775 -18.035 13.440

3 A T -11.448 9.875 3254.039 .246 -30.811 7.915

T A 11.448 9.875 3254.039 .246 -7.915 30.811 Based on estimated marginal means. *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent Variable: F1

373

8) Interaction between vowel, task, and syllable type. The “C” stands for closed (syllable) and the “O” for open (syllable). The numbers 1, 2, 3 stand for the NRT, PIT, and CPT respectively.


vowel task

(I)

syll

(J)

syll

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea

Lower Bound

Upper

Bound

a 1 C O -25.083* 6.436 3254.215 .000 -37.703 -12.463

O C 25.083* 6.436 3254.215 .000 12.463 37.703

2 C O 15.028* 6.221 3254.084 .016 2.830 27.225

O C -15.028* 6.221 3254.084 .016 -27.225 -2.830

3 C O 27.709* 9.943 3254.011 .005 8.214 47.204

O C -27.709* 9.943 3254.011 .005 -47.204 -8.214

e 1 C O 10.972 7.308 3254.242 .133 -3.357 25.301

O C -10.972 7.308 3254.242 .133 -25.301 3.357

2 C O -11.928 6.798 3254.040 .079 -25.256 1.400

O C 11.928 6.798 3254.040 .079 -1.400 25.256

3 C O -9.693 12.286 3254.046 .430 -33.782 14.396

O C 9.693 12.286 3254.046 .430 -14.396 33.782

i 1 C O 9.069 10.636 3254.103 .394 -11.785 29.923

O C -9.069 10.636 3254.103 .394 -29.923 11.785

2 C O 2.229 8.133 3254.080 .784 -13.716 18.175

O C -2.229 8.133 3254.080 .784 -18.175 13.716

3 C O -2.056 10.270 3254.029 .841 -22.192 18.079

O C 2.056 10.270 3254.029 .841 -18.079 22.192

o 1 C O 37.923* 8.490 3254.179 .000 21.276 54.571

O C -37.923* 8.490 3254.179 .000 -54.571 -21.276

2 C O -5.723 9.030 3254.021 .526 -23.429 11.982

O C 5.723 9.030 3254.021 .526 -11.982 23.429

3 C O -29.074* 10.376 3254.029 .005 -49.419 -8.729

O C 29.074* 10.376 3254.029 .005 8.729 49.419

u 1 C O 19.815 18.287 3254.362 .279 -16.040 55.670

O C -19.815 18.287 3254.362 .279 -55.670 16.040

2 C O 21.376* 8.660 3254.014 .014 4.396 38.355

O C -21.376* 8.660 3254.014 .014 -38.355 -4.396

3 C O -6.400 9.758 3254.064 .512 -25.532 12.732

374

O C 6.400 9.758 3254.064 .512 -12.732 25.532 Based on estimated marginal means. *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent variable: F1 9) Pairwise comparisons of the significant interaction between vowel and course level. In the table, the variable is labeled as “group.” The “1” represents the intermediate-level HS, and “2” denotes the advanced-level group.


vowel (I) Group

(J)

Group

Mean

Difference (I-

J)

Std.

Error df Sig.a

95% Confidence Interval

for Differencea

Lower Bound

Upper

Bound

a 1 2 -30.800 22.987 8.557 .215 -83.214 21.614

2 1 30.800 22.987 8.557 .215 -21.614 83.214

e 1 2 -23.556 23.103 8.731 .335 -76.066 28.954

2 1 23.556 23.103 8.731 .335 -28.954 76.066

i 1 2 -21.261 23.325 9.071 .386 -73.963 31.441

2 1 21.261 23.325 9.071 .386 -31.441 73.963

o 1 2 -5.012 23.414 9.210 .835 -57.794 47.770

2 1 5.012 23.414 9.210 .835 -47.770 57.794

u 1 2 -22.283 23.536 9.404 .367 -75.179 30.613

2 1 22.283 23.536 9.404 .367 -30.613 75.179





375

10) Parameter estimates for the significant vowel by grammar score interaction. The [vowel=u]*grammar category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval

Parameter Estimate Std.

Error df t Sig. Lower Bound

Upper Bound

Grammar -1.232 4.040 9.429 -0.280 0.785 -11.121 8.656

[vowel=a]*grammar -3.203 1.632 3254.033 -1.963 0.050 -6.402 -0.004

[vowel=e]*grammar 2.018 1.676 3254.036 1.204 0.229 -1.267 5.304

[vowel=i]*grammar -0.826 1.779 3254.032 -0.464 0.643 -4.315 2.664

[vowel=o]*grammar 6.380 1.834 3254.004 3.478 0.001 2.782 9.976

[vowel=u]*grammar 0 0 . . . . . 11) Parameter estimates for the significant vowel by cultural sensitivity interaction (ICCS total). The [vowel=u]*ICCS total category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval



Upper Bound

ICCS total -0.761 0.765 9.331 -0.995 0.345 -2.482 0.960

[vowel=a]*ICCS total -0.101 0.275 3254.031 -0.367 0.714 -0.641 0.439

[vowel=e]*ICCS total -0.556 0.285 3254.063 -1.953 0.051 -1.115 0.002

[vowel=i]*ICCS total 0.275 0.298 3254.020 0.921 0.357 -0.310 0.861

[vowel=o]*ICCS total -0.322 0.303 3254.152 -1.062 0.288 -0.918 0.272

[vowel=u]*ICCS total 0 0 . . . . . 12) Parameter estimates for the significant vowel by Spanish use interaction. The [vowel=u]*Spanish use category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval



Upper Bound

Spanish use 1.980 3.118 9.443 0.635 0.540 -5.023 8.982

[vowel=a]*Spanish use 2.395 1.142 3254.090 2.098 0.036 0.156 4.635

[vowel=e]*Spanish use 0.521 1.178 3254.201 0.442 0.658 -1.789 2.832

[vowel=i]*Spanish use -0.050 1.245 3254.117 -0.041 0.968 -2.492 2.391

[vowel=o]*Spanish use -2.722 1.298 3254.090 -2.096 0.036 -5.269 -0.175

[vowel=u]*Spanish use 0 0 . . . . .

376

Appendix I: Results of statistical analyses conducted with F2




Intercept 1 8.013 34.084 .000

vowel 4 3254.136 19.229 .000

task 2 3254.550 6.374 .002

stress 1 3254.309 2.416 .120

syll 1 3254.193 2.323 .128

Gender 1 8.033 3.921 .083

Group 1 8.041 .463 .515

Travel 2 8.025 .162 .853

Grammar 1 8.042 .151 .708

ICCSTotal 1 8.016 1.697 .229

Spanuse 1 8.030 .348 .571

vowel * task 8 3254.218 23.711 .000

vowel * stress 4 3254.244 46.598 .000

vowel * syll 4 3254.153 5.837 .000


vowel * task * syll 10 3254.167 4.444 .000

vowel * Gender 4 3254.162 75.974 .000

vowel * Group 4 3254.125 18.114 .000

vowel * Travel 8 3254.133 23.615 .000

vowel * Grammar 4 3254.070 23.228 .000

vowel * ICCSTotal 4 3254.134 .850 .493

vowel * Spanuse 4 3254.137 30.542 .000


377



(I) vowel (J) vowel

Mean

Difference (I-



Differencea

Lower Bound

Upper

Bound

a e -363.502* 14.347 3254.064 .000 -403.801 -323.202

i -677.573* 15.315 3254.197 .000 -720.592 -634.555

o 363.528* 14.756 3254.336 .000 322.080 404.976

u 320.720* 17.554 3254.191 .000 271.412 370.029

e a 363.502* 14.347 3254.064 .000 323.202 403.801

i -314.071* 15.967 3254.117 .000 -358.923 -269.220

o 727.030* 15.437 3254.321 .000 683.669 770.391

u 684.222* 18.130 3254.177 .000 633.295 735.149

i a 677.573* 15.315 3254.197 .000 634.555 720.592

e 314.071* 15.967 3254.117 .000 269.220 358.923

o 1041.101* 16.343 3254.450 .000 995.195 1087.008

u 998.294* 18.895 3254.113 .000 945.219 1051.369

o a -363.528* 14.756 3254.336 .000 -404.976 -322.080

e -727.030* 15.437 3254.321 .000 -770.391 -683.669

i -1041.101* 16.343 3254.450 .000 -1087.008 -995.195

u -42.808 18.458 3254.371 .204 -94.655 9.039

u a -320.720* 17.554 3254.191 .000 -370.029 -271.412

e -684.222* 18.130 3254.177 .000 -735.149 -633.295

i -998.294* 18.895 3254.113 .000 -1051.369 -945.219

o 42.808 18.458 3254.371 .204 -9.039 94.655 Based on estimated marginal means. *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent Variable: F2.

378




(I) task (J) task

Mean



Differencea

Lower Bound

Upper

Bound

1 2 -15.725 9.400 3254.854 .283 -38.239 6.790

3 -38.385* 10.863 3254.626 .001 -64.403 -12.367

2 1 15.725 9.400 3254.854 .283 -6.790 38.239

3 -22.661* 9.267 3254.198 .044 -44.857 -.464

3 1 38.385* 10.863 3254.626 .001 12.367 64.403

2 22.661* 9.267 3254.198 .044 .464 44.857





4) Pairwise comparisons of the vowel by task interaction. The numbers 1, 2, and 3 represent the NRT, PIT, and CPT respectively.


vowel

(I)

task

(J)

task

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea

Lower Bound

Upper

Bound

a 1 2 23.222 13.996 3254.155 .291 -10.300 56.745

3 57.040* 19.203 3254.077 .009 11.044 103.035

2 1 -23.222 13.996 3254.155 .291 -56.745 10.300

3 33.817 18.861 3254.081 .219 -11.360 78.995

3 1 -57.040* 19.203 3254.077 .009 -103.035 -11.044

2 -33.817 18.861 3254.081 .219 -78.995 11.360

e 1 2 -27.066 15.129 3254.585 .221 -63.303 9.172

3 -242.642* 23.012 3254.314 .000 -297.760 -187.523

2 1 27.066 15.129 3254.585 .221 -9.172 63.303

3 -215.576* 22.122 3254.090 .000 -268.564 -162.588

3 1 242.642* 23.012 3254.314 .000 187.523 297.760

379

2 215.576* 22.122 3254.090 .000 162.588 268.564

i 1 2 -64.229* 20.858 3254.318 .006 -114.189 -14.270

3 -131.978* 24.667 3254.278 .000 -191.061 -72.895

2 1 64.229* 20.858 3254.318 .006 14.270 114.189

3 -67.749* 22.054 3254.099 .006 -120.573 -14.924

3 1 131.978* 24.667 3254.278 .000 72.895 191.061

2 67.749* 22.054 3254.099 .006 14.924 120.573

o 1 2 31.972 19.001 3254.536 .278 -13.539 77.484

3 84.847* 18.947 3254.538 .000 39.464 130.229

2 1 -31.972 19.001 3254.536 .278 -77.484 13.539

3 52.874* 19.352 3254.111 .019 6.522 99.227

3 1 -84.847* 18.947 3254.538 .000 -130.229 -39.464

2 -52.874* 19.352 3254.111 .019 -99.227 -6.522

u 1 2 -42.522 31.336 3254.574 .525 -117.580 32.535

3 40.807 32.839 3254.509 .642 -37.850 119.465

2 1 42.522 31.336 3254.574 .525 -32.535 117.580

3 83.330* 20.953 3254.079 .000 33.144 133.516

3 1 -40.807 32.839 3254.509 .642 -119.465 37.850

2 -83.330* 20.953 3254.079 .000 -133.516 -33.144 Based on estimated marginal means. *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent Variable: F2.

5) Pairwise comparisons of the vowel by stress interaction. “T” indicates tonic, and “A” indicates atonic.


vowel

(I)

stress

(J)

stress

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea

Lower Bound

Upper

Bound

a A T -9.928 14.131 3254.153 .482 -37.635 17.779

T A 9.928 14.131 3254.153 .482 -17.779 37.635

e A T -41.008* 16.168 3254.279 .011 -72.709 -9.307

T A 41.008* 16.168 3254.279 .011 9.307 72.709

i A T -189.418* 16.872 3254.233 .000 -222.499 -156.337

T A 189.418* 16.872 3254.233 .000 156.337 222.499

380

o A T 89.938* 16.312 3254.382 .000 57.955 121.922

T A -89.938* 16.312 3254.382 .000 -121.922 -57.955

u A T 93.409* 18.243 3254.245 .000 57.640 129.179

T A -93.409* 18.243 3254.245 .000 -129.179 -57.640




b. Dependent Variable: F2. 6) Pairwise comparisons of the vowel by syllable type interaction. “C” indicates a closed syllable and “O” an open syllable.


vowel

(I)

syll

(J)

syll

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea

Lower Bound

Upper

Bound

a C O 40.635* 13.624 3254.150 .003 13.923 67.346

O C -40.635* 13.624 3254.150 .003 -67.346 -13.923

e C O -29.176 16.115 3254.172 .070 -60.773 2.421

O C 29.176 16.115 3254.172 .070 -2.421 60.773

i C O -35.963* 17.131 3254.082 .036 -69.551 -2.374

O C 35.963* 17.131 3254.082 .036 2.374 69.551

o C O 30.413 16.471 3254.175 .065 -1.881 62.707

O C -30.413 16.471 3254.175 .065 -62.707 1.881

u C O 53.726* 22.820 3254.250 .019 8.984 98.469

O C -53.726* 22.820 3254.250 .019 -98.469 -8.984





381



vowel task

(I)

stress

(J)

stress

Mean

Difference (I-J)

Std.

Error df

Sig.a


for Differencea

Lower Bound

Upper

Bound

a 1 A T 62.358* 19.076 3254.341 .001 24.956 99.759

T A -62.358* 19.076 3254.341 .001 -99.759 -24.956

2 A T -16.729 18.947 3254.110 .377 -53.879 20.421

T A 16.729 18.947 3254.110 .377 -20.421 53.879

3 A T -75.413* 32.814 3254.173 .022 -139.751 -11.075

T A 75.413* 32.814 3254.173 .022 11.075 139.751

e 1 A T -192.323* 20.856 3254.258 .000 -233.215 -151.430

T A 192.323* 20.856 3254.258 .000 151.430 233.215

2 A T -128.771* 20.446 3254.116 .000 -168.860 -88.683

T A 128.771* 20.446 3254.116 .000 88.683 168.860

3 A T 198.070* 38.699 3254.145 .000 122.193 273.947

T A -198.070* 38.699 3254.145 .000 -273.947 -122.193

i 1 A T -241.357* 27.643 3254.849 .000 -295.557 -187.158

T A 241.357* 27.643 3254.849 .000 187.158 295.557

2 A T -76.574* 24.196 3254.117 .002 -124.014 -29.134

T A 76.574* 24.196 3254.117 .002 29.134 124.014

3 A T -250.323* 34.857 3254.036 .000 -318.667 -181.978

T A 250.323* 34.857 3254.036 .000 181.978 318.667

o 1 A T 53.990* 25.939 3255.029 .037 3.132 104.848

T A -53.990* 25.939 3255.029 .037 -104.848 -3.132

2 A T 97.882* 26.210 3254.160 .000 46.491 149.272

T A -97.882* 26.210 3254.160 .000 -149.272 -46.491

3 A T 117.943* 32.182 3254.046 .000 54.845 181.042

T A -117.943* 32.182 3254.046 .000 -181.042 -54.845

u 1 A T 14.213 38.562 3254.424 .712 -61.395 89.821

T A -14.213 38.562 3254.424 .712 -89.821 61.395

2 A T 172.963* 24.516 3254.060 .000 124.895 221.031

T A -172.963* 24.516 3254.060 .000 -221.031 -124.895

3 A T 93.052* 30.163 3254.080 .002 33.913 152.192

382

T A -93.052* 30.163 3254.080 .002 -152.192 -33.913 Based on estimated marginal means. *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent Variable: F2.



vowel task

(I)

syll

(J)

syll

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea

Lower Bound

Upper

Bound

a 1 C O 53.552* 19.659 3254.318 .006 15.007 92.097

O C -53.552* 19.659 3254.318 .006 -92.097 -15.007

2 C O -28.963 19.001 3254.141 .128 -66.218 8.293

O C 28.963 19.001 3254.141 .128 -8.293 66.218

3 C O 97.315* 30.369 3254.043 .001 37.770 156.859

O C -97.315* 30.369 3254.043 .001 -156.859 -37.770

e 1 C O 28.515 22.322 3254.354 .202 -15.250 72.281

O C -28.515 22.322 3254.354 .202 -72.281 15.250

2 C O -68.950* 20.762 3254.082 .001 -109.658 -28.242

O C 68.950* 20.762 3254.082 .001 28.242 109.658

3 C O -47.094 37.526 3254.089 .210 -120.671 26.482

O C 47.094 37.526 3254.089 .210 -26.482 120.671

i 1 C O -42.813 32.486 3254.166 .188 -106.509 20.883

O C 42.813 32.486 3254.166 .188 -20.883 106.509

2 C O -9.683 24.840 3254.136 .697 -58.386 39.020

O C 9.683 24.840 3254.136 .697 -39.020 58.386

3 C O -55.391 31.367 3254.067 .078 -116.892 6.110

O C 55.391 31.367 3254.067 .078 -6.110 116.892

o 1 C O 76.876* 25.932 3254.270 .003 26.030 127.722

O C -76.876* 25.932 3254.270 .003 -127.722 -26.030

2 C O 11.343 27.581 3254.055 .681 -42.735 65.421

O C -11.343 27.581 3254.055 .681 -65.421 42.735

3 C O 3.021 31.693 3254.067 .924 -59.120 65.161

383

O C -3.021 31.693 3254.067 .924 -65.161 59.120

u 1 C O 52.441 55.854 3254.514 .348 -57.071 161.953

O C -52.441 55.854 3254.514 .348 -161.953 57.071

2 C O -15.312 26.450 3254.047 .563 -67.172 36.549

O C 15.312 26.450 3254.047 .563 -36.549 67.172

3 C O 124.050* 29.804 3254.115 .000 65.614 182.486

O C -124.050* 29.804 3254.115 .000 -182.486 -65.614 Based on estimated marginal means. *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent Variable: F2.

9) Pairwise comparisons of the significant interaction between vowel and course level. In the table, the variable is labeled as “group.” The “1” represents the intermediate-level HS, and “2” denotes the advanced-level group.


vowel

(I)

Group

(J)

Group

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea

Lower Bound

Upper

Bound

a 1 2 7.229 60.659 8.789 .908 -130.494 144.951

2 1 -7.229 60.659 8.789 .908 -144.951 130.494

e 1 2 -17.580 61.070 9.030 .780 -155.660 120.500

2 1 17.580 61.070 9.030 .780 -120.500 155.660

i 1 2 -46.640 61.851 9.501 .469 -185.441 92.160

2 1 46.640 61.851 9.501 .469 -92.160 185.441

o 1 2 102.117 62.162 9.694 .132 -36.985 241.219

2 1 -102.117 62.162 9.694 .132 -241.219 36.985

u 1 2 156.644* 62.592 9.964 .031 17.112 296.175

2 1 -156.644* 62.592 9.964 .031 -296.175 -17.112





384

10) Pairwise comparisons of the significant interaction between vowel and travel group. The “0” refers to the “no travel” group, the “1” refers to the infrequent (every 3-4 years) travelers, and the “2” represents the frequent travel group.


vowel

(I)

Travel

(J)

Travel

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea

Lower Bound

Upper

Bound

a 0 1 -29.237 61.563 8.733 1.000 -211.041 152.567

2 -59.719 81.050 8.688 1.000 -299.352 179.914

1 0 29.237 61.563 8.733 1.000 -152.567 211.041

2 -30.482 74.444 8.698 1.000 -250.528 189.563

2 0 59.719 81.050 8.688 1.000 -179.914 299.352

1 30.482 74.444 8.698 1.000 -189.563 250.528

e 0 1 79.258 61.916 8.935 .698 -102.652 261.169

2 -93.428 81.580 8.918 .846 -333.215 146.360

1 0 -79.258 61.916 8.935 .698 -261.169 102.652

2 -172.686 74.840 8.884 .140 -392.845 47.474

2 0 93.428 81.580 8.918 .846 -146.360 333.215

1 172.686 74.840 8.884 .140 -47.474 392.845

i 0 1 -57.013 62.756 9.430 1.000 -239.269 125.243

2 -244.676* 82.469 9.313 .046 -484.812 -4.540

1 0 57.013 62.756 9.430 1.000 -125.243 239.269

2 -187.663 75.660 9.280 .103 -408.135 32.810

2 0 244.676* 82.469 9.313 .046 4.540 484.812

1 187.663 75.660 9.280 .103 -32.810 408.135

o 0 1 13.243 62.893 9.512 1.000 -169.082 195.568

2 79.067 83.403 9.742 1.000 -161.548 319.681

1 0 -13.243 62.893 9.512 1.000 -195.568 169.082

2 65.823 76.562 9.730 1.000 -155.106 286.753

2 0 -79.067 83.403 9.742 1.000 -319.681 161.548

1 -65.823 76.562 9.730 1.000 -286.753 155.106

u 0 1 5.385 63.619 9.959 1.000 -177.353 188.123

2 128.580 83.733 9.897 .468 -112.229 369.389

1 0 -5.385 63.619 9.959 1.000 -188.123 177.353

2 123.195 76.989 9.949 .422 -97.989 344.379

2 0 -128.580 83.733 9.897 .468 -369.389 112.229

385

1 -123.195 76.989 9.949 .422 -344.379 97.989





11) Parameter estimates for the significant vowel by grammar score interaction. The [vowel=u]*grammar category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval

Parameter Estimat

e Std.


Upper Bound

Grammar 16.887 11.709 9.999 1.442 0.180 -9.203 42.976 [vowel=a]*grammar -9.687 4.983 3254.072 -1.944 0.052 -19.459 0.083 [vowel=e]*grammar -27.475 5.118 3254.076 -5.368 0.000 -37.511 17.438 [vowel=i]*grammar -33.217 5.436 3254.071 -6.110 0.000 -43.876 22.557 [vowel=o]*grammar 7.487 5.602 3254.032 1.336 0.181 -3.497 18.472 [vowel=u]*grammar 0 0 . . . . .

12) Parameter estimates for the significant vowel by Spanish use interaction. The [vowel=u]*Spanish use category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval



Upper Bound

Spanish use -23.047 8.294 10.020 -2.779 0.019 -41.522 -4.572 [vowel=a]*Spanish use 18.518 3.488 3254.149 5.309 0.000 11.678 25.358 [vowel=e]*Spanish use 24.915 3.600 3254.298 6.920 0.000 17.855 31.975 [vowel=i]*Spanish use 38.389 3.803 3254.185 0.109 0.000 30.931 45.847 [vowel=o]*Spanish use 10.264 3.967 3254.149 2.587 0.010 2.485 18.042 [vowel=u]*Spanish use 0 0 . . . . .

386

Appendix J: Results of statistical analyses conducted with Euclidean distance.




Intercept 1 7.973 2.249 .172

vowel 4 3254.148 5.066 .000

task 2 3254.730 89.789 .000

stress 1 3254.393 190.064 .000

syll 1 3254.228 35.296 .000

Gender 1 8.002 15.168 .005

Group 1 8.013 1.865 .209

Travel 2 7.991 2.758 .123

Grammar 1 8.014 3.029 .120

ICCSTotal 1 7.978 .026 .877

Spanuse 1 7.998 4.156 .076

vowel * task 8 3254.264 6.165 .000

vowel * stress 4 3254.300 25.014 .000

vowel * syll 4 3254.172 .815 .515


vowel * task * syll 10 3254.192 4.089 .000

vowel * Gender 4 3254.185 20.543 .000

vowel * Group 4 3254.133 8.988 .000

vowel * Travel 8 3254.143 10.775 .000

vowel * Grammar 4 3254.055 4.257 .002

vowel * ICCSTotal 4 3254.145 4.830 .001

vowel * Spanuse 4 3254.149 16.695 .000

a. Dependent Variable: EUC.

387



(I) vowel (J) vowel

Mean

Difference (I-



Differencea

Lower Bound

Upper

Bound

a e -37.226* 11.593 3254.046 .013 -69.791 -4.660

i -333.997* 12.376 3254.235 .000 -368.759 -299.234

o -250.056* 11.924 3254.431 .000 -283.549 -216.563

u -222.637* 14.185 3254.226 .000 -262.483 -182.792

e a 37.226* 11.593 3254.046 .013 4.660 69.791

i -296.771* 12.903 3254.121 .000 -333.015 -260.527

o -212.830* 12.474 3254.409 .000 -247.869 -177.792

u -185.412* 14.651 3254.206 .000 -226.565 -144.259

i a 333.997* 12.376 3254.235 .000 299.234 368.759

e 296.771* 12.903 3254.121 .000 260.527 333.015

o 83.941* 13.206 3254.591 .000 46.845 121.036

u 111.359* 15.269 3254.116 .000 68.470 154.248

o a 250.056* 11.924 3254.431 .000 216.563 283.549

e 212.830* 12.474 3254.409 .000 177.792 247.869

i -83.941* 13.206 3254.591 .000 -121.036 -46.845

u 27.419 14.915 3254.480 .661 -14.477 69.315

u a 222.637* 14.185 3254.226 .000 182.792 262.483

e 185.412* 14.651 3254.206 .000 144.259 226.565

i -111.359* 15.269 3254.116 .000 -154.248 -68.470

o -27.419 14.915 3254.480 .661 -69.315 14.477




b. Dependent Variable: EUC.

388




(I) task (J) task

Mean



for Differencea

Lower

Bound

Upper

Bound

1 2 -16.744 7.596 3255.153 .083 -34.937 1.449

3 -105.099* 8.778 3254.837 .000 -126.123 -84.074

2 1 16.744 7.596 3255.153 .083 -1.449 34.937

3 -88.355* 7.488 3254.236 .000 -106.291 -70.418

3 1 105.099* 8.778 3254.837 .000 84.074 126.123

2 88.355* 7.488 3254.236 .000 70.418 106.291







(I)

stress

(J)

stress

Mean Difference (I-

J)

Std.

Error df Sig.a


Differencea


A T -81.722* 5.928 3254.393 .000 -93.344 -70.099

T A 81.722* 5.928 3254.393 .000 70.099 93.344





389

5) Pairwise comparisons of the main effect of syllable type. “C” indicates a closed syllable, and “O” indicates an open syllable.


(I) syll (J) syll

Mean



for Differencea

Lower

Bound

Upper

Bound

C O -37.567* 6.323 3254.228 .000 -49.965 -25.169

O C 37.567* 6.323 3254.228 .000 25.169 49.965







vowel

(I)

task

(J)

task

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea


a 1 2 -22.631 11.310 3254.175 .136 -49.720 4.458

3 -73.614* 15.518 3254.064 .000 -110.782 -36.446

2 1 22.631 11.310 3254.175 .136 -4.458 49.720

3 -50.983* 15.242 3254.070 .002 -87.490 -14.476

3 1 73.614* 15.518 3254.064 .000 36.446 110.782

2 50.983* 15.242 3254.070 .002 14.476 87.490

e 1 2 -2.436 12.225 3254.779 1.000 -31.719 26.846

3 -186.562* 18.595 3254.399 .000 -231.102 -142.022

2 1 2.436 12.225 3254.779 1.000 -26.846 31.719

3 -184.126* 17.877 3254.083 .000 -226.944 -141.307

3 1 186.562* 18.595 3254.399 .000 142.022 231.102

2 184.126* 17.877 3254.083 .000 141.307 226.944

i 1 2 -58.649* 16.855 3254.405 .002 -99.020 -18.277

3 -132.501* 19.933 3254.349 .000 -180.245 -84.757

2 1 58.649* 16.855 3254.405 .002 18.277 99.020

390

3 -73.852* 17.821 3254.096 .000 -116.539 -31.166

3 1 132.501* 19.933 3254.349 .000 84.757 180.245

2 73.852* 17.821 3254.096 .000 31.166 116.539

o 1 2 -27.112 15.354 3254.711 .233 -63.889 9.665

3 -84.733* 15.311 3254.715 .000 -121.405 -48.061

2 1 27.112 15.354 3254.711 .233 -9.665 63.889

3 -57.621* 15.638 3254.113 .001 -95.078 -20.165

3 1 84.733* 15.311 3254.715 .000 48.061 121.405

2 57.621* 15.638 3254.113 .001 20.165 95.078

u 1 2 27.108 25.322 3254.765 .853 -33.543 87.760

3 -48.084 26.537 3254.674 .210 -111.645 15.477

2 1 -27.108 25.322 3254.765 .853 -87.760 33.543

3 -75.192* 16.931 3254.067 .000 -115.746 -34.637

3 1 48.084 26.537 3254.674 .210 -15.477 111.645

2 75.192* 16.931 3254.067 .000 34.637 115.746 Based on estimated marginal means a. Adjustment for multiple comparisons: Bonferroni. *. The mean difference is significant at the .05 level. b. Dependent Variable: EUC 7) Pairwise comparisons of the vowel by stress interaction.

Pairwise Comparisons

vowel

(I)

stress

(J)

stress

Mean Difference

(I-J) Std. Error df Sig.a


for Differencea

Lower

Bound

Upper

Bound

a A T -50.087* 11.419 3254.172 .000 -72.476 -27.698

T A 50.087* 11.419 3254.172 .000 27.698 72.476

e A T .309 13.065 3254.350 .981 -25.307 25.926

T A -.309 13.065 3254.350 .981 -25.926 25.307

i A T -179.805* 13.634 3254.286 .000 -206.537 -153.073

T A 179.805* 13.634 3254.286 .000 153.073 206.537

o A T -98.057* 13.182 3254.496 .000 -123.902 -72.212

T A 98.057* 13.182 3254.496 .000 72.212 123.902

u A T -80.970* 14.742 3254.302 .000 -109.874 -52.065

T A 80.970* 14.742 3254.302 .000 52.065 109.874 Dependent variables: Euc. Based on estimated marginal means a. Adjustment for multiple comparisons: Bonferroni. *. The mean difference is significant at the .05 level.

391



vowel task

(I)

stress

(J)

stress

Mean

Difference (I-J)

Std.

Error df Sig.a


for Differencea

Lower

Bound

Upper

Bound

a 1 A T -87.647* 15.415 3254.437 .000 -117.870 -57.423

T A 87.647* 15.415 3254.437 .000 57.423 117.870

2 A T -50.999* 15.311 3254.111 .001 -81.019 -20.979

T A 50.999* 15.311 3254.111 .001 20.979 81.019

3 A T -11.616 26.516 3254.200 .661 -63.606 40.374

T A 11.616 26.516 3254.200 .661 -40.374 63.606

e 1 A T -108.864* 16.854 3254.321 .000 -141.908 -75.819

T A 108.864* 16.854 3254.321 .000 75.819 141.908

2 A T -95.096* 16.522 3254.120 .000 -127.491 -62.701

T A 95.096* 16.522 3254.120 .000 62.701 127.491

3 A T 204.888* 31.272 3254.161 .000 143.573 266.202

T A -204.888* 31.272 3254.161 .000 -266.202 -143.573

i 1 A T -218.370* 22.337 3255.147 .000 -262.167 -174.573

T A 218.370* 22.337 3255.147 .000 174.573 262.167

2 A T -72.890* 19.552 3254.121 .000 -111.225 -34.554

T A 72.890* 19.552 3254.121 .000 34.554 111.225

3 A T -248.156* 28.168 3254.006 .000 -303.384 -192.928

T A 248.156* 28.168 3254.006 .000 192.928 303.384

o 1 A T -66.230* 20.960 3255.396 .002 -107.326 -25.134

T A 66.230* 20.960 3255.396 .002 25.134 107.326

2 A T -99.129* 21.180 3254.183 .000 -140.657 -57.601

T A 99.129* 21.180 3254.183 .000 57.601 140.657

3 A T -128.811* 26.006 3254.020 .000 -179.801 -77.822

T A 128.811* 26.006 3254.020 .000 77.822 179.801

u 1 A T -6.301 31.161 3254.554 .840 -67.398 54.796

T A 6.301 31.161 3254.554 .840 -54.796 67.398

2 A T -150.489* 19.811 3254.040 .000 -189.331 -111.646

T A 150.489* 19.811 3254.040 .000 111.646 189.331

3 A T -86.120* 24.374 3254.068 .000 -133.909 -38.330

392

T A 86.120* 24.374 3254.068 .000 38.330 133.909 Based on estimated marginal means a. Adjustment for multiple comparisons: Bonferroni. *. The mean difference is significant at the .05 level. b. Dependent Variable: EUC



vowel task (I) syll (J) syll

Mean

Difference (I-

J)

Std.

Error df Sig.a


for Differencea

Lower

Bound

Upper

Bound

a 1 C O -47.365* 15.886 3254.405 .003 -78.513 -16.218

O C 47.365* 15.886 3254.405 .003 16.218 78.513

2 C O 19.395 15.355 3254.156 .207 -10.710 49.501

O C -19.395 15.355 3254.156 .207 -49.501 10.710

3 C O -42.001 24.541 3254.016 .087 -90.118 6.115

O C 42.001 24.541 3254.016 .087 -6.115 90.118

e 1 C O -7.277 18.038 3254.456 .687 -42.643 28.089

O C 7.277 18.038 3254.456 .687 -28.089 42.643

2 C O -41.643* 16.777 3254.071 .013 -74.539 -8.748

O C 41.643* 16.777 3254.071 .013 8.748 74.539

3 C O -54.032 30.324 3254.081 .075 -113.488 5.424

O C 54.032 30.324 3254.081 .075 -5.424 113.488

i 1 C O -62.912* 26.252 3254.191 .017 -114.384 -11.441

O C 62.912* 26.252 3254.191 .017 11.441 114.384

2 C O -5.194 20.072 3254.149 .796 -44.550 34.161

O C 5.194 20.072 3254.149 .796 -34.161 44.550

3 C O -53.062* 25.347 3254.050 .036 -102.760 -3.364

O C 53.062* 25.347 3254.050 .036 3.364 102.760

o 1 C O -69.197* 20.955 3254.337 .001 -110.284 -28.110

O C 69.197* 20.955 3254.337 .001 28.110 110.284

2 C O -9.843 22.288 3254.034 .659 -53.543 33.857

O C 9.843 22.288 3254.034 .659 -33.857 53.543

3 C O -10.078 25.611 3254.050 .694 -60.292 40.137

O C 10.078 25.611 3254.050 .694 -40.137 60.292

393

u 1 C O -81.829 45.134 3254.679 .070 -170.323 6.664

O C 81.829 45.134 3254.679 .070 -6.664 170.323

2 C O 19.987 21.374 3254.021 .350 -21.921 61.895

O C -19.987 21.374 3254.021 .350 -61.895 21.921

3 C O -118.448* 24.084 3254.118 .000 -165.669 -71.227

O C 118.448* 24.084 3254.118 .000 71.227 165.669 Based on estimated marginal means a. Adjustment for multiple comparisons: Bonferroni. *. The mean difference is significant at the .05 level. b. Dependent Variable: EUC 10) Pairwise comparisons of the significant interaction between vowel and course level. In the table, the variable is labeled as “group.” The “1” represents the intermediate-level HS, and “2” denotes the advanced-level group.


vowel (I) Group (J) Group

Mean

Difference

(I-J)

Std.

Error df Sig.a


for Differencea

Lower

Bound

Upper

Bound

a 1 2 9.590 41.619 9.076 .823 -84.438 103.618

2 1 -9.590 41.619 9.076 .823 -103.618 84.438

e 1 2 -47.844 42.009 9.422 .283 -142.231 46.544

2 1 47.844 42.009 9.422 .283 -46.544 142.231

i 1 2 -69.180 42.749 10.103 .136 -164.300 25.940

2 1 69.180 42.749 10.103 .136 -25.940 164.300

o 1 2 -60.035 43.043 10.383 .192 -155.463 35.393

2 1 60.035 43.043 10.383 .192 -35.393 155.463

u 1 2 -108.008* 43.448 10.779 .031 -203.874 -12.141

2 1 108.008* 43.448 10.779 .031 12.141 203.874 Based on estimated marginal means a. Adjustment for multiple comparisons: Bonferroni. *. The mean difference is significant at the .05 level. b. Dependent Variable: EUC

394

11) Pairwise comparisons of the significant interaction between vowel and travel frequency. The “0” denotes those who did not travel, the “1” those who traveled every 3-4 years, and the “2” represents the frequent travelers.


vowel (I) Travel (J) Travel

Mean Difference

(I-J)

Std.

Error df Sig.a


for Differencea

Lower

Bound

Upper

Bound

a 0 1 30.360 42.212 8.995 1.000 -93.478 154.197

2 -11.091 55.547 8.931 1.000 -174.303 152.122

1 0 -30.360 42.212 8.995 1.000 -154.197 93.478

2 -41.450 51.025 8.945 1.000 -191.325 108.424

2 0 11.091 55.547 8.931 1.000 -152.122 174.303

1 41.450 51.025 8.945 1.000 -108.424 191.325

e 0 1 57.879 42.548 9.284 .617 -66.093 181.851

2 -63.762 56.051 9.260 .852 -227.169 99.646

1 0 -57.879 42.548 9.284 .617 -181.851 66.093

2 -121.641 51.402 9.212 .125 -271.659 28.377

2 0 63.762 56.051 9.260 .852 -99.646 227.169

1 121.641 51.402 9.212 .125 -28.377 271.659

i 0 1 -49.387 43.344 9.999 .843 -173.790 75.016

2 -203.037* 56.894 9.829 .016 -366.880 -39.194

1 0 49.387 43.344 9.999 .843 -75.016 173.790

2 -153.650* 52.179 9.782 .045 -304.060 -3.240

2 0 203.037* 56.894 9.829 .016 39.194 366.880

1 153.650* 52.179 9.782 .045 3.240 304.060

o 0 1 -23.341 43.474 10.119 1.000 -147.829 101.146

2 -113.063 57.776 10.453 .233 -277.495 51.368

1 0 23.341 43.474 10.119 1.000 -101.146 147.829

2 -89.722 53.032 10.437 .361 -240.695 61.251

2 0 113.063 57.776 10.453 .233 -51.368 277.495

1 89.722 53.032 10.437 .361 -61.251 240.695

u 0 1 -10.611 44.159 10.772 1.000 -135.603 114.381

2 -161.487 58.087 10.680 .055 -326.156 3.181

1 0 10.611 44.159 10.772 1.000 -114.381 135.603

2 -150.877 53.434 10.757 .051 -302.159 .406

2 0 161.487 58.087 10.680 .055 -3.181 326.156

395

1 150.877 53.434 10.757 .051 -.406 302.159





11) Parameter estimates for the significant vowel by grammar score interaction. The [vowel=u]*grammar total category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval



Upper Bound

Grammar -10.788 8.130 10.830 -1.327 0.212 -28.716 7.141 [vowel=a]*grammar 4.983 4.027 3254.057 1.238 0.216 -2.912 12.879 [vowel=e]*grammar -1.179 4.136 3254.063 -0.285 0.775 -9.290 6.930 [vowel=i]*grammar -8.542 4.393 3254.056 -1.944 0.052 -17.155 0.072 [vowel=o]*grammar -6.935 4.527 3254.001 -1.532 0.126 -15.812 1.941 [vowel=u]*grammar 0 0 . . . . .

12) Parameter estimates for the significant vowel by cultural sensitivity interaction (ICCS total). The [vowel=u]*ICCS total category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval

Parameter Estimate Std. Error df t Sig.

Lower Bound

Upper Bound

ICCS total 0.386 1.410 10.630 0.274 0.789 -2.730 3.503 [vowel=a]*ICCS total 0.150 0.679 3254.055 0.222 0.825 -1.182 1.483 [vowel=e]*ICCS total -1.363 0.703 3254.116 -1.938 0.053 -2.743 0.016 [vowel=i]*ICCS total -2.002 0.737 3254.034 -2.717 0.007 -3.448 -0.557 [vowel=o]*ICCS total 0.234 0.749 3254.286 0.313 0.754 -1.234 1.704 [vowel=u]*ICCS total 0 0 . . . . .

396

13) Parameter estimates for the significant vowel by Spanish use. The [vowel=u]*Spanuse category was set to zero. The significance values indicate which vowels had slopes that differed significantly from zero.

95% Confidence

Interval

Parameter Estimate Std. Error df t Sig.

Lower Bound

Upper Bound

Spanish use 16.071 5.760 10.860 2.790 0.018 3.374 28.769 [vowel=a]*Spanish use -14.472 2.828 3254.166 5.134 0.000 -19.999 -8.945 [vowel=e]*Spanish use -6.879 2.909 3254.377 2.364 0.018 -12.583 -1.170 [vowel=i]*Spanish use 4.669 3.073 3254.217 1.518 0.129 -1.359 10.693 [vowel=o]*Spanish use -9.281 3.205 3254.166 2.895 0.004 -15.567 -2.999 [vowel=u]*Spanish use 0 0 . . . . .

397

Appendix K: Results of statistical analyses conducted with duration.




Intercept 1 8.040 4.125 .077

vowel 4 3254.166 3.803 .004

task 2 3254.769 13.390 .000

stress 1 3254.419 749.661 .000

syll 1 3254.248 9.310 .002

Gender 1 8.070 1.684 .230

Group 1 8.082 1.248 .296

Travel 2 8.058 .608 .567

Grammar 1 8.083 .819 .392

ICCSTotal 1 8.045 .001 .975

Spanuse 1 8.066 .222 .650

vowel * task 8 3254.286 5.953 .000

vowel * stress 4 3254.323 7.252 .000

vowel * syll 4 3254.191 4.281 .002


vowel * task * syll 10 3254.211 3.247 .000

vowel * Gender 4 3254.204 .647 .629

vowel * Group 4 3254.150 .253 .908

vowel * Travel 8 3254.160 1.301 .238

vowel * Grammar 4 3254.069 .477 .753

vowel * ICCSTotal 4 3254.162 .761 .550

vowel * Spanuse 4 3254.166 1.313 .263

a. Dependent Variable: Duration.

398



(I) vowel (J) vowel

Mean

Difference (I-



Differencea


a e .011* .002 3254.060 .000 .006 .016

i .018* .002 3254.256 .000 .012 .023

o .010* .002 3254.459 .000 .004 .015

u .014* .002 3254.247 .000 .007 .020

e a -.011* .002 3254.060 .000 -.016 -.006

i .007* .002 3254.137 .005 .001 .013

o -.001 .002 3254.436 1.000 -.007 .004

u .003 .002 3254.226 1.000 -.004 .009

i a -.018* .002 3254.256 .000 -.023 -.012

e -.007* .002 3254.137 .005 -.013 -.001

o -.008* .002 3254.625 .001 -.014 -.003

u -.004 .002 3254.132 .856 -.011 .003

o a -.010* .002 3254.459 .000 -.015 -.004

e .001 .002 3254.436 1.000 -.004 .007

i .008* .002 3254.625 .001 .003 .014

u .004 .002 3254.510 .738 -.002 .011

u a -.014* .002 3254.247 .000 -.020 -.007

e -.003 .002 3254.226 1.000 -.009 .004

i .004 .002 3254.132 .856 -.003 .011

o -.004 .002 3254.510 .738 -.011 .002




b. Dependent Variable: Duration.

399




(I) task (J) task

Mean



Differencea


1 2 -.006* .001 3255.207 .000 -.008 -.003

3 -.006* .001 3254.879 .000 -.010 -.003

2 1 .006* .001 3255.207 .000 .003 .008

3 -.001 .001 3254.257 1.000 -.004 .002

3 1 .006* .001 3254.879 .000 .003 .010

2 .001 .001 3254.257 1.000 -.002 .004







(I)

stress (J) stress

Mean

Difference (I-



Differencea


A T -.026* .001 3254.419 .000 -.028 -.024

T A .026* .001 3254.419 .000 .024 .028





400

5) Pairwise comparisons of the main effect of syllable type. “C” indicates a closed syllable, and “O” indicates an open syllable.


(I) syll (J) syll

Mean



Differencea


C O .003* .001 3254.248 .002 .001 .005

O C -.003* .001 3254.248 .002 -.005 -.001







vowel

(I)

task

(J)

task

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea


a 1 2 -.010* .002 3254.194 .000 -.015 -.006

3 -.011* .002 3254.079 .000 -.017 -.005

2 1 .010* .002 3254.194 .000 .006 .015

3 -.001 .002 3254.085 1.000 -.007 .005

3 1 .011* .002 3254.079 .000 .005 .017

2 .001 .002 3254.085 1.000 -.005 .007

e 1 2 .002 .002 3254.820 .663 -.002 .007

3 .000 .003 3254.426 1.000 -.007 .007

2 1 -.002 .002 3254.820 .663 -.007 .002

3 -.002 .003 3254.098 1.000 -.009 .005

3 1 .000 .003 3254.426 1.000 -.007 .007

2 .002 .003 3254.098 1.000 -.005 .009

i 1 2 -.001 .003 3254.432 1.000 -.007 .006

3 -.002 .003 3254.374 1.000 -.010 .005

2 1 .001 .003 3254.432 1.000 -.006 .007

401

3 -.001 .003 3254.111 1.000 -.008 .005

3 1 .002 .003 3254.374 1.000 -.005 .010

2 .001 .003 3254.111 1.000 -.005 .008

o 1 2 -.014* .002 3254.750 .000 -.020 -.008

3 -.016* .002 3254.753 .000 -.022 -.010

2 1 .014* .002 3254.750 .000 .008 .020

3 -.002 .002 3254.129 1.000 -.008 .004

3 1 .016* .002 3254.753 .000 .010 .022

2 .002 .002 3254.129 1.000 -.004 .008

u 1 2 -.005 .004 3254.805 .620 -.015 .005

3 -.003 .004 3254.711 1.000 -.013 .007

2 1 .005 .004 3254.805 .620 -.005 .015

3 .002 .003 3254.081 1.000 -.005 .008

3 1 .003 .004 3254.711 1.000 -.007 .013

2 -.002 .003 3254.081 1.000 -.008 .005





7) Pairwise comparisons of the vowel by syllable type interaction.


vowel

(I)

syll

(J)

syll

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea


a C O -7.444E-5 .002 3254.186 .966 -.003 .003

O C 7.444E-5 .002 3254.186 .966 -.003 .003

e C O .006* .002 3254.218 .005 .002 .010

O C -.006* .002 3254.218 .005 -.010 -.002

i C O .010* .002 3254.086 .000 .005 .014

O C -.010* .002 3254.086 .000 -.014 -.005

o C O .002 .002 3254.223 .285 -.002 .006

O C -.002 .002 3254.223 .285 -.006 .002

u C O -.002 .003 3254.332 .444 -.008 .003

O C .002 .003 3254.332 .444 -.003 .008

402


vowel

(I)

syll

(J)

syll

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea


a C O -7.444E-5 .002 3254.186 .966 -.003 .003

O C 7.444E-5 .002 3254.186 .966 -.003 .003

e C O .006* .002 3254.218 .005 .002 .010

O C -.006* .002 3254.218 .005 -.010 -.002

i C O .010* .002 3254.086 .000 .005 .014

O C -.010* .002 3254.086 .000 -.014 -.005

o C O .002 .002 3254.223 .285 -.002 .006

O C -.002 .002 3254.223 .285 -.006 .002

u C O -.002 .003 3254.332 .444 -.008 .003

O C .002 .003 3254.332 .444 -.003 .008







vowel task

(I)

stress

(J)

stress

Mean Difference

(I-J)

Std.

Error df

Sig.a


for Differencea

Lower

Bound

Upper

Bound

a 1 A T -.028* .002 3254.465 .000 -.033 -.023

T A .028* .002 3254.465 .000 .023 .033

2 A T -.028* .002 3254.127 .000 -.033 -.024

T A .028* .002 3254.127 .000 .024 .033

3 A T -.030* .004 3254.219 .000 -.038 -.022

T A .030* .004 3254.219 .000 .022 .038

e 1 A T -.030* .003 3254.344 .000 -.035 -.025

T A .030* .003 3254.344 .000 .025 .035

403

2 A T -.027* .003 3254.136 .000 -.032 -.022

T A .027* .003 3254.136 .000 .022 .032

3 A T -.032* .005 3254.179 .000 -.042 -.023

T A .032* .005 3254.179 .000 .023 .042

i 1 A T -.020* .004 3255.200 .000 -.027 -.013

T A .020* .004 3255.200 .000 .013 .027

2 A T -.013* .003 3254.137 .000 -.019 -.006

T A .013* .003 3254.137 .000 .006 .019

3 A T -.015* .004 3254.018 .001 -.024 -.006

T A .015* .004 3254.018 .001 .006 .024

o 1 A T -.022* .003 3255.458 .000 -.028 -.015

T A .022* .003 3255.458 .000 .015 .028

2 A T -.033* .003 3254.201 .000 -.040 -.027

T A .033* .003 3254.201 .000 .027 .040

3 A T -.027* .004 3254.033 .000 -.035 -.019

T A .027* .004 3254.033 .000 .019 .035

u 1 A T -.019* .005 3254.587 .000 -.029 -.009

T A .019* .005 3254.587 .000 .009 .029

2 A T -.033* .003 3254.054 .000 -.039 -.026

T A .033* .003 3254.054 .000 .026 .039

3 A T -.028* .004 3254.083 .000 -.036 -.021

T A .028* .004 3254.083 .000 .021 .036 Based on estimated marginal means *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent variable: Duration 9) Interaction between vowel, task, and syllable type. The “C” stands for closed (syllable) and the “O” for open (syllable). The numbers 1, 2, 3 stand for the NRT, PIT, and CPT respectively.


vowel task

(I)

syll

(J)

syll

Mean Difference

(I-J)

Std.

Error df Sig.a


Differencea


a 1 C O .001 .003 3254.432 .708 -.004 .006

O C -.001 .003 3254.432 .708 -.006 .004

2 C O -.004 .002 3254.173 .137 -.008 .001

404

O C .004 .002 3254.173 .137 -.001 .008

3 C O .002 .004 3254.028 .529 -.005 .010

O C -.002 .004 3254.028 .529 -.010 .005

e 1 C O .009* .003 3254.485 .001 .004 .015

O C -.009* .003 3254.485 .001 -.015 -.004

2 C O .002 .003 3254.086 .488 -.003 .007

O C -.002 .003 3254.086 .488 -.007 .003

3 C O .006 .005 3254.096 .197 -.003 .016

O C -.006 .005 3254.096 .197 -.016 .003

i 1 C O .011* .004 3254.210 .006 .003 .020

O C -.011* .004 3254.210 .006 -.020 -.003

2 C O .017* .003 3254.166 .000 .011 .023

O C -.017* .003 3254.166 .000 -.023 -.011

3 C O -1.481E-5 .004 3254.064 .997 -.008 .008

O C 1.481E-5 .004 3254.064 .997 -.008 .008

o 1 C O .002 .003 3254.362 .497 -.004 .009

O C -.002 .003 3254.362 .497 -.009 .004

2 C O .007 .004 3254.047 .055 .000 .014

O C -.007 .004 3254.047 .055 -.014 .000

3 C O -.002 .004 3254.064 .577 -.010 .006

O C .002 .004 3254.064 .577 -.006 .010

u 1 C O -.008 .007 3254.716 .282 -.022 .006

O C .008 .007 3254.716 .282 -.006 .022

2 C O .009* .003 3254.034 .009 .002 .015

O C -.009* .003 3254.034 .009 -.015 -.002

3 C O -.008* .004 3254.134 .040 -.015 .000

O C .008* .004 3254.134 .040 .000 .015 Based on estimated marginal means. *. The mean difference is significant at the .05 level. a. Adjustment for multiple comparisons: Bonferroni. b. Dependent variable: Duration

405

Appendix L: Normalized and non-normalized formant values

Comparison of statistical results conducted on normalized and non-normalized values. The darkened cells indicate differences between the two statistical outcomes.

Vowel 0.000 0.000 0.000 0.000 0.000 0.000 0.004Task 0.000 0.000 0.002 0.003 0.000 0.000 0.000Stress 0.000 0.000 0.000 0.000 0.000Syllable Type 0.000 0.000 0.002Vowel*Task 0.000 0.000 0.000 0.000 0.000 0.000 0.000Vowel*Stress 0.000 0.000 0.000 0.000 0.000 0.000 0.000

Vowel*Syllable Type 0.000 0.000 0.000 0.002

Vowel*Task*Stress 0.004 0.004 0.000 0.000 0.000 0.000

Vowel*Task*Syllable Type 0.000 0.000 0.000 0.000 0.000 0.000 0.000

Gender 0.025 0.005LevelTravelGrammarICCS TotalSpanish UseVowel*Gender 0.000 0.000 0.000 0.000 0.027Vowel*Level 0.046 0.000 0.001 0.000 0.011Vowel*Travel 0.000 0.000 0.000 0.000 0.000 0.000Vowel*Grammar 0.000 0.000 0.000 0.002Vowel*ICCS Total 0.022 0.006 0.001 0.001Vowel*Spanish Use 0.000 0.000 0.033 0.000 0.001

Effect/InteractionNormalized

F1F1 F2Normalized

F2Euclidean Distance

Normalized Euclidean Distance

Duration

406

Appendix M: Vowel by task by travel interaction

1) Type III tests of fixed effects: Dependent variable F1.



Intercept 1 7.990 19.335 .002

vowel 4 3234.081 16.014 .000

task 2 3234.352 27.277 .000

stress 1 3234.226 215.545 .000

syll 1 3234.128 2.100 .147

Gender 1 8.005 7.707 .024

Group 1 8.011 .903 .370

Travel 2 8.008 2.666 .130

Grammar 1 8.011 .012 .914

ICCSTotal 1 7.992 1.480 .259

Spanuse 1 8.004 .478 .509

vowel * task 8 3234.129 25.177 .000

vowel * stress 4 3234.167 61.150 .000

vowel * syll 4 3234.090 .807 .521


vowel * task * syll 10 3234.103 6.694 .000

vowel * Gender 4 3234.099 71.229 .000

vowel * Group 4 3234.073 2.762 .026

vowel * Travel 8 3234.088 20.770 .000

vowel * Grammar 4 3234.034 10.938 .000

vowel * ICCSTotal 4 3234.079 3.155 .013

vowel * Spanuse 4 3234.080 6.092 .000

vowel * task * Travel 20 3234.162 3.638 .000


407

2) Pairwise comparisons of interaction with dependent variable F1.


vowel Travel (I) task

(J)

task

Mean

Difference

(I-J)

Std.

Error df Sig.a

95% Confidence


Lower

Bound

Upper

Bound

a 0 1 2 8.845 7.917 3234.052 .792 -10.119 27.809

3 -24.486* 9.620 3234.002 .033 -47.528 -1.445

2 1 -8.845 7.917 3234.052 .792 -27.809 10.119

3 -33.332* 10.017 3234.012 .003 -57.325 -9.339

3 1 24.486* 9.620 3234.002 .033 1.445 47.528

2 33.332* 10.017 3234.012 .003 9.339 57.325

1 1 2 -9.341 7.381 3234.130 .617 -27.020 8.339

3 -55.862* 9.090 3234.096 .000 -77.633 -34.090

2 1 9.341 7.381 3234.130 .617 -8.339 27.020

3 -46.521* 8.938 3234.047 .000 -67.930 -25.112

3 1 55.862* 9.090 3234.096 .000 34.090 77.633

2 46.521* 8.938 3234.047 .000 25.112 67.930

2 1 2 -44.041* 7.233 3234.052 .000 -61.365 -26.717

3 -87.435* 8.678 3234.098 .000 -108.221 -66.648

2 1 44.041* 7.233 3234.052 .000 26.717 61.365

3 -43.394* 8.649 3234.039 .000 -64.111 -22.677

3 1 87.435* 8.678 3234.098 .000 66.648 108.221

2 43.394* 8.649 3234.039 .000 22.677 64.111

e 0 1 2 26.437* 8.468 3234.147 .005 6.155 46.720

3 52.785* 11.047 3234.054 .000 26.324 79.246

2 1 -26.437* 8.468 3234.147 .005 -46.720 -6.155

3 26.348 11.176 3234.029 .055 -.420 53.115

3 1 -52.785* 11.047 3234.054 .000 -79.246 -26.324

2 -26.348 11.176 3234.029 .055 -53.115 .420

1 1 2 48.158* 7.876 3234.426 .000 29.293 67.023

3 51.767* 10.604 3234.262 .000 26.368 77.167

2 1 -48.158* 7.876 3234.426 .000 -67.023 -29.293

3 3.610 10.026 3234.018 1.000 -20.406 27.625

3 1 -51.767* 10.604 3234.262 .000 -77.167 -26.368

408

2 -3.610 10.026 3234.018 1.000 -27.625 20.406

2 1 2 35.845* 7.911 3234.393 .000 16.896 54.793

3 36.723* 10.225 3234.266 .001 12.231 61.214

2 1 -35.845* 7.911 3234.393 .000 -54.793 -16.896

3 .878 9.859 3234.051 1.000 -22.737 24.493

3 1 -36.723* 10.225 3234.266 .001 -61.214 -12.231

2 -.878 9.859 3234.051 1.000 -24.493 22.737

i 0 1 2 21.292 11.354 3234.124 .183 -5.902 48.487

3 47.448* 12.025 3234.051 .000 18.647 76.249

2 1 -21.292 11.354 3234.124 .183 -48.487 5.902

3 26.156 11.463 3234.022 .068 -1.302 53.613

3 1 -47.448* 12.025 3234.051 .000 -76.249 -18.647

2 -26.156 11.463 3234.022 .068 -53.613 1.302

1 1 2 31.263* 10.718 3234.280 .011 5.592 56.934

3 69.969* 11.640 3234.294 .000 42.088 97.849

2 1 -31.263* 10.718 3234.280 .011 -56.934 -5.592

3 38.706* 10.127 3234.117 .000 14.450 62.961

3 1 -69.969* 11.640 3234.294 .000 -97.849 -42.088

2 -38.706* 10.127 3234.117 .000 -62.961 -14.450

2 1 2 34.251* 10.046 3234.110 .002 10.188 58.314

3 45.588* 10.837 3234.292 .000 19.631 71.544

2 1 -34.251* 10.046 3234.110 .002 -58.314 -10.188

3 11.337 9.946 3234.071 .763 -12.486 35.159

3 1 -45.588* 10.837 3234.292 .000 -71.544 -19.631

2 -11.337 9.946 3234.071 .763 -35.159 12.486

o 0 1 2 16.054 10.784 3234.192 .410 -9.775 41.884

3 32.632* 10.622 3234.191 .006 7.191 58.073

2 1 -16.054 10.784 3234.192 .410 -41.884 9.775

3 16.577 11.535 3233.999 .452 -11.051 44.205

3 1 -32.632* 10.622 3234.191 .006 -58.073 -7.191

2 -16.577 11.535 3233.999 .452 -44.205 11.051

1 1 2 16.149 10.791 3234.162 .404 -9.698 41.996

3 .061 10.604 3234.097 1.000 -25.338 25.460

2 1 -16.149 10.791 3234.162 .404 -41.996 9.698

3 -16.088 9.899 3234.136 .313 -39.798 7.622

3 1 -.061 10.604 3234.097 1.000 -25.460 25.338

2 16.088 9.899 3234.136 .313 -7.622 39.798

2 1 2 27.493* 10.078 3234.750 .019 3.353 51.633

409

3 18.964 9.727 3234.857 .154 -4.334 42.263

2 1 -27.493* 10.078 3234.750 .019 -51.633 -3.353

3 -8.528 9.742 3234.015 1.000 -31.862 14.805

3 1 -18.964 9.727 3234.857 .154 -42.263 4.334

2 8.528 9.742 3234.015 1.000 -14.805 31.862

u 0 1 2 59.298* 15.594 3234.186 .000 21.946 96.650

3 83.505* 16.101 3234.158 .000 44.939 122.071

2 1 -59.298* 15.594 3234.186 .000 -96.650 -21.946

3 24.207 11.744 3234.000 .118 -3.923 52.336

3 1 -83.505* 16.101 3234.158 .000 -122.071 -44.939

2 -24.207 11.744 3234.000 .118 -52.336 3.923

1 1 2 30.646 14.186 3234.149 .092 -3.332 64.625

3 51.969* 14.777 3234.093 .001 16.574 87.363

2 1 -30.646 14.186 3234.149 .092 -64.625 3.332

3 21.322 10.494 3234.099 .127 -3.814 46.459

3 1 -51.969* 14.777 3234.093 .001 -87.363 -16.574

2 -21.322 10.494 3234.099 .127 -46.459 3.814

2 1 2 4.377 13.770 3234.641 1.000 -28.605 37.360

3 20.452 14.498 3234.607 .475 -14.275 55.178

2 1 -4.377 13.770 3234.641 1.000 -37.360 28.605

3 16.074 10.235 3234.003 .349 -8.440 40.589

3 1 -20.452 14.498 3234.607 .475 -55.178 14.275

2 -16.074 10.235 3234.003 .349 -40.589 8.440





410

3) Type III tests of fixed effects: Dependent variable F2.



Intercept 1 8.013 34.166 .000

vowel 4 3234.136 19.511 .000

task 2 3234.503 5.957 .003

stress 1 3234.333 2.653 .103

syll 1 3234.200 2.305 .129

Gender 1 8.033 4.040 .079

Group 1 8.042 .419 .536

Travel 2 8.037 .153 .860

Grammar 1 8.042 .126 .732

ICCSTotal 1 8.016 1.709 .227

Spanuse 1 8.032 .325 .584

vowel * task 8 3234.201 23.463 .000

vowel * stress 4 3234.253 44.392 .000

vowel * syll 4 3234.148 6.472 .000


vowel * task * syll 10 3234.166 4.676 .000

vowel * Gender 4 3234.160 75.305 .000

vowel * Group 4 3234.125 18.597 .000

vowel * Travel 8 3234.146 24.514 .000

vowel * Grammar 4 3234.073 23.917 .000

vowel * ICCSTotal 4 3234.134 .797 .527

vowel * Spanuse 4 3234.135 31.692 .000



411

4) Pairwise comparisons of interaction with dependent variable F2.



(J)

task

Mean

Difference

(I-J)

Std.

Error df Sig.a

95% Confidence


Lower

Bound

Upper

Bound

a 0 1 2 22.769 24.252 3234.097 1.000 -35.321 80.859

3 44.422 29.467 3234.030 .395 -26.158 115.001

2 1 -22.769 24.252 3234.097 1.000 -80.859 35.321

3 21.653 30.684 3234.044 1.000 -51.841 95.147

3 1 -44.422 29.467 3234.030 .395 -115.001 26.158

2 -21.653 30.684 3234.044 1.000 -95.147 51.841

1 1 2 57.648* 22.610 3234.202 .032 3.493 111.804

3 114.107* 27.843 3234.157 .000 47.417 180.797

2 1 -57.648* 22.610 3234.202 .032 -111.804 -3.493

3 56.459 27.379 3234.091 .118 -9.119 122.037

3 1 -114.107* 27.843 3234.157 .000 -180.797 -47.417

2 -56.459 27.379 3234.091 .118 -122.037 9.119

2 1 2 -7.552 22.155 3234.097 1.000 -60.618 45.514

3 16.240 26.583 3234.160 1.000 -47.432 79.912

2 1 7.552 22.155 3234.097 1.000 -45.514 60.618

3 23.792 26.494 3234.079 1.000 -39.667 87.251

3 1 -16.240 26.583 3234.160 1.000 -79.912 47.432

2 -23.792 26.494 3234.079 1.000 -87.251 39.667

e 0 1 2 -37.384 25.939 3234.227 .449 -99.512 24.745

3 -248.313* 33.840 3234.100 .000 -329.367 -167.258

2 1 37.384 25.939 3234.227 .449 -24.745 99.512

3 -210.929* 34.232 3234.065 .000 -292.923 -128.935

3 1 248.313* 33.840 3234.100 .000 167.258 329.367

2 210.929* 34.232 3234.065 .000 128.935 292.923

1 1 2 .879 24.126 3234.602 1.000 -56.907 58.666

3 -180.006* 32.483 3234.382 .000 -257.809 -102.203

2 1 -.879 24.126 3234.602 1.000 -58.666 56.907

3 -180.885* 30.712 3234.051 .000 -254.448 -107.322

3 1 180.006* 32.483 3234.382 .000 102.203 257.809

2 180.885* 30.712 3234.051 .000 107.322 254.448

412

2 1 2 -43.789 24.232 3234.558 .213 -101.830 14.252

3 -292.299* 31.321 3234.387 .000 -367.320 -217.278

2 1 43.789 24.232 3234.558 .213 -14.252 101.830

3 -248.510* 30.200 3234.096 .000 -320.845 -176.175

3 1 292.299* 31.321 3234.387 .000 217.278 367.320

2 248.510* 30.200 3234.096 .000 176.175 320.845

i 0 1 2 33.323 34.778 3234.195 1.000 -49.978 116.625

3 -59.936 36.833 3234.095 .311 -148.159 28.287

2 1 -33.323 34.778 3234.195 1.000 -116.625 49.978

3 -93.260* 35.114 3234.057 .024 -177.366 -9.153

3 1 59.936 36.833 3234.095 .311 -28.287 148.159

2 93.260* 35.114 3234.057 .024 9.153 177.366

1 1 2 -67.100 32.829 3234.405 .123 -145.733 11.533

3 -129.950* 35.655 3234.425 .001 -215.351 -44.548

2 1 67.100 32.829 3234.405 .123 -11.533 145.733

3 -62.850 31.019 3234.186 .128 -137.148 11.448

3 1 129.950* 35.655 3234.425 .001 44.548 215.351

2 62.850 31.019 3234.186 .128 -11.448 137.148

2 1 2 -140.302* 30.774 3234.176 .000 -214.012 -66.593

3 -194.999* 33.194 3234.421 .000 -274.507 -115.491

2 1 140.302* 30.774 3234.176 .000 66.593 214.012

3 -54.697 30.466 3234.123 .218 -127.670 18.275

3 1 194.999* 33.194 3234.421 .000 115.491 274.507

2 54.697 30.466 3234.123 .218 -18.275 127.670

o 0 1 2 1.706 33.032 3234.287 1.000 -77.414 80.825

3 62.749 32.535 3234.286 .162 -15.181 140.679

2 1 -1.706 33.032 3234.287 1.000 -80.825 77.414

3 61.044 35.332 3234.025 .252 -23.585 145.672

3 1 -62.749 32.535 3234.286 .162 -140.679 15.181

2 -61.044 35.332 3234.025 .252 -145.672 23.585

1 1 2 46.773 33.054 3234.247 .471 -32.400 125.945

3 112.515* 32.482 3234.159 .002 34.714 190.315

2 1 -46.773 33.054 3234.247 .471 -125.945 32.400

3 65.742 30.321 3234.211 .091 -6.884 138.368

3 1 -112.515* 32.482 3234.159 .002 -190.315 -34.714

2 -65.742 30.321 3234.211 .091 -138.368 6.884

2 1 2 48.879 30.871 3235.037 .340 -25.064 122.822

3 82.877* 29.794 3235.180 .016 11.512 154.241

413

2 1 -48.879 30.871 3235.037 .340 -122.822 25.064

3 33.997 29.841 3234.047 .764 -37.478 105.473

3 1 -82.877* 29.794 3235.180 .016 -154.241 -11.512

2 -33.997 29.841 3234.047 .764 -105.473 37.478

u 0 1 2 -97.048 47.767 3234.279 .127 -211.462 17.366

3 -7.258 49.320 3234.241 1.000 -125.390 110.874

2 1 97.048 47.767 3234.279 .127 -17.366 211.462

3 89.790* 35.974 3234.026 .038 3.625 175.955

3 1 7.258 49.320 3234.241 1.000 -110.874 125.390

2 -89.790* 35.974 3234.026 .038 -175.955 -3.625

1 1 2 66.856 43.454 3234.229 .372 -37.225 170.937

3 156.335* 45.265 3234.153 .002 47.917 264.754

2 1 -66.856 43.454 3234.229 .372 -170.937 37.225

3 89.479* 32.145 3234.161 .016 12.484 166.475

3 1 -156.335* 45.265 3234.153 .002 -264.754 -47.917

2 -89.479* 32.145 3234.161 .016 -166.475 -12.484

2 1 2 -101.008 42.179 3234.893 .050 -202.037 .021

3 -29.834 44.409 3234.847 1.000 -136.204 76.536

2 1 101.008 42.179 3234.893 .050 -.021 202.037

3 71.174 31.351 3234.030 .070 -3.918 146.267

3 1 29.834 44.409 3234.847 1.000 -76.536 136.204

2 -71.174 31.351 3234.030 .070 -146.267 3.918





414

5) Type III tests of fixed effects: Dependent variable Euclidean distance.



Intercept 1 7.975 2.219 .175

vowel 4 3234.148 5.397 .000

task 2 3234.660 88.682 .000

stress 1 3234.424 183.485 .000

syll 1 3234.237 39.221 .000

Gender 1 8.003 14.819 .005

Group 1 8.016 1.861 .210

Travel 2 8.009 2.939 .110

Grammar 1 8.015 3.051 .119

ICCSTotal 1 7.980 .024 .881

Spanuse 1 8.001 4.170 .075

vowel * task 8 3234.239 5.937 .000

vowel * stress 4 3234.312 23.173 .000

vowel * syll 4 3234.165 1.032 .389


vowel * task * syll 10 3234.190 4.483 .000

vowel * Gender 4 3234.182 20.072 .000

vowel * Group 4 3234.132 9.170 .000

vowel * Travel 8 3234.162 10.768 .000

vowel * Grammar 4 3234.059 4.525 .001

vowel * ICCSTotal 4 3234.145 4.946 .001

vowel * Spanuse 4 3234.146 17.503 .000


a. Dependent Variable: EUC.

415

6) Pairwise comparisons of interaction with dependent variable Euclidean Distance.



(J)

task

Mean

Difference

(I-J)

Std.

Error df Sig.a

95% Confidence


Lower

Bound

Upper

Bound

a 0 1 2 -11.813 19.579 3234.093 1.000 -58.709 35.083

3 -45.966 23.789 3233.998 .160 -102.945 11.013

2 1 11.813 19.579 3234.093 1.000 -35.083 58.709

3 -34.153 24.771 3234.018 .504 -93.485 25.178

3 1 45.966 23.789 3233.998 .160 -11.013 102.945

2 34.153 24.771 3234.018 .504 -25.178 93.485

1 1 2 -31.757 18.253 3234.241 .246 -75.477 11.962

3 -95.287* 22.478 3234.177 .000 -149.126 -41.449

2 1 31.757 18.253 3234.241 .246 -11.962 75.477

3 -63.530* 22.103 3234.084 .012 -116.472 -10.589

3 1 95.287* 22.478 3234.177 .000 41.449 149.126

2 63.530* 22.103 3234.084 .012 10.589 116.472

2 1 2 -24.618 17.886 3234.094 .506 -67.458 18.222

3 -76.145* 21.460 3234.182 .001 -127.548 -24.743

2 1 24.618 17.886 3234.094 .506 -18.222 67.458

3 -51.528* 21.389 3234.068 .048 -102.758 -.297

3 1 76.145* 21.460 3234.182 .001 24.743 127.548

2 51.528* 21.389 3234.068 .048 .297 102.758

e 0 1 2 -9.245 20.940 3234.275 1.000 -59.401 40.912

3 -191.701* 27.319 3234.097 .000 -257.136 -126.266

2 1 9.245 20.940 3234.275 1.000 -40.912 59.401

3 -182.456* 27.636 3234.049 .000 -248.650 -116.263

3 1 191.701* 27.319 3234.097 .000 126.266 257.136

2 182.456* 27.636 3234.049 .000 116.263 248.650

1 1 2 9.711 19.476 3234.799 1.000 -36.939 56.361

3 -128.512* 26.223 3234.493 .000 -191.322 -65.702

2 1 -9.711 19.476 3234.799 1.000 -56.361 36.939

3 -138.223* 24.794 3234.028 .000 -197.610 -78.836

3 1 128.512* 26.223 3234.493 .000 65.702 191.322

416

2 138.223* 24.794 3234.028 .000 78.836 197.610

2 1 2 -6.973 19.562 3234.736 1.000 -53.829 39.882

3 -233.361* 25.285 3234.499 .000 -293.925 -172.797

2 1 6.973 19.562 3234.736 1.000 -39.882 53.829

3 -226.388* 24.380 3234.092 .000 -284.784 -167.992

3 1 233.361* 25.285 3234.499 .000 172.797 293.925

2 226.388* 24.380 3234.092 .000 167.992 284.784

i 0 1 2 37.507 28.076 3234.230 .545 -29.742 104.756

3 -59.117 29.735 3234.091 .141 -130.340 12.105

2 1 -37.507 28.076 3234.230 .545 -104.756 29.742

3 -96.624* 28.348 3234.037 .002 -164.523 -28.725

3 1 59.117 29.735 3234.091 .141 -12.105 130.340

2 96.624* 28.348 3234.037 .002 28.725 164.523

1 1 2 -80.764* 26.503 3234.524 .007 -144.244 -17.284

3 -158.243* 28.784 3234.552 .000 -227.187 -89.299

2 1 80.764* 26.503 3234.524 .007 17.284 144.244

3 -77.479* 25.042 3234.218 .006 -137.459 -17.498

3 1 158.243* 28.784 3234.552 .000 89.299 227.187

2 77.479* 25.042 3234.218 .006 17.498 137.459

2 1 2 -117.622* 24.843 3234.205 .000 -177.127 -58.116

3 -172.507* 26.798 3234.547 .000 -236.693 -108.321

2 1 117.622* 24.843 3234.205 .000 58.116 177.127

3 -54.885 24.595 3234.130 .077 -113.796 4.025

3 1 172.507* 26.798 3234.547 .000 108.321 236.693

2 54.885 24.595 3234.130 .077 -4.025 113.796

o 0 1 2 4.750 26.667 3234.360 1.000 -59.123 68.623

3 -50.574 26.266 3234.359 .163 -113.486 12.338

2 1 -4.750 26.667 3234.360 1.000 -68.623 59.123

3 -55.324 28.524 3233.992 .158 -123.644 12.997

3 1 50.574 26.266 3234.359 .163 -12.338 113.486

2 55.324 28.524 3233.992 .158 -12.997 123.644

1 1 2 -27.803 26.685 3234.303 .893 -91.718 36.113

3 -114.627* 26.222 3234.180 .000 -177.435 -51.818

2 1 27.803 26.685 3234.303 .893 -36.113 91.718

3 -86.824* 24.478 3234.254 .001 -145.454 -28.193

3 1 114.627* 26.222 3234.180 .000 51.818 177.435

2 86.824* 24.478 3234.254 .001 28.193 145.454

2 1 2 -59.586 24.921 3235.399 .051 -119.278 .107

417

3 -89.880* 24.052 3235.593 .001 -147.490 -32.269

2 1 59.586 24.921 3235.399 .051 -.107 119.278

3 -30.294 24.091 3234.023 .626 -87.997 27.408

3 1 89.880* 24.052 3235.593 .001 32.269 147.490

2 30.294 24.091 3234.023 .626 -27.408 87.997

u 0 1 2 74.528 38.562 3234.349 .160 -17.837 166.894

3 -1.932 39.816 3234.295 1.000 -97.299 93.436

2 1 -74.528 38.562 3234.349 .160 -166.894 17.837

3 -76.460* 29.042 3233.994 .026 -146.022 -6.899

3 1 1.932 39.816 3234.295 1.000 -93.436 97.299

2 76.460* 29.042 3233.994 .026 6.899 146.022

1 1 2 -62.956 35.080 3234.279 .218 -146.980 21.069

3 -144.475* 36.542 3234.172 .000 -232.002 -56.949

2 1 62.956 35.080 3234.279 .218 -21.069 146.980

3 -81.520* 25.951 3234.183 .005 -143.678 -19.361

3 1 144.475* 36.542 3234.172 .000 56.949 232.002

2 81.520* 25.951 3234.183 .005 19.361 143.678

2 1 2 71.384 34.051 3235.202 .108 -10.175 152.943

3 4.539 35.851 3235.138 1.000 -81.331 90.410

2 1 -71.384 34.051 3235.202 .108 -152.943 10.175

3 -66.845* 25.310 3234.000 .025 -127.467 -6.222

3 1 -4.539 35.851 3235.138 1.000 -90.410 81.331

2 66.845* 25.310 3234.000 .025 6.222 127.467





Rebecca Ellen Ronquest

EDUCATION 2012 Ph.D. Hispanic Linguistics, Indiana University Minor: General Linguistics Dissertation: An Acoustic Analysis of Heritage Spanish Vowels. Director: Dr. Erik W. Willis Committee: Dr. Kimberly Geeslin, Dr. Manuel Díaz-Campos, Dr. Kenneth de Jong 2004 M.A. Hispanic Linguistics, Indiana University M.A. Linguistics, Indiana University 2002 B.A. Spanish, The College of William & Mary

Summa Cum Laude

RESEARCH INTERESTS Acoustic phonetics, Sociolinguistics, Bilingualism, Heritage Spanish, Speech Perception

PROFESSIONAL EMPLOYMENT 2002-2011 Associate Instructor of Spanish, Indiana University 2005-2006 Research Assistant. National Institutes of Health (NIH) Pre- Summer 2007 doctoral Fellow, Indiana University Speech Research Lab. Summer 2008 Dr. David Pisoni, Director. 2001 Spanish Drill Instructor, The College of William & Mary

PUBLICATIONS Refereed Journals 2012 Ronquest, R. (Accepted). An acoustic examination of unstressed

vowel reduction in Heritage Spanish. Selected proceedings from the 2011 Hispanic Linguistics Symposium (HLS).

2010 Ronquest, R., Levi, S., & Pisoni, B. (2010). Language

identification from visual-only speech signals. Attention, Perception, and Psychophysics, 72(6), 1601-1613.

2009 Ronquest, R., & Díaz-Campos, M. (2009). Discriminating pitch-accent alignment in Spanish. In Masullo, P., O'Rourke, E., & Huang, C. (Eds.), Romance Linguistics 2007: Selected papers from the 37th Linguistic Symposium on Romance Languages (LSRL), (pp. 243-260). Philadelphia: Benjamins.

2007 Díaz-Campos, M., & Ronquest, R. (2007). La percepción de

acentos tonales en enunciados afirmativos. Estudios de fonética experimental, 16, 81-98.

Progress Reports 2007 Ronquest, R., Levi, S., & Pisoni, B. (2007). Language

identification from visual-only speech. In Research on Spoken Language Processing Progress Report No. 28, 95-118.

2006 Ronquest, R., & Hernandez, L. (2006). Lip-reading skills in

bilinguals. In Research on Spoken Language Processing Progress Report No. 27, 219-226.

PRESENTATIONS Conference Presentations 2012 Ronquest, R. (2012, March). An acoustic analysis of the effects of

syllable type on Heritage Spanish vowel production. Paper presented at The Ninth Annual Graduate Student Conference on Luso-Brazilian and Hispanic Linguistics, Literature and Culture, Bloomington, IN.

2012 Ronquest, R. (2012, February). The effect of speech style on the

acoustic distribution of Heritage Spanish vowels. Poster presented at Current Approaches to Spanish and Portuguese Second Language Phonology 2012, Colombia, SC.

2011 Ronquest, R. (2011, October). An analysis of mid-vowel

variation in Heritage Spanish. Poster presented at The 17th Annual Mid-Continental Workshop on Phonetics and Phonology, Urbana, IL.

2011 Ronquest, R. (2011, October). An acoustic characterization of

the Heritage Spanish vowel system. Paper presented at The 2011 Hispanic Linguistics Symposium, Athens, GA.

2008 Ronquest, R., Levi, S., & Pisoni, D. (2008, July). Visual cues to language identification. Paper presented at The Linguistic Society of America Summer Meeting, Columbus, OH.

2007 Ronquest, R., & Díaz-Campos, M. (2007,April). Discriminating

pitch-accent alignment in Spanish: A pilot investigation. Paper presented at The Kentucky Foreign Language Conference, Lexington, KY.

2007 Ronquest, R., & Díaz-Campos, M. (2007, March). A perceptual

discrimination study of pitch-accent alignment in Spanish. Paper presented at The Linguistic Symposium on Romance Languages XXXVII, Pittsburgh, PA.

Lab Talks and Invited Presentations 2011 Ronquest, R. (2011, November). Living in two languages: A

linguistic study of bilingual youths. Paper presented at the Latino Studies Program Brown Bag series. Indiana University, Bloomington, IN.

2010 Ronquest, R. (2010, December). An acoustic analysis of

Heritage Spanish vowels. Paper presented at the Hispanic Linguistics Brown Bag series. Department of Spanish and Portuguese, Indiana University, Bloomington, IN.

2008 Ronquest, R. (2008, February). Language identification from

visual-only speech signals. Paper presented at the Hispanic Linguistics Brown Bag series, Department of Spanish and Portuguese, Indiana University, Bloomington, IN.

2007 Ronquest, R. (2007, February). Examining pitch-accent

alignment in Spanish. Paper presented at the Hispanic Linguistics Brown Bag series. Department of Spanish and Portuguese, Indiana University, Bloomington, IN.

2006 Ronquest, R. (2006, December). Visual-only language

identification in monolinguals and bilinguals. Paper presented at lab meeting, Speech Research Laboratory, Indiana University, Bloomington, IN.

TEACHING EXPERIENCE Courses Taught 2002-2011 Associate Instructor, Indiana University

S326--Introduction to Hispanic Linguistics (Fall 2008, Spring 2010, Fall 2010, Spring 2011) S280--Spanish Grammar in Context (Fall 2009) S275--Introduction to Spanish History and Culture (Spring 2008, Fall 2008) S250--Intermediate Spanish II (Fall 2007) S200--Intermediate Spanish I (Fall 2003, Spring 2004) S105--Accelerated first year Spanish (Fall 2002, Spring 2003, Fall 2004, Spring 2007)

2001 Drill instructor, The College of William & Mary HISP 101 Drill--Beginning level Spanish (Fall 2001) Tutoring 2005-Present Tutored students enrolled in university-level Spanish courses at

Indiana University. Ability levels ranged from beginning to advanced.

2001-2002 Tutor employed by University Instructors, Inc, in Williamsburg,

Virginia. Tutored high school students enrolled in beginning-level Spanish and Advanced Placement Spanish.

SERVICE AND VOLUNTEER ACTIVITIES Teaching 2009 Participant in Language Teaching Share Fair. Presented original

warm-up activity entitled “A improvisar” to other graduate teaching instructors during a workshop on designing effective warm-up activities for foreign language teaching.

2001-2002 Volunteer Spanish teacher at Matthew Whaley Elementary School

in Williamsburg, Virginia. Taught Spanish to first graders one to two times per week during the Spring semesters of 2001 and 2002.

Professional 2011 Reviewer, Selected Proceedings from the Hispanic Linguistics Symposium. 2010 Chair of session on Heritage Speakers, and provided opening

remarks for keynote address at the Hispanic Linguistics Symposium at Indiana University.

2009 Reviewer, Indiana University Working Papers Online. 2001 Volunteer translator for Eastern Shore Rural Health, Inc., as part

of a service-learning program at The College of William & Mary. Performed medical translation for migrant workers who visited the rural health clinics, and also went to migrant camps to give short, educational presentations on basic health practices.

AWARDS AND HONORS 2012 Outstanding graduate student poster award at Current Approaches

to Spanish and Portuguese Second Language Phonology (conference)

2011-2012 Latino Studies Program Dissertation Year Fellowship 2009-2010 Distinguished Teaching Award at Indiana University 2002-2003 Distinguished Teaching Award at Indiana University 2002 R.Merrit Cox Memorial Fellowship at The College of William &

Mary for exceptional undergraduate student pursuing graduate study.

2002 Phi Beta Kappa Society, Alpha Chapter, The College of William & Mary

2002 Golden Key National Honor Society at The College of William & Mary

LANGUAGES Spanish Near-native fluency Italian Intermediate-level speaking, reading, and comprehension German Beginning-level speaking, reading, and comprehension Latin Reading knowledge

Date post:	26-Apr-2023
Category:	Documents
Upload:	ncsu
View:	0 times
Download:	0 times

An Acoustic Analysis of Heritage Spanish Vowels

Documents