Within-Year Retention Among Ph.D. Students: TheEffect of Debt, Assistantships, and Fellowships
Pilar Mendoza • Pedro Villarreal III • Alee Gunderson
Received: 21 March 2013� Springer Science+Business Media New York 2014
Abstract This study employs the 2007–2008 National Postsecondary Student Aid
Study and the National Research Center’s survey data, ‘‘A Data-Based Assessment of
Research-Doctorate Programs in the United States,’’ to investigate the (1) the effects of
debt in relation to tuition and fees paid and (2) the effects of teaching assistantships,
research assistantships, and fellowships on within year retention among Ph.D. students.
We created an innovative conceptual model for this study by merging a socioeconomic
model for graduate students and a graduate student socialization model. We used
propensity score weights for estimating average treatment effects and average treatment
effect on the treated as well as a series of control and balancing variables. This study
provides timely insights into which of these financial strategies are likely to improve
the already low doctoral retention rates nationwide. To the best of our knowledge, this
is the first study that includes proxies of socialization variables in examining the role of
various funding mechanisms in doctoral retention using a national representative
dataset.
Keywords Doctoral student retention � Doctoral assistantships � Fellowships �Graduate debt � Graduate students
P. Mendoza (&)Deparment of Educational Leadership & Policy Analysis, College of Education, University of MissouriColumbia, 202 Hill Hall, Columbia, MO 65211, USAe-mail: [email protected]
P. Villarreal IIICollege of Education, University of Florida, Gainesville, FL, USAe-mail: [email protected]
A. GundersonYouth Development and Agricultural Education, Purdue University, West Lafayette, IN, USAe-mail: [email protected]
123
Res High EducDOI 10.1007/s11162-014-9327-x
Introduction
Graduate education produces the world’s scholars, educators, innovators, and leaders
(Walker et al. 2008). Moreover, graduate and professional degrees are increasingly being
considered the entry-level credential in many professions. In fact, it is projected that about
2.5 million jobs will require either a master’s, doctoral, or professional degree between
2008 and 2018 in the United States (Wendler et al. 2010). However, about 40 % of all
students pursuing a doctorate degree do not finish their programs (Gardner 2009). Lack of
adequate financial resources can discourage students from continuous enrollment thereby
lengthening time to degree and ultimately decreasing the probability of completion (Bowen
and Rudenstine 1992; Kim and Otts 2010; Lovitts 2001; St. John and Andrieu 1995).
In a meta-analysis, Gururaj et al. (2010) found that every form of aid is significant in
promoting graduate student retention, especially aid in the form of grants. Additionally,
financial aid in the form of assistantships that allow students’ involvement through aca-
demic tasks with their advisors and peers have been shown to be critical in graduate
retention (Gardner and Barnes 2007; Kim and Otts 2010). This indicates that in the case of
graduate education, the socialization aspect of assistantships as a form of financial aid has
important implications to retention. Nettles and Millett (2006) found that 44 % of the
doctoral students surveyed in their study were offered research assistantships, 60 %,
teaching assistantships, and 48 % fellowships. Fellowships are considered the ‘‘top prize
because they often cover all student expenses and ordinarily come with no work
requirements. Research and teaching assistantships, however, which often require students
to work with faculty on research projects or instructional activities, can be most valuable
for their associations and the apprenticeships they provide to students in preparation for
professional careers’’ (Nettles and Millett 2006, p. 74). However, time intensive forms of
financial support, such as some teaching assistantships, may impede the degree progress of
doctoral students whereas research assistantships, which align with students’ dissertation
topics, may shorten their time to degree (Kim and Otts 2010).
Nonetheless, financial aid in the form of fellowships and assistantships has not kept pace
with the rising costs of attendance, and so student borrowing has increased, although, these
trends of borrowing vary significantly by field of study and race/ethnicity (Kim and Otts
2010; Hoffer et al. 2006). The Consumer Financial Protection Bureau reported that the
outstanding student loan debt is over $1 trillion, which surpasses the amount owned on all
credit cards in the United States. In 2010–2011, federal loans accounted for 69 % of all aid
given to graduate students, who received $16,423 on average in federal loans (College
Board 2011).
Problem Statement
Assistantships, fellowships, and loans are the main financial means for doctoral students.
While student advocacy groups have become increasingly vocal against loans and the costs
of higher education continue to rise, little academic inquiry has occurred to understand
how the changing mix of these financial resources is influencing doctoral students’ decision
to enroll from fall to spring in graduate school. Previous studies suggest that various forms
of aid have differential effects on doctoral retention; however, none of these studies have
been conducted with a national representative database and with special attention to the
socialization effects of research and teaching assistantships. Therefore, in this study we
integrate findings from previous qualitative work on doctoral socialization through
Res High Educ
123
assistantships and fellowships in relation to retention with quantitative studies on graduate
student debt to investigate the effect of debt, research assistantships, teaching assistantships
and fellowships on within year doctoral retention. We used within year retention given that
the dataset available is cross-sectional and collected data on students enrolled in the
academic year 2007–2008 only.
Research Questions
• What are the effects of debt on within year retention among Ph.D. students?
• What are the effects of teaching assistantships, research assistantships, and fellowships
on within year retention among Ph.D. students?
Dataset, Sample, and Population
The main sources of data employed include the 2007–2008 National Postsecondary Student
Aid Study (NPSAS: 08) and the National Research Center’s survey data (NRC: 06), ‘‘A
Data-Based Assessment of Research-Doctorate Programs in the United States.’’ NPSAS:08
is a nationally representative dataset with postsecondary student level characteristics related
to financial aid, demographics, and academic characteristics meant to address how students
enrolled in 2007–2008 and their families pay for postsecondary education. The NRC data is
based on a departmental level survey conducted in 2005–2006 and ranks departments by
field of study based on measures of research productivity, student outcomes, and diversity. It
contains information at the doctoral departmental level related to the graduate student
experiences as well as the overall productivity, culture, and climate of individual depart-
ments. We merged these two datasets using CIP and IPEDS codes, which allowed us to
obtain variables related to the departmental characteristics for each student that are likely to
impact their graduate experiences. Although the surveys were conducted 2 years apart, we
believe that the departmental variables from NRC are likely to remain the same or minimal
changed with little impact for the purpose of this study because change in higher education
tends to be slow (Braskman and Wergin 1998; Tierney 1992).
After merging these datasets, the population of inference for these analyses became
Ph.D. students enrolled the fall of 2007 in departments ranked by the NRC survey data in
the humanities, social/behavioral sciences, life sciences, math, engineering, computer
science, health sciences and other majors throughout the nation. The sample frame is the
same used in the data collection procedure of NPSAS: 08 (Cominole et al. 2010). Sub-
sequent to matching the datasets, the sample size was (n = 1,198).
Dependent Variable
Based on the definition established by St. John and Andrieu (1995), students who were
positively retained were defined as those who were enrolled in September 2007 and in
February 2008 at the same institution. A significant percentage of students, 90 % of cases,
indicate experiencing within year retention in this data. While some researchers may
question the ability of these analyses to detect an effect due to the limited variability in the
retention outcome, we concluded, consistent with previous research, that 10 % variability
Res High Educ
123
would be sufficient to produce results that generate consistent and precise estimates given
the standard 10 events per variable guideline recommended in the statistical research
literature (Peduzzi et al. 1996).
Treatment Variables of Policy Interest
First, we defined a relative measure of debt as the ratio between cumulative graduate and
undergraduate debt over tuition and fees paid during 2007–2008 (debt-to-price ratio). This
relative measure is a better indication of students’ financial need and willingness to borrow
than using the debt amount alone; it also controls for the price differences across insti-
tutions and fields of study. However, the distribution of this variable was substantially
irregular with a significant proportion (59.4 %) of cases without a positive debt-to-price
ratio. The zero-inflated, Poisson-like distribution of this variable indicated that use of this
variable as a continuous variable in the equation would violate standard assumptions in
future analyses. Consequently, we constructed a dichotomous variable that distinguished
between subjects who indicated no debt relative to costs (entered as 0) and those subjects
who had some debt relative to costs (entered as 1).
In relation to the second research question, we measured the impact of assistantships
and fellowships on the outcome variable through three dummy variables indicating whe-
ther students received primarily a research assistantship, teaching assistantship, or fel-
lowship. After conducting a series of crosstabs, we found that 8.1 % of students in the
dataset have a research and teaching assistantships, 9.0 % a research assistantship and a
fellowship, 9.9 % a teaching assistantship and a fellowship, and 2.2 % have the three of
them. Thus, as explained below, we applied robustness checks by analyzing the models
with and without these cases with multiple assistantships/fellowships. In addition, we
controlled for the dollar amount associated with these assistantships and fellowships as an
indicator of the prevalence of each of these among students with combinations of assis-
tantships and fellowships.
While these crosstabs may indicate that the treatments are not entirely isolated from
each other, the vast majority of subjects will experience only one of these treatments
during the doctoral experiences. To reduce the number of potential outcomes into a
manageable set of dependent variables, we collapsed students with two or all three types of
fellowships and assistantships into three constructs. The operationalization of the outcome
variables resulted in the following three categorical variables: primarily teaching assis-
tantships (25.5 %), primarily research assistantships (36.9 %), and primarily Fellowships
(14.1 %) which were used as dependent variables in the Propensity Score Models and as
independent variables in the main outcome models. We were interested in understanding
the effects of teaching assistantships without the potential influence of the research
assistantship experience. Consequently, those students who reported having both teaching
and research assistantships were listed as primarily research assistantships. This opera-
tionalization will allow the effect of teaching assistantships to be exposed without the
potential confounding that could be attributed to having a research assistantship. For those
Ph.D. fellows who reported having also either or both a teaching and/or research assis-
tantship, we considered several alternatives. Because students who have a fellowship are
likely to have no specific work or other institutional obligations, we wanted to understand
these fellows without the mixed effects that could potentially influence these Ph.D. stu-
dents’ outcomes due to an assistantship. As a result, those students reporting both fel-
lowships and teaching assistantships were combined with those students who reported
Res High Educ
123
primarily teaching assistantships. Likewise, students reporting both fellowships and
research assistantships were combined with the category of students reporting primarily
research assistantships. We decided to combine students who reported both research as-
sistantships and teaching assistantships with the category of primarily research assistant-
ships because we did not want the socialization characteristics particular to the research
assistantships affecting the effects of the models of teaching assistantships. Finally, sub-
jects who reported all three were collapsed into the primarily research assistantship cat-
egory. While this recoding created a substantially greater number of subjects in the
research assistantship group, we reasoned that this operationalization was likely to have a
less deleterious effect on either outcome as explained above.
Finally, almost a quarter (23.5 %) of Ph.D. students in the sample reported not having
an assistantship or fellowship. In most analyses, this group served as the reference category
for interpretation of the treatment effects on retention. While this may seem as an
unusually small proportion of students, the sample is delimited to a select number of
doctoral universities that are nationally ranked by the National Research Council (NRC),
and the sample was also delimited to only those doctoral program areas for which the NRC
compiles a ranking. Given these delimitations, we are confident that this operationalization
of each of the outcome variables would provide the most informative and reliable con-
clusions regarding our research questions of interest.
Covariates Used as Balancing and Control Variables
The covariates included in the models were selected according to each of the six constructs in
the theoretical model developed for this study and the availability of data. We created an
innovative conceptual model for this study by merging the retention model for graduate
students as defined by St. John and Andrieu (1995) and the graduate student socialization
model created by Weidman et al. (2001). By augmenting the socioeconomic framework of
St. John and Andrieu (1995) with the socialization framework created by Weidman et al.
(2001), we were able to include a more detailed account of the factors associated with
doctoral within-year retention, especially in relation to the experience in graduate school,
which has been proven to be a critical factor in doctoral attrition in qualitative work (Gardner
2009; Golde 2000, 2005; Lovitts 2001) and correspond to one of the six constructs of St. John
and Andrieu’s model. In particular, St. John and Andrieu (1995) argue that within year
retention decisions among graduate students are a function of the following six constructs.
Social and Economic Background
A series of studies have demonstrated that the background characteristics of students such
as age, gender, race/ethnicity, SES, marital, dependent, and immigration status impact their
graduate experience and retention (Gardner and Mendoza 2010). As such, the variables
used in this construct account for the individual SES, demographics as well as immigration
and dependency status of students.
Aspirations
The highest degree expected by students influence their level of commitment to the goal of
obtaining a degree (Burton and Ramist 2001; Kim 2007). However, given that all students
in the dataset are pursuing the highest level of educational attainment, we assume that all
Res High Educ
123
students have the same aspiration, to obtain their Ph.D. Therefore there is no need to
control for this construct.
Expected Earnings Upon Graduation
This construct is based on human capital theory and relates to the labor market conditions
upon graduation that influence the cost-benefit analysis to stay enrolled (DesJardins and
McCall 2010). We obtained data from 2009 Survey of Earned Doctorates to create a
variable with the average basic annual salary for doctorate recipients for employment in the
US by field of study.
Prices
A large body of research confirms the impact of prices and subsidies in educational
attainment at all levels (Hossler et al. 2009; Kim and Otts 2010; Nettles and Millett 2006).
We controlled for the cost of attendance using two measures, the tuition and fees amount
paid and other costs besides tuition and fees paid. We used these two variables in the
models addressing the second research questions, but only other costs in the model
addressing the first question, given that the treatment variable in that case already accounts
for tuition and fees.
Price Subsidies
This construct refers to any type of discount students receive in the forms of financial aid
(grants, loans, fellowships, work-study, and waivers) and assistantships. We used two
variables to control for these subsidies, loan-based amount and non-loan based amount,
given that the first one needs to be repaid after graduation. The amount of non-loan based
subsidies was included in all analyses. Graduate loans was not included in the analysis
addressing the first research question given that the associated variable of interest already
deals with the issue of loan debt.
Graduate Experiences
The socialization of doctoral students is determined by the particular circumstances of the
student related to their socialization experiences, institutional climate and culture, disci-
plinary norms, and role of the advisor (Gardner 2009; Lovitts 2001). Likewise, the model
by Weidman et al. (2001) emphasizes the graduate experience in student socialization and
retention. Researchers have asserted that the graduate student experience occurs mainly at
the departmental level (Gardner and Mendoza 2010; Golde 2005; Tinto 1993; Weidman
et al. 2001). These graduate experiences become a series of socialization processes by
which students learn the robes of the profession and become members of a community of
scholars/practice for the rest of their careers upon graduation. In this context, doctoral
retention partially depends on the socialization processes associated with the normative
and structural character of the field of study manifested at the departmental level. Given
that socialization processes are compounded by the specific characteristics of students’
universities and departments, we used variables that as a whole relate the culture, climate,
and interactions of the students’ surroundings. However, a limitation of this study is the
lack of variables that directly measures socialization. At the very least we were able to use
Res High Educ
123
proxies of socialization related to the departmental climate of each student in the dataset.
We believe that merging two datasets to add departmental climate is an important and new
contribution of this study. These departmental characteristics are the ones included in the
NRC ranking of doctoral departments and include size and three dimensions measured by a
number of variables in each category: research activity, students support and outcomes,
and diversity. We also used several variables indicative of the particular experience of each
student such as years in the program, enrollment patterns, distance education, employment
while enrolled, and field of study.
For the main models, we used the dimensional variables constructed from a variety of
measures in order to minimize the number of variables to include in the main models.
However, for calculating the propensity score weights we used the variables listed as
variables used in ‘‘PS Model’’ or ‘‘All’’ provided in Table 1. Years in the program,
enrollment patterns, distance education, employment while enrolled, and field of study
influence how students engage and integrate into their academic, professional and personal
communities. For example, years in the program shape the experiences of doctoral students
before and after the qualifier examination (Tinto 1993). Whether students attend part time
or take distance education courses signals different ways of involvement. Disciplinary
cultures are a major source of socialization for doctoral students, and so, field of study is an
influential factor in doctoral retention (Mendoza 2007). The relationship with advisor has
been documented as a critical aspect of the doctoral experience, socialization, and attrition
(Lovitts 2001). Unfortunately, we do not have data that could account for this construct,
which constitutes a limitation of the study.
According to Malcolm and Dowd (2012) and others there are many factors, both
observable and unobservable, that contribute to a student’s decision to finance their edu-
cation through loans. In addition, a student’s decision to take out a loan for education is not
a random phenomenon; essentially, students self-select themselves into loan programs.
Similarly, according to Nettles and Millett (2006), graduate student finances are complex
involving both tangibles, such as tuition, lost wages, personal income, type of assistant-
ships, and intangibles, such as attitudes toward debt and other psychological approaches to
acquiring and using money. Additionally, selection bias to the different types of assis-
tantships is likely driven by departmental and disciplinary characteristics as well as the
educational background and predispositions of the individual student. To damper self-
selection bias on observables, we employed counterfactual methodologies as explained in
the next section.
Analysis
Stage 1: Data Augmentation, Cleaning, and Transformations
Various data conditioning and transformations were performed on the variables during this
stage to generate more normally distributed data in some variables and to test curvilinear or
nonlinear relationships in other variables. We transformed the age of subjects variable by
taking the inverse function to create a more normally distributed variable of age. Due to
considerations of time and the dimensionality of age as a construct that parallels time, we
also generated squared and cubic forms of the inverse of age to test the likely nonlinear
functional relationships between age and the various response variables included in the
models and equations. We tested various transformations of AGI and found that the square
root of AGI is most normally distributed and would perform best in latter stages of
Res High Educ
123
Table 1 List of variables, data source, and models
Outcome variable Independent variables Source Model
Within year retention:2007–2008
Variables of interest
Cumulative debt/tuition and fees NPSAS Model 1
Primarily research assistantship NPSAS Model 2
Primarily teaching assistantship NPSAS Model 3
Primarily fellowship NPSAS Model 4
Construct Covariates
Social and economicbackground
Gender NPSAS All
Race/Ethnicity NPSAS All
Age NPSAS All
Dependent and civil status NPSAS All
Family size NPSAS All
Disability status NPSAS All
Immigration status NPSAS All
English primary language NPSAS All
Income NPSAS All
Home ownership NPSAS All
Parental education NPSAS All
Expected earnings National average salary for Ph.D.s by field SED All
Prices Cost other than tuition and fees amount NPSAS All
Tuition and fees amount NPSAS M 2, 3 & 4
Prices subsidies Non-loans subsidies NPSAS All
Graduate loans amount NPSAS M 2, 3 & 4
Graduate experiences/institutionalcharacteristics
Private/public NPSAS All
Urban/suburban NPSAS All
% Minority students NPSAS All
Graduate experiences/enrollment
Attendance pattern NPSAS All
Years in the program NPSAS All
Field of study NPSAS All
Took distance ed classes NPSAS All
Graduate experiences/employment
Hours work outside assistantships NPSAS All
Teaching assistantship primarily NPSAS M 1
Research assistantship primarily NPSAS M 1
Graduate experiences/dimensional measuresof departmental culture,climate, andsocialization
Program size quartile NPSAS M 1, 2, 3, 4
95th Percentile research activity NPSAS M 1, 2, 3, 4
95th Percentile student support & outcomes NPSAS M 1, 2, 3, 4
95th percentile diversity NPSAS M 1, 2, 3, 4
Graduate experiences/professoriatecharacteristics andresearch outcomes
Total number of faculty NRC PS Model
% Assistant professors NRC PS Model
% Tenured faculty NRC PS Model
% of Interdisciplinary faculty NRC PS Model
% Non-Asian Minority faculty NRC PS Model
% Female faculty NRC PS Model
Awards per faculty member NRC PS Model
Res High Educ
123
Table 1 continued
Outcome variable Independent variables Source Model
% of Faculty with grants NRC PS Model
Average number of publications per faculty NRC PS Model
Average citations per publication NRC PS Model
Graduate experiences/
student bodycharacteristics
Number of students enrolled NRC PS Model
Average annual first year enrollment NRC PS Model
% Non-asian minority students NRC PS Model
% Female students NRC PS Model
% International students NRC PS Model
Average GRE scores NRC PS Model
Graduate experiences/
student outcomes
Avg. completion percentage NRC PS Model
Median time to degree NRC PS Model
Average number Ph.D.s graduated NRC PS Model
% of Students with academic plans NRC PS Model
Collects data about post-graduation employment NRC PS Model
Graduate experiences/
student financial and
assistantship support
% First year stdts w full financial support NRC PS Model
% First-year stdts w external fellowships NRC PS Model
% Stdts w research assistantships NRC PS Model
% Stdts w teaching assistantships NRC PS Model
% First year stdts w institutional fellowships alone NRC PS Model
% First year stdts w multiple fellowships NRC PS Model
% First year stdts w internal fellowships &assistantships
NRC PS Model
% First year stdts w multiple internal assistantships NRC PS Model
Graduate experience/student support
Student work space provided NRC PS Model
Student health insurance provided NRC PS Model
Regular graduate programs directors/coordinatorsmeetings
NRC PS Model
Annual review of all enrolled doctoral students NRC PS Model
On-campus graduate research conferences NRC PS Model
Travel support to attend professional meetings NRC PS Model
Orientation for new graduate students NRC PS Model
International student orientation NRC PS Model
Language screening/support prior to teaching NRC PS Model
Instruction in writing NRC PS Model
Instruction in statistics NRC PS Model
Organized training to help students improve teachingskills
NRC PS Model
Prizes/awards for teaching or research NRC PS Model
Assistance/training in proposal writing NRC PS Model
Formal training in academic integrity NRC PS Model
Posted academic grievance procedure NRC PS Model
Dispute resolution procedure NRC PS Model
Active graduate student association NRC PS Model
Staff assigned to the graduate student association NRC PS Model
Financial support for the graduate student
association
NRC PS Model
Res High Educ
123
analyses. Additionally, we tested polynomial functions of the variable, Years in the Pro-
gram, which measured the number of years subject had been a doctoral student at the time
of data collection. In testing the curvilinear form of the variable, we found that linear,
quadratic, cubic, and quartic forms of the variable improved the fit of certain models.
Conducting similar tests on the variable percentage minority at university; we transformed
this variable using the logarithm to create a more normally distributed variable. The
relative improvement from variable transformations may have failed to outweigh the
benefit of maintaining the variables in their original over preferred scales, especially to be
able to make clear statements regarding variables of policy interest including debt-to-price
ratio. Thus, we took into consideration the benefit of transformations for each variable
independently.
Stage 2: Missing Data
Only five variables in NPSAS:08 (parent education level, non-tuition and fees costs of
attendance, tuition and fees expenses, percent of minority students, and debt-to-price)
contained missing values ranging from less than 1 % to as much as 5 % of cases per
variable. Missing data in the NRC data ranged from less than 1 % to as much as 37 %.
Variables used in the propensity score stages of analysis included missing values that
ranged from 0 to 37 % of cases; however, variables that were used in the final outcome
models stage of analysis contained missing values that ranged from 0 to 5 % of cases. For
an examination of the percentage of missing values in each variable used in the final
outcome regression models, examine Table 2. Missing data was assumed to be related to
characteristics for which variables have been included in the datasets, which allows for the
imputation of the variables; this condition is referred to as Missing at Random (MAR)
according to Little and Rubin (2002). After evaluating the amount and type of missing
data, we concluded that using multiple imputations would produce the least biased results
for these analyses (Little and Rubin 2002) using Stata’s mi command for multiple impu-
tations (Press 2011) using a series of chained equations for a set of twenty imputations
(M = 20).
We applied multiple imputation techniques available a set of routines within Stata/MP
12 including conditional imputations. Using these procedures gave us the ability to impute
cases as needed and generate variables prior to imputation but impute from their origi-
nating variables prior to data transformations and conditioning. mi estimate, a command in
Stata, applies Rubin’s combination rules when combining results from estimates of the 20
datasets. These analyses may include descriptive statistics, propensity score regression
models, and final outcome regression models. Consistent with Rubin’s combination rules,
the following equation was used:
�Q ¼ 1
m
Xm
i¼1
Q̂ðiÞ;
Q represents the average of the individual results for the 20 imputed datasets, m, summed,
R and divided 1m
by the total number of multiply imputed datasets. Additionally, according
Rubin’s formulas, the estimates of variances were used to arrive at one overall estimate of
the coefficients and their related means, proportions, and odds-ratios. The formula used to
arrive at these estimates of the total variance was:
Res High Educ
123
Table 2 Distributions, means, and standard errors of, and percentage missing data for variables used inanalyses (n = 1,198)
Variable name %Missing
Mean/proportion
Std. Err. Minimum Maximum
Outcome variable
Within year retention 2008 0 0.899 0.009 0 1
Treatment variables
Graduate debt over tuition & fees cont. 3 0.717 0.045 0 18.210
Graduate debt over tuition & fees dummy 3 0.433 0.145 0 1
Research assistantship 0 0.369 0.014 0 1
Teaching assistantship 0 0.255 0.013 0 1
Fellowship 0 0.141 0.010 0 1
No assistantships/fellowships 0 0.235 0.012 0 1
Control and balancing variables
Female 0 0.391 0.014 0 1
Minority 0 0.381 0.014 0 1
Inverse of age 0 0.035 0.000 0.016 0.048
Inverse of age squared 0 0.001 0.000 0.001 0.001
Inverse of age cubed 0 0.000 0.000 0.000 0.000
Year in program 0 3.324 0.054 0 7
Year in program squared 0 14.569 0.408 13.770 15.369
Year in program cubic 0 74.092 2.780 68.637 79.547
Dependency status
Indep., married, no dependents 0 0.172 0.011 0 1
Indep., unmarried/separated, w/depends. 0 0.042 0.006 0 1
Indep., married, w/dependents 0 0.154 0.010 0 1
Family size 0 0.654 0.031 0 8
Subject has a disability 0 0.056 0.007 0 1
Other than english language primary 0 0.356 0.014 0 1
Immigration status
Resident alien 0 0.043 0.006 0 1
Foreign 0 0.299 0.013 0 1
US citizen (reference)
Square root of the adjusted gross income 0 155.271 2.051 0 424.764
Student owns home/pays mortgage 0 0.241 0.012 0 1
Parent’s highest level of education 5 5.739 0.073 1 9
Expected earnings 0 69,520.03 440.891 45,000 95,000
Nontuition expense budget [adjusted] 3 20,002.58 301.620 -9,581 74,123
Tuition and fees paid 3 14,964.48 322.985 -15,266 47,050
Total subsidies [non-loan] 0 21,870.86 482.445 0 81,002
Total loans [excluding parent plus] 0 2,433.611 171.225 0 40,134
Private sector, institution 0 0.321 0.014 0 1
Log of percentage minority at institution \1 2.973 0.013 1.386 4.500
Graduate class level 0 1.894 0.034 0 3
Urbanicity of location
Rural 0 0.059 0.007 0 1
Res High Educ
123
T ¼ �U þ 1þ 1
m
� �B;
where T is the total variance and U is the average within-imputation variance and B rep-
resents the between-imputation variance of the estimate while m is the number of imputed
data sets.
Stage 3: Propensity Score Analysis
During this stage, consistent with recommendations for dealing with selection bias based
on observables (Schneider et al. 2007) we modeled characteristics that influence the
variables of policy interest in separate equations. These models generated scores used in
balancing cases in each treatment condition. We used the following equation to estimate
the treatment selection models for the three types of assistantships:
P WijXi ¼ xið Þ ¼ E Wið Þ ¼eXibi
1þ eXibi¼ 1
1þ e�Xibi;
where P is the probability of being in the treatment group W, given X which is a vector of
conditioning variables, and b is a vector of regression parameters (Guo and Fraser 2010) to be
estimated. Traditional logistic regression models were employed at the propensity score
analysis stage with each of the treatments included as dependent variables in the equations. At
this stage, we used variables that were theoretically assumed to influence the probability each
Ph.D. student to receive the treatment: (1) have a high debt-to-price ratio, (2) be a research
Table 2 continued
Variable name %Missing
Mean/proportion
Std. Err. Minimum Maximum
Suburban 0 0.226 0.012 0 1
Urban (reference)
Attendance intensity
Half-time 0 0.120 0.009 0 1
Less than half-time 0 0.073 0.008 0 1
Full-time (reference)
Graduate majors by disciplinary field
Humanities majors 0 0.109 0.009 0 1
Social and behavioral science majors 0 0.180 0.011 0 1
Math, engineering, and computer sciences 0 0.292 0.013 0 1
Health majors 0 0.023 0.004 0 1
Life sciences (reference)
Distance education courses taken 0 0.059 0.007 0 1
Job Hours/Week [excluding work-study] 0 7.867 0.445 0 80
Program size quartile \1 3.070 0.030 1 4
95th Percentile research activity 0 60.463 1.261 1 236
95th Percentile student support & outcomes 0 81.927 1.243 2 236
95th Percentile diversity 0 77.706 1.248 1 236
Descriptive results for variables used only in the propensity score models stage of the analyses are availablefor request from the authors
Res High Educ
123
assistant, (3) be a teaching assistant, or (4) have earned a fellowship. A full description and
defense of the use of each of predictors for these propensity score models is beyond the scope
of this manuscript. However, the variables included in each of the propensity score models are
subsumed within the following broad categories of variables: graduate experiences and
professional characteristics, student body characteristics, average student outcomes, student
financial supports, and general student support structures. Also included were variables
related to social and economic background as well as institutional characteristics. A complete
list of the conditioning variables used at this stage is included in Table 1.
Stage 4: Propensity Score Weight Creation Stage
We developed propensity score weights for each treatment variable of policy interest that is
used in later regression models as a means of creating a balanced group of subjects in each
of the treatment and untreated categories; this creates a quasi-experimental condition
among the treatment and control conditions. Accordingly, we were interested in estimating
the Average Treatment Effect (ATE) and the Average Treatment Effect on the Treated
(ATT). A number of texts and articles (Morgan and Winship 2007; Guo and Fraser 2010;
Murnane and Willett 2011) have articulated cogent rationales why naive regression models
may under or overestimate the true effect in a causal relationship. This literature is con-
cerned with the process of making causal claims regarding the relationships of interven-
tions on outcomes of interest.
The naive regression model estimates the average difference, typically a mean differ-
ence, between treatment and control groups. This represents the net effect expected after
controlling for other confounding characteristics. However, this regression estimate may
not accurately measure the true effect because the treatment and control groups may be
dissimilar due to selection bias (Shadish et al. 2002), which arises when subjects are not
randomly assigned to participate in the treatment and control groups. To damper selection
bias, we estimate an ATE, which uses one of several techniques including propensity score
analysis to balance the cases on characteristics assumed to influence selection into the
treatment and control groups. The result provides an estimate on the effect or benefit of the
program or policy intervention, on average, for all people. However, it may be of more
substantive interest to examine whether the program or policy intervention has an effect on
only those who participate in the program or policy intervention, or in essence, on those
who self-select or are assigned by some external process to only the treatment condition. In
this case, we estimated the ATT treatment effect.
In estimating the two treatment effects we used the following two formulas commonly
applied in the research literature (Guo and Fraser 2010). The formula used in calculating
the propensity score weight for estimating the ATE is 1P
for treated participants and 1ð1�PÞ for
the control participants, where P is the probability of being in the treated group. Alter-
natively, the formula used in calculating the propensity score weight for estimating the
ATT is 1 for the treated participants and P1�P
for the controls participants.
Stage 5: Sensitivity Analyses Using Checks for Imbalance
Following recommendations offered in Guo and Fraser (2010), we checked for balance in
the covariates conducting simple weighted logistic regression models with the treatment
variable included as the sole predictor while each of the predictors originally used in the
propensity score models entered as the outcome variable in separate models. Tests of
Res High Educ
123
significance were used to assess whether the treatment and control groups were balanced
on the predictors included in the propensity score models. Assuming that the predictors
used in the propensity score models represent the selection process well, a balanced
propensity score weight is preferable. Checks for balance for the four treatment variables in
the ATE and ATT conditions indicated that the weights for ATE had no propensity score
model covariate imbalance for the teaching assistantship treatment variable. The same
check found one variable and two variables to be imbalanced for the fellowships treatment
variable and the research assistantship treatment variable respectively; however, the
imbalance in one or two variables is ignorable (Hahs-Vaughn and Onwuegbuzie 2006),
thus, we proceeded to use these propensity score weights in the outcome regression
analyses. The check for balance in the debt-to-price treatment variable resulted in 26
variables being imbalanced in the weight. Checks for balance in the ATT condition found
no covariate imbalance with the teaching assistantships, fellowships, and debt-to-price
propensity score weights. However, the check of balance for the research assistantship
treatment variable resulted in 7 variables being imbalanced in the ATT propensity score
weight. These checks for balance indicated that we can proceed with main outcome models
for each of the treatment variables except for the debt-to-price treatment under the ATE
and the research assistantship variable under the ATT conditions. See Table 3 to examine
the results of checks for balance in the treatment variables.
Stage 6: Main Outcome Models Stage
We balanced doctoral students on characteristics germane to decisions regarding the
variables of policy interest. Then, we controlled for systematic differences between those
that persist and those who do not persist in a doubly robust analysis approach (Schafer and
Kang 2008). The rationale for using the doubly robust approach rather than solely pro-
pensity score matching, matching, or regression is that only one of the models (propensity
score model or outcome regression model) needs to be properly specified for the ATE or
ATT models to accurately estimate the true treatment effect. We estimated the outcome
models using a weighted logistic regression model in the final stage to estimate the ATE
and ATT for the various treatments studied. We also estimated traditional regression
models, referred to as naive regression models, which do not balance the data using
propensity scores. Rather, these naive regression models use controls as the means of
balancing the data. The following equation represents the weighted logistic regression
model used in estimating Naive Regression Models, ATE Models, and ATT Models:
Table 3 Checks for covariate imbalance in the treatment variables by treatment effects
Treatment variable Number of variables remaining imbalanced in treatment effects
ATE ATT
Debt-to-price ratio 26 0
Research assistantships 2 7
Teaching assistantships 0 0
Fellowships 1 0
A value of ‘‘0’’ reflects balance. A value [ ‘‘0’’ represents the number of covariates that remain imbalancedafter creating and testing the propensity score weights
The ATE weight variable for debt-to-price remains significantly imbalanced
The ATT weight variable for research Assistantships remains significantly imbalanced
Res High Educ
123
P Yi ¼ 1jXi;Kið Þ ¼ pi ¼eðboþb1Xiþb2X2iþ���þbkXkiÞ
1þ eðboþb1Xiþb2X2iþ���þbkXkiÞ ;
where Y is the probability that studenti will re-enroll within the 2007–2008 academic year
while Xi represents a vector of covariates included in the equation and e represents taking
the exponential function of the cross product of the variables and estimated coefficients
biX. The Ki represents the treatment variables of interest (assistantships, fellowships, and
debt-to-price ratio).
Stage 7: Additional Robustness Checks
We conducted additional robustness checks by specifying the assistantships and fellow-
ships simultaneously and independently. In the outcome modeling results, we transformed
the debt-to-price treatment variable into a dichotomous variable where those who report a
positive ratio value on debt-to-price are compared as a group to those having no debt.
Additionally, we generated reduced models (eliminating non-significant variables from the
equations) comparing the full saturated, theoretically derived model results reported in the
appendices against the reduced (statistically derived) models. These robustness model
results are available from the authors upon request. Testing these variations in the final
models did not change the substance of the conversation provided below.
Results
Before presenting the results of the analyses, we highlight informative trends in the
variables of interest found in the dataset as useful background information related to the
research questions and interpretation of results as well as suggestions for future inquiry.
Table 2 presents the distributions, means, standard errors, and percentage of cases with
missing data.
Descriptive Results
The mean distribution of research assistantships, teaching assistantships and fellowships
varies across fields. In particular, the smallest proportion of teaching assistantships was
found in the health sciences (11 %) and the highest in the social/behavioral sciences
(40 %). Students in the humanities had the smallest share of research assistantships (10 %)
while student in math/engineering/computer science had the largest proportion (53 %). The
variation of proportions of fellowships across fields hovered in the teens except for two
fields, math/engineering/computer science (9 %) and life sciences (24 %). These variations
are indicative of the funding patterns of doctoral education. Those fields with higher
proportions of fellowships and research assistantships are likely to enjoy of higher levels of
external funding while those with high proportions of teaching assistantships depend more
on the labor of graduate students for teaching.
International students, mainly from Asia, have a significant presence in American
doctoral education in certain STEM fields (National Science Board 2012), which ulti-
mately represents inexpensive labor for the research and development capacity of the US
(Slaughter et al. 2002). In particular, in the dataset of this study, math/engineering/com-
puter science had the largest proportion of foreign students of all fields (54.8 %) with
research assistantships. Also, all international students with research assistantships in
Res High Educ
123
health sciences were Asian. Research assistantships in the humanities were held by 46.2 %
of international students whereas the proportion of international students with research
assistantships in the social/behavioral sciences was the smallest of all fields (20.8 %)
followed by the life sciences (24.1 %).
Different patterns were found among teaching assistantship holders. In the humanities
and social/behavioral sciences, most of the teaching assistantships were held by domestic
students (85.1 and 76.5 % respectively). In the health sciences, 11.1 % of students had a
teaching assistantship, all of whom were females, both domestic and foreign. This high-
lights the presence of gender gaps commonly found in doctoral education. The life sciences
had the smallest proportion of foreign students with teaching assistantships (15.2 %).
Nevertheless, we found that the largest concentration of foreign students with teaching
assistantships was in math/engineering/computer science (55.9 %). In the math/engineer-
ing/computer science, humanities, life sciences, and social/behavioral sciences most of the
fellowships were held by domestic students (59.6, 90, 82.5 and 76.1 % respectively).
Fellowships were nonexistent in the health sciences among international students. Fel-
lowships are normally granted to US citizens only, which explains these disparities.
Patterns related to debt were also found across fields and demographics of students.
About 43 % of students in the dataset had a student debt higher than the price paid in
tuition and fees. For example, the proportion of students with a debt higher than the tuition
and fees paid (debt-to-price ratio) in math/engineering/computer science was 26 % com-
pared to 69 % in the health sciences, 57 % in the humanities, 54 % in the social behavioral
sciences, 45 % in the life sciences. In all cases, the majority of students with a positive debt
ratio in relation to tuition and fees paid were White domestic students, which correlates
with the notion of higher debt aversion among minority students and the associated issues
around access and equity (Dowd 2008).
Complex and acute gender gap variations in all groups of students, national and
international, were found across all the variables of interest and dependent variables,
sometimes favoring males and at other times favoring females. We believe that these
variations require careful analysis in future investigations. These variations are indicative
of the different types of experiences students obtained across disciplines, which are ulti-
mately connected with the norms and expectations of each field, as well as issues of
retention.
Model Results
The naive regression models are based on traditional regression techniques under the
assumptions that predicate the use of logistic regression models on multiply imputed data.
These models report the expected average effect for a typical Ph.D. student in the sample.
The models do this by controlling for variables that are theoretically assumed to influence
the outcome, and is achieved only if, the assumptions for its use are met including but not
limited to the following: the variables included are measured without error, no collinearity
exists between covariates, and no important variables are excluded from the outcome
model (Long 1997; Little and Rubin 2002). Even if all assumptions underlying the use of
logistic regression are met, these models may under- or overestimate the true effects of the
treatment variable on the outcome if there are characteristics that make treatment partic-
ipants different from treatment nonparticipants; this concept is referred to, in the coun-
terfactual and propensity score literature, as selection bias (Morgan and Winship 2007;
Schneider et al. 2007).
Res High Educ
123
To account for selection bias or the potential threat to validity through selection, we
implement propensity score techniques. The results of Average Treatment Effects (ATE)
models ameliorate this potential threat of selection on observables by balancing the
treatment and non-treatment groups on characteristics that are thought to influence
selection into the treatment prior to fitting the final outcome models. In fact, this technique
is designed to create functionally equivalent groups between the treatment and non-
treatment participants within the sample. In simpler terms, these models report the average
expected effect for the treatment variable on the outcome among all doctoral students
included in the sample. Stated differently, it indicates what would have been the effect, on
average, if all Ph.D. students would have been offered that particular treatment. However,
there might still be effects due to unobservable, and so, bias might still be present, even
when using counterfactual models.
Finally, Average Treatment Effect on the Treated models examine Ph.D. students who
have received the treatment against those who would have received the treatment but
refused the treatment. It is conceivable to have Ph.D. students rejecting fellowships or
assistantships for a variety of reasons, such as desire to work with a research group, have
teaching experience, and prestige associated with certain fellowships, or simply better
salary and benefits. This is referred to in the counterfactual and propensity score literature
as the problem of compliers and non-compliers. To investigate this effect, the Average
Treatment Effect on the Treated modeling approach reduces the sample to those subjects
that received the treatment and those who would have received but did not receive the
treatment. Thus, the results of these models indicate the average expected effect on one-
year retention for treatment participants. Stated differently, this effect indicates, of those
who participated in the treatment and those who should have participated, what is their
probability in experiencing one-year retention. Unlike the ATE and naive regression model
results, the ATT results are no longer concerned with an overall average effect across the
sample or with the typical Ph.D. student, but for only those students who received or would
have received the fellowship or assistantships.
Teaching Assistantships
The results for the logistic naive regression models indicate that teaching assistantships are
positively related to within-year retention outcomes (see Table 4). A typical Ph.D. student
included in the sample has an increased probability of being retained than that of a Ph.D.
student who does not receive an assistantship or fellowship. This result indicates that
teaching assistantships in this sample increase the within-year retention of students relative
to students who do not have assistantships or fellowships. Controlling for selection bias,
the Average Treatment Effects model indicates the same directional and a similar statis-
tical significance level effect for teaching assistantships on within-year retention rates.
While the logistic naive regression model slightly underestimates the effect compared to
the Average Treatment Effect model, in this case, both models render the same conclusion:
Ph.D. students who participate in teaching assistantships are more likely to be retained
within an academic year. When examining the Average Treatment Effect on the Treated
model, teaching assistants also have a similar directional and significance level effect as
the naive regression and Average Treatment Effects models. In this model, the comparison
is between teaching assistants and students who were not teaching assistants but possess all
the characteristics that would exist in a typical teaching assistant in this sample. Among
this subsample, teaching assistantships are important to Ph.D. students’ within-year
retention. In fact, this result suggests that teaching assistants were more likely to persist
Res High Educ
123
Tab
le4
Eff
ects
of
Tea
chin
gA
ssis
tants
hip
sT
reat
men
ton
Wit
hin
-yea
rR
eten
tion:
Log-o
dds
and
Robust
Sta
ndar
dE
rrors
(n=
1,1
98
)
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Tre
atm
ent
var
iab
le
Tea
chin
gas
sist
ants
hip
s1
.27
0[0
.43
]**
1.3
11
[0.4
3]*
*1
.068
[0.5
3]*
Res
earc
has
sist
ants
hip
s0
.59
8[0
.43
]0
.72
8[0
.54
]-
0.0
18
[0.6
7]
Fel
low
ship
s0.5
82
[0.4
7]
0.7
50
[0.5
3]
0.6
63
[0.7
1]
So
cial
and
econ
om
icb
ack
gro
un
d
Fem
ale
0.2
85
[0.2
8]
0.2
37
[0.3
0]
0.0
38
[0.3
8]
Min
ori
ty-
0.0
43
[0.3
0]
-0
.097
[0.3
6]
-0
.40
5[0
.48
]
Inv
erse
of
age
41
0.7
0[8
50
.69
]-
33
5.8
21
[97
9.3
5]
51
7.7
50
[1,2
08.8
7]
Inv
erse
of
age
qu
adra
tic
-1
3,2
16
.96
0[2
7,8
44
.84]
13
,76
0.9
40
[32
,533
.87
]-
11
,33
6.3
80
[39
,938
.80
]
Inv
erse
of
age
cub
ic1
54
,29
4.5
00
[29
5,2
88
.40
]-
14
5,7
08
.10
0[3
49
,65
9.3
0]
99
,39
3.9
50
[42
6,4
74
.20
]
Yea
rin
pro
gra
m-
0.2
72
[0.1
2]*
-0
.141
[0.1
4]
2.5
68
[0.9
9]*
*
Yea
rin
pro
gra
mq
uad
rati
cN
/AN
/AN
/AN
/A-
0.7
70
[0.2
8]*
*
Yea
rin
pro
gra
mcu
bic
N/A
N/A
N/A
N/A
0.0
63
[0.0
2]*
Ind
ep.,
mar
ried
,n
od
epen
den
ts-
0.3
39
[0.5
0]
-0
.508
[0.5
5]
-0
.13
5[0
.70
]
Indep
.,unm
arri
ed/s
epar
ated
,w
/dep
ends
-0
.448
[0.8
7]
-1
.180
[0.9
0]
-0
.86
5[1
.13
]
Ind
ep.,
mar
ried
,w
/dep
end
ents
-0
.626
[1.0
7]
-2
.307
[1.0
4]*
-1
.07
6[1
.34
]
Fam
ily
size
0.0
60
[0.4
0]
0.1
41
[0.4
0]*
*0
.744
[0.4
9]
Su
bje
cth
asa
dis
abil
ity
1.7
52
[0.7
9]*
2.0
71
[0.7
2]*
*4
.757
[1.2
8]*
**
Res
iden
tal
ien
-0
.060
[0.4
8]
0.0
46
[0.5
6]
-0
.69
3[0
.69
]
Fo
reig
n-
0.1
25
[0.3
8]
-0
.524
[0.5
3]
0.1
39
[0.6
3]
Oth
erth
anen
gli
shla
ngu
age
pri
mar
y-
0.4
03
[0.3
5]
-0
.426
[0.4
4]
-0
.36
3[0
.59
]
Sq
uar
ero
ot
of
the
adju
sted
gro
ssin
com
e-
0.0
01
[0.0
0]
-0
.004
[0.0
0]#
-0
.00
6[0
.00
]*
Stu
den
to
wn
sh
om
e/p
ays
mort
gag
e-
0.0
67
[0.3
2]
0.1
37
[0.3
3]
0.7
17
[0.4
3]#
Par
ent’
sh
igh
est
lev
elo
fed
uca
tio
n-
0.0
29
[0.0
5]
-0
.040
[0.0
6]
0.0
24
[0.0
8]
Res High Educ
123
Tab
le4
con
tin
ued
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Ex
pec
ted
earn
ings
afte
rg
rad
uat
ion
Nat
ion
alav
erag
esa
lary
by
fiel
do
fst
ud
y-
0.0
00
[0.0
0]
-0
.000
[0.0
0]
-0
.00
0[0
.00
]
Pri
ces
No
ntu
itio
nex
pen
seb
ud
get
[adju
sted
]0
.00
0[0
.00
]**
*0
.00
0[0
.00
]**
*0
.000
[0.0
0]*
**
Tu
itio
nan
dfe
esp
aid
0.0
00
[0.0
0]*
**
0.0
00
[0.0
0]*
**
0.0
00
[0.0
0]*
**
Pri
cesu
bsi
die
s
To
tal
sub
sid
ies
[no
n-l
oan
]-
0.0
00
[0.0
0]*
**
-0
.000
[0.0
0]*
**
-0
.00
0[0
.00
]**
*
To
tal
loan
s[e
xcl
ud
ing
par
ent
plu
s]-
0.0
00
[0.0
0]
-0
.000
[0.0
0]
-0
.00
0[0
.00
]
Gra
duat
eex
per
ience
s/in
stit
uti
onal
char
acte
rist
ics
Pri
vat
ese
cto
r,in
stit
uti
on
-0
.395
[0.3
8]
-0
.037
[0.4
8]
-0
.24
9[0
.59
]
Rura
l-
0.4
18
[0.4
4]
-0
.762
[0.4
9]
-0
.00
9[0
.66
]
Su
bu
rban
-0
.155
[0.3
1]
-0
.532
[0.3
6]
-0
.11
7[0
.42
]
Log
of
per
centa
ge
min
ori
tyat
inst
ituti
on
-0
.229
[0.2
8]
-0
.410
[0.3
2]
-0
.24
7[0
.37
]
Gra
duat
eex
per
ience
s/en
roll
men
t
Hal
f-ti
me
0.5
36
[0.3
5]
0.3
23
[0.4
2]
0.6
24
[0.6
3]
Les
sth
anh
alf-
tim
e1
.02
3[0
.43
]*0
.73
1[0
.45
]1
.021
[0.6
9]
Gra
du
ate
clas
sle
vel
0.3
17
[0.2
2]
-0
.004
[0.2
6]
-0
.08
1[0
.46
]
Hu
man
itie
sm
ajo
rs0
.97
2[0
.49
]*1
.11
4[0
.56
]*1
.734
[0.6
4]*
*
So
cial
and
beh
avio
ral
scie
nce
maj
ors
0.8
10
[0.4
3]#
1.1
26
[0.5
9]#
0.9
65
[0.6
2]
Mat
h,
eng
inee
rin
g,
and
com
pu
ter
scie
nce
s0
.60
2[0
.47
]1
.51
5[0
.57
]**
*1
.057
[0.6
6]
Hea
lth
maj
ors
-0
.034
[0.7
1]
0.8
32
[0.7
6]
1.1
42
[0.9
2]
Oth
erm
ajo
rs0
.20
1[0
.42
]0
.97
1[0
.55
]#0
.753
[0.5
9]
Dis
tan
ceed
uca
tio
nco
urs
esta
ken
-0
.315
[0.4
2]
-0
.964
[0.5
1]#
-0
.51
6[0
.65
]
Res High Educ
123
Ta
ble
4co
nti
nu
ed
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Gra
du
ate
exp
erie
nce
s/em
plo
ym
ent
Job
ho
urs
/wee
k[e
xcl
udin
gw
ork
-stu
dy
]0
.005
[0.0
1]
0.0
14
[0.0
1]
0.0
05
[0.0
1]
Dep
artm
enta
lcu
lture
,cl
imat
e,an
dso
cial
izat
ion
Pro
gra
msi
zeq
uar
tile
-0
.36
7[0
.15
]*-
0.4
86
[0.0
1]*
**
-0
.507
[0.2
1]*
95
thP
erce
nti
lere
sear
chac
tivit
y-
0.0
11
[0.0
0]*
*-
0.0
13
[0.1
5]*
*-
0.0
17
[0.0
1]*
*
95
thP
erce
nti
lest
ud
ent
sup
po
rt&
ou
tco
mes
0.0
15
[0.0
0]*
**
0.0
17
[0.0
0]*
**
0.0
17
[0.0
1]*
*
95
thP
erce
nti
led
iver
sity
-0
.00
2[0
.00
]-
0.0
01
[0.0
0]
0.0
02
[0.0
1]
Co
nst
ant
-3
.77
0[8
.54
]4
.152
[9.7
3]
-9
.207
[11
.89]
#p\
0.1
0;
*p\
0.0
5;
**
p\
0.0
1;
**
*p\
0.0
01
Res High Educ
123
than students who were offered and did not complete their teaching assistantships or did
not accept teaching assistantships altogether.
Research Assistantships
The logistic naive regression models indicate that while there appears to be a positive
relationship between research assistantships and within-year retention among Ph.D. stu-
dents, this result was statistically non-significant (see Table 5). While Ph.D. students who
have a research assistantship have a higher probability of being retained in comparison to
Ph.D. students who do not have either form of fellowship or assistantships, the general
relationship is not significantly different from the null. Controlling for selection bias, the
Average Treatment Effect model indicates that research assistants are slightly more likely
to experience within-year retention, although it remains statistically non-significant. This
particular result does indicate that the naive logistic regression model does slightly
overestimate the relationship. The two models do suggest that research assistants on
average do not appear to be retained at significant levels greater than their peers. While it is
likely that the within-year retention outcome during any academic year in a graduate
student’s career is not likely to be an important outcome for Ph.D. research assistants, it
remains important to continue to monitor and understand any potential changes to these
trends that may merit further investigation.
Fellowships
The naive logistic regression model results indicate there is a slightly positive though non-
significant relationship between fellowships and within-year retention (see Table 6). This
non-significant finding at first glance is unexpected. It was expected that Ph.D. fellows
would have a higher probability of retention than their Ph.D. peers who do not have either a
fellowship or assistantship. Controlling for selection bias, however, changes the effect of
fellowships from non-significance to marginal significance. This finding indicates that the
naive logistic regression model grossly underestimates the relationship. This finding
illustrates the importance of applying propensity score methods in higher education
research. Whereas the naive logistic regression model was not able to uncover the rela-
tionship, the Average Treatment Effect model results indicated that for functionally
equivalent groups, Ph.D. students on average are likely to have a higher probability of
within-year retention if they are on a fellowship. This finding is more consistent with
general conclusions that having a fellowship or assistantship improves the odds of reten-
tion. Additionally, the Average Treatment Effect on the Treated, which compares Ph.D.
Fellows with Ph.D. students who should have received a Fellowship but did not, indicates
that the mere act of participating in a Fellowship does increase the likelihood of experi-
encing within-year retention.
Debt-to-Price Ratio
The logistic naive regression model results indicate that there is no significant relationship
between the debt-to-price ratio, which represents how much indebtedness in relation to
tuition and fees paid a Ph.D. student is carrying on within-year retention (see Table 7).
After preforming checks for imbalance, we found that there was a sizable imbalance in the
debt-to-price ratio for both low debt and high debt groups; therefore, we could not proceed
Res High Educ
123
Ta
ble
5E
ffec
tso
fre
sear
chas
sist
ants
hip
str
eatm
ent
var
iab
leo
nw
ith
in-y
ear
rete
nti
on
:lo
g-o
dd
san
dro
bu
stst
andar
der
rors
(n=
1,1
98
)
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Tre
atm
ent
var
iab
le
Res
earc
has
sist
ants
hip
s0
.598
[0.4
3]
0.5
37
[0.4
5]
No
tA
pp
lica
ble
Tea
chin
gas
sist
ants
hip
s1
.270
[0.4
3]*
*1
.427
[0.5
1]*
*
Fel
low
ship
s0582
[0.4
7]
0.2
58
[0.5
4]
So
cial
and
eco
no
mic
bac
kg
roun
d
Fem
ale
0.2
85
[0.2
8]
0.6
87
[0.3
4]
Min
ori
ty-
0.0
43
[0.2
9]
-0
.386
[0.3
7]
Inv
erse
of
age
41
0.7
03
[85
0.6
9]
1,5
08
.96
7[1
,00
1.4
3]
Inv
erse
of
age
qu
adra
tic
-1
3,2
16
.96
0[2
7,8
44
.84]
-5
0,3
65
.20
0[3
3,4
25
.14
]
Inv
erse
of
age
cub
ic1
54
,29
4.5
00
[29
5,2
88
.40
]5
56
,35
8.1
00
[36
2,3
32
.20
]
Yea
rin
pro
gra
m-
0.2
72
[0.1
2]*
-0
.348
[0.1
5]*
Ind
ep.,
mar
ried
,n
od
epen
den
ts-
0.3
39
[0.5
0]
-0
.119
[0.6
2]
Ind
ep.,
un
mar
ried
/sep
arat
ed,
w/d
epen
ds.
-0
.44
8[0
.87
]-
0.4
44
[1.1
4]
Ind
ep.,
mar
ried
,w
/dep
end
ents
-0
.62
6[1
.07
]-
0.0
66
[1.2
9]
Fam
ily
size
0.5
61
[0.4
0]
0.6
32
[0.5
1]
Su
bje
cth
asa
dis
abil
ity
1.7
52
[0.7
9]*
2.0
69
[0.7
5]*
*
Res
iden
tal
ien
-0
.06
0[0
.48
]-
0.0
64
[0.5
2]
Fo
reig
n-
0.1
25
[0.3
8]
-0
.300
[0.4
7]
Oth
erth
anen
gli
shla
ng
uag
ep
rim
ary
-0
.40
3[0
.35
]-
0.3
46
[0.4
4]
Sq
uar
ero
ot
of
the
adju
sted
gro
ssin
com
e-
0.0
01
[0.0
02]
-0
.005
[0.0
0]*
Stu
den
to
wn
sh
om
e/pay
sm
ort
gag
e-
0.0
67
[0.3
2]
0.2
81
[0.3
5]
Par
ent’
shig
hes
tle
vel
of
educa
tion
-0
.02
9[0
.05
]-
0.0
05
[0.0
7]
Ex
pec
ted
earn
ing
saf
ter
gra
du
atio
n
Nat
ional
aver
age
sala
ryby
fiel
do
fst
udy
-0
.00
0[0
.00
]0
.000
[0.0
0]
Res High Educ
123
Ta
ble
5co
nti
nu
ed
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Pri
ces
Nontu
itio
nex
pen
sebudget
[adju
sted
]0.0
00
[0.0
0]*
**
0.0
00
[0.0
0]*
**
Tu
itio
nan
dfe
esp
aid
0.0
00
[0.0
0]*
**
0.0
00
[0.0
0]*
**
Pri
cesu
bsi
die
s
To
tal
sub
sid
ies
[no
n-l
oan
]-
0.0
00
[0.0
0]*
**
-0
.000
[0.0
0]*
**
To
tal
Lo
ans
[Ex
clu
din
gP
aren
tP
lus]
-0
.00
0[0
.00
]-
0.0
00
[0.0
0]#
Gra
duat
eex
per
ience
s/in
stit
uti
onal
char
acte
rist
ics
Pri
vat
ese
cto
r,in
stit
uti
on
-0
.39
5[0
.38
]0
.180
[0.4
5]
Ru
ral
-0
.41
8[0
.44
]-
0.4
25
[0.5
2]
Su
bu
rban
-0
.15
5[0
.31
]-
0.0
69
[0.3
5]
Log
of
per
centa
ge
min
ori
tyat
inst
ituti
on
-0
.22
9[0
.28
]-
0.1
83
[0.3
3]
Gra
du
ate
exp
erie
nce
s/en
roll
men
t
Hal
f-ti
me
0.5
36
[0.3
5]
0.2
01
[0.4
8]
Les
sth
anh
alf-
tim
e1
.023
[0.4
3]*
1.4
62
[0.5
6]*
*
Gra
du
ate
clas
sle
vel
0.3
17
[0.2
2]
0.4
03
[0.2
7]
Hum
anit
ies
maj
ors
0.9
72
[0.4
9]*
1.7
44
[0.6
0]*
*
So
cial
and
beh
avio
ral
scie
nce
maj
ors
0.8
10
[0.4
2]
0.3
48
[0.5
8]
Mat
h,
eng
inee
rin
g,
and
com
pu
ter
scie
nce
s0
.602
[0.4
7]
0.6
22
[0.5
6]
Hea
lth
maj
ors
-0
.03
4[0
.71
]0
.235
[0.8
5]
Oth
erm
ajo
rs0
.602
[0.4
2]
-0
.023
[0.5
8]
Dis
tance
educa
tion
cours
esta
ken
-0
.20
1[0
.42
]-
0.2
62
[0.6
0]
Gra
duat
eex
per
ience
s/em
plo
ym
ent
Job
ho
urs
/wee
k[e
xcl
udin
gw
ork
-stu
dy
]0
.005
[0.0
1]
0.0
09
[0.0
1]
Dep
artm
enta
lcu
lture
,cl
imat
e,an
dso
cial
izat
ion
Res High Educ
123
Ta
ble
5co
nti
nu
ed
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Pro
gra
msi
zeq
uar
tile
-0
.03
7[0
.15
]*-
0.3
15
[0.1
6]*
95
thP
erce
nti
lere
sear
chac
tiv
ity
-0
.01
1[0
.00
]**
-0
.012
[0.0
0]*
*
95
thP
erce
nti
lest
ud
ent
sup
po
rt&
ou
tco
mes
0.0
15
[0.0
0]*
**
0.0
16
[0.0
0]*
**
95
thP
erce
nti
led
iver
sity
-0
.00
2[0
.00
]-
0.0
01
[0.0
0]
Const
ant
-3
.77
0[8
.54
]-
15
.83
9[9
.94
]
#p\
0.1
0;
*p\
0.0
5;
**
p\
0.0
1;
**
*p\
0.0
01
Res High Educ
123
Tab
le6
Eff
ects
of
fell
ow
ship
str
eatm
ent
var
iab
leo
nw
ith
in-y
ear
rete
nti
on
:lo
g-o
dd
san
dro
bu
stst
and
ard
erro
rs(n
=1
,198
)
Var
iab
les
Nai
ve
reg
ress
ion
AT
Ere
gre
ssio
nA
TT
reg
ress
ion
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Tre
atm
ent
var
iab
le
Fel
low
ship
s0.5
82
[0.4
7]
0.9
12
[0.4
8]#
1.3
56
[0.7
8]#
Tea
chin
gas
sist
ants
hip
s1.2
70
[0.4
3]*
*1.8
94
[0.5
8]*
**
2.0
412.0
41
[0.9
2]*
Res
earc
has
sist
ants
hip
s0
.598
[0.4
3]
1.2
48
[0.5
2]
1.3
42
[0.8
3]
So
cial
and
eco
no
mic
bac
kg
roun
d
Fem
ale
0.2
85
[0.2
8]
-0
.267
[0.3
9]
-0
.73
8[0
.49
]
Min
ori
ty-
0.0
43
[0.2
9]
0.2
21
[0.3
3]
0.1
70
[0.5
2]
Inv
erse
of
age
41
0.7
03
[85
0.6
9]
1,6
54
.485
[1,1
26.7
0]
92
6.4
9[2
,19
3.1
6]
Inv
erse
of
age
qu
adra
tic
-1
3,2
16
.96
0[2
7,8
44
.84]
-5
6,6
09
.10
0[3
6,8
39
.59
]-
39
,66
4.3
50
[68
,82
0.0
8]
Inv
erse
of
age
cub
ic1
54
,29
4.5
00
[29
5,2
88
.40
]6
45
,12
1.8
00
[39
1,4
41
.40
]#5
29
,33
6.9
00
[70
3,5
81
.90
]
Yea
rin
pro
gra
m-
0.2
72
[0.1
2]*
-0
.466
[0.1
6]*
*-
0.8
05
[0.2
4]*
**
Ind
ep.,
mar
ried
,n
od
epen
den
ts-
0.3
39
[0.5
0]
0.5
12
[0.5
5]
1.1
45
[0.9
6]
Ind
ep.,
un
mar
ried
/sep
arat
ed,
w/d
epen
ds.
-0
.448
[0.8
7]
1.1
73
[1.2
7]
0.7
87
[1.9
0]
Ind
ep.,
mar
ried
,w
/dep
end
ents
-0
.626
[1.0
7]
0.6
78
[1.0
6]
2.2
80
[2.0
8]
Fam
ily
size
0.5
61
[0.4
0]
-0
.049
[0.3
7]
-0
.64
5[0
.74
]
Su
bje
cth
asa
dis
abil
ity
1.7
52
[0.7
9]*
2.2
70
[0.9
0]*
5.2
85
[1.5
1]*
**
Res
iden
tal
ien
-0
.060
[0.4
8]
0.7
00
[0.6
5]
-0
.20
3[1
.27
]
Fo
reig
n-
0.1
25
[0.3
8]
-0
.075
[0.4
4]
-0
.20
5[0
.64
]
Oth
erth
anen
gli
shla
ngu
age
pri
mar
y-
0.4
03
[0.3
5]
-0
.743
[0.4
3]#
-0
.38
0[0
.72
]
Sq
uar
ero
ot
of
the
adju
sted
gro
ssin
com
e-
0.0
01
[0.0
0]
-0
.001
[0.0
0]
0.0
01
[0.0
0]
Stu
den
to
wn
sh
om
e/pay
sm
ort
gag
e-
0.0
67
[0.3
2]
0.3
18
[0.3
8]
0.9
89
[0.7
6]
Par
ent’
sh
igh
est
lev
elo
fed
uca
tio
n0
.029
[0.0
5]
-0
.063
[0.0
6]
-0
.03
2[0
.11
]
Ex
pec
ted
earn
ing
saf
ter
gra
du
atio
n
Nat
ion
alav
erag
esa
lary
by
fiel
do
fst
ud
y-
0.0
00
[0.0
0]
-0
.000
[0.0
0]
-0
.00
0[0
.00
]#
Res High Educ
123
Tab
le6
con
tin
ued
Var
iab
les
Nai
ve
reg
ress
ion
AT
Ere
gre
ssio
nA
TT
reg
ress
ion
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Pri
ces
Nontu
itio
nex
pen
sebudget
[adju
sted
]0.0
00
[0.0
0]*
**
0.0
00
[0.0
0]*
*0.0
00
[0.0
0]#
Tu
itio
nan
dfe
esp
aid
0.0
00
[0.0
0]*
**
0.0
00
[0.0
0]*
**
0.0
00
[0.0
0]*
**
Pri
cesu
bsi
die
s
To
tal
sub
sid
ies
[no
n-l
oan
]-
0.0
00
[0.0
0]*
**
-0
.000
[0.0
0]*
**
-0
.00
0[0
.00
]*
To
tal
loan
s[e
xcl
udin
gp
aren
tp
lus]
-0
.000
[0.0
0]
-0
.000
[0.0
0]
0.0
00
[0.0
0]
Gra
duat
eex
per
ience
s/in
stit
uti
onal
char
acte
rist
ics
Pri
vat
ese
cto
r,in
stit
uti
on
-0
.395
[0.3
8]
-0
.257
[0.4
7]
0.0
35
[0.6
5]
Ru
ral
-0
.418
[0.4
4]
0.1
23
[0.4
9]
1.0
79
[0.8
6]
Su
bu
rban
-0
.155
[0.3
1]
0.5
80
[0.4
2]
0.9
46
[0.6
2]
Lo
go
fp
erce
nta
ge
min
ori
tyat
inst
itu
tio
n-
0.2
29
[0.2
8]
0.1
38
[0.3
5]
0.8
22
[0.5
8]
Gra
du
ate
exp
erie
nce
s/en
roll
men
t
Hal
f-ti
me
0.5
36
[0.3
5]
0.5
17
[0.4
3]
0.9
57
[0.6
6]
Les
sth
anh
alf-
tim
e1
.023
[0.4
3]*
0.8
18
[0.4
2]*
0.6
12
[0.8
5]
Gra
du
ate
clas
sle
vel
0.3
17
[0.2
2]
0.9
06
[0.3
1]*
*1
.435
[0.4
2]*
**
Hu
man
itie
sm
ajo
rs0
.972
[0.4
9]*
-0.0
14
[0.5
6]
-0
.76
0[0
.76
]
So
cial
and
beh
avio
ral
scie
nce
maj
ors
0.8
10
[0.4
2]#
1.5
89
[0.5
5]*
*2
.153
[0.8
4]*
*
Mat
h,
engin
eeri
ng,
and
com
pute
rsc
ience
s0.6
02
[0.4
7]
0.5
24
[0.5
3]
1.7
59
[0.8
5]*
Hea
lth
maj
ors
-0
.034
[0.7
1]
-0
.022
[0.8
2]
0.7
58
[1.7
7]
Oth
erm
ajo
rs0
.201
[0.4
2]
-0
.694
[0.5
1]
-0
.54
0[0
.64
]
Dis
tance
educa
tion
cours
esta
ken
-0
.315
[0.4
2]
-1
.029
[0.5
2]*
-1
.55
4[0
.97
]
Gra
du
ate
exp
erie
nce
s/em
plo
ym
ent
Job
ho
urs
/wee
k[e
xcl
ud
ing
wo
rk-s
tud
y]
0.0
05
[0.0
1]
0.0
04
[0.0
1]
0.0
03
[0.0
2]
Dep
artm
enta
lcu
lture
,cl
imat
e,an
dso
cial
izat
ion
Res High Educ
123
Tab
le6
con
tin
ued
Var
iab
les
Nai
ve
reg
ress
ion
AT
Ere
gre
ssio
nA
TT
reg
ress
ion
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Pro
gra
msi
zeq
uar
tile
-0
.367
[0.1
5]*
-0
.466
[0.1
8]*
*-
0.5
40
0.5
40
[0.3
0]#
95th
Per
centi
lere
sear
chac
tivit
y-
0.0
11
[0.0
0]*
*-
0.0
19
[0.0
1]*
**
-0
.02
7[0
.01
]**
*
95
thP
erce
nti
lest
ud
ent
sup
po
rt&
ou
tco
mes
0.0
15
[0.0
0]*
**
0.0
07
[0.0
0]
0.0
04
[0.0
1]
95
thP
erce
nti
led
iver
sity
-0
.002
[0.0
0]
-0
.002
[0.0
0]
0.0
01
[0.0
1]
Co
nst
ant
-3
.770
[8.5
4]
-1
4.1
77
[11
.06]
-5
.55
3[2
2.4
5]
#p\
0.1
0;
*p
\0
.05;
**
p\
0.0
1;
**
*p
\0
.001
Res High Educ
123
Ta
ble
7E
ffec
tso
fd
ebt-
to-p
rice
rati
otr
eatm
ent
on
wit
hin
-yea
rre
ten
tio
n:
log
-od
ds
and
rob
ust
stan
dar
der
rors
(n=
1,1
98
)
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Tre
atm
ent
var
iab
le
Gra
du
ate
deb
to
ver
tuit
ion
&fe
es0
.092
[0.2
6]
No
tA
pp
lica
ble
-0
.021
[0.2
9]
So
cial
and
eco
no
mic
bac
kg
roun
d
Fem
ale
0.1
05
[0.2
6]
0.1
57
[0.3
3]
Min
ori
ty-
0.0
80
[0.2
6]
-0
.261
[0.3
0]
Inv
erse
of
age
-1
38
.72
2[7
91
.28
]-
38
1.0
66
[1,1
64.5
8]
Inv
erse
of
age
qu
adra
tic
4,3
42
.440
[25
,777
.36
]5
,581
.08
[37
,237
.77]
Inv
erse
of
age
cub
ic-
26
,36
1.4
80
[27
2,5
18
.60
]2
4,0
99
.05
0[3
86
,56
8.5
0]
Yea
rin
pro
gra
m-
0.3
39
[0.1
2]*
*-
0.6
81
[0.1
8]*
**
Ind
ep.,
mar
ried
,n
od
epen
den
ts-
0.5
42
[0.5
0]
-0
.551
[0.6
3]
Ind
ep.,
un
mar
ried
/sep
arat
ed,
w/d
epen
ds.
-0
.15
5[0
.75
]-
0.5
30
[1.0
0]
Ind
ep.,
mar
ried
,w
/dep
end
ents
-0
.89
3[0
.94
]-
0.9
60
[1.2
7]
Fam
ily
size
0.6
25
[0.3
5]#
0.8
23
[0.4
7]#
Su
bje
cth
asa
dis
abil
ity
1.7
94
[0.7
6]*
3.7
75
[1.1
2]*
**
Res
iden
tal
ien
-0
.11
4[0
.46
]0
.679
[0.6
6]
Fo
reig
n0
.300
[0.3
8]
1.4
18
[0.5
7]*
Oth
erth
anen
gli
shla
ngu
age
pri
mar
y-
0.1
97
[0.3
2]
-0
.961
[0.4
0]*
Sq
uar
ero
ot
of
the
adju
sted
gro
ssin
com
e-
0.0
02
[0.0
0]
-0
.003
[0.0
0]
Stu
den
to
wn
sh
om
e/pay
sm
ort
gag
e-
0.0
61
[0.2
9]
-0
.120
[0.3
5]
Par
ent’
shig
hes
tle
vel
of
educa
tion
-0
.02
0[0
.05
]-
0.0
73
[0.0
6]
Ex
pec
ted
earn
ing
saf
ter
gra
du
atio
n
Nat
ion
alav
erag
esa
lary
by
fiel
do
fst
ud
y-
0.0
00
[0.0
0]
-0
.000
[0.0
0]
Res High Educ
123
Tab
le7
con
tin
ued
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Pri
ces
Nontu
itio
nex
pen
sebudget
[adju
sted
]0.0
00
[0.0
0]*
*0.0
00
[0.0
0]*
Pri
cesu
bsi
die
s
To
tal
sub
sid
ies
[no
n-l
oan
]-
0.0
00
[0.0
0]
-0
.000
[0.0
0]*
Pri
vat
ese
cto
r,in
stit
uti
on
0.9
99
[0.3
2]*
*0
.690
[0.4
0]#
Ru
ral
-0
.599
[0.4
0]
-1
.415
[0.4
7]*
*
Su
bu
rban
0.2
85
[0.2
8]
0.0
79
[0.3
6]
Log
of
per
centa
ge
min
ori
tyat
inst
ituti
on
-0
.291
[0.2
4]
-0
.125
[0.2
9]
Gra
duat
eex
per
ience
s/en
roll
men
t
Hal
f-ti
me
-0
.002
[0.3
1]
-0
.490
[0.3
7]
Les
sth
anh
alf-
tim
e0
.04
0[0
.39
]-
0.1
06
[0.5
4]
Gra
du
ate
clas
sle
vel
0.3
32
[0.2
0]#
1.0
75
[0.3
3]*
**
Hu
man
itie
sm
ajo
rs0
.67
9[0
.44
]1
.150
[0.5
6]*
So
cial
and
beh
avio
ral
scie
nce
maj
ors
0.6
51
[0.3
8]#
0.9
57
[0.5
2]#
Mat
h,
eng
inee
rin
g,
and
com
pu
ter
scie
nce
s0
.96
2[0
.44
]*1
.277
[0.7
3]#
Hea
lth
maj
ors
0.4
37
[0.6
6]
0.4
33
[0.8
6]
Oth
erm
ajo
rs0
.29
7[0
.39
]-
0.0
79
[0.5
2]
Dis
tance
educa
tion
cours
esta
ken
-0
.506
[0.3
9]
-0
.399
[0.4
9]
Gra
duat
eex
per
ience
s/em
plo
ym
ent
Job
ho
urs
/wee
k[e
xcl
ud
ing
wo
rk-s
tud
y]
-0
.002
[0.0
1]
0.0
00
[0.0
1]
Tea
chin
gas
sist
ants
hip
0.7
82
[0.3
8]*
0.7
12
[0.4
8]
Res
earc
has
sist
ants
hip
0.1
28
[0.3
7]
-0
.005
[0.5
0]
Res High Educ
123
Tab
le7
con
tin
ued
Var
iable
sN
aive
regre
ssio
nA
TE
regre
ssio
nA
TT
regre
ssio
n
Lo
g-o
dd
sS
EL
og
-od
ds
SE
Lo
g-o
dd
sS
E
Fel
low
ship
0.2
85
[0.2
9]
0.0
00
[0.4
8]
Dep
artm
enta
lcu
ltu
re,
clim
ate,
and
soci
aliz
atio
n
Pro
gra
msi
zeq
uar
tile
-0
.369
[0.1
3]*
*-
0.4
77
[0.1
7]*
*
95th
Per
centi
lere
sear
chac
tivit
y-
0.0
12
[0.0
0]*
**
-0
.014
[0.0
0]*
95
thP
erce
nti
lest
ud
ent
sup
po
rt&
ou
tco
mes
0.0
14
[0.0
0]*
**
0.0
11
[0.0
0]*
*
95
thP
erce
nti
led
iver
sity
-0
.001
[0.0
0]
0.0
01
[0.0
0]
Co
nst
ant
5.2
42
[7.9
4]
9.8
02
[12
.08
]
#p
\0
.10;
*p\
0.0
5;
**
p\
0.0
1;
**
*p\
0.0
01
Res High Educ
123
with an analysis of the Average Treatment Effect model. However, the Average Treatment
Effect on the Treated, which compares the high debt ratio Ph.D. students with those Ph.D.
students who should have a high debt ratio but do not, finds that the level of indebtedness
would have had no different effects for these Ph.D. students. In essence, whether or not a
Ph.D. student has a higher ratio of indebtedness would not have had a differing effect on
Ph.D. students’ likelihood of being retained.
Discussion and Interpretation of Results
Although past studies suggest that financial aid in general facilitates doctoral retention,
there is no consensus on the effects of different forms of aid on doctoral outcomes
(Mwenda 2010). On the one hand, some have argued that fellowships allow students to be
more focused and efficient towards degree completion and relieve students from the stress
of financing their education and so, these students tend to experience shorter trajectories
time to degree (Herzig 2004). Another positive outcome of fellowships is that by providing
students with financial resources and greater availability of time, they are more likely to
attend conferences and other professional development events (Mwenda 2010). Others
have mentioned that the positive effect of fellowships takes place mainly in the humanities
because students tend to embark on research in isolation and that in STEM fields, fel-
lowships are detrimental towards degree progress because they do not enable students’
ability to join research groups as it is the case with research assistantships. In STEM fields,
joining a research group is normally the foundation for dissertation work and the means for
proper socialization conducive of doctoral retention. Therefore, some argue that research
assistantships are the most effective financial support for doctoral completion, especially in
STEM fields (Ampaw and Jaeger 2012; Bair and Haworth 1999; Bowen and Rudenstine
1992). Some have argued that teachings assistantships are also positive by facilitating the
integration of students into the academic communities increasing contact with faculty and
peers, despite the lack of attention to research (Austin 2002; Girves and Wemmerus 1988;
Golde 2005; Herzig 2004). Moreover, Mwenda (2010) indicates that teaching assistant-
ships are useful in training students to become teachers and helping students learn foun-
dational knowledge in their disciplines. On the contrary, some have expressed concerns
related to teaching assistantships by arguing that these can take away time and effort
towards degree completion, especially during the dissertation stage of doctoral programs
(Ampaw and Jaeger 2012; Herzig 2004). In sum, past studies indicate that research as-
sistantships and fellowships are stronger predictors of retention than teaching assistant-
ships, although the three of them exhibit positive effects.
In this context, one of the research questions of this study sought to answer was the
impact of teaching assistantships, research assistantships, and fellowships on within year
retention for Ph.D. students in departments ranked by the National Research Council.
Results indicate that both teaching assistantships and fellowships have a significant and
positive effect on within year retention. Fellowships are normally competitive on a merit-
based basis, which means that those with fellowships are awarded to students who have
been selected based on their potential to succeed in doctoral work. Therefore, fellowships
tend to be good predictors of retention. This study indicates that traditional analyses
underestimate the relationship of fellowships on doctoral retention. However, after con-
trolling for self-selection bias on the observables, we found that the mere participation in
fellowships aids retention (see Table 8). This finding sheds light on the value of using
propensity score analyses in higher education research.
Res High Educ
123
On the other hand, teaching assistantships are normally assigned based on departmental
needs rather than the qualifications or promise of doctoral success of students. Therefore,
as we found in the models, teaching assistantships help all students in general staying
enrolled due to the funding that teaching assistantships provide but also because it helps
students socialize with faculty and peers. It is likely that teaching assistantships increase
the retention of students due to the long-term planning and preparation that can occur in
teaching assistantships. Such examples include course preparation and course teaching
schedules, which are established in advanced, even during at least the previous academic
year. This early determination of teaching schedules and formal participation in class preps
for the following semester or term is likely to create a condition of stability that may
increase the student’s confidence throughout the academic school year. Additionally,
teaching assistants at certain institutions are required to undergo training and development
in instruction. Consequently, these students may feel as if they have spent significant time
developing skills that they wish to hone further during the ensuring academic semester as
they continue to progress through the program. We offer these interpretations as a possible
foundation for future inquiry.
The effect of research assistantships on retention in this study was non-significant,
contrary to what previous work has suggested. This non-significance can be attributed
partially to the inability of isolating the effect of research assistantships in the models due
to the need to collapse several groups of students into the primarily research assistantship
category. In addition, the dataset used in this study did not include variables measuring the
quality of the student-advisor/faculty relationships. Faculty advisors are crucial for the
socialization, development of competencies, and ultimately the retention of doctoral stu-
dents (Nettles and Millett 2006; Lovitts 2001; Golde 2005). Ill-natured student-advisor/
faculty relationships can damper the positive effects of research assistantships. Qualitative
studies have demonstrated the importance of these relationships (Mwenda 2010), however,
Table 8 Treatment variable effects on within-year retention based on fully specified theoretical models fornaive regression models, average treatment effects models, and average treatment on the treated effectsmodels: log-odds and robust standard errors (n = 1,198)
Treatment variables Naive regression ATE ATT
Log-odds SE Log-odds SE Log-odds SE
1. Research assistantships 0.598 (.43) 0.537 (.45) Not applicablea
2. Teaching assistantships 1.270** (.43) 1.311** (.43) 1.068* (.53)
3. Fellowships 0.582 (.47) 0.912# (.48) 1.356# (.78)
4. Debt-to-price 0.095 (.26) Not applicablea 0.021 (.29)
Reference category for the research assistantships, teaching assistantships, and fellowships models is Ph.D.students with no assistantships or fellowships
Reference category for all debt-to-price models is no debt or debt less than the price of attending graduateschool for a Ph.D
All models above included a vector of predictor variables as provided in Table 1 under the model column.They were listed as models 1, 2, 3, 4, and All
All model results including the control and balancing variables are available by request from the authors# p \ 0.10; *p \ 0.05; ** p \ 0.01;a The checks for covariate balance conducted after generating the propensity score weight indicated that theweight under the ATE treatment for Debt-to-Price and Research Assistantships did not balance the data wellenough to allow estimation of the regression results using propensity score weights
Res High Educ
123
quantitative accounts of this dimension are still lacking in the literature. This indicates a
need for including measures of student-advisor/faculty relationships in future quantitative
research. Despite the statistical efforts undertaken to control for selection bias on
observables, this finding serves as an example of the limitations that occur due to the
inability to control for unobservable, such as the student-advisor relationship. Also, it is
possible that research assistantships offer less financial stability given the fact that they are
normally part of research grants, which vary in duration and in some cases, suffer from
abrupt termination. In other words, research assistantships can be short-term and limited in
scope providing a greater sense of employment ambiguity to the Ph.D. student. Clearly, to
answer these questions, future research should further investigate the role of research
assistantships in doctoral retention.
Doctoral students are increasingly relying on loans to afford the price of attendance
(Hoffer et al. 2006). According to the National Science Foundation (NSF), 48 % of doctoral
graduates in 2009–2010 owed over $20,400 on average in education-related debt and 16 %
owed over $50,000 in combined undergraduate and graduate debt. However, to the best of
our knowledge, this is the first quantitative study investigating the role of debt on doctoral
retention. Gardner and Holley (2011) in a qualitative study tapped into the issue of loan debt
among doctoral students, indicating that ‘‘students had faith that the attainment of the doc-
toral degree would eventually mitigate these financial concerns, as they expected their
professional careers would bring financial stability to their lives’’ (p. 87). This attitude might
provide insights to the non-significance of debt in relation to price of attendance found in this
study, in that students are confident about their employability and earnings after graduation
to pay their loans and so, debt is not a central consideration for leaving the program. In fact,
debt might have the opposite effect to some students by motivating them to graduate, earn at
the expected level for Ph.D.’s in their field, and pay their loans. Thus, given the current levels
of debt of doctoral students and favorable labor market conditions, debt might not be a
critical factor in doctoral attrition. This finding does point to a need to monitor this effect as
more students enroll in Ph.D. programs with greater levels of debt. As a matter of policy
concern, it will be incumbent of institutions to be proactive in ensuring that Ph.D. students do
not take too much debt without understanding the potential consequences for indebtedness
on other important outcomes such as type of employment after degree completion. A perhaps
more relevant question might be related to the effects of debt on doctoral enrollment. As with
the case in undergraduate education, the prospect of high levels of debt my discourage
students with high debt aversion to enroll in doctoral programs.
This study shows that retention and the role of teaching assistantships, research assis-
tantships, fellowships, and debt manifest differently by field of study, gender, race/eth-
nicity, and immigration status. This study shows that the patterns associated with gender,
race/ethnicity, and immigration status varies dramatically by field of study. Therefore,
future studies, when possible, should not aggregate fields of study in their analysis.
Another notable theme is that the gender gap exists across fields and within racial/ethnic
groups except for a few exceptions among US Whites in the life sciences, humanities, and
social/behavioral sciences. In some instances, the gender gap favors females in some fields,
especially in health and life sciences. Asian foreigners, especially males, dominate the
math, engineering, and computer science fields. US minority students are significantly
underrepresented except for Asians in some fields. In general, minority women tend to
perform better than their male counterparts. Most notable is the fact that US Black males
are the most underrepresented group in all fields. These results support previous work
developing the concept of ‘‘rejection of academic careers’’ and the lack of feelings of
Res High Educ
123
belonging as common reasons for the underrepresentation of minorities in advanced
degrees (Glazer-Raymo 1999; Winkle-Wagner 2009).
As the higher education financial aid policy landscape continues to emphasize education
as a private good, student debt will continue to be a source of concern as it will likely have
differential effects by student demographics and field of study. Concomitantly, it is
expected that the federal government and institutions will continue to offer graduate as-
sistantships and fellowships to doctoral students. This study provides timely insights into
which of these financial strategies are likely to improve the already low doctoral retention
rates nationwide. To the best of our knowledge, this is the first quantitative study that
includes socialization variables in examining the role of various funding mechanisms in
doctoral retention using a national representative database. Given the essential role of the
experience within each academic unit in doctoral education, much more than in under-
graduate education, this study promises to shed light not only to the role of finances in
doctoral retention, but also to the role of academic integration and socialization in com-
bination to funding in doctoral outcomes. Clearly, future studies should examine the role of
the student-advisor/faculty relationship in doctoral retention as well as disaggregated data
by field of study, gender, race/ethnicity, and immigration status.
References
Ampaw, F. D., & Jaeger, A. J. (2012). Completing the three stages of doctoral education: An event historyanalysis. Research in Higher Education, 53, 640–660.
Austin, A. E. (2002). Preparing the next generation of faculty: graduate school as socialization to academiccareers. The Journal of Higher Education, 73(1), 94–122.
Bair, C. R. & Haworth, J.G. (1999). Doctoral student attrition and persistence: A metasynthesis of research.Paper presented at the annual meeting of the Association for the Study of Higher Education, SanAntonio, TX. (ERIC Document Reproduction Service No. ED437008).
Bowen, W. G., & Rudenstine, N. L. (1992). In pursuit of the Ph.D.. Princeton: Princeton University Press.Braskman, L., & Wergin, J. (1998). Forming new social partnerships. In W. Tierney (Ed.), The responsive
university: Restructuring for high performance (pp. 62–91). Baltimore: Johns Hopkins UniversityPress.
Burton, N. W., & Ramist, L. (2001). Predicting success in college: SAT studies of classes graduating since1980. New York: College Entrance Examination Board.
College Board. (2011). Trends in student aid. (Trends in Higher Education Series). Washington: Author.Cominole, M., Riccobono, J., Siegel, P., & Caves, L. (2010). 2007–08 National Postsecondary Student Aid
Study (NPSAS:08) Full-scale Methodology Report (NCES 2011-188). Washington: US Department ofEducation, National Center for Education Statistics.
DesJardins, S. L., & McCall, B. P. (2010). Simulating the effects of financial aid packages on collegestudent stop out, reenrollment spells, and graduation chances. Review of Higher Education, 33(4),513–541.
Dowd, A. C. (2008). Dynamic interactions and intersubjectivity: Challenges to causal modeling in studies ofcollege student debt. Review of Educational Research, 78(2), 232–259.
Gardner, S. K. (2009). Student and faculty attributions of attrition in high and low-completing US doctoralprograms. Higher Education, 58, 97–112.
Gardner, S. K., & Barnes, B. J. (2007). Graduate student involvement: Socialization for the professionalrole. The Journal of College Student Development, 48, 369–387.
Gardner, S. K., & Holley, K. A. (2011). Those invisible barriers are real: The progression of first-generationstudents through doctoral education. Equity and Excellence in Education, 44(1), 77–92.
Gardner, S. K., & Mendoza, P. (2010). On becoming a scholar: Socialization and development in doctoraleducation. Sterling: Stylus.
Girves, J. E., & Wemmerus, V. (1988). Developing models of graduate student degree progress. The Journalof Higher Education, 59(2), 163–189.
Glazer-Raymo, J. (1999). Shattering the myths: Women in academe. Baltimore: Johns Hopkins UniversityPress.
Res High Educ
123
Golde, C. M. (2000). Should I stay or should I go? Student descriptions of the doctoral attrition process.Review of Higher Education, 23(2), 119–227.
Golde, C. M. (2005). The role of the department and discipline in doctoral student attrition: lessons fromfour departments. Journal of Higher Education, 76, 669–700.
Guo, S., & Fraser, W. M. (2010). Propensity score analysis: Statistical methods and applications. ThousandOaks: Sage.
Gururaj, S., Heilig, J. V., & Somers, P. (2010). Graduate student persistence: Evidence from three decades.Journal of Student Financial Aid, 40(1), 31–46.
Hahs-Vaughn, D. L., & Onwuegbuzie, A. J. (2006). Estimating and using propensity score analysis withcomplex samples. The Journal of Experimental Education, 73(1), 31–65.
Herzig, A. H. (2004). Becoming mathematicians: Women and students of color choosing and leavingdoctoral mathematics. Review of Educational Research, 74(2), 171–214.
Hoffer, T. B., Welch, V., Webber, K., Williams, K., Lisek, B., Hess, M., et al. (2006). Doctorate recipientsfrom United States universities: Summary report 2005. Chicago: National Opinion Research Center.
Hossler, D., Ziskin, M., Gross, J. P., Kim, S., & Cekic, O. (2009). Student aid and its role in encouragingpersistence. Higher education: Handbook of theory and research (pp. 389–425). Netherlands:Springer.
Kim, D. (2007). The effect of loans on students’ degree attainment: differences by student and institutionalcharacteristics. Harvard Educational Review, 77(1), 64–100.
Kim, D., & Otts, C. (2010). The effect of loans on time to doctorate degree: Differences by race/ethnicity,field of study, and institutional characteristics. Journal of Higher Education, 81(1), 1–32.
Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data (2nd ed.). Hoboken: Wiley.Long, J. S. (1997). Regression models for categorical and limited dependent variables. Thousand Oaks:
SAGE.Lovitts, B. E. (2001). Leaving the ivory tower: The causes and consequences of departure from doctoral
study. Lanham: Rowman & Littlefield.Malcolm, L. E., & Dowd, A. C. (2012). The impact of undergraduate debt on the graduate school enrollment
of STEM baccalaureates. The Review of Higher Education, 35(2), 265–306.Mendoza, P. (2007). Academic capitalism and doctoral student socialization: A case study. Journal of
Higher Education, 78(1), 71–96.Morgan, S. L., & Winship, C. (2007). Counterfactuals and causal inference: Methods and principles for
social science research. New York: Cambridge University Press.Murnane, R. J., & Willett, J. B. (2011). Methods matter: Improving causal inference in educational and
social science research. New York: Oxford University Press.Mwenda, M. N. (2010). Underrepresented minority students in STEM doctoral programs: The role of
financial support and relationships with faculty and peers. Dissertation, University of Iowa.National Science Board. (2012). Science and Engineering Indicators 2012. Arlington: National Science
Foundation (NSB 12-01).Nettles, M. T., & Millett, C. M. (2006). Three magic letters: Getting to Ph.D (1st ed.). Baltimore: The Johns
Hopkins University Press.Peduzzi, P., Concato, J., Kemper, E., Holford, T. R., & Feinstein, A. R. (1996). A simulation study of the
number of events per variable in logistic regression analysis. Journal of Clinical Epidemiology, 49,1372–1379.
Schafer, J. L., & Kang, J. (2008). Average causal effects from non-randomized studies: A practical guideand simulated example. Psychological Methods, 13(4), 279–313.
Schneider, B., Carnoy, M., Kilpatrick, J., Schmidt, W. H., & Shavelson, R. J. (2007). Estimating causaleffects using experimental and observational designs: A think tank white paper. Washington: AmericanEducational Research Association.
Shadish, W. R., Campbell, D. T., & Cook, T. D. (2002). Experimental and quasi-experimental designs forgeneralized causal inference. Boston: Houghton Mifflin.
Slaughter, S., Campbell, T., Hollernan, M., & Morgan, E. (2002). The ‘‘traffic’’ in graduate students:Graduate students as tokens of exchange between academe and industry. Science, Technology andHuman Values, 27(2), 282–313.
St. John, E. P., & Andrieu, S. C. (1995). The influence of price subsidies on within-year persistence bygraduate students. Higher Education, 29, 143–168.
Stata Press. (2011). Stata multiple imputation reference manual, release 12. College Station: Author.Tierney, W. (1992). Cultural leadership and the search for community. Liberal Education, 78(5), 16–21.Tinto, V. (1993). Leaving college: Rethinking the causes and cures of student attrition (2nd ed., p. 296).
Chicago: University of Chicago Press.
Res High Educ
123
Walker, G. E., Golde, C. M., Jones, L., Conklin Bueschel, A., & Hutchings, P. (2008). The formation ofscholars: Rethinking doctoral education for the twenty-first century. San Francisco: Jossey-Bass.
Weidman, J. C., Twale, D. J., & Stein, E. L. (2001). Socialization of graduate and professional students inhigher education: A perilous passage?. San Francisco: Jossey-Bass.
Wendler, C., Bridgeman, B., Cline, F., Millett, C., Rock, J., Bell, N., et al. (2010). The path forward: Thefuture of graduate education in the United States. Council of Graduate School: Educational TestingService.
Winkle-Wagner, R. (2009). The unchosen me: Race, gender, and identity among Black women in college.Baltimore: The Johns Hopkins University Press.
Res High Educ
123