+ All Categories
Home > Documents > Estimating Downside Tax Risk: Exploration of Unfavorable Tax Settlements · 2014. 6. 4. · narrow...

Estimating Downside Tax Risk: Exploration of Unfavorable Tax Settlements · 2014. 6. 4. · narrow...

Date post: 05-Feb-2021
Category:
Upload: others
View: 7 times
Download: 0 times
Share this document with a friend
55
Estimating Downside Tax Risk: Exploration of Unfavorable Tax Settlements Andrew M. Bauer Department of Accountancy The University of Illinois at Urbana-Champaign Kenneth J. Klassen School of Accounting and Finance University of Waterloo This draft: April 2014 Keywords: tax risk, risk management, tax settlement, extreme downside risk * Corresponding author: Andrew M. Bauer, Department of Accountancy, The University of Illinois at Urbana- Champaign, 1206 S. Sixth Street, Champaign, IL 61822, tel: +1-217-244-4505, email: [email protected]. We appreciate the comments received by Katharine Drake, Tom Omer and Kathleen Powers. We also appreciate the comments from workshop and/or conference participants at Northeastern University, the University of Tennessee, the University of Texas-Austin, the University of Utah and the 2014 ATA Mid-Year Meeting. We thank Qin Cao, Christina Chen, Armando Espinal and Diana Rechenmacher for excellent research assistance. We gratefully acknowledge the financial support provided by the University of Illinois at Urbana-Champaign.
Transcript
  • Estimating Downside Tax Risk:

    Exploration of Unfavorable Tax Settlements

    Andrew M. Bauer∗ Department of Accountancy

    The University of Illinois at Urbana-Champaign

    Kenneth J. Klassen School of Accounting and Finance

    University of Waterloo

    This draft: April 2014

    Keywords: tax risk, risk management, tax settlement, extreme downside risk

    *Corresponding author: Andrew M. Bauer, Department of Accountancy, The University of Illinois at Urbana-Champaign, 1206 S. Sixth Street, Champaign, IL 61822, tel: +1-217-244-4505, email: [email protected]. We appreciate the comments received by Katharine Drake, Tom Omer and Kathleen Powers. We also appreciate the comments from workshop and/or conference participants at Northeastern University, the University of Tennessee, the University of Texas-Austin, the University of Utah and the 2014 ATA Mid-Year Meeting. We thank Qin Cao, Christina Chen, Armando Espinal and Diana Rechenmacher for excellent research assistance. We gratefully acknowledge the financial support provided by the University of Illinois at Urbana-Champaign.

    mailto:[email protected]

  • Estimating Downside Tax Risk: Exploration of Unfavorable Tax Settlements

    Abstract

    In this paper, we examine downside tax risk as the likelihood that a firm will face a large and negative tax outcome. To achieve this objective, we develop a model to predict the probability that a firm will have an unfavorable settlement with tax authorities, a “tax loss event.” We combine an industry-adjusted measure of high cash effective tax rate (Cash ETR) values with financial statement verification of a tax settlement to identify a sample of tax loss event firms. We develop a logistic model from a parsimonious set of factors that are associated with downside tax risk to generate a scaled probability of the likelihood of these negative events, the DTR-score. Empirical tests show that our model provides strong predictions of a tax loss event. We further consider whether market participants price this risk and the timing of any reaction. We show that the abnormal returns are negative in the event year for firms with a tax loss event and non-positive in the prior year as well. These results do not appear to vary with the volatility of ETR, a proxy for tax risk management. Our research adds to the emerging literature that examines the tax risk within firms. Specifically, we develop a prediction model useful for various firm stakeholders, as well as provide additional insight into the factors that create downside risk for the firm and how the market reacts to the revelation of such risks.

  • 1. Introduction

    In recent years, the concept of tax risk within corporations has received much attention.

    The Organization for Economic Cooperation and Development (OECD), as well as major

    accounting firms, highlight the links between tax risk and uncertainty, and publicize the role of

    tax risk management within strong corporate governance structures (PricewaterhouseCoopers,

    2004; OECD, 2009). Corporate managers and boards of directors have begun to recognize tax

    risk management as a separate component of enterprise risk management (Wunder, 2009). Yet,

    what does it mean to say that a firm has higher tax risk? According to Neubig and Sangha

    (2004), tax risk is the possibility that the firm will have an unexpected tax outcome. This

    definition implies that tax risk involves both unexpected positive and negative outcomes.

    However in this paper, we focus our analysis of tax risk on loss events (i.e., events in the extreme

    negative tail of a distribution) and examine whether these events are identifiable ex-ante. We

    further examine whether the equity market reacts to these events, our proxy for downside tax

    risk.

    Examining these questions is important for four reasons. First, despite the recent attention

    on tax risk, useful, public information about firms’ tax risk is sparse. Hanlon (2003) and Lenter

    et al. (2003) acknowledge that although tax disclosures in financial statements are meant to be

    informative, they are often lacking. Gleason and Mills (2002) find that U.S. firms infrequently

    disclose tax contingencies despite the substantial future payments tied to those contingencies. In

    a more recent study, Raedy et al. (2011) conclude that investors fail to use the information

    provided in tax footnotes due to a general lack of informativeness. Thus, stakeholders would

    benefit from a method to identify tax risk. Second, the question of whether tax risk can be

    assessed separately from overall firm risk is unclear. Gleason and Mills (2002) describe the

    1

  • current state of disclosure around negative tax outcomes as one which lacks value-relevant

    information for investors. If tax risk management is a component of enterprise risk management

    (Wunder, 2009) then investor assessments of firm risk could subsume value-relevant assessments

    of tax risk. Our model allows us to create a proxy for downside tax risk ex-ante and also to

    examine, if stakeholders react to such information, whether stakeholders value tax risk separately

    from other risk factors. Third, prior literature documents that the distribution of risk is

    asymmetric and skewed to the downside (e.g. Ang et al., 2006; Huang et al. 2012), thus our

    focus on unfavorable tax settlements is consistent with this view. Fourth, tax risk management is

    a related concept that the extant literature has not explored. We provide a first step by

    investigating whether some managers adjust their GAAP effective tax rates (ETRs) in

    anticipation of potential tax loss events.

    We address our research questions in the following ways. First, we conceptualize

    downside tax risk as the probability that a firm suffers a large tax loss – i.e., an unfavorable

    settlement with a tax authority. This concept of tax risk complements existing research that

    views tax risk as a form of volatility (e.g., Guenther et al., 2013; Neuman et al., 2013). However,

    rather than focus simply on cross-sectional variation in cash effective tax rates (Cash ETR), we

    narrow in on the most extreme negative portion of the volatility distribution. Defining downside

    tax risk as the probability of a loss event is consistent with other concepts of risk in accounting

    and finance, such as extreme downside risk (e.g., Huang et al., 2012), crash risk (e.g., Jin and

    Myers, 2006; Hutton et al., 2009; Kim et al., 2011) and operational risk (e.g., Chernobai et al,

    2011; Brown et al., 2012). We believe this design choice provides a more powerful setting to

    examine the economic forces that determine tax risk because it allows us to identify actual

    negative tax outcomes. Second, we identify firms in the tail of our Cash ETR distribution with

    2

  • disclosed, unfavorable tax settlements as “event” observations. We investigate whether ex-ante

    characteristics associated with tax risk can predict these tax loss events using a logistic model.

    Third, we examine the abnormal stock market returns associated with our proxy for downside tax

    risk, as well as a proxy for GAAP ETR management. We investigate abnormal returns before,

    during, and after the year of our tax loss events to determine the extent to which the market

    anticipates and/or reacts to the disclosure of tax risk, and whether these reactions differ for firms

    that do or do not appear to manage the financial reporting of the tax risk through tax expense.

    In order to identify our potential tax loss event sample, we measure whether a firm’s

    observed annual Cash ETR value exceeds its ten-year firm-specific mean by more than two

    standard deviations for the industry (i.e., 4-digit NAICS classification), yielding 284 firm-year

    observations over a period from 2002 to 2012. We recognize that an extreme Cash ETR is a

    noisy proxy because this ratio may vary for many reasons, including mismatched numerator and

    denominator. To address this concern, we explore the annual reports for observations above our

    firm-specific Cash ETR value threshold to determine whether available disclosures support the

    presence of a loss (i.e., disclosure of a settlement with tax authorities). We identify an event

    sample of 52 observations for firms that unfavorably settle with tax authorities.

    We estimate a logistic regression on a parsimonious set of tax risk factors to predict the

    likelihood of a tax loss event in both a broad sample and a one-to-five matched sample (based on

    fiscal year and four-digit industry classification). We document that, as expected, this likelihood

    is increasing in higher Cash ETR volatility, the presence of foreign (i.e., multinational)

    operations, mergers and acquisitions activities, the number of business segments, and the

    existence of SOX 404 internal control weaknesses, and the likelihood is decreasing in the

    magnitude of R&D and book-tax differences. The negative relation between book-tax differences

    3

  • and the probability of a loss event suggests that, on average, firms begin to accrue the loss in the

    year prior to its payment.

    Overall, we find that the probability of an unfavorable tax settlement is indeed greater for

    our event sample and that the factors in our multivariate model predict this probability with

    reasonable accuracy. To provide further clarity regarding the predictive ability of our model,

    given the rarity of tax loss events across our full sample, we construct and analyze a “downside

    tax risk” DTR-score. Similar to Dechow et al. (2011), we scale the predicted probability of a tax

    loss event by the unconditional probability of that event to create our DTR-score. DTR-scores

    above 1 reflect higher downside tax risk. We find that, for our full sample, 63% of our event

    observations are in the highest quintile of DTR-score and that we correctly classify 72% of our

    event and non-event firm-year observations as either above 1 or below 1, respectively.

    Our final set of analyses is designed to test our hypotheses that the market does not price

    the tax loss event firms negatively prior to the event year and that in the event year, the market

    reacts most negatively for firms that do not manage that risk well. To do so, we examine the

    abnormal returns of our event and control firms in a four factor Fama-French (1993) model

    around the years our tax loss events are disclosed. We rely on a 2x2 matrix of firms – those in

    our tax loss event sample or our control sample interacted with those above or below the median

    GAAP ETR Volatility. ETR Volatility is measured as the ten-year firm average standard deviation

    of ETR less the ten-year industry average standard deviation of ETR. It reflects the degree to

    which firms manage the financial reporting of tax risk. Our evidence supports the hypothesis that

    the market anticipates the impending tax loss events, but the negative market reaction continues

    in the year of the extreme negative tax event. We fail to find a difference across firms with

    differing ETR Volatility. Ongoing work will continue to improve the specification of these tests.

    4

  • Our study complements Gleason and Mills (2002), who focus on the likelihood that a

    firm discloses a contingent tax liability. We move beyond the contingency to predict which firms

    with high cash effective tax rates actually settle contingent liabilities, and whether and how the

    market reacts. Furthermore, our prediction of tax loss events does not necessitate that a firm pre-

    disclose or accrue any contingent tax liability; it is sufficient for the firm to acknowledge an

    unfavorable settlement with tax authorities in the year of settlement only. To the extent any lack

    of disclosure or accrual is considered poor tax risk management, our subsequent market tests

    speak to that argument. Put simply, our model predicts significant, unfavorable tax contingencies

    – recognized or unrecognized, disclosed or undisclosed – that come to bear on the firm.

    More broadly, our study sheds light on the usefulness of financial statement disclosures

    for assessing downside tax risk (DTR). We supplement our statistical measure of a tax loss event

    with information about the outcome found in annual reports. Our loss measure is accurately

    predicted by publicly available, prior year information. The results from our prediction model

    suggest that annual reports do provide information regarding tax strategies in general and

    regarding tax risk more specifically. However, although our model is a useful predictor of ex-

    ante tax risk, results from our market tests suggest that investors typically do not react fully to

    these cues prior to the disclosure of a tax settlement.

    Finally, this research contributes to the relatively recent literature that examines the tax

    risk of the firm. While the practice community has been concerned with, and encouraged clients

    in, managing tax risk (see for example, Herskovitz, 1997), little empirical research predates

    Guenther et al. (2013) and Neuman et al. (2013). We add to this literature by exploring the

    extreme downside tail of tax risk. By focusing on the significant negative risk ex-post, we are

    able to develop a prediction model that can be readily applied by market participants and in

    5

  • future research. Furthermore, the variables that aid in the prediction of downside tax risk provide

    new insight into the factors that create risk for the firm. This research complements concurrent

    research that tries to understand this underexplored element of firm risk.

    The remainder of the paper is organized as follows. Section 2 provides background for

    our focus on tax loss events. Section 3 provides details of our research design, including our

    identification of tax volatility and tax settlements, our prediction model and our abnormal returns

    model. Section 4 reports the univariate and multivariate analyses of our prediction model.

    Section 5 reports tests of the market reaction to tax loss events and our downside tax risk proxy.

    Finally, Section 6 concludes with a summary of the paper.

    2. Background

    2.1 Large Tax Payments as a Proxy for Downside Tax Risk

    Managing corporate tax exposures has become increasingly important within firms’

    governance practices. Recent regulation, legislation and standard-setting – such as the Sarbanes-

    Oxley Act and FIN 48/ASC 740 – have responded to greater public desire for scrutiny of

    corporations, corporate managers and directors with respect to their tax choices. Surveys of tax

    executives and managers show an increasing percentage of firms are implementing tax risk

    management practices to facilitate informed decision-making (Wunder, 2009; Lavermicocca,

    2011). Corporate decision-makers have heightened concern about the risks involved in their tax

    strategies. Furthermore, Wunder (2009) documents that U.S. and foreign-owned multinational

    corporations report higher aversion to tax risk as a result of the increased focus on tax risk

    management.

    While there seems to be increased attention, how do external stakeholders view risk,

    generally, and how do they conceptualize tax risk, specifically? Further, how can one identify

    6

  • varying tax risks across firms? McGuire et al. (2013) consider the mean and variability of tax

    payments of their sample; these values suggest that some firms maintain a consistently low level

    of taxes paid while others maintain a consistently high level of taxes paid. Whether high or low

    tax, firms appear to have consistent tax volatility. Does maintaining low tax volatility ex post

    equate to low tax risk? Guenther et al. (2013) also examine tax rate variability and compute the

    standard deviation of cash effective tax rates (Cash ETR). They show that this measure of risk is

    related to the firms’ overall stock price volatility, and that the level of Cash ETR and the level of

    book tax differences are not related to stock price volatility. Both Guenther et al. (2013) and

    McGuire et al. (2013) advance our understanding of tax risk by incorporating the second moment

    of the Cash ETR distribution.

    Empirical work on asset pricing has reached beyond the assumption of a normal

    distribution for stock returns. Dittmar (2002) and Moreno and Rodriguez (2009) incorporated

    third and fourth moments into asset pricing models and showed that these were important to

    understanding pricing as well. Building on the notion that significant declines are not viewed by

    market participants symmetrically to significant gains, a Value at Risk (VaR) metric was

    developed to specifically capture the lower tail risk. Bali et al. (2009) finds that such a measure

    of risk is positively related to expected returns. Huang et al. (2012) extends this literature by

    developing an empirical proxy for the extreme downside risk (EDR) that captures both the

    skewness and the fat tails. They show that both the VaR measure and their EDR measure are

    idiosyncratic risks that are positively and mutually related to expected returns.

    Thus, recent empirical efforts suggest that markets price the risk of unusual negative

    events, in spite of opportunities to diversify many firm-specific risks. While there are many

    external forces that may lead to a negative event for the firm, its tax positions can expose it to

    7

  • significant, unforeseen tax payments, both as a result of payments for past transactions, and the

    required changes that are now necessary in the future. For example, international transfer pricing

    cases can result in very significant settlements (e.g., GlaxoSmithKline; Reed, 2006). In addition,

    if the transfer prices used in the past are not acceptable, the required change will lead to higher

    future taxes as well.

    Therefore, we consider tax risk from this perspective. We explore tax risk as the

    probability of a loss event—that is, an identifiable event in which a negative tax outcome occurs.

    This conceptualization of tax risk is akin to extreme downside market risk (Ang et al., 2006;

    Mitton and Vorkink, 2007; Bali et al., 2009; Boyer et al., 2010; Huang et al., 2012). These

    studies emphasize that the negative side of the distribution (e.g., the negative stock returns in the

    extreme downside risk literature) are more common than expected based on traditional assumed

    distributions. In particular, the distributions are skewed and negative tails are fatter than

    expected, and that such risk is priced by market participants. The focus of study on so-called tail

    events is also evident in the literatures on crash risk (e.g., Jin and Myers, 2006; Hutton et al.,

    2009; Kim et al., 2011) and certain forms of operational risk (e.g., Chernobai et al, 2011; Brown

    et al., 2012). Each of these areas of inquiry are consistent with anecdotal evidence that managers

    are most concerned about large negative outcomes (e.g., March and Shapira, 1987).

    In our setting, management establishes their tax planning strategy, which is private

    information. Through financial reporting choices around their tax expenses, these managers can

    mask the exposure to large tax payouts they undertake, or may simply be unaware of them.

    However, risky strategies can eventually yield a significant, unfavorable tax payment. To the

    extent that the payment is significant, the amount of taxes paid in the year is high relative to a

    8

  • joint firm-industry average (e.g., n standard deviations from the mean). This occurrence gives

    rise to our proxy for a potential tax loss event.

    To confirm that the extreme outcomes are indeed loss events, we consider whether firms’

    disclosures are relatedly informative. If management holds news until a negative tax settlement is

    likely and additional tax payments are made, then disclosures in financial reports should include

    discussion of the tax event. Investors can use this information, regardless of its timing, to adjust

    their value of the firm. On the other hand, Gleason and Mills (2002) find that U.S. firms

    infrequently disclose tax contingencies. Thus if management views the cost of disclosure too

    high, perhaps due to potential future dealings with tax authorities, then the related disclosure

    could be purposefully less informative, if informative at all, about the settlement. Alternatively,

    the event may not be a tax loss at all. Considering that the annual value of taxes paid and pre-tax

    book income are inherently volatile (Dyreng et al., 2008), an increase in taxes paid during a

    given year could be due to a number of non-events, including taxes on a substantial divestiture or

    simply the timing of installment payments. Ultimately, if investors recognize differences in the

    information content of tax disclosures that accompany unusually large tax payments, they should

    assess the underlying tax risk differently. Furthermore, if factors that imply a high probability of

    a tax loss event are evident ex-ante, it is possible to use that information to predict tax risk even

    in the absence of a disclosed loss event.

    The emerging literature, however, calculates tax risk as a symmetric construct. Measures

    of volatility (or tax aggressiveness) are evaluated at the firm-year level using cross-sectional tests

    (e.g., McCarty 2012; Guenther et al., 2013; Hutchens and Rego, 2013; Neuman et al., 2013), and

    often with firm fixed effects. In our research design, we consider implications based on an

    asymmetric form of tax risk: downside tax risk. How the market reacts to this risk of negative

    9

  • outcomes is unclear. If the risk is known, it will require a greater return, even though it is

    idiosyncratic (consistent with the VaR and EDR literature). Alternatively, if the risk is largely

    hidden by management, then its revelation will yield a negative return as the additional risk is

    incorporated into the firm’s value. We predict that there is a relation between the market returns

    and downside tax risk.

    H1: Abnormal stock returns are related to the realization of tax loss events.

    2.2 Downside Tax Risk and Tax Risk Management

    In this section, we consider how firms manage downside tax risk, specifically through

    their financial reporting risk. According to Neubig and Sangha (2004), the sources of tax risk

    include compliance, legislation, technical tax issues, financial reporting, etc. Similar risks are

    identified in the tax risk management guide of PricewaterhouseCoopers (2004). Yet despite

    documentation that firms are developing specific tax risk management policies (Wunder, 2009),

    generally corporate outsiders cannot evaluate the individual components of these policies.

    Stakeholders may be able to evaluate effective tax rate management using ETRs that are

    readily available in financial statements. Tax professionals interviewed in a study by Mulligan

    and Oats (2009) identified tax reserves as a clear example of risk management. Thus managing

    an ETR is a form of tax risk management because the ETR is a function of the firm’s tax

    reserves, tax accruals and adjustments to valuation allowances. Our predictions of how ETR

    management interacts with the probability of a tax loss event follow.

    First, consider more generally the relation between hedging and risk documented in the

    literature (e.g., Froot et al., 1993; Tufano, 1996; Guay, 1999; Zhang, 2009). These studies

    document that firms use derivatives and other financial instruments to hedge business risks and

    ultimately maintain or enhance firm value. Similarly, we would expect firms to use tax reserves

    10

  • and accruals to manage the volatility of their ETR and underlying tax risks. Firms that maintain a

    consistent level of ETR should be able to avoid unwanted shocks to earnings, whereby smoother

    earnings provide value to stakeholders (Grant et al., 2009).1

    Nevertheless, we expect there is cross-sectional variation in whether firms with a tax loss

    event will effectively anticipate and manage that risk through their ETR. Although Gleason and

    Mills (2002) find that both the disclosure of and accrual for tax loss contingencies increase with

    the size of the expected loss, such actions are observable for only the largest claims. They find

    that many claims that would be material under SFAS No. 5 contingent loss reporting rules are

    infrequently disclosed, potentially leading investors to assume the absence of significant

    contingencies. Thus, there are two possible predictions. One, shareholders are unaware of the

    downside risk that they bear until the tax loss event (and settlement amount) is disclosed by the

    firm. Two, shareholders are aware of the risk, but the disclosure resolves the uncertainty and the

    price reacts to the change in probability of the loss (to 1, in this case). Alternatively, to the

    extent that some firms effectively manage this risk through their ETRs, we expect that the market

    will value firms with a tax loss event and weak tax risk management (of the ETR) negatively

    relative to firms with a tax loss event and strong tax risk management. If the market participants

    are simply updating their probability of the loss, then tax risk management would not

    differentially affect returns. More specifically, we state the following hypotheses in null form:

    H2: Firms with a tax loss event and volatile firm-level ETR have the same abnormal stock returns as firms with a tax loss event and nonvolatile firm-level ETR in the event period, compared to non-event firms.

    1 We view the volatility of ETR as tax risk management, rather than earnings management (for a review of the latter, see Hanlon and Heitzman, 2010). Like McGuire et al. (2013), who compare concepts of sustainable tax avoidance and tax minimization, we are most interested in a consistent strategy that persists over time rather than a decision that affects primarily a given period. ETR volatility (over a ten-year period) is a proxy for this consistent strategy.

    11

  • 3. Research Design

    3.1 Identifying Downside Tax Risk as a Loss Event

    In order to identify firms with downside tax risk and thus the potential for a tax loss

    event, we begin by assessing extreme values of Cash ETR, similar to Guenther et al. (2013).2

    First, we require that the firm has ten consecutive years of ROA percentages that are greater than

    or equal to ½ percent using rolling estimation windows from t-9 to t. This restriction helps to

    ensure that we have an economically meaningful sample of annual Cash ETR observations and to

    mitigate issues related to small denominators or firms with losses. Second, we calculate the firm-

    specific mean and standard deviation of annual Cash ETR for the ten-year window. Ten-year

    estimates are consistent with the long-run approach to tax planning. We require ten consecutive

    years of data in order to retain these statistics for year t of the rolling window. Third, we use the

    firm-specific standard deviation to calculate the rolling ten-year industry average standard

    deviation (based on four-digit NAICS industry codes) for each industry and fiscal year of our

    sample. Fourth, we calculate our measure of extreme cash payments, the Cash ETR threshold,

    for each firm-year as an observed annual Cash ETR value that exceeds the ten-year firm-specific

    mean by more than two standard deviations for the industry. Thus any firm-year with a Cash

    ETR significantly greater than its mean (e.g., exceeding two industry standard deviations) is

    considered a potential tax loss event observation.

    3.2 Overall and Tax Loss Event Samples

    Our sample selection process begins with all public firms available in Compustat during

    the period 2002 to 2012. As reported in Table 1, this initial step yields 99,446 firm-year

    observations across 13,982 firms. Next, we require that annual Cash ETR is measureable for

    2 We calculate Cash ETR annually as taxes paid divided by pre-tax income less special items (Dyreng et al., 2008). If the denominator is less than zero, we set the observation to missing. We also truncate the distribution between zero and one to reduce the influence of undue outliers that are more likely to represent noise than a tax loss event.

    12

  • each firm-year observation, resulting in a reduction of sample to 47,356 firm-year observations

    across 8,644 firms. As we require that firms have an ROA percentage greater than or equal to ½

    percent, our sample is further reduced to 41,793 firm-year observations across 8,264 firms after

    imposing this restriction. Finally, we ensure that each firm-year observation has ten years of

    consecutive data to construct our volatility of Cash ETR measure. This restriction yields a

    sample of 8,572 firm-year observations across 1,705 firms that we use to compute our Cash ETR

    threshold. Of these firm-year observations, 321 (or 3.7% of the sample) fall in the extreme high

    tail and thus are considered potential tax loss events. Consistent with asymmetric risk, we note

    conversely that only 132 firm-year observations (or 1.5% of the sample) fall in the extreme low

    tail of our Cash ETR threshold distribution.

    For each potential tax loss event, we further examine the annual reports of the firm within

    the estimation window to determine if a tax settlement occurs or not.3 If the firm has suffered a

    loss event we expect to find disclosure about settlements with tax authorities or other relevant

    information; these observations represent our tax events. If the firm has not suffered a loss event

    we expect to find adequate disclosure or information to validate the high tax payments (e.g., a

    substantial divestiture, catch-up for a previous year’s under-installment, etc.). It is possible also

    that a firm is not forthcoming or is vague about the reasons for the high tax payments; we

    classify these vague observations as a sub-group of the non-event observations. Firms are also

    permitted to have multiple tax loss events identified. Our annual report validation reveals 55 tax

    loss events. After a final restriction of our sample for observations missing control variable data

    we report, in Table 1, 7,438 firm-year observations across 1,507 firms. Of those firm-year

    3 We also examine news releases of the firm during the estimation window through Lexis-Nexis. Generally, news releases do not provide identification of settlements with tax authorities that are not disclosed within annual reports.

    13

  • observations, 284 are above our extreme Cash ETR threshold and 52 represent tax loss events for

    use in our subsequent prediction model.

    Finally, note that our tax loss event identification does not rely on information from

    unrecognized tax benefit (UTB) settlements, which are required disclosures under FIN 48 (ASC

    740) for fiscal years beginning after December 15, 2006 (and available in Compustat). As we

    discuss in greater detail in the Appendix, we do not rely on UTB settlements to identify tax loss

    events for two reasons. First, our chosen procedure is applicable in both the pre and post FIN 48

    periods, thus using UTB settlements would result in a significant loss of data and

    generalizability. Second, the settlement amount reported in a firm’s UTB reconciliation

    represents a reduction of a tax liability, yet is does not necessarily represent a current outlay of

    cash to tax authorities or an unfavorable settlement. If a firm has remitted tax payments (to

    mitigate interest or penalties) in advance of a subsequently favorable settlement then the UTB

    settlement disclosure could reflect simply the recovery of that deposit. Such is the case in the

    example of International Speedway Corp. in our Appendix. Thus the UTB settlement amount on

    its own does not reflect whether a settlement is favorable or unfavorable, or tied to the extreme

    Cash ETR value, and relying on it without further, manual validation would introduce noise into

    our proxy of downside tax risk.

    3.3 Factors that Predict a Tax Loss Event

    The probability that a firm incurs a tax loss event – an unfavorable settlement with tax

    authorities – is a function of the firm’s tax risk. Recent practitioner literature and academic

    literature suggest that tax risk is a function of numerous categories of risk and/or individual

    14

  • factors of risk.4 We develop a parsimonious set of variables using a combination of step-wise

    analysis that considers univariate statistics, the fit of the various model iterations and inter-

    correlations of variables, and researcher judgment (see Altman, 1968).5 By ensuring parsimony,

    our model is easily replicable. All tax risk factors are measured one year prior to the tax loss

    event year.

    Variables included in the final estimation include the variance of the firm’s annual Cash

    ETR, computed over a five-year period. Higher variance in this measure implies more

    uncertainty in tax outcomes. We expect a positive relation between Cash ETR VARIANCE and

    the likelihood of a tax loss event.

    We also measure a firm’s return on assets (ROA) to control for operating performance.

    On the one hand, firms with more profitable operations could be expected to undertake tax

    planning with a higher level of uncertainty and thus potentially higher payoffs. Yet profitable

    firms that manage their tax planning well would not be considered risky. Relatedly, Hutton et al.

    (2009) and Kim et al. (2011) show that profitability is negatively related to crash risk, indicating

    that loss events in general are more likely for less profitable firms. Therefore, we expect a

    negative relation between ROA and the likelihood of a tax loss event.

    We include an indicator of foreign operations (FOROPS) in our model for two reasons.

    One, the presence of foreign operations suggests a firm is multinational, and we expect

    multinational firms to have more tax planning opportunities – and opportunities for risk – than

    domestic firms. Two, Wilson (2009) and Lisowsky (2010) find that foreign income is positively

    4 For example, see PricewaterhouseCoopers (2004) and OECD (2009) for practitioner literature. See Wunder (2009), McCarty (2012), Guenther et al. (2013), Hutchens and Rego (2013) and Neuman et al. (2013) for academic literature. 5 In addition to other excluded factors discussed in this section, we exclude proxies for tax cushions from our model. Inclusion of a tax cushion estimate, based on the IRS estimate in Gleason and Mills (2002), can explain variation between our event and non-event samples, but drastically reduces our sample because of data limitations.

    15

  • related to the use of tax shelters.6 The inherent likelihood of tax shelters increases downside tax

    risk because tax shelters are targeted by tax authorities. For these reasons, we expect FOROPS to

    be positively related to the likelihood of a tax loss event.

    Firm size (SIZE), the log of total assets, is included as a proxy for the potential scale of

    tax planning activities. Including SIZE is consistent also with the contingent tax disclosure model

    of Gleason and Mills (2002). However, like Gleason and Mills (2002), our sample firms are

    inherently large (we require ten consecutive years of data to determine our high Cash ETR

    threshold) and thus we may lack sufficient variation on this factor in our multivariate model.

    The ratio of research and development expense to total lagged assets (RND) is in our

    prediction model because tax legislation provides favorable credits to research and development

    expenditures, but tax authorities heavily scrutinize these claims. Furthermore, as noted in Wilson

    (2009), firms can create intangible assets through their research and development activities and

    Hanlon et al. (2007) find a positive association between intangible assets and tax non-compliance

    (i.e., tax deficiencies). For these reasons, we expect RND to be positively associated with the

    likelihood of a tax loss event and thus tax settlements.

    We include measures for merger and acquisition activity (MNA) as well as the number of

    business segments (NSEG) in our model. Business combinations reflect tax risk because they can

    increase the complexity of operations, the tax planning scope and opportunities of the firm, and

    the compliance risk surrounding tax deductions and benefits tied to the business combination.

    Similarly, a higher number of business segments can increase complexity and opportunities also,

    6 Although we measure foreign income (Compustat variable PIFO), including it and a foreign operations indicator (1 if foreign income is positive, 0 otherwise) in our model significantly increases multicollinearity. Thus we include only the foreign operations indicator. Note also that controlling for tax havens does not improve the fit of our model, nor are tax havens a significant predictor, likely due to the amount of missing observations tied to this variable.

    16

  • thus increasing the potential downside tax risk of the firm. Therefore we expect both MNA and

    NSEG to be positively associated with the likelihood of a tax loss event.

    We control for internal control weaknesses also. Bauer (2013) documents that certain

    ICW disclosures reflect weak risk management practices and thus we expect ICW to be

    positively associated with the likelihood of a tax loss event. Although both ICWs and tax-related

    ICWs are significant predictors of a tax loss event in univariate analysis, we include the more

    general ICW proxy in our model. As tax-related ICWs are more infrequent than ICWs in our

    sample, they can predict a tax loss event perfectly after matching of control firms on industry,

    and would be dropped subsequently from our prediction model. We utilize matched-control

    samples to help examine our DTR-score and the abnormal returns of tax loss event firms.

    Finally, we include a measure of book-tax differences (BTD) in our model. Per Mills

    (1998), proposed IRS audit adjustments increase as BTDs increase. Thus BTDs can act as a

    signal to the IRS to investigate firm tax positions, and so we would expect BTD to be positively

    associated with the likelihood of a tax loss event.

    3.4 Prediction Model and DTR-score

    Our primary analysis of asymmetric or downside tax risk predicts the likelihood of a tax

    loss event. The output from our prediction model is then adjusted to a scaled probability that acts

    as a proxy for downside tax risk, creating a “downside tax risk” score, or DTR-score.

    Conceptually, our prediction method is inspired by Altman’s study (1968) that predicts

    bankruptcy (i.e., discriminant analysis). Empirically, however, our method more closely follows

    the tax liability prediction model of Gleason and Mills (2002) and tax shelter prediction models

    in studies by Wilson (2009) and Lisowsky (2010) (i.e., logistic regression models). The

    17

  • construction of a DTR-score is consistent with research by Dechow et al. (2011) on the

    likelihood of fraud and earnings misstatements.

    Furthermore, our empirical model reflects a discrete hazard model; essentially, we

    analyze the time until a tax loss event (i.e. a failure) occurs. Consistent with Beck et al. (1998),

    and as explained below, we control for temporal dependence in our data to complete the

    identification of the hazard model and to prevent violation of the independence assumptions of a

    standard logistic model. This concern arises because our sample of firm-year observations allows

    for variation in both the time-series and cross-section.

    We predict the probability of a tax loss event in year t using firm i characteristics

    measured in the prior year. For ease of illustration, our following logistic hazard prediction

    model does not include firm or time subscripts:

    ln PTaxEvent1- PTaxEvent

    = α+βX+ ε (1)

    where PTaxEvent =1

    1 + 𝑒−(α+βX+ ε) = the probability the firm will have an unfavorable tax settlement

    with the tax authorities; and

    βX = β1CASH_ETR_VARIANCE + β2ROA + β3FOROPS + β4SIZE + β5RND + β6MNA +

    β7NSEG + β8ICW + β9BTD + β10DURATION + β11DURATION2 (2).

    See Table 2 for full details of our variable definitions. The variables DURATION and

    DURATION2 are included to control for temporal dependence, where DURATION reflects the

    time trend of our sample period (i.e. DURATION equals 1 for 2002 and equals 11 for 2012). The

    quadratic specification uses fewer degrees of freedom than the inclusion of dummy variables and

    is thus also more parsimonious and more easily replicable. We also adjust the standard errors by

    clustering by firm.

    18

  • In order to construct our DTR-score, we scale PTaxEvent by the unconditional probability of

    a tax loss event occurring in our sample period. As described previously, our event sample

    comprises 52 observations determined by examining annual reports for disclosures about tax

    settlements, and we estimate our prediction model on a full sample of 7,438 firm-year

    observations. Thus the unconditional probability of a tax loss event for our full sample is 0.70%.

    Although our tax loss events are rare, the use of prediction models and/or hazard models in such

    contexts is not uncommon. For example, Shumway (2001) uses a discrete logistic hazard model

    to improve on Altman’s (1968) prediction of bankruptcy events. Also, Dechow et al. (2011)

    develop a logistic model to predict the likelihood of financial misstatement; the unconditional

    probability of misstatement in their sample is 0.37%.

    Our full sample tests demonstrate the broad applicability of our model. However, to

    facilitate our analysis of the market reaction to tax loss event firms, and to perform subsequent

    out-of-sample robustness analysis, we also estimate our prediction model and DTR-score based

    on a matched control sample. To identify this control sample, we utilize fiscal year and 4-digit

    NAICS industry to determine a maximum one-to-five match for our event sample observations.

    3.5 Market Reaction – Abnormal Returns

    To test Hypotheses 1 and 2, and examine the market reaction of stakeholders to tax loss

    events, we perform tests of monthly abnormal returns in a four-factor Fama-French (1993)

    model. We assess the market reaction to our tax loss event and control firms conditional on the

    strength of ETR risk management, which we proxy for using a firm-level, industry-adjusted

    measure of ETR volatility. We perform separate regressions for the years preceding, including,

    following and surrounding our event year. Our empirical model, based on Wilson (2009), is as

    follows:

    19

  • XRit = β1TaxEvent*WeakRMi + β2Control*WeakRMi + β3Control*StrongRMi +

    β4Tax_Event*StrongRMi + bMKTRFt + sSMBt + hHMLt + wUMDt + εit (3).

    See Table 2 for full details on our variable descriptions. XR and MKTRF represent

    abnormal monthly firm and market returns, respectively, and SMB, HML and UMD represent

    size, book-to-market and momentum premia, respectively. We interact a dichotomous measure

    of tax loss events (TaxEvent for event firms, Control for selected control firms) with a

    dichotomous measure of ETR risk management (StrongRM and WeakRM) and include all four

    two-way interaction variables in the model (and omit the intercept).

    Our ETR risk management proxy is based on our measure of ETR Volatility. ETR

    Volatility is measured as the standard deviation of a firm’s annual ETR less the four-digit,

    NAICS industry standard deviation of annual ETR, where standard deviation is constructed over

    a ten year period. Firms with lower volatility are considered stronger risk managers. Thus we

    label firms with below-median ETR Volatility as StrongRM and those with above-median ETR

    Volatility as WeakRM. We then interact these conditions with our tax loss event conditions to

    construct our four two-way interaction variables of interest.

    4. Results for Tests of the Prediction Model

    4.1 Descriptive Statistics – Sample Distributions

    As a first set of analyses, we examine the distribution of observations by our firm-level

    Cash ETR threshold. We compare our overall control sample of observations (7,154 firm-years,

    with four-digit industry) to both our aggregate extreme Cash ETR sample (i.e., 284 potential tax

    loss event firm-years) and our validated extreme Cash ETR sample (i.e., 52 event firm-years, 192

    non-event firm-years and 40 vague, non-event firm-years). Table 3, Panels A through D

    20

  • respectively, shows distributions across fiscal year, 2-digit industry (rather than 4-digit, for

    brevity), deciles of size and deciles of profitability.

    Table 3, Panel A, shows that all years from 2002 through 2012 have a potential tax loss

    event. However, most likely due to the economic downturn, a disproportionately larger number

    of potential tax loss events occur in the middle period of our sample years.7 Our tax event sample

    rises through 2007 and then declines again. Table 3, Panel B, shows that every industry has a

    firm with a tax loss event; we retain industries with at least one extreme Cash ETR observation.

    Also, relatively high proportions of large observations occur in agriculture (code 11; 10.5% of

    the total sample), construction (code 23; 9.4% of the total sample) and extractive (code 21; 8.4%

    of the total sample) industries. Our tax event sample also has a high number of events from the

    agriculture industry (4.8% of the total sample).

    Table 3, Panel C, gives some indication that the extreme Cash ETR values are found

    among firms that are smaller. The lowest three deciles of size have over 35 observations each.

    However, the distribution of our tax event sample is much more even across the size deciles; in

    fact, the highest number of tax event observations (i.e., 7) occurs in the smallest and largest

    deciles. Thus, size does not appear as significant a factor of tax loss events as of high Cash

    ETR.8 On the other hand, profitability does appear to be an important factor to tax loss events.

    Table 3, Panel D, shows that 131 of the 284 potential tax loss events (i.e., 46%) are reported in

    the lowest two deciles of profitability. Similarly, 26 of the 52 tax events (i.e., 50%) are reported

    7 The economic downturn would affect the Cash ETR values by decreasing the denominator, making the observed ratio higher. While we verify each observation, the larger initial sample due to this statistical phenomenon will yield a larger number of real events. 8 A chi-square test fails to reject the null hypothesis of no association between size decile and loss event classifications at the 10% level of statistical significance.

    21

  • in the lowest two deciles of profitability also. As described in Sections 3.2 and 3.3, we control

    for firm size (i.e., SIZE) and profitability (i.e., ROA) in our prediction model.

    4.2 Descriptive Statistics – Univariate Analysis

    Building on the sample distribution analysis, we provide traditional descriptive statistics

    for each independent variable in our model in Table 4. These statistics are provided for variables

    measured in the year prior to the event period. We include both the full sample of non-tax event

    observations, as well as the matched sample, as described above. Finally, we report the p-value

    from a t-test of means for each variable.

    The variance of cash ETR, the presence of ICWs and book-tax differences have

    differences in means at the 1% level of statistical significance. In comparison to the full sample,

    the number of segments is also different at the 5% level of statistical significance, but is very

    similar in comparison to the matched sample. On the other hand, the mean return on assets and

    mean R&D are different in comparison to the matched sample at the 5% level, whereas these

    means are different from the respective means of the full sample at only the 10% level of

    statistical significance. The differences in means are generally in the direction we predicted,

    except RND and BTD are lower for the tax events, contrary to our prediction of a positive

    relation.9

    4.3 Logistic Regression Results

    In this section we report the results from our primary logistic regression models that

    predict a tax loss event. Table 5 contains two columns of results based on which sample is used

    as the control: the first column presents results based on the full sample of observations; the

    second column presents results using the matched sample as a control. Within each column, by

    9 In untabulated analysis, no significant difference exists, statistically or in magnitude, between event firms and control firms on the basis of intangible assets. Intangible assets are frequently linked to R&D activities.

    22

  • variable, the first value reported represents the coefficient; the second value (in parentheses)

    represents the z-statistic, and; the third value (in straight brackets) represents the coefficient

    interpreted as a marginal effect.

    The first column of results in Table 5 demonstrates that our logistic model provides a

    reasonable prediction of the likelihood that a firm has a tax loss event (i.e., a settlement with tax

    authorities). The pseudo R2 value of 0.12 and chi2 statistic of 107.4 imply that the model is well

    fit. Furthermore, the area under the Receiver Operator Characteristic (ROC) Curve is 0.80,

    further validating the fit of the model. This ROC Curve statistic implies that for a random pair of

    firms – one with a tax loss event and one without– our model would correctly identify the firm

    with a tax loss event 80% of the time (see Hosmer and Lemeshow, 2000). In contrast, a ROC

    Curve statistic of 0.50 would indicate that our model has no greater predictive power than chance

    (i.e., 50%).

    In the first column of Table 5, using a 5% cut off on two-tailed tests of statistical

    significance, seven of the nine firm characteristics are statistically significant; ROA and SIZE are

    not statistically significant. Cash ETR VARIANCE, FOROPS, MNA, NSEG, ICW, and

    DURATION are positively related to the likelihood of a tax loss event, as expected. Thus firms

    with greater multinational scope and firms undergoing a business combination, both of which

    can imply complexity and more uncertain tax opportunities, are more likely to have an

    unfavorable settlement with tax authorities.

    The other three statistically significant coefficients in the first column of Table 5 are

    negatively related with our dependent variable. DURATION_SQ is negative, as predicted,

    consistent with the effect of DURATION declining as the duration grows. Consistent with

    univariate results, firms with lower R&D expense are more likely to have a tax loss event. It is

    23

  • possible that these firms, on average, do not have the tax shield created by R&D and so take on

    riskier tax activities. BTD is also estimated to have a negative relation with the tax event

    indicator. Although this result is unexpected, evidence in Table 3 Panel D documents that a high

    proportion of firms with tax loss events have low profitability. If profitability is low, then book

    income may be low relative to taxable income, particularly if tax expense (or ETR) is higher

    pursuant to an accrual for settlement. Untabulated analysis documents that 46% of the tax loss

    event firm-years in the bottom quartile of the range of BTD (i.e. BTD < 0.004) have below-

    median levels of profitability.

    Looking at the second column of Table 5, the pseudo R2 value indicates that the

    explanatory power of the model improves with the matched sample. The area under the ROC

    Curve increases slightly, to 82%, also. The individual coefficients differ across the two

    regressions, but are generally similar in sign and statistical significance. However, MNA and the

    two DURATION coefficients are no longer statistically significant at the 10% level using a two-

    tailed test.

    Broadly speaking, the evidence in Table 5 demonstrates that the firm characteristics we

    have chosen provide reasonable predictions of the likelihood that a firm experiences a significant

    tax payment in the year following. These firms have encountered an unfavorable tax outcome,

    evidence of the downside risk inherent in their tax planning decisions.

    4.4 Examining the Estimated Predictions In-Sample: DTR-score

    As we view the existence of a tax event to be an extreme negative event, consistent with

    higher probability of such outcomes, we use the prediction model in the previous section to

    potentially identify other firm-years in which such risk existed, but the extreme tax payment is

    not observed. To achieve this objective, we follow Dechow et al. (2011) and compute the

    24

  • predicted probability from our fitted values for our full and matched-sample model. These

    predicted values are then deflated by the unconditional probability of the event to yield a ratio,

    our DTR-score. The ratio is greater than 1 if the predicted probability exceeds the unconditional

    predicted probability.

    Table 6 tabulates these scores by quintile for both the full sample and the matched

    sample. Each quintile is segmented into the observed tax events and the observations without an

    observed tax event. The table reveals that the observed tax events are increasing across the

    quintiles, with 33 and 26 events (63% and 50% of the events) found in the highest quintile when

    using the full sample and matched control sample, respectively. The sensitivity, that is, the

    observed events correctly identified using a cut-off score of 1, equals 73% and 75%,

    respectively, for the two analyses. These sensitivities are similar to those reported by Dechow et

    al. (2011). The type I error rates (that is, non-event observations with a score greater than 1) are

    28% and 24%, respectively. This compares favorably to the rates in Dechow et al. of 36%. One

    would expect a reasonable proportion of Type I errors since many firms that have a high risk of

    the extreme tax payment are not caught by tax authorities.

    The evidence in Table 6 suggests that the prediction model provides meaningful

    separation of firms that have a tax event and firms that are in the control sample. These analyses

    support the conclusions drawn from the ROC curve described above. As further corroboration of

    our DTR-score, we provide Figures 1 and 2. These figures show the distribution of DTR-score

    for event and matched firms in the t-4 to t+4 window around the event (match) year. Figure 1

    reports the median DTR-score for each sample, and Figure 2 reports the percentage of DTR-

    score values above 1.0.

    25

  • Figure 1 shows that the median DTR-score is higher for event firms relative to matched

    firms in the t-1 through t+2 period. Given that DTR-score is calculated from firm characteristics

    measured in the prior period, these scores reflect higher median downside tax risk in the t-2

    through t+1 period. Thus the downside risk associated with the likelihood of tax loss events

    appears to persist over at least four years, and is highest in the period of the event. Similarly,

    Figure 2 shows that the percentage of DTR-scores above 1.0 is also near its peak in the event

    period, and that the separation between event and match firms persists over a similar four-year

    period.

    5. Market Reaction Tests

    5.1 Abnormal Returns with Tax Event Firms matched to Primary Control Sample

    In this and the following section, we report the results of our market reaction tests of

    Hypotheses 1 and 2. That is, we look at the abnormal returns from a four factor Fama-French

    (1993) model for a sample of event and non-event firms to determine whether the risk of a tax

    loss event is valued in the pre-event period, and whether tax loss event firms with high ETR

    volatility have lower returns than tax loss event firms with low ETR volatility in the event

    period. We also examine the post-event period and aggregate period for completeness.

    In columns (1) through (4) of Table 7, Panel A, we report results from the abnormal

    returns test where our tax loss event firms are matched with control firms based on year and

    industry (as identified in Section 4.3). Column (1), the pre-period, shows significant (1% level,

    two-tailed), positive coefficients for Control*WeakRM and Control*StrongRM. This suggests

    that for all control firms, the returns are positive, whereas the returns are not positive for the

    event firms. An F-test that compares the two coefficients for event firm-years to those of control

    26

  • firm-years show a difference that is statistically different at the 10% level using a two-tailed test.

    This provides support for Hypothesis 1 using data from years before the event.

    Using data for the year of the event, the coefficient on TaxEvent*WeakRM is negative

    whereas the coefficient on Control*WeakRM is zero. The coefficients for the StrongRM

    interactions are positive for the Control sample and not statistically significant for the TaxEvent

    sample. Like the year prior, the test of Hypothesis 1 rejects the null at the 5% level of statistical

    significance in the event period.

    In a test of Hypothesis 2, the coefficients of TaxEvent*StrongRM and Control*StrongRM

    are significantly different from one another at the 10% level using a two-tailed test. The

    difference in coefficients across the WeakRM groups is not statistically significant at

    conventional levels. Thus, using data from t=0, the negative response is evident within the strong

    risk management firms whereas it is not evident in the weak risk management firms.

    Using data from the prior year, the returns for the TaxEvent sample is negative relative to

    the Control sample only for the WeakRM group (at the 10% level of statistical significance, one-

    tailed); the difference across the StrongRM group is not statistically significant.

    Results reported in columns (3) and (4) of Table 7, Panel A, show that in the post-period

    and for all years for the firms, there are no differences across the coefficients.

    5.2 Abnormal Returns with Predicted Values in the Broader Sample

    To assess the ability of our model to predict the out-of-sample likelihood of a tax event

    we also re-estimate our abnormal returns model where our tax loss event is defined using our

    prediction model. Specifically, we estimate PTaxEvent for all of the firms not matched to our event

    sample, using the coefficient values from the second column of Table 5. We label this predicted

    27

  • probability PDTR. Firm-year observations in the highest decile of the distribution of PDTR are also

    labeled as predicted TaxEvent observations (i.e. PrTaxEvent, otherwise labeled PrControl).

    In column (1) of Table 7, Panel B, we report the re-estimation of Equation (3) after the

    interaction of PrTaxEvent and PrControl with StrongRM and WeakRM. In column (2), we adjust

    the model to include PDTR, PrTaxEvent and the interaction of these two variables (as well as an

    intercept). In column (3), we supplement the column (2) model by adding StrongRM and the

    respective two-way interactions and three-way interactions between StrongRM, PDTR and

    PrTaxEvent. As no actual event date exists for this sample, we re-estimate the models across the

    full sample period (i.e. 2002-2012), rather than in the t-1 to t+1 window around an actual event.

    The results from column (1) of Table 7, Panel B provide evidence that across the broad

    hold-out sample, observations with a higher predicted downside tax risk generate higher returns

    for investors. The returns for the predicted TaxEvent sample are positive relative to the Control

    sample for both the WeakRM and StrongRM groups (at the 5% level of statistical significance,

    two-tailed). The difference within the predicted TaxEvent group across WeakRM and StrongRM

    is not statistically significant, however. Broadly, this evidence is consistent with Hypothesis 1, in

    that greater risk requires a greater return.

    The results from columns (2) and (3) of Table 7, Panel B are weaker. None of the

    coefficients of interest are statistically significant in column (2). In column (3), the return to

    investors appears to increase as the probability of downside tax risk increases for firms with low

    ETR volatility (i.e. StrongRM*PDTR is positive and statistically significant at the 5% level).

    6. Conclusion

    This study contributes to the literature on tax risk, yet from a different perspective than

    extant research. We consider downside tax risk as the probability of a loss event, in which an

    28

  • unfavorable and perhaps unexpected tax settlement occurs. Like studies of extreme downside

    risk (e.g. Huang et al., 2012), crash risk (e.g., Jin and Myers, 2006; Hutton et al., 2009; Kim et

    al., 2011) and operational risk (e.g., Chernobai et al, 2011; Brown et al., 2012), we focus on the

    observations in the extreme negative tail of the distribution, where the likelihood of a loss is

    significantly greater. With this conceptualization in mind, we identify firms with extreme and

    high values of Cash ETR (relative to an industry benchmark) and a disclosed tax settlement in

    their financial statements as tax loss event firms. Furthermore, we develop a logistic prediction

    model from firm characteristics on which we measure a scaled probability (DTR-score) that

    reflects the likelihood of a tax loss event and is a signal of downside tax risk. Finally, we

    examine the extent to which the market reacts to this downside tax risk.

    Empirical results from tests of our prediction model demonstrate that the likelihood of a

    tax loss event (i.e., unfavorable settlement with tax authorities) is positively associated with tax

    risk factors such as higher Cash ETR volatility, foreign operations, M&A activity, business

    segments (i.e. complexity) and SOX internal control weaknesses. This likelihood is also

    negatively associated with R&D expenditures and book-tax differences. Using our DTR-score,

    we demonstrate that in a broad sample our model correctly classifies the likelihood of a tax loss

    event (i.e. DTR-score above 1) for observed events with 73% accuracy. Also, our classification,

    Type 1 and Type II error rates compare favorably to other studies that examine the likelihood of

    relatively rare events in broad samples (e.g. financial misstatements; Dechow et al., 2011).

    Similar classification rates result from a matched sample analysis of our DTR-score, further

    validating the strength of our prediction model and our proxy for downside tax risk.

    Empirical results from tests of the market reaction to these tax loss events indicate that

    the shareholders of these tax loss event firms anticipate the tax loss events, although the negative

    29

  • market reaction appears highest in the year of the extreme negative event. We fail to find a

    differential response in the market to firms with high GAAP ETR volatility as compared to low

    ETR volatility. In a holdout sample examination of the predicted probability of tax loss events,

    we find some evidence that firms with higher downside tax risk generate higher returns for

    shareholders. Again, we do not find a differential response across ETR volatility, a proxy for the

    strength of tax risk management.

    We believe that our study provides valuable insight regarding the nature of tax risk within

    firms. In future research, our model could be used not only to predict downside tax risk in a

    broader sample but also to help evaluate the extent to which firms effectively manage their tax

    risk. More generally, our model and its predicted probabilities should be useful for market

    participants and other firm stakeholders in their assessments of firm risk.

    30

  • Appendix Discussion of Tax Loss Event firms and UTB Settlement Data from the post-FIN 48 Period

    Our identification of Tax Loss Event firms relies on manual verification of information (i.e. disclosure of unfavorable settlements with tax authorities) contained in 10-K reports. Our sample period spans 2002 to 2012, and thus nearly half of it falls in the post-FIN 48 period – when firms disclosure UTB reserves and settlements in their 10-K reports. Compustat collects and compiles information from UTB disclosures, yet we do not rely on UTB settlements (i.e. TXTUBSETTLE per Compustat) for two reasons. One, observations from the pre-FIN 48 period would be lost, reducing the generalizability and scope of our prediction model. Two, the value of TXTUBSETTLE does not necessarily represent an unfavorable settlement with tax authorities, if it is disclosed or compiled at all. For example, as with International Speedway Corp. below, the value of TXTUBSETTLE could simply represent the financial statement reversal of a previous accrual in which no additional taxes paid are required upon settlement. Thus without additional, manual verification, the UTB settlement data represents a noisy proxy for our categorization of tax loss events, a proxy for asymmetric, downside tax risk.

    The following table lists information for firms above our Cash ETR threshold to highlight four different situations that could arise in the post-FIN 48 period, some of which represent tax loss events and some of which represent noise (i.e. non-events). We explain each of these four situations following the table.

    GVKEY FYEAR Company Name TXPD TXTUBSETTLE Tax Loss Event 5680 2009 Lowe’s Companies Inc. $1,157M $1M 0 6116 2009 International Speedway Corp. $36.297M $121.532M 0 8530 2010 Pfizer Inc. $11,775M $0M 1

    12403 2009 Fred’s Inc. $22.999M $8.6M 1 Lowe’s Companies Inc. Lowe’s reports a UTB settlement but we label it as a non-event observation. Furthermore, this observation is also part of the group with “vague” disclosures. We consider the disclosure to be indicative of a vague non-event because certain information implies a settlement; however it is unclear whether any related amount was paid during the year (and thus contributes to the extreme Cash ETR value observed). The 10-K contains the following excerpt:

    “The Company is subject to examination by various foreign and domestic taxing authorities. During 2009, the IRS completed its examination of the Company’s 2004 and 2005 income tax returns, with the exception of certain issues that are presently under appeal.”

    International Speedway Corp. International Speedway reports a UTB settlement but we label it as a non-event observation. As indicated in the disclosure below, the majority of this settlement amount represents a return of deposit made by International Speedway in a previous year. Thus the settlement amount that firms disclose represents a reduction of the UTB reserve amount but it does not necessarily represent taxes paid during the year. In this particular case, the settlement is favorable and no additional taxes paid are required. An excerpt from the 10-K is as follows:

    31

  • “The company entered into a definitive settlement agreement with the Internal Revenue Service in connection with the federal income tax examination for the 1999 through 2005 fiscal years on May 28, 2009. As a result of the Settlement, on June 17, 2009, the company received approximately $97.4 million of the $117.9 million in deposits that it had previously made with IRS. In addition, it received approximately $14.6 million in cash for interest earned on the deposited funds.”

    Pfizer Inc. Pfizer does not report a UTB settlement but we label it as a tax loss event observation. In its UTB reconciliation (untabulated in this Appendix), Pfizer does not include a line specifically labeled as for “settlements”. Therefore Compustat does not recognize any amount for its TXTUBSETTLE variable. The following excerpt, however, clearly identifies a settlement with tax authorities:

    “During the fourth quarter of 2010, we reached a settlement with the U.S. Internal Revenue Service (IRS) related to issues we had appealed with respect to the audits of the Pfizer Inc. tax returns for the years 2002 through 2005, as well as the Pharmacia audit for the year 2003 through the date of merger with Pfizer (April 16, 2003). The IRS concluded its examination of the aforementioned tax years and issued a final Revenue Agent’s Report (RAR). We have agreed with all of the adjustments and computations contained in the RAR.”

    Fred’s Inc. Fred’s reports a UTB settlement and based on additional information in its 10-K we classify it as a tax loss event observation. The additional information indicates that taxes paid during the year reflect, in part, amounts paid in settlement. The 10-K disclosure includes the following excerpt:

    “…the Internal Revenue Service [finalized] an exam of the Company during 2009 covering fiscal years 2004 through 2007. The Company recorded a reduction to unrecognized tax benefits for settlement of the IRS exam which included tax and interest in the amount of $8.6 million.”

    32

  • References

    Altman, E. I., 1968. Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy. Journal of Finance 23(4), 589-609.

    Ang, A., Chen, J., Xing, Y., 2006. Downside risk. The Review of Financial Studies 19(4), 1191-1239.

    Bali, T. G., Demirtas, K. O., Levy, H., 2009. Is there an intertemporal relation between downside risk and expected returns? Journal of Financial and Quantitative Analysis 44(4), 883-909.

    Bauer, A., 2013. Tax Avoidance and the Implications of Weak Internal Controls. Working paper, University of Illinois at Urbana-Champaign.

    Beck, N., Katz, J.N., Tucker, R., 1998. Taking Time Seriously in Binary Time-Series-Cross-Section Analysis. American Journal of Political Science 42(4), 1260-1288.

    Boyer, C. M., Ciccone, S. J., Swicegood, P., 2010. The search for A dog-free portfolio: Why unions matter to investors. Journal of Business & Economics Research 8(6), 67-71.

    Brown, S., Goetzmann, C., Liang, B., Schwarz, C. 2012. Trust and Delegation. Journal of Financial Economics 103, 221-234.

    Chen, S., Chen, X., Cheng, Q., Shevlin, T., 2010. Are family firms more tax aggressive than non-family firms? Journal of Financial Economics 95(1), 41–61.

    Chernobai, A., Jorion, P., Yu, F., 2011. The Determinants of Operational Risk in U.S. Financial Institutions. Journal of Financial and Quantitative Analysis 46(6), 1683-1725.

    Dechow, P.M., Ge, W., Larson, C.R., Sloan, R.G., 2011. Predicting Material Accounting Misstatements. Contemporary Accounting Research 28(1), 17-82.

    Dittmar, R. F., 2002. Nonlinear pricing kernels, kurtosis preference, and evidence from the cross section of equity returns. The Journal of Finance 57(1), 369-404.

    Dyreng, S., Hanlon, M., Maydew, E., 2008. Long-Run Corporate Tax Avoidance. The Accounting Review 83(1), 61-82.

    Dyreng, S., Hanlon, M., Maydew, E., 2010. The Effects of Executives on Corporate Tax Avoidance. The Accounting Review 85(4): 1163-1189.

    Fama, E. F., French, K. R., 1993. Common risk factors in the returns on stocks and bonds. Journal of Financial Economics 33(1), 3-56.

    Frank, M., Lynch, L., Rego, S., 2009. Tax Reporting Aggressiveness and Its Relation to Aggressive Financial Reporting. The Accounting Review 84(2), 467-496.

    Froot, K. A., Scharfstein, D.S., Stein, J. C., 1993. Risk management: Coordinating corporate investment and financing policies. The Journal of Finance 48(5), 1629–1658.

    Gleason, C.A., Mills, L.F., 2002. Materiality and Contingent Tax Liability Reporting. The Accounting Revnew 77(2), 317-342.

    Graham, J. R., Raedy, J. S., Shackelford, D. A., 2012. Researching in accounting for income taxes. Journal of Accounting and Economics 53(1-2), 412–434.

    Grant, J., Markarian, G., Parbonetti, A., 2009. CEO Risk-Related Incentives and Income Smoothing. Contemporary Accounting Research 26(4), 1029–1065.

    Guay,W.,1999.The impact of derivatives on firm risk: an empirical examination of new derivative users. Journal of Accounting and Economics 26, 319–351.

    33

  • Guenther, D. A., Matsunaga, S. R., Williams, B. M., 2013. Tax Avoidance, Tax Aggressiveness and Firm Risk. Working paper, University of Oregon.

    Hanlon, M., 2003. What Can We Infer about a Firm’s Taxable Income from Its Financial Statements? National Tax Journal, 56(4), 831-863.

    Hanlon, M., Mills, L., Slemrod, J., 2007. An empirical examination of corporate tax noncompliance. In Taxing Corporate Income in the 21st Century, edited by A. Auerbach, J. R. Hines, Jr., and J. Slemrod. Cambridge, U.K.: Cambridge University Press.

    Hanlon, M., Heitzman, S., 2010. A Review of Tax Research. Journal of Accounting & Economics 50 (2-3), 127-178.

    Herskovitz, D.L., 1997. Managing an IRS audit: Practical insights and techniques. A dialogue. Tax Executive 49, 375-385.

    Higgins, D., 2013. Communicating Information Beyond the Tax Footnote: Effective Tax Rate Forecasts in the MD&A. Working paper, University of Connecticut.

    Hosmer, D., Lemeshow, S., 2000. Applied Logistic Regression. 2nd edition. New York, NY: John Wiley & Sons, Inc.

    Huang, W., Liu, Q., Rhee, S.G., Wu, F., 2012. Extreme downside risk and expected stock returns. Journal of Banking and Finance 36(5), 1492-1502.

    Hutchens, M., Rego, S., 2013. Tax Risk and the Cost of Equity Capital. Working paper, Indiana University.

    Hutton, A.P., Marcus, A.J., Tehranian, H., 2009. Opaque financial reports, R2, and crash risk. Journal of Financial Economics 94, 67–86.

    Jin, L., Myers, S.C., 2006. R2 around the world: new theory and new tests. Journal of Financial Economics 79, 257–292.

    Kim, J-B., Li, Y., Zhang, L., 2011. Corporate tax avoidance and stock price crash risk: Firm-level analysis. Journal of Financial Economics 100(3), 639–662.

    Lavermicocca, C., 2011. Tax Risk Management Practices and their Impact on Tax Compliance Behaviour – The Views of Tax Executives from Large Australian Companies. eJournal of Tax Research 9(1), 89-115.

    Lenter, D., Shackelford, D. A., Slemrod, J., 2003. Public Disclosure of Corporate Tax Return Information: Accounting, Economics, and Legal Perspectives. National Tax Journal, 56(4), 803-830.

    Lisowsky, P., 2010. Seeking Shelter: Empirically Modeling Tax Shelters Using Financial Statement Information. The Accounting Review 85(5), 1693-1720.

    March, J. G., Shapira, Z., 1987. Managerial perspectives on risk and risk taking. Management Science 33(11), 1404-1418.

    McCarty, R.D., 2012. Optimal Tax Risk and Firm Value. Working paper, Northeastern University.

    McGuire, S. T., Neuman, S. S., Omer, T. C., 2013. Sustainable Tax Strategies and Earnings Persistence. Working paper, Texas A&M University.

    Mills, L.F., 1998. Book-Tax Differences and Internal Revenue Service Adjustments. Journal of Accounting Research 36(2), 343-356.

    Mitton, T., Vorkink, K., 2007. Equilibrium underdiversification and the preference for skewness. The Review of Financial Studies 20(4), 1255-1288.

    34

  • Moreno, D., Rodríguez, R., 2009. The value of coskewness in mutual fund performance evaluation. Journal of Banking & Finance, 33(9), 1664-1676.

    Mulligan, E., Oats, L., 2009, Tax Risk Management: Evidence from the United States. British Accounting Review 6, 680-703.

    Neubig, T., Sangha, B., 2004. Tax risk and strong corporate governance. The Tax Executive 56(2), 114–119.

    Neuman, S. S., Omer, T. C., Schmidt, A. P., 2013. Risk and Return: Does Tax Risk Reduce Firms’ Effective Tax Rates? Working paper, Texas A&M University, University of Nebraska-Lincoln, North Carolina State University.

    OECD, 2009. Corporate Governance and Tax Risk Management. Forum on Tax Administration, Organisation for Economic Co-operation and Development, Paris.

    PricewaterhouseCoopers, 2004. Tax risk management. London: PWC.

    Raedy, J. S., Seidman, J. K., Shackelford, D. A., 2011. Is There Information Content in the Tax Footnote? Working paper, University of North Carolina and University of Texas-Austin.

    Reed, K., 2006. “GSK Pays ‘Biggest Tax Settlement in U.S. History.” Accountancy Age, September 9, 2006. Retrieved October 8, 2013 from http://www.accountancyage.com/aa/news/1768893/gsk-pays-biggest-tax-settlement-us-history.

    Shumway, T., 2001. Forecasting Bankruptcy More Accurately: A Simple Hazard Model. The Journal of Business 74(1), 101-124.

    Tufano, P., 1996. Who manages risk? An empirical examination of risk management practices in the gold mining industry. The Journal of Finance 51(4), 1097–137.

    Wilson, R., 2009. An Examination of Corporate Tax Shelter Participants. The Accounting Review 84(3), 969-999.

    Wunder, H., 2009. Tax risk management and the multinational enterprise. Journal of International Accounting Auditing & Taxation 18(1), 14-28.

    Zhang, H., 2009. Effect of derivative accounting rules on corporate risk-management behavior. Journal of Accounting and Economics 47, 244-264.

    35

    http://www.accountancyage.com/aa/news/1768893/gsk-pays-biggest-tax-settlement-us-historyhttp://www.accountancyage.com/aa/news/1768893/gsk-pays-biggest-tax-settlement-us-history

  • Figure 1 Median DTR-score

    Figure 1 shows the median DTR-score in each year around the t=0 event year for our event and match sample observations. We include all firm-year observations related to both our event firms and our matched control firms in our sub-sample. A maximum of five control firms are matched on fiscal year and four-digit industry classification for each event observation. We estimate our prediction model and scale the predicted probability of a tax loss event by the unconditional probability of an event for the sub-sample to generate the DTR-score for each observation. For each year in the t-4 to t+4 window around the event year or match year, we report the median DTR-score by event firms and match firms.

    36

  • Figure 2 Percentage of DTR-score > 1.0

    Figure 2 shows the percentage of observations with a DTR-score above 1.0 in each year around the t=0 event year for our event and match sample observations. We include all firm-year observations related to both our event firms and our matched control firms in our sub-sample. A maximum of five control firms are matched on fiscal year and four-digit industry classification for each event observation. We estimate our prediction model and scale the predicted probability of a tax loss event by the unconditional probability of an event for the sub-sample to generate the DTR-score for each observation. For each year in the t-4 to t+4 window around the event year or match year, we report the percentage of observations with a DTR-score above 1.0 by event firms and match firms.

    37

  • Table 1 Sample Selection Procedure

    Criteria Firm-Year

    Observations Firm-Level

    Observations Public firms in Compustat, 2002-2012 99,446 13,982 Less: observations without an annual Cash ETR value (52,090) (5,338) 47,356 8,644 Less: observations with an annual ROA value less than 1/2% (5,563) (380) 41,793 8,264 Less: observations without ten years of consecutive data to measure Cash ETR benchmarks (33,221) (6,559) Remaining sample to estimate Cash ETR threshold, 2002-2012 8,572 1,705 Less: observations missing data for control variables (1,134) (198) Final sample for prediction model, 2002-2012 7,438 1,507

    38

  • Table 2 Variable Definitions

    Panel A – Prediction Model Variables

    Dependent Variable Constructs

    Criteria

    Tax Loss Event = 1 if an observation has an above-threshold value of Cash ETR and financial statements disclose an unfavorable settlement with tax authorities, 0 otherwise. Measured at time t.

    Cash ETR threshold = 1 if the difference between the firm-year annual Cash ETR and firm-level ten-year average Cash ETR exceeds 2 standard deviations of the ten-year, four-digit NAICS industry average Cash ETR, 0 otherwise. Measured in rolling windows at time t.

    Cash ETR = Per Dyreng et al. (2008), Taxes paid divided by (Pre-tax income less special items); txpd / (pi – spi), per Compustat. Set to missing if the denominator is negative and truncated between 0 and 1. For the ten-year rolling averages of Cash ETR, the sum of the numerator is divided by the sum of the denominator. Annual and rolling average values are measured at time t.

    DTR-score = The predicted probability of a tax loss event PTaxEvent (e.g. constructed using the coefficient values from Table 5) scaled by the unconditional probability of a Tax Loss Event in the sample data. Measured at time t, based on data at time t-1.

    Independent Variables Cash ETR Variance = The variance of annual Cash ETR over rolling five-year windows per

    firm. Measured at time t-1. ROA = Return on Assets; (pi – xi) / lag(at), per Compustat. Restricted to include

    values that are greater than or equal to ½% only. Measured at time t-1. FOROPS = 1 if foreign income is positive, 0 otherwise; Foreign income = pifo, per

    Compustat. Measured at time t-1. SIZE = Natural log of total assets; Total Assets = at, per Compustat. Measured at

    time t-1. RND = R&D expense divided by lagged total assets; xrd / lag(at), per

    Compustat. Set to zero if R&D expense is missing. Measured at time t-1. MNA = 1 if observation has mergers and acquisitions (M&A) activity during the

    year, 0 otherwise; M&A = aqp, per Compustat. Set to zero if M&A data is missing. Measured at time t-1.

    NSEG = Number of segments reported; segments reported = ptis, per Compustat segment file. Set to zero if segment data is missing or firm is not in Compustat segment file. Measured at time t-1.

    ICW = 1 if firm-year reports a material internal control weakness (ICW), 0 otherwise; ICW = IC_IS_EFFECTIVE (“N” or “Y”), per Audit Analytics. Measured at time t-1.

    BTD = Per Wilson (2009), Book-tax difference is book income less taxable income scaled by lagged assets; [(pi-(txfed+txfo)/stxr)-(tlcf-lag(tlcf))] / lag(at), per Compustat. If txfed is missing, taxable income = (txt-txdi-txs-txo) / stxr, where stxr is the statutory tax rate. Measured at time t-1.

    DURATION = A time trend representing the years elapsed from the beginning of the sample period (i.e. 2002 = 1, 2003 = 2, etc.).

    DURATION2 = The square of DURATION to complete a quadratic time trend.

    39

  • Table 2 – continued

    Panel B – Abnormal Returns Model Variables

    Dependent Variable Criteria XR = Firm i’s excess stock return during month t, the stock return minus risk-

    free rate [rit – rft]; ret – rf, per CRSP. Independent Variable Constructs

    TaxEvent (PrTaxEvent) = 1 if firm i has a Tax Loss Event value of 1 (i.e., above-threshold Cash ETR value and financial statements disclose an unfavorable settlement with tax authorities), 0 otherwise. For holdout sample tests, PrTaxEvent equals 1 if a firm-year observation is in the top decile of DTR-score, 0 otherwise.

    Control (PrControl) = 1 if firm i has a Tax Loss Event value of 0 and has been matched to a Tax Loss Event firm (based on event year and four-digit NAICS industry), 0 otherwise. For holdout sample tests, PrControl equals 1 if a firm-year observation is outside the top decile of DTR-score, 0 otherwise.

    WeakRM = 1 if firm i has an above-median value of ETR Volatility, 0 otherwise. StrongRM = 1 if firm i has a below-median value of ETR Volatility, 0 otherwise. ETR Volatility = The standard deviation of firm i’s annual ETR less the four-digit, NAICS

    industry standard deviation of annual ETR, where standard deviation is constructed over a ten year period. Per Dyreng et al. (2008), ETR equals tax expense divided by (Pre-tax income less special items); txt / (pi – spi), per Compustat. Set to missing if the denominator is negative.

    TaxEvent*WeakRM = Interaction of TaxEvent and WeakRM variables. Control*WeakRM = Interaction of Control and WeakRM variables. Control*StrongRM = Interaction of Control and StrongRM variables. TaxEvent*StrongRM = Interaction of TaxEvent and StrongRM variables. PDTR = Predicted downside tax risk, based on coefficients from the matched-

    sample prediction model in Table 5 column 2. For use in holdout sample tests.

    PDTR*PrTaxEvent = Interaction of PDTR and PrTaxEvent for use in holdout sample tests. StrongRM*PDTR = Interaction of StrongRM and PDTR for use in holdout sample tests. StrongRM*PrTaxEvent = Interaction of StrongRM and PrTaxEvent for use in holdout sample tests. StrongRM*PDTR* PrTaxEvent

    = Interaction of StrongRM, PDTR and PrTaxEvent for use in holdout sample tests.

    MKTRF = Excess market return in month t, the value-weighted market return minus risk-free rate [rmt - rft]; mktrf, per CRSP.

    SMB = Size premium for month t, the difference in returns bet


Recommended