Essays in Financial Economics Lira Rocha da Mota Mertens

Essays in Financial Economics

Lira Rocha da Mota Mertens

Submitted in partial fulfillment of therequirements for the degree of

Doctor of Philosophyunder the Executive Committee

of the Graduate School of Arts and Sciences

COLUMBIA UNIVERSITY

2021

© 2021


All Rights Reserved

Abstract

Essays in Financial Economics


This dissertation studies topics in financial economics. In the first chapter, The Corporate

Supply of (Quasi) Safe Assets, I examine whether the demand for safe assets affects nonfinancial

corporations in the US. Investors value safety services in financial assets, such as the ability to serve

as a store of value, to serve as collateral, or to meet mandatory capital and liquidity requirements.

I present a model in which investors value safety services not only in traditional safe assets such

as US Treasuries, but also in corporate debt. Shareholders thus maximize the value of the firm

by complementing standard business operations with safe asset creation. Based on this theoretical

framework, I use the CDS-bond basis to derive a measurement of the safety premium of corporate

bonds. I document substantial cross sectional variation in the safety premium of corporate bonds,

which allows me to test the model’s predictions. I show that a high safety premium leads to

a marked increase in debt issuance by relatively safer firms. These debt proceeds have a small

impact on real investment and are largely used instead for equity payouts. This mechanism can

explain why, in the aftermath of the financial crisis, non-financial investment grade companies

significantly increased their debt issuance and equity payout while investment remained weak.

The second chapter, The Cross-Section of Risk and Return, focuses on a common practice

in the finance literature which is to create characteristic portfolios by sorting on characteristics

associated with average returns. We show that the resultant portfolios are likely to capture not only

the priced risk associated with the characteristic but also unpriced risk. We develop a procedure

to remove this unpriced risk using covariance information estimated from past returns. We apply

our methodology to the five Fama-French characteristic portfolios. The squared Sharpe ratio of the

optimal combination of the resultant characteristic efficient portfolios is 2.13, compared with 1.17

for the original characteristic portfolios.

In the third chapter, Should Information be Sold Separately? Evidence from MiFID II, we ex-

amine whether selling information separately improves its production. We use a recent regulation

in Europe (MiFID II) that unbundles research from transactions to investigate this question. We

show that unbundling causes fewer research analysts to cover a firm. This decrease does not come

from small- or mid-cap firms but is concentrated in large firms. Contrary to conventional wis-

dom, the reduction in the coverage quantity is accompanied by an increase in the coverage quality.

Further analyses suggest that the enhancement of analyst competition could drive the results: inac-

curate analysts drop out (extensive margin) and analysts who stay produce better-quality research

(intensive margin). Our findings suggest that selling information separately improves information

quality at the cost of reducing information quantity.

Table of Contents

List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii

List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x

Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii

Chapter 1: The Corporate Supply of (Quasi) Safe Assets . . . . . . . . . . . . . . . . . . . 1

1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

1.2 Related Literature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

1.3 Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

1.3.1 Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

1.3.2 Model Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

1.3.3 Comparative Statics and Empirical Predictions . . . . . . . . . . . . . . . 15

1.3.4 Bringing the Model to the Data . . . . . . . . . . . . . . . . . . . . . . . . 18

1.4 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

1.5 Measuring the Corporate Safety Premium . . . . . . . . . . . . . . . . . . . . . . 21

1.5.1 Empirical construction of the CDS-bond basis . . . . . . . . . . . . . . . . 23

1.5.2 The Validity of Cross-Basis as the Relative Safety Premium Measure . . . . 25

1.6 Firms’ Response to Safety Premium . . . . . . . . . . . . . . . . . . . . . . . . . 27

1.6.1 Impact of Safety Premium on Net Debt Issuance . . . . . . . . . . . . . . 27

i

1.6.2 Impact of Safety Premium on Firms’ Real Decisions . . . . . . . . . . . . 32

1.6.3 Heterogeneity across Firms’ Characteristics . . . . . . . . . . . . . . . . . 34

1.6.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

1.7 Robustness and Additional Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

1.7.1 Is the Cross-basis Really Capturing Variation in the Safety Premium? . . . 36

1.7.2 Delayed Reactions: Effect of Safety Premium in Different Time Horizons . 39

1.7.3 Limits to Arbitrage and the Demand for Safety . . . . . . . . . . . . . . . 40

1.8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

1.9 Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

Chapter 2: The Cross-Section of Risk and Return . . . . . . . . . . . . . . . . . . . . . . . 63

2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63

2.2 Theory and examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

2.2.1 A simple example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

2.2.2 The general case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78

2.3 Empirical results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

2.3.1 Hedge portfolio construction . . . . . . . . . . . . . . . . . . . . . . . . . 84

2.3.2 Description of the sorted portfolios . . . . . . . . . . . . . . . . . . . . . . 88

2.3.3 Pricing results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

2.3.4 Industry-neutral characteristic portfolios . . . . . . . . . . . . . . . . . . . 95

2.4 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

Chapter 3: Should Information be Sold Separately? Evidence from MiFID II . . . . . . . . 116

3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

ii

3.2 Regulation background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122

3.3 Hypotheses development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125

3.4 Empirical design and data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128

3.4.1 Empirical design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128

3.4.2 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

3.5 Investigating the impact of unbundling: firm level analyses . . . . . . . . . . . . . 132

3.5.1 The impact of unbundling on analyst coverage . . . . . . . . . . . . . . . . 134

3.5.2 The impact of unbundling on analysts forecast quality . . . . . . . . . . . . 135

3.6 Investigating the channel: analyst level analyses . . . . . . . . . . . . . . . . . . . 137

3.6.1 Intensive margin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137

3.6.2 Extensive margin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140

3.7 Robustness check and extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

3.7.1 Firm’s brokerage house coverage . . . . . . . . . . . . . . . . . . . . . . . 145

3.7.2 Other quality measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146

3.7.3 Alternatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150

3.7.4 Brokerage house level employment . . . . . . . . . . . . . . . . . . . . . . 152

3.7.5 Capital market effects . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154

3.8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156

3.9 Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157

3.10 Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162

References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187

Appendix A: Chapter 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188

iii

A.1 Data Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188

A.1.1 Variable definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188

A.1.2 Measuring SG&A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188

A.1.3 FISD Cleaning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189

A.1.4 Merge FISD to Compustat . . . . . . . . . . . . . . . . . . . . . . . . . . 190

A.2 Measures of Aggregate Safety Premium . . . . . . . . . . . . . . . . . . . . . . . 191

A.3 Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192

A.3.1 Price of the Bond . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192

A.3.2 Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195

A.4 Infinite Horizon Dynamic Model . . . . . . . . . . . . . . . . . . . . . . . . . . . 195

A.4.1 Simulation Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195

A.4.2 State Space Discretization . . . . . . . . . . . . . . . . . . . . . . . . . . 196

A.4.3 Main Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197

A.5 Additional Empirical Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198

A.5.1 Cash Flow Identity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199

A.6 CDS-bond Basis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201

A.7 Alternative Model with Limits to Arbitrage . . . . . . . . . . . . . . . . . . . . . 203

A.7.1 Explicit Model of Arbitrageurs . . . . . . . . . . . . . . . . . . . . . . . . 203

A.7.2 An Example of a Negative CDS-bond Trade . . . . . . . . . . . . . . . . . 205

Appendix B: Chapter 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209

B.1 Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209

B.2 Proofs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210

iv

B.2.1 The characteristic efficient portfolios . . . . . . . . . . . . . . . . . . . . . 210

B.2.2 Proof of Proposition 2.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 212

B.2.3 Proof of Proposition 2.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 212

B.2.4 The optimal hedge ratio . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214

B.3 Empirical details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215

B.3.1 Empirical definition of main variables . . . . . . . . . . . . . . . . . . . . 215

B.3.2 Loading estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217

B.3.3 Dealing with missing prices . . . . . . . . . . . . . . . . . . . . . . . . . 217

B.4 Supplemental results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

B.4.1 Portfolio bin population . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

B.4.2 High power vs. low power . . . . . . . . . . . . . . . . . . . . . . . . . . 219

Appendix C: Chapter 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224

C.1 Data construction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224

C.1.1 Firm level observations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224

C.1.2 Choice of data frequency . . . . . . . . . . . . . . . . . . . . . . . . . . . 225

C.1.3 Variable definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226

C.2 Additional figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228

C.3 Other summary statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232

C.4 Additional regression results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237

C.4.1 Dynamic coefficients point estimates . . . . . . . . . . . . . . . . . . . . . 237

C.4.2 Additional results for firm level coverage . . . . . . . . . . . . . . . . . . 239

C.4.3 Additional results for analyst informativeness . . . . . . . . . . . . . . . . 242

v

C.5 Propensity score matching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243

vi

List of Tables

1.1 Calibration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

1.2 Number of Bonds and Firms with Valid CDS-Bond Basis . . . . . . . . . . . . . . 52

1.3 Firm-Level Summary Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

1.4 CDS-bond Basis Summary Statistics . . . . . . . . . . . . . . . . . . . . . . . . . 54

1.5 CDS-Bond cross section Analyses . . . . . . . . . . . . . . . . . . . . . . . . . . 55

1.6 Treasury Safety Premium Proxies . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

1.7 CDS-Bond Time Series Analyses . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

1.8 Impact of Safety Premium on Net Debt Issuance . . . . . . . . . . . . . . . . . . . 58

1.9 Impact of Safety Premium on Firms’ Decisions . . . . . . . . . . . . . . . . . . . 59

1.10 Heterogeneous Impact of Cross-Basis on Firms’ Decisions . . . . . . . . . . . . . 60

2.1 Low book-to-market stocks in the Money industry as of June 2008 . . . . . . . . . 105

2.2 Average monthly excess returns for the sorted portfolios . . . . . . . . . . . . . . . 106

2.3 Alphas and loadings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

2.4 Results of time-series regressions of hedge-portfolios . . . . . . . . . . . . . . . . 112

2.5 Sharpe Ratio improvement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

2.6 Ex-post optimal Markowitz weights . . . . . . . . . . . . . . . . . . . . . . . . . 114

2.7 Spanning tests for HML . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

vii

2.8 Correlations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115

3.1 Firm Level Summary Statistics (Sample for Coverage Quantity) . . . . . . . . . . 162

3.2 Firm Level Summary Statistics (Sample for Coverage Quality) . . . . . . . . . . . 163

3.3 Analyst Level Summary Statistics (Sample for Intensive Margin) . . . . . . . . . . 164

3.4 Analyst Level Summary Statistics (Sample for Extensive Margin) . . . . . . . . . . 165

3.5 One Difference (Pre and Post) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166

3.6 Firm Level Outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167

3.7 Analyst Level Outcomes (Intensive Margin) . . . . . . . . . . . . . . . . . . . . . 168

3.8 Analyst Level Outcomes (Extensive Margin). Probability of Stop or Dropping Outafter Unbundling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169

3.9 Analyst Level Outcomes (Extensive Margin). Probability of Stop or Dropping OutBefore and After Unbundling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170

3.10 Placebo Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171

3.11 Firms’ Brokerage House Coverage . . . . . . . . . . . . . . . . . . . . . . . . . . 172

3.12 Other Measures of Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173

3.13 Abnormal Return and Analyst Informativeness . . . . . . . . . . . . . . . . . . . . 174

3.14 Learning Effect (Lagged Forecast Error as Control) . . . . . . . . . . . . . . . . . 175

3.15 Brokerage House Employment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176

3.16 Capital Market Effects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177

A.1 Impact of Cross-basis on Firm’s Decisions Conditional on Positive Debt IssuanceIncluding Ratings FE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198

A.2 Impact of Cross-basis on Firm’s Decisions Conditional on Positive Debt IssuanceIncluding Ratings FE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199

A.3 Negative CDS-bond Basis Cash Flow Diagram . . . . . . . . . . . . . . . . . . . . 205

viii

A.4 A Negative CDS-bond Basis Example . . . . . . . . . . . . . . . . . . . . . . . . 207

A.5 Implicit Funding Cost . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208

B.1 Number of firms in each portfolio . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

B.2 Results of time-series regressions on characteristic-balanced hedge-portfolios . . . 223

C.1 Summary Statistics (Country Information) . . . . . . . . . . . . . . . . . . . . . . 232

C.2 Summary Statistics (Small vs. Large in Sample for Quantity) . . . . . . . . . . . . 233

C.3 Summary Statistics (Small vs. Large in Sample for Quality) . . . . . . . . . . . . . 235

C.4 Firm Level Outcomes (Dynamic Coefficients) . . . . . . . . . . . . . . . . . . . . 237

C.5 Intensive Margin (Dynamic Coefficients) . . . . . . . . . . . . . . . . . . . . . . . 238

C.6 Firm Level Coverage (Sample for Coverage Quality). . . . . . . . . . . . . . . . . 239

C.7 Firm Level Coverage (Results in logs). . . . . . . . . . . . . . . . . . . . . . . . . 240

C.8 Firm Level Coverage (EU Small vs. US Small and EU Large vs. US Large) . . . . 241

C.9 Analyst Informativeness (Small vs. Large). . . . . . . . . . . . . . . . . . . . . . . 242

C.10 Covariates Balance Table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 246

C.11 Firm Level Outcomes (Propensity Score Matching). . . . . . . . . . . . . . . . . . 247

ix

List of Figures

1.1 Corporate Safety Premium and Bond Price . . . . . . . . . . . . . . . . . . . . . . 44

1.2 Firm’s Total Value as a Function of Debt Issuance . . . . . . . . . . . . . . . . . . 45

1.3 Response to Perceived Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

1.4 Response to Aggregate Safety Premium . . . . . . . . . . . . . . . . . . . . . . . 47

1.5 Interaction Effects of Bond Issuance Response to Safety Premium . . . . . . . . . 48

1.6 Heterogeneous Effects on Bond Issuance Due to Initial Net Worth . . . . . . . . . 48

1.7 The CDS-bond Basis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

1.8 US Treasury Safety Premium. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

1.9 Effect of Cross-Basis on Firms’ Decisions for Different Time Horizons. . . . . . . 50

2.1 Six assets in the space of loadings on priced and unpriced factors. . . . . . . . . . . 72

2.2 Rolling regression R2s – HML returns on industry returns . . . . . . . . . . . . . . 99

2.3 HML loadings on industry-portfolios . . . . . . . . . . . . . . . . . . . . . . . . . 100

2.4 Volatility of the money industry-portfolios . . . . . . . . . . . . . . . . . . . . . . 101

2.5 Rolling regression R2s – HML returns on Money industry returns . . . . . . . . . . 101

2.6 Ex-post HML/RMW/CMA loadings vs. characteristics . . . . . . . . . . . . . . . 102

2.7 Ex-post MktRF loadings vs. characteristics . . . . . . . . . . . . . . . . . . . . . 103

2.8 Ex-post SMB loadings vs. characteristics. . . . . . . . . . . . . . . . . . . . . . . 104

x

3.1 Illustration of Bundling and Unbundling . . . . . . . . . . . . . . . . . . . . . . . 157

3.2 Average Analyst Coverage Over Time (EU vs. US) . . . . . . . . . . . . . . . . . 158

3.3 Average Analyst Coverage Over Time between Small Firms and Large Firms inthe EU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158

3.4 Empirical Cumulative Distribution of Forecast Error at the Firm Level . . . . . . . 159

3.5 Firm Level Outcomes (Dynamic Coefficients) . . . . . . . . . . . . . . . . . . . . 160

3.6 Analyst Level Forecast Error (Dynamic Coefficients) . . . . . . . . . . . . . . . . 161

A.1 Successful Mergers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190

A.2 Firm’s Total Value as Function of Capital and Debt Issuance . . . . . . . . . . . . 195

A.3 Issuance Response to Perceived Safety . . . . . . . . . . . . . . . . . . . . . . . . 197

B.1 Ex-post loading vs. characteristic . . . . . . . . . . . . . . . . . . . . . . . . . . . 222

C.1 Geographic Composition of Stocks in Analyst Portfolio. . . . . . . . . . . . . . . . 228

C.2 Average Analyst Coverage (EU vs. US) . . . . . . . . . . . . . . . . . . . . . . . 229

C.3 Firm Level Forecast Error Distribution . . . . . . . . . . . . . . . . . . . . . . . . 230

C.4 Average Number of Analysts per Brokerage House Over Time in the EU, Small vs.Large . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231

C.5 Distributions of Propensity Scores after Matching. . . . . . . . . . . . . . . . . . . 245

xi

Acknowledgements

I am grateful to my advisers Kent Daniel, Olivier Darmouni, Tano Santos, and Jesse Schreger.

They provided me with safe ground to discuss ideas, guided me through the process of shaping

raw ideas into developed papers, and always offered me support when needed. Kent and Tano

made me believe in my potential as a researcher. They were my mentors and source of inspiration

throughout the PhD. Tano always encouraged me to be ambitious in my research endeavors and

gave me the support to do so. He followed my steps from the idea generation to the writing of

this dissertation, always with wit and wisdom, making me laugh even in the darkest hour. Kent,

always nice and sharp, pushed my research to new heights. Olivier was always present, giving

timely and detailed feedback. I would never have survived the Job Market had he not listened to

my presentation countless times and encouraged me to do better. Jesse always reinvigorated my

energies by being optimistic about my progress, while also provided precise and wise comments

that helped me write a better dissertation.

I want to thank my coauthors, Yifeng Guo and Simon Rottke. Cooperating with them taught

me not only about finance but also about friendship and partnership. Simon had endless patience

with my ups and downs with research, and no matter how many times he could have been mad, he

always found a way to say something to cheer us up. Yifeng was my companion from homeworks

in the first year to dissertation in the later years. I thank Yifeng for making the journey easier and

more joyful.

I am also thankful to other faculty members in Columbia for their advice and help. I am

xii

particularly grateful to Jose Scheinkman, who opened the Columbia doors when I came as a visitor

from Brazil. This was the first step of the journey that changed the course of my life. I am

also thankful to my friends, especially Amanda dos Santos, Polina Dovman, Kerry Siani, Melina

Papoutsi, Cristina Tessari, and Tuomas Tomunen. I was lucky to have them by my side to share

coffees, gossips, runs, and finance ideas.

I am deeply thankful to my husband, Luca Mertens. He was the first to support my decision

to start the PhD at Columbia. Luca was my best company, holding my hand and working with me

when the moment was difficult and celebrating when the victories arrived. I am also indebted to my

family. To my parents, Luzanira Mota and Jesus Mota, for being my inspiration of perseverance

and always making clear they were proud of my achievements. To my brother, Tomaz Mota, for

always having a positive word to say. And to my parents-in-law, Bertilla Diquigiovanni and Daniel

Mertens, for moving mountains to help in whatever was needed.

xiii

To Luca, per aspera ad astra.

xiv

Chapter 1

The Corporate Supply of (Quasi) Safe Assets 1

1.1 Introduction

There is a particular class of financial assets that can serve as a store of value, be used as

collateral, and be used to meet mandatory capital and liquidity requirements. The stability and

safety of their cash flows are what makes these assets ideal vehicles for the provision of these

safety services, and as a result, some refer to these securities as safe assets. When these assets are

in limited supply, they might command a safety premium, i.e., a value investors are willing to pay

above the discounted cash flows because of the aforementioned services they provide. Some have

argued that, since the global financial crisis, the demand for safety services has increased above

the supply of assets that can meet this demand, leading to an increase in the safety premium.2 The

literature has traditionally relied on a pragmatic and narrow definition of safe assets, which include

1This chapter is based on Mota (2021). I am grateful to my advisors Kent Daniel, Olivier Darmouni, Tano San-tos, and Jesse Schreger for their invaluable guidance and continuous support, and to Simona Abis, Patrick Bolton,Douglas Diamond, Jason Donaldson, Amanda Dos Santos, Polina Dovman, Philip Dybvig, Yifeng Guo, HarrisonHong, Jeremias Huber, Philip Kalikman, Yiming Ma, Harry Mamaysky, Daniel Mertens, Luca Mertens, Vrinda Mittal,Melina Papoutsi, Giorgia Piacentino, Simon Rottke, Jose Scheinkman, Kerry Siani, Suresh Sundaresan, Cristina Tes-sari, Tuomas Tomunen, Stijn Van Nieuwerburgh, Neng Wang, seminar participants at Macro and Finance at ColumbiaBusiness School and the Finance Colloquium at Columbia University Economics Department, and conference partic-ipants at the Diamond-Dybvig 36th Anniversary Conference, the 2020 AFA PhD Poster Session, and the Macro Fi-nance Research Program of the Becker Friedman Institute Summer Session for their helpful discussions and sugges-tions. I acknowledge the support of the Chazen Institute for Global Business at Columbia Business School. I welcomecomments, including references to related papers that I have inadvertently overlooked. All errors are my own.

2See inter alia Krishnamurthy and Vissing-Jørgensen (2012a), Gorton et al. (2012), Caballero et al. (2017),Caballero and Farhi (2018).

1

liabilities issued by developed countries or by the financial sector.3 In this paper, I argue that

non-financial corporations can act as a class of safety service suppliers, and I study their supply

responses to changes in the safety premium.

The starting point of this paper is the observation that safety is not a binary characteristic

of an asset. Different assets can provide different amounts of safety services, depending on the

underlying characteristics of their cash flows. I first show that US corporate bonds have earned a

safety premium in recent years, with safer firms earning a higher premium. Corporate bonds are

not entirely insulated from credit risk, but they can still function as a store of value, as collateral,

or as regulatory capital, thereby providing an imperfect substitute for traditional safe securities.

Thus, in the presence of a positive safety premium, corporate managers may create shareholder

value by issuing debt. I show that firms do indeed respond to an increase in their relative safety

premia by issuing more debt. In my sample, rather than using the new funds to invest, they pay out

the borrowed money to shareholders. The demand for safe assets therefore has direct implications

for corporate borrowing behavior and capital structure.

I begin by presenting a model that fleshes out the safety-creation mechanism in non-financial

firms. The model fulfills two purposes. First, it defines the safety premium component in corpo-

rate debt prices. The safety premium varies across both time and firms, and maps to an empirically

observable measure that I call the cross-basis. Second, it generates predictions about firms’ re-

sponses to shocks in the safety premium. I then use data on US non-financial corporations to test

the model’s predictions.

Following Krishnamurthy and Vissing-Jørgensen (2012a), I take a reduced-form approach and

model the demand for safety services as a primitive. The model innovates in assuming that all debt

securities can potentially provide some safety services.4 In equilibrium, two securities with the

same cash flows can have different prices simply because they provide different safety services.

3See inter alia Caballero et al. (2016), Gorton (2017), Caballero et al. (2017), Caballero and Farhi (2018), Farhiand Maggiori (2018), or more broadly, works that study the non-pecuniary value of US Treasuries or money-likesecurities such as Sunderam (2015), Nagel (2016), Du et al. (2018), Binsbergen et al. (2019), He et al. (2019), Jianget al. (2020), Diamond (2020).

4For the micro-foundations for investor’s demand for securities with stable cash-flows, such as debt, the readershould refer to Gorton and Pennacchi (1990), Dang et al. (2015), Dang et al. (2019).

2

The difference in prices between two such assets measures the relative safety premium between

them. This conceptual innovation allows me to use cross sectional data to bypass the traditional

difficulty faced by the literature of estimating the aggregate safety premium, the market price of

one unit of safety services. As a result, I can use the cross section of bond prices to identify the

effect of fluctuations of the relative safety premium on corporate debt issuance, as well as on other

firm policies such as capital and intangible investment and payout policy.

In my model, the firm-specific safety premium depends on both the aggregate safety premium,

which I assume to be exogenous, and on firms’ decisions, which are endogenous. In particular,

as firms issue more debt, the firm-specific safety premium goes down. Three main empirical

predictions emerge from the model. First, firms respond to an increase in the safety premium

by issuing more debt. Second, for financially constrained firms, an increase in the relative safety

premium relaxes this constraint and leads to more investment. Third, for financially unconstrained

firms, variation in debt issuance due to fluctuations in the safety premium mostly results in equity

payout.

To test the model predictions, the first empirical challenge is to measure the safety premium

of corporate bonds. The safety premium of a particular bond is the difference in the prices of the

bond and of a benchmark asset with the same cash flows but no safety services. This benchmark

is not readily available. I overcome this issue by constructing, for each bond in the sample, a

synthetic asset, which is a portfolio that combines the corporate bond and the maturity-matched

credit default swap (CDS). I call this synthetic asset the hedged bond. Under the assumption that

the CDS perfectly hedges the credit risk, for all bonds the hedged bond’s cash flows are the same.

Then, by comparing the yields of the hedged bonds, I am able to quantify the relative safety premia

among those assets.

Specifically, I introduce the cross-basis as a measure of relative safety premium in corporate

bonds. Theoretically, the cross-basis measures the safety premium of a specific firm relative to

the average firm. Empirically, I construct the cross-basis by first calculating the CDS-bond ba-

sis for each bond, which is the difference in yields between the hedged bond and the maturity-

3

and coupon-matched US Treasury, and then define the cross-basis as the firm-specific CDS-bond

basis minus the basis index, which is the face-value weighted average CDS-bond basis in the sam-

ple. There is a direct connection between the investors’ Euler equation from the model and the

cross-basis, which yields three predictions. First, in the cross section, cross-basis is monotonic

in perceived safety, i.e., the amount of safety services an asset provides. Second, one factor, the

aggregate safety premium (normalized to be the US Treasury safety premium), fully explains the

time series variation of cross-basis. Third, loadings on the aggregate safety premium are mono-

tonic in perceived safety. I find strong support for all three predictions in the data, which validates

the connection between the cross-basis and firm-specific safety premium.

Equipped with an empirical measurement of the safety premium in corporate bonds, I test how

firms react to variations in the safety premium. I find that in the cross section, firms with higher

cross-basis in a given quarter issue more debt in the following quarter. A one percentage point

increase in the cross-basis forecasts an increase of net debt issuance of 10 basis points as a share of

total assets, which translates to an increase of $27.4 million in debt issuance for the average firm or

a 31% increase relative to their quarterly average debt issuance. On the other hand, the cross-basis

has a small impact on real investment, measured by capital investment, intangible investment, or

acquisitions. Instead, the safety premium strongly forecasts equity payouts in the form of dividends

and equity repurchases. A one percentage point increase in the cross-basis forecasts a 8 basis points

increase in firms’ net payout as a share of total assets, or in dollars, $23.2 million, a number very

close to the effect of the cross-basis on net debt issuance. In light of the corporate bond supply

model I introduced, this is evidence that companies with relatively high safety premium are not

financially constrained, consequently almost all debt issued in response to the safety premium is

converted into equity payouts.

I conduct a battery of additional tests to confirm these findings. A possible worry with the

cross-basis as a measure of the relative safety premium is that it may be capturing frictions not

related to safety, such as counterparty risk, mis-measurement due to bond illiquidity, restructuring

uncertainty, or mismatch in the payoff structure of bonds. However, one important feature of the

4

cross-basis is that any friction that affects all bonds or CDS equally does not affect the cross-

basis. Moreover, only frictions that are systematically correlated with firms’ fundamentals are of

concern when assessing the impact of the cross-basis on firms’ choices, such as debt issuance. I

show that none of these confounders are large enough to explain the cross sectional variation of

the cross-basis5 or to be of concern regarding the interpretation of the results.

The results presented here give a new perspective on the impact of the demand for safety on

the overall economy. By using a narrow definition of safe assets, Caballero et al. (2017) predicted

that the high demand for safety would decrease risk-free interest rates, but would have the opposite

effect on risk premia. This is at odds with the observed historical low credit-spreads in corporate

bonds in recent years, since corporate bonds are intrinsically not risk-free. The fact that the safety

premium can affect yields of imperfect substitutes to traditional safe assets is one likely explanation

for the recent low yields, and it opens the possibility for a broader impact of the demand for safety,

one that affects firms’ cost of capital, capital structure, and business operation.

Most importantly, my paper sheds light on the determinants of the supply of safety services.

Historically, the government and the financial sector have been the main producers of safe assets.

The common feature of the public sector is that their debt issuance does not seem to respond

to demand pressures, even if this could be associated with economic gains (Jiang et al. (2020)).

Furthermore, given the high public indebtedness and the sluggish growth of most developed coun-

tries, many have argued that the public sector has exhausted its ability to expand the production of

safe assets (Caballero et al. (2017)). Likewise, in the aftermath of the financial crisis, the finan-

cial sector’s ability to engineer safe substitutes for Treasuries has been diminished by regulation

and constrained capital. In a landscape of limited alternative safe assets, non-financial firms with

ratings comparable to a sovereign are viable alternatives.

The findings in this paper also shed light on recent macroeconomic trends. In the aftermath of

the financial crisis, the aggregate amount outstanding of non-financial corporate bonds almost dou-

5For instance Arora et al. (2012) show that counterparty risk in CDS contracts is “vanishingly small," and Baiand Collin-Dufresne (2019) show that only bonds’ collateral value is persistently important in explaining the crosssectional variation of the CDS-bond basis.

5

bled, while aggregate investment remained weak (Gutiérrez and Philippon (2017) and Crouzet and

Eberly (2019)). Although investment grade (IG)6 firms are the largest bond issuers, their average

investment in capex and R&D together represents only 60% of their total net profits, suggesting

that all investment in the last ten years could have been financed with retained earnings alone. An-

other important empirical observation is that payouts have been largely financed by debt issuance

(Farre-Mensa et al. (2018)). In the last ten years, 90% of IG firms that issued debt in a given quarter

also engaged in positive distribution to shareholders in that same quarter. For every dollar issued in

IG bonds, 0.80 cents were paid back to shareholders within the same quarter by the median firm.7

In summary, evidence suggests that IG firms are changing their capital structure by issuing debt to

repurchase their own equity and pay dividends, but not necessarily to invest more. The theory of

corporate safety creation presented in this paper provides a coherent explanation for these facts.

1.2 Related Literature

This paper relates to and draws from several strands of literature. I build on the safe-assets

literature that shows that investors value safe assets with stable cash-flows and that there is a

shortage in supply of asset with this characteristic.8 The contribution of my paper is to study a

new class of safe-asset supplier, non-financial corporations, and to show that the response to the

safety premium is not concentrated on the safest firms in the US, as discussed in Krishnamurthy

and Vissing-Jørgensen (2012b), but rather is pervasive across the corporate sector.

Much like banks, I show that non-financial corporations respond to the discount in their lia-

bilities by issuing debt, even if this tends to be a purely financial operation unrelated to the firms’

business operations. The elasticity of the supply of corporate bonds to changes in safety premium

is akin to the creation of money-like short-term claims by the financial sector in response to shocks

in the safety and liquidity premia (see, e.g., Sunderam (2015), Nagel (2016), Krishnamurthy and

6Bonds rated BBB or more.7All summary statistics were calculated using the sample of Compustat firms described in Section 1.4.8A non-exhaustive list of related paper is Gorton et al. (2012), Krishnamurthy and Vissing-Jørgensen (2012a),

Caballero et al. (2016), Caballero et al. (2017), Caballero and Farhi (2018), Lenel (2020), Diamond (2020).

6

Vissing-Jorgensen (2015) and Kacperczyk et al. (2020)). The difference is that, instead of pro-

viding substitutes to money claims, highly rated corporate bonds can be good substitutes for the

safety properties of long-term Treasuries. Hence, my paper shows evidence that the private sector

elasticity of supply of safe assets is not concentrated in highly liquid short-term liabilities.

This paper relates to the literature that investigates the interconnection between Treasury bond

issuance and corporate decisions. I show that corporate bonds are imperfect substitutes to Treasury

bonds not only with respect to maturity, like in Greenwood et al. (2010) and Badoer and James

(2016), but also in providing safety services. Furthermore, an innovation with respect to this

literature is to measure the firm-level safety premium, which allows me to find variation in firms’

behavior that is not captured by the variation in the supply of US Treasury bonds, as is explored

in the aforementioned papers and in Graham et al. (2014), Liu et al. (2020) and Giambona et al.

(2020). This also allows for leveraging the cross section of firms to alleviate endogeneity concerns

related to the time series of the supply of Treasury bonds.

Kacperczyk et al. (2020) also look at the cross section to identify how fluctuations in the safety

premium affect issuance. Their focus is on short-term liabilities, certificate of deposits issued

by commercial banks and commercial paper issued by non-financial firms. They find that the

financial sector issues certificate of deposits in response to an increase in the safety premium, but

that non-financial corporations do not issue more commercial paper in response to this increase.

By focusing on longer maturity debt,9 I find that non-financial corporations actively respond to

an increase in the own safety premium by issuing more debt. I interpret these results as evidence

that while banks and other intermediaries are probably the marginal producers of short-term safe

assets, non-financial corporates have a role in producing safe assets in longer maturities.

Also related to this paper are the works by Lenel (2020) and Diamond (2020) who consider the

collateral value of corporate bonds for financial intermediaries and its interaction with the demand

for safe assets. Lenel (2020) studies how the variation in the supply of US Treasury bonds affects

asset prices and lending volumes in the financial sector, and Diamond (2020) presents a theory

9The empirical part of this paper looks at non-financial corporate debt with 1-year or more time to maturity.

7

on how the financial sector is organized to engineer safe assets backed by bonds, and how this

mechanism affects firms’ capital structure. I contribute to these works by using market prices to

construct a firm-level measure of the corporate safety premium, and provide empirical evidence

that variation in the safety premium, which include compensation to the collateral value corporate

debt, has a direct impact on firms’ cost of borrowing and therefore leverage decisions.

Finally, this paper also connects to the literature on the market timing of firms’ capital structure.

One explanation for the existence of market timing is information asymmetry, i.e., managers have

better information than the market about firm fundamentals (Baker and Wurgler (2002), Baker

et al. (2003), Warusawitharana and Whited (2016) and Ma (2019)). Another reason for corporate

market timing is precautionary savings, firms raise external finance when it is cheap, guaranteeing

liquidity for periods when external finance is expensive (Bolton et al. (2013) and Eisfeldt and Muir

(2016)). My model does not rely on information asymmetry, as the safety premium is observable to

all agents. Furthermore, I do not find that fluctuations in the relative safety premium are associated

with corporate savings, but instead, they are associated with equity payouts. Thus, my paper

presents a new channel explaining why we might observe corporate debt issuance when corporate

debt yields are depressed. That is, if the safety premium is positive, some firms are able to create

value to shareholders by issuing debt because in doing so, they increase the amount of safety

services in the economy.

1.3 Model

In this section, I present a model of safety creation by non-financial corporations. Section

1.3.1 presents the model, and the results are found in Section 1.3.2. Section 1.3.3 summarizes the

testable predictions derived from the model, and section 1.3.4 discusses how to empirically test

them. The empirical tests are performed in Section 1.6.

8

1.3.1 Setup

There are three sets of agents in the model: representative investors, a set of heterogeneous

firms, and an exogenous supplier of safe assets, like US Treasury bonds.

Investors

The key innovation of this paper is to allow for the possibility that different assets provide

different degrees of safety services. More formally, I normalize to one the amount of safety services

provided by one dollar of face value of US Treasuries (henceforth UST). Let αi,t+1 be the safety

services by unit of face value provided by any other bond i. Thus, if qT,t+1 and qi,t+1 are the total

face value of UST and bond i respectively, the total supply of safety services in the economy, St,

is given by

St = qT,t+1 +N∑i=1

αi,t+1qi,t+1. (1.1)

I refer to αi,t+1 as perceived safety throughout.

I follow Krishnamurthy and Vissing-Jørgensen (2012a) and take the demand for safety services

as a primitive. I assume investors derive utility from safety services in a way that is separable from

standard utility from consumption. Specifically, the problem of the representative investor is

max{ct,Qt+1}∞t=0

Et

[∑t

βtu (ct) + Θtv(St)

]

s.t ct + Qt+1Pt ≤ ωt + QtXt,

(1.2)

where Pt is a column vector of asset prices, Xt are realizations of the corresponding payoffs at time

t, Qt is a row vector of portfolio positions, ct is consumption, ωt is an exogenous wealth flow, and

β is the standard discount rate. The assumptions on the functions u and ν are standard: u′ > 0,

u′′ < 0, v′ > 0 and v′′ < 0. Finally, given the normalization of the safety services provided by

UST, Θt is a demand shifter that allows us to identify the impact of an aggregate shock in the

demand for safety services supplied by UST.

9

From the consumers’ Euler equation, the price for each asset i is

Pi,t = Et [Mt+1Xi,t+1] + αi,t+1ϕt, (1.3)

where Mt+1 := β u′(ct+1)u′(ct)

, θt := Θtβu′(ct)

and ϕt := θtv′(St).

Prices consist of the standard discounted cash flows component and the safety premium com-

ponent, measured by αi,t+1ϕt. I refer to ϕt as the aggregate safety premium; it is the market price

of one unit of safety services. Given that I have normalized to one the safety services associated

with one dollar of face value of UST, ϕt is also the safety premium of UST. In what follows, I refer

to ϕt as the aggregate safety premium or the UST safety premium interchangeably.

In this model, the law of one price does not hold: two assets can have the same cash flows X ,

but have different prices precisely because they provide different safety services. The difference

is measured by the perceived safety, αi,t+1. Perceived safety can be correlated with Xi, to allow

for the possibility that the amount of safety services an asset provides relates to its cash flow

characteristics, but in a way that is not fully captured by the correlation between M and Xi.

Environment

Firms’ production is subject to aggregate productivity shocks xt and idiosyncratic productivity

shocks zi,t. The two stochastic processes are described by

xt+1 = ρXxt + σXηxt+1, (1.4)

zi,t+1 = ρZzi,t + σZηzi,t+1. (1.5)

where ηxt and ηzi,t are IID standard normal random variables. The parameters ρX and ρZ repre-

sent the persistence of the aggregate and the idiosyncratic shocks, and σX and σZ the respective

volatilities.

The aggregate safety premium, ϕt, depends on preference shock θt, and the exogenous supply

of UST bonds and of other assets that provide safety services. I assume that the total amount of

10

safety services provided by the non-financial corporate sector is small, therefore ϕt is exogenous

in my model. The stochastic process that drives ϕt is described by

logϕt+1 = logϕ+ ρϕ logϕt + σϕηϕt+1, (1.6)

where ηϕ is an IID standard normal random variable; ϕ, σϕ, and ρϕ represent respectively the

median, volatility, and persistence of the aggregate safety premium.

I assume that the stochastic discount factor is given by

logMt,t+1 = log β − γ(xt+1 − xt), (1.7)

where β is the rate time of preference, γ is the relative risk aversion, and xt is the economy’s

aggregate productivity shock. Equation (1.7) follows from the reduced-form assumption that the

aggregate consumption ct is an affine function of the aggregate productivity ct = a + bxt with

b > 0. Notice then that this model is not a general equilibrium one, because there are sources of

consumption other than the dividends paid by the non-financial corporations.

Firms

There is a set of heterogeneous firms, indexed by i ∈ I. The operating profits of a firm are

determined by

Πi,t(ki,t;xt, zi,t) = exp(xt + zi,t)kζi,t, (1.8)

where xt and zi,t are the aggregate and idiosyncratic productivity shocks defined in (1.4) and (1.5),

ki,t is the capital of the firm, and 0 < ζ < 1 is the capital share of production. ζ < 1 implies firms’

technology exhibits decreasing return-to-scale.

Firms may need external funds to finance dividends or investment. To allow for a firm to be

financially constrained, equity issuance must be costly. Thus, for simplicity, I assume all external

finance is in the form of one-period debt. Let bi,t+1 ≥ 0 be the face value of debt issued at time

11

t, which is due at time t + 1. Pi,t is the market price for one dollar of face value of debt. At time

t, a solvent firm pays bi,t and receives bi,t+1Pi,t from bondholders. Shareholders are protected by

limited liability, and distributions to shareholders are denoted by di,t. Investment, equity payout,

and financing decisions must satisfy the budget constraint

di,t = Πi,t + (1− δ)ki,t − bi,t − ki,t+1 + Pi,tbi,t+1 ≥ 0 (1.9)

where δ > 0 is the depreciation rate of capital.

Let time t net worth be wi,t := Πi,t+ lv(1− δ)ki,t− bi,t and lv ∈ (0, 1) be the liquidation value

of capital. I assume that debt is protected by a positive net worth covenant, as in Brennan and

Schwartz (1984). Thus, default occurs whenever wi,t first becomes negative,10,11 and the default

region can be written as

∆{xt, zi,t} = {xt + zi,t < ln(wi,t)} where wi,t =bi,t − lv(1− δ)ki,t

kζi,t.

In a default scenario, the bondholders receive a default payoff equal to

L(bi,t, ki,t;xt, zi,t) = (1− ξ)Πi,t + lv(1− δ)ki,tbi,t

,

where ξ ∈ (0, 1) is the bankruptcy cost.

Let I∆i,t+1be an indicator variable for the default region of firm i, at time t + 1. The market

price of debt Pi,t is given by12

Pi,t = Et[Mt+1

((1− I∆i,t+1

)+ I∆i,t+1

L(bi,t+1, ki,t+1;xt+1, zi,t+1))]

+ αi,t+1ϕt. (1.10)

10For examples of other work that uses a solvency default definition see Crouzet (2018) Elenev et al. (2020).11The advantage of this approach, compared to the traditional strategic default, is that it allows, for each action and

state, a closed form solution for debt prices, as shown in A.3.1. This avoids the computational cost of solving jointlythe value of the firm and the default threshold. As discussed in Strebulaev and Whited (2012), the “loop-within-a-loop” algorithms to solve dynamic models with strategic defaults can be very computationally costly, in particularwhen we have many state variables, as my model does.

12I calculate the debt price in closed form for each (kt+1, bt+1; ∫t). This calculation can be found in A.3.1.

12

A key ingredient of the model is that, while the aggregate safety premium ϕt is exogenous,

firm’s decisions regarding debt issuance, payout, and investment affect the firm’s perceived safety,

αi,t+1. For most of the empirical results of the paper, the specific functional form and even the

determinants of αi,t+1 do not need to be specified. The reason is that, as shown later, my approach

allows for an estimate of αi,t+1 (up to a scalar) directly. However, the theoretical characterization

of this problem requires a functional form of αi,t+1. For this reason, I assume that

αi,t+1 = αµi,t exp(−ψ bi,t+1

EtΠi,t+1

), (1.11)

where the parameter µ represents the memory of investors and ψ is a positive scalar.

Expression (1.11) links the perceived safety to the inverse of the interest coverage ratio ( bi,t+1

EtΠi,t+1),

one of the key determinants of bond ratings.13 In addition, perceived safety is persistent, as mea-

sured by the parameter µ. These choices are made to match the empirical observations discussed

in Section 1.5, that perceived safety is monotonic in ratings and ratings are persistent in time.

Furthermore, equation (1.11) captures in a tractable way an important feature about safe assets:

“When it comes to forming beliefs about which assets are safe, reputation and history matter"

(Caballero et al. (2017)). Moreover, there is an additional benefit of specifying the functional

αi,t+1, which is that it allows for the structural estimation of the UST safety premium, ϕt, as

discussed in Section 1.5.2.

Finally, a firm’s value is defined as the discounted value of all future distributions to sharehold-

ers and can be written recursively as:

V (∫i,t) = maxki,t+1,bi,t+1

d(ki,t+1, bi,t+1; ∫i,t) + Et [Mt,t+1Vi,t+1(∫i,t+1)] (1.12)

where Mt,t+1 is the one-period stochastic discount factor in (1.7), the dividends dt satisfy the

budget constraint (1.9) and ∫i,t = {wi,t, αi,t, xt, zi,t, ϕt} are the state variables. The expectation

13A common practice is to measure the interest coverage ratio is the ratio between company’s earnings beforeinterest and taxes (EBIT) during a given period divided and the company’s interest payments due within the sameperiod. To see its importance for ratings, see Moody’s credit rating methodology here.

13

https://www.moodys.com/researchandratings/methodology/003006001/rating-methodologies/methodology/003006001/003006001/-/0/0/-/0/-/-/en/global/rr

is taken over the joint conditional distribution of aggregate, idiosyncratic, and aggregate safety

premium shocks. Note that the limited liability assumption is naturally met by the positive dividend

constraint.

The firm’s model deviates from the Modigliani-Miller benchmark in two dimensions: (1) there

are financial constraints related to the solvency default and bankruptcy cost ξ, and (2) bonds carry

a safety premium αi,t+1ϕt. The underlying assumption is that firm managers can create value to

shareholders by changing their firm’s capital structure to produce safety services that are valuable

to investors.

1.3.2 Model Results

To allow for a sharp characterization of the results of the model, I restrict myself to a two-period

version of the model, and leave a complete treatment of the fully dynamic model for the appendix.

Qualitatively, both versions of the model deliver similar results; though obviously, quantitative

statements rely on the fully dynamic model.

Bond Prices

A firm’s safety premium depends on the aggregate safety premium, ϕt, and the firm-specific

perceived safety, αi,t+1. A positive safety premium acts as a discount in a firm’s cost of borrowing

by increasing the price of their debt. Furthermore, since perceived safety depends on the debt

coverage ratio, as bi,t+1 increases, the safety premium converges to zero.

Figure 1.1 illustrates how a firm’s safety premium and debt prices depend on the level of debt

issued for different levels of the aggregate safety premium for a fixed set of state variables. Ceteris

paribus, the larger the amount of debt issued, the smaller the debt price. There are two reasons for

this. First, there is the traditional risk mechanism in which an increase in debt issuance increases

the probability of default and lowers debt prices. Second, if the aggregate safety premium is

positive, larger issuance decreases the firm’s perceived safety and thereby decreases its safety

premium and price.

14

Model Mechanics

To illustrate the mechanics of the model, Figure 1.2 shows how the total value varies with the

amount of bonds issued for different levels of the aggregate safety premium. Consider first the

case in which ξ = 0, i.e., there is no bankruptcy cost. If the aggregate safety premium is also zero,

ϕt = 0, corporate debt does not carry a safety premium either, and the value of the firm would be

invariant to debt issuance. This is the Modigliani-Miller benchmark.

If safety services are valuable in this economy, ϕt > 0, the total value of the firm has two

components: the value of the normal business operation and the value due to safe-asset creation.

In Figure 1.2a, the green area shows the value of a firm’s business operation, which is the firm’s

value if the aggregate safety premium is zero. The blue areas represent that additional value created

by issuing debt that carries a safety premium for different levels of aggregate safety premium ϕt.

The maximum value is achieved when the marginal safety premium received by issuing debt is

equal to the marginal decrease in safety value for making the firm more leveraged.

The second step is to introduce a bankruptcy cost greater than zero. In this case, the firm faces

an additional trade-off for issuing bonds: On one hand, issuing bonds creates value for the firm due

to the extra creation of safety services; on the other hand, it increases the probability of bankruptcy,

which increases expected bankruptcy cost. Again, for each level of ϕt, there is a unique investment-

and debt-level pair that maximizes the value of the firm. Figure 1.2b shows how the total value

of the firm varies with issuance for different levels of aggregate safety premium. The dotted lines

represent the optimal debt level b∗i,t+1 that maximizes the value of the firm for each ϕt.

1.3.3 Comparative Statics and Empirical Predictions

I want to understand how firms react to variations in the safety premium, both in the time

series and in the cross section. In the cross section, the objective is to understand how cross

sectional differences in initial perceived safety, {αi,t}i∈I , affect the optimal policy. In the time

series, instead, the objective is to understand how a single firm optimally adjusts its policy in

15

response to fluctuations in the aggregate safety premium, ϕt.

Armed with the main intuition of the model, I turn next to illustrate the numerical solutions the

model. Parameter choices are described in Table 1.1. Parameters, such as the decreasing returns to

scale, depreciation, the idiosyncratic and aggregate shock parameters, and the stochastic discount

factor parameters are standard and follow the literature closely in selecting them.

The model predictions depend on whether the firm is financially constrained or not. I define a

firm to be unconstrained at time t whenever the dividend constraint is slack, i.e., d∗i,t > 0.14 Notice

that I use the word dividends, but the reader should understand “payouts," since it is immaterial

whether the firm distributes these payouts in the form of dividends, equity repurchases, or some

combination of both. As an illustration, Figure A.2 plots the valuation as function of the actions

{ki,t+1, bi,t+1} and the optimal policy for an unconstrained and a constrained firm.

Consider first the implications of the model for the cross section. Figure 1.3 shows optimal

bond issuance, investment, and payouts as a function of the initial perceived safety, αi,t, for two

different firms, one constrained and another unconstrained, for three different levels of the ag-

gregate safety premium, ϕt. As shown in panels (a) and (b), for both the unconstrained and the

constrained firms, as long as ϕt > 0, the higher the initial perceived safety, the higher the bond

issuance. The difference between these two firms lies on how they use the proceeds from debt

issuance. Unconstrained firms, even if the aggregate safety premium is zero, can always achieve

their optimal capital scale. Therefore, the money borrowed mostly translates into higher payouts.

Instead, constrained firms with higher perceived safety are less financially constrained than oth-

erwise identical companies with lower perceived safety, thus the higher investment of the former

relative to the later. Intuitively, payouts are equal to zero in this case. I summarize these results in

the following prediction.

Prediction 1.1 (Cross Section) In the cross section, if the aggregate safety premium is positive

(ϕt > 0), ceteris paribus, optimal bond issuance, b∗i,t+1, is increasing in the firm’s initial perceived

14Strictly speaking, there could be firms that have d∗i,t = 0 and are not financially constrained, because theyachieved their optimal scale exactly at the boundary. A small perturbation in the parameters, of course, places this firmat one side or the other of the constraint.

16

safety, αi,t. Furthermore, if a firm is financially unconstrained, investment increases by less than

the proceeds from bond issuance, with the remainder being distributed to equity holders.

The predictions are similar for the time series. Figure 1.4 shows the effect of the aggregate

safety premium on bond issuance, investment, and payouts for two different firms, one constrained

and another unconstrained, for three different levels of the firm-specific perceived safety, αi,t. I

summarize these predictions next.

Prediction 1.2 (Time Series) In the time series, for a firm with positive perceived safety, αi,t > 0,

ceteris paribus, optimal bond issuance, b∗i,t+1, is increasing in the aggregate safety premium, ϕt.

Furthermore, if the firm is financially unconstrained, investment increases by less than the proceeds

from bond issuance, the remainder being distributed to equity holders.

There are obviously interaction effects between the two components of the safety premium.

For instance, debt issuance of firms with higher initial level of perceived safety will respond more

to variation in the aggregate safety premium than otherwise equal firms with lower initial level of

perceived safety. In Figure 1.5, I plot the first derivative of optimal bond issuance with respect

to ϕt as a function of initial perceived safety. An equivalent effect exists for variation of the first

derivative of optimal bond issuance with respect to αi,t as a function of aggregate safety premium.

This result thus generates the next prediction.

Prediction 1.3 (Heterogeneous Effects A) Firms with higher perceived safety respond to an in-

crease in the aggregate safety premium by increasing bond issuance more than firms with lower

perceived safety. Furthermore, the difference in bond issuance among firms with different levels of

initial perceived safety is larger when the aggregate safety premium is higher.

Finally, there are also interactions between the two components of the safety premium and

firms’ initial net worth, wi,t. Consider two firms that are financially constrained, one with higher

net worth than the other. Then, the marginal cost of issuing an additional dollar of debt in response

to an increase in, say, the aggregate safety premium is lower for the company with higher net worth.

17

The reason is that the expected bankruptcy cost is lower for this company than for the company

with lower net worth. Clearly this effect disappears when the net worth is sufficiently high and

the firms are no longer financially constrained. This phenomenon is illustrated in Figure 1.6 and

generates the following empirical prediction.

Prediction 1.4 (Heterogeneous Effects B) Firms that are financially unconstrained respond to a

higher perceived safety or the aggregate safety premium by increasing bond issuance more than

financially constrained firms.

There are implications of this model regarding the interaction between safety premium and

bankruptcy cost. Testing these implications requires proper time or cross sectional varying mea-

sures of the bankruptcy cost, which are notably hard to estimate, and is therefore out of the scope

of this paper.

1.3.4 Bringing the Model to the Data

The model yields several predictions regarding firms’ responses to variations in the safety pre-

mium. The rest of the paper is dedicated to testing them using data on US non-financial corpora-

tions. With the goal of mapping the predictions described in the previous subsection to reduced-

form linear regressions, I calculate a policy function linearization.

Let the state space be S := {wi,t, αi,t, xt, zi,t, ϕt ∈ R+ × (0, 1)×R×R×R+} and the action

space be A := {bi,t+1, ki,t+1 ∈ R+ ×R+}. Note that in order to be feasible, an action must satisfy

the budget constraint. Let π∗ be the optimal policy function, π∗ : S → A.

Consider the Taylor expansion of the policy function around the point ∫0 = {wt, αt, x, zt, ϕ},

where wt, αt, and zt are the time t cross sectional averages of firms’ net worth, perceived safety,

and idiosyncratic productivity shock, respectively; and x and ϕ are time series averages of the

aggregate productivity shock and the aggregate safety premium.

As will be discussed in the next section, it is difficult to observe αi,t or ϕt directly in the data.

18

A quantity that turns out to be empirically important is the cross-basis, defined as

CrossBasisi,t := (αi,t − αt)ϕt. (1.13)

Hence, the optimal policy around ∫0 is approximately

∆bi,t+1 ≈ β1,t1

ϕt−1

CrossBasisi,t−1 + β2,tzi,t + β3,twi,t + δt + δi, (1.14)

where βs are the partial derivatives evaluated in ∫0, and δt and δi collect variables that are constant

in the cross section and in the time series. An equivalent specification can be written for investment

ii,t and payout di,t.

Empirically estimating how bond issuance, investment, and payouts vary as a function of the

cross section allows me to identify how a firm’s decision changes as a function of perceived safety,

conditional to a constant aggregate safety premium, ϕt.

1.4 Data

I study the impact of variation in the safety premium on the behavior of non-financial firms in

the United States. To this end, I use four main data sets: (i) bond characteristics, from the Mergent

Fixed Income Securities Database (FISD) for academia, (ii) Treasury yields and corporate bond

yields, respectively from the CRSP US Treasury Fixed-Term Indexes and WRDS Bond Returns

databases (iii) CDS spreads for single-name CDS composites from Markit, and (iv) firm balance

sheet data from Compustat quarterly data. The empirical tests focus on the period from January

2003 to September 2020. The starting date coincides with Markit’s CDS data starting date and the

end data with the last updated of WRDS Bond Returns database.

The bond-level data set is created by merging corporate bond characteristics, bond prices data,

and single-name CDS data. For bonds, I apply standard filters and exclude bonds not listed or

traded in the US public market, not traded in US dollars, whose issuers are not in the jurisdiction

19

of the United States, with convertible or floating rates, with non senior-unsecured claims, with

issuance size of less than 100 thousand dollars, or with time to maturity less than 1 year. For CDS

spreads, I select only CDS spreads for US dollar contracts that refer to senior unsecured debt and

with the documentation clause type “No Restructuring" (XR).15 The final data set contains infor-

mation about corporate yields, duration, bid and ask spreads, volume traded, bond characteristics,

and CDS spreads quoted in basis points per annum for a notional value of $10 million.

The original Markit data set provides a CDS-spread term structure incorporating maturities of

1y, 2y, 3y, 4y, 5y, 7y, 10y, 15y, 20y, and 30y. I use a locally constant hazard rate interpolation

of CDS spreads to match the bonds’ duration. To mitigate the concern that the 5y maturity is the

most liquid, I only consider CDS spreads that have valid quote rating; this guarantees that the CDS

spread has passed several tests ran by Markit to assure the quality of the quote.16

The firm-level data set is formed by merging the bond-level data set with firms’ balance sheet

data from Compustat quarterly data. The sample includes all Compustat firms that have publicly

traded equity in one of the three main stock exchanges (NYSE, NASDAQ, and Amex), except

regulated utilities (SIC codes 4900-4999), financial firms (SIC codes 6000-6999), and firms cate-

gorized as public service, international affairs, or non-operating establishments (SIC codes 9000+).

Finally, following standard data-cleaning methodology applied in the literature, I also exclude firms

with missing or non-positive book value of equity or sales and firms with less than $1 million in

total assets. For firm-level analyses, I aggregate yields, CDS spreads, and bases at the firm level

by calculating the face-value weighted averages.

All the empirical results use the universe of US firms in the Compustat sample with successful

merges with non-missing CDS spread, yields, and bonds outstanding. I winsorize all yields, CDS

15This option excludes restructuring altogether from the CDS contract, eliminating the possibility that the Protec-tion Seller suffers a “soft” Credit Event that does not necessarily result in losses to the Protection Buyer. Since theimplementation of SNAC in 2009, CDS with this clause are the most traded in North America. For more detail see,e.g., Boyarchenko et al. (2019)

16The documentation on Markit quote ratings can be found at Markit.com Guide Version 14.4 (here). The manualsays: “Ratings are assigned based on both quantitative criteria - of which the most important is the number of distinctpassing contributions - and qualitative measures: how competitive, liquid, and transparent the market is; and whetherthe trades are time stamped, frequently updated tradable quotes. To achieve a rating at all, our composites must havepassed stringent standards on these criteria."

20

https://wrds-www.wharton.upenn.edu/documents/729/Markit_CDS__Bonds_User_Guide.pdf

spreads, CDS-bond bases, and financial ratios at 1% and 99% to remove outliers. The final data set

for bond prices is monthly, the same frequency as the corporate bond prices data set. The firm-level

data is quarterly, the same frequency as Compustat.

The coverage per year is reported in Table 1.2. The final sample covers on average 2793

bonds and 395 firms per year. It is worth noting that coverage is not concentrated in a few rating

buckets and is rather representative of the credit-risk spectrum. Table 1.3 reports the quarterly data

summary statistics.

1.5 Measuring the Corporate Safety Premium

The first empirical challenge to test the model of corporate safety creation is to measure the

safety premium in corporate liabilities. So far I have been speaking of corporate liabilities or debt,

but for measuring non-financial corporate safety premia, I focus on corporate bonds. This is due to

data availability. Data on corporate bond instruments is more readily available and comprehensive

than other forms of corporate debt. Moreover, for simplicity in notation, while explaining the

safety premium measurement construction, I assume that each firm in the sample has only one

bond outstanding. I discuss the empirical implementation of multiple bonds per firm in the next

subsection.

From equation (1.3), the safety premium of a particular bond is the difference in prices of this

bond and of a benchmark asset with the same cash flows, but no safety services. This benchmark

is not readily available. I overcome this issue by constructing, for each bond in the sample, a

synthetic asset which is a portfolio that combines the corporate bond and the maturity-matched

credit default swap (CDS). I call this synthetic asset the hedged bond. Under certain assumptions,

which are discussed below, the cash flows of the hedged bonds are the same among all bonds. Then,

by comparing the yields of the hedged bonds, I am able to quantify the relative safety premium

between different bonds.

The building blocks of the hedged-bond construction are the CDS contracts. A CDS contract

21

is a derivative instrument that can be used to hedge against default risk of the reference entity. If

there exists a CDS contract that perfectly hedges the default risk of a corporate bond, investors can

create a hedged bond that closely replicates the cash flow of a risk-free bond by buying a corporate

bond and the maturity-matched CDS.

To formally see how one can use the difference in yields of hedged bonds to measure the

relative safety premium, let yi,t = − ln(Pi,t) ≈ 1−Pi,t be the yield of bond i at time t. From (1.3),

yields are approximately

yT,t = 1− Et [Mt+1XT,t+1]− ϕt for Treasuries,

yi,t = 1− Et [Mt+1Xj,i,t+1]− αi,tϕt for bond i.

I make three assumptions, which are discussed further below. First, the CDS perfectly hedges

against credit risk; second, the CDS does not provide any safety services; and third, the US Trea-

sury bond is risk-free. Consider a hedged bond constructed by buying a bond j and a corresponding

CDSj . Under the three assumptions, the hedged bond has exactly the same payoff as the Treasury.

I define the CDS-bond basis as the difference in yields between the UST bond and the hedged

bond. Hence, the CDS-bond basis is

Basisi,t = yT,t − (yi,t − CDSi,t) = (αi,t+1 − 1)ϕt ∀i, (1.15)

where the bond i and UST bond T are of equal maturity and coupons. The basis is thus a measure

of the safety premium differential between the UST bond and the bond i. If the bond offers safety

services similar to UST, then αi,t+1 is close to one and the basis will be close to zero.

As discussed in Section 1.3, the variables of interest for testing the model of corporate safety

creation are αi,t+1 and ϕt, which are not observable directly in the data. Therefore, I define cross-

basis, which is the dependent variable of interest in equation (1.13). The cross-basis of bond i is

the difference in the yields of this hedged bond and of the average hedged bond in the sample,

22

which I denote by Basist.

CrossBasisi,t = Basisi,t −Basist = (αi,t+1 − αt+1)ϕt (1.16)

Equation (1.16) yields three testable predictions for the cross-basis and its connection with the

safety premium. First in the cross section, the cross-basis is monotonic in perceived safety (αi,t+1);

second, one factor, the US Treasury safety premium (ϕt), fully explains the time series variation

of the cross-basis; third, loadings on the US Treasury safety premium are monotonic in bonds’

perceived safety. I test these predictions below and find considerable support for them, which

validates my interpretation of the cross-basis as a measure of relative safety premium.

1.5.1 Empirical construction of the CDS-bond basis

I now map the theoretically defined CDS-bond basis, cross-basis and basis index to the non-

financial US corporate bond data. I begin by following the standard methodology (Oehmke and

Zawadowski (2017), Kim et al. (2016), among others), and for each bond j with maturity τ of firm

i, I calculate the CDS-bond basis at time t as

Basisj,τ,i,t = CDSτ,i,t − CreditSpreadj,τ,i,t (1.17)

where CDSi,τ,t is the CDS spread for company i, interpolated to have the same maturity τ as the

bond. CreditSpreadj,τ,i,t is the difference in yields between the bond j and the duration-matched

Treasury yield. The firm-level CDS-bond basis at each time t is the face-value weighted average

Basisj,i,τ,t across all bonds j outstanding of firm i in the sample.

The cross-basis calculation follows directly from equation (1.16), and it is defined as

CrossBasisi,t = Basisi,t −Basisindex,t, (1.18)

where Basisi,t is firm i CDS-bond basis and Basisindex,t is the face-value weighted bases for

23

all companies in the sample. Considering the value weighted average instead of simple averages

reduces the influence of small issuers with extreme basis values.

Table 1.4 reports summary statistics for cross-basis, CDS-bond basis, yields, and CDS spreads

in my sample; and Figure 1.7 shows the face-value weighted average CDS-bond basis by rating

category for corporate bonds of US non-financial firms. The CDS-bond basis is on average neg-

ative, reaching points as low as −10% during the peak of the financial crisis. This indicates that

investors are willing to forgo yields to hold a US Treasury rather than a hedged bond. In light of

the model of demand for safety, I interpret this result as evidence that US Treasuries provide more

safety services than corporate bonds.

In the absence of frictions, the CDS-bond basis and the cross-basis should be zero. The preva-

lence of negative CDS-bond bases in the market is a well-known empirical regularity in the cor-

porate bond literature (e.g. Longstaff et al. (2005), Oehmke and Zawadowski (2017), Bai and

Collin-Dufresne (2019)). The cross-basis captures the cross sectional dispersion in the CDS-bond

basis.

I interpret the cross-basis as capturing the premium attached to the services associated with the

ownership of high-quality bonds, such as collateral value, regulatory relief, or simply psychologi-

cal relief. All these interpretations are consistent with the model in Section 1.3. But it also may be

capturing frictions not associated with safety such as counterparty risk (Giglio (2014)), illiquidity

(Longstaff et al. (2005), Oehmke and Zawadowski (2015, 2017)), restructuring uncertainty (Berndt

et al. (2007)), or mismatch in the payoff structure of bonds (Duffie (1999)). In Section 1.7.1, I ar-

gue that none of these confounders are large enough to explain the cross sectional variation of the

cross-basis.

The innovation of this paper is, first, to show that cross sectional variation in the cross-basis

captures cross sectional variation in the safety premium of different corporate bonds, and, sec-

ond, that firms respond to variation in the cross-basis by issuing bonds. I turn next to the first

contribution and leave the second to Section 1.6.

24

1.5.2 The Validity of Cross-Basis as the Relative Safety Premium Measure

To validate the connection between the CDS-bond basis and the safety premium, I test the three

predictions from the model. First, I run the following Fama-MacBeth regressions to test whether

the cross-basis is monotonic in firms’ perceived safety

CrossBasisi,t = β0 + β1αi,t + εi,t, (1.19)

where αi,t are proxies for the perceived safety of firm i at time t. I measure safety in several ways:

rating category buckets (AA and above, A, BBB, BB, B and below); rating rank (1 for AAA,

2 for AA+, 3 for AA, etc.); interest coverage ratio (ICRi,t), measured as interest expenses over

EBITA; and the structurally estimated αi,t.17

Column (1) and (2) of Table 1.5 shows that, on average, the basis is monotonic on ratings.

Moreover, ratings alone are able to explain 25% of the cross sectional variation of cross-basis. I

also test the explanatory power of ICR and the structurally estimated αi,t. The results are reported

in columns (3) and (4). The highly statistically significant coefficient for ICR means that the

cross-basis is monotonically increasing in the inverse of the ICR. The highly significantly signifi-

cant coefficient for αi,t is yet another evidence that the cross-basis is monotonically increasing in

perceived safety. I interpret these results as supporting evidence for the predictions that the cross-

basis is monotonic in firms’ perceived safety and that perceived safety explains a lot of the cross

sectional variation in of the cross-bases.

Next, I turn to time series analyses to test the two remaining predictions. I run the following

regression

CrossBasisr,t = β0,r + β1,rUST SPt + εr,t, (1.20)

where CrossBasisr,t is the value-weighted average of the basis for the rating category r and

UST SPt is a proxy for the safety premium of Treasury.

17Details of this estimation are in A.2.

25

There is no consensus in the literature about how to measure the US Treasury safety premium.

I use four distinct measures: two measures inspired by Krishnamurthy and Vissing-Jørgensen

(2012a) (henceforth KV-J): (i) AAA−Treasury and (ii) BBB−(AAA and AA) basis spread;18 (iii)

the box-trade spread as in Binsbergen et al. (2019); and (iv) the structurally estimated safety pre-

mium based on the model in this paper. A more detailed discussion about each of the measures is

presented in A.2.

Figure 1.8 shows the time series of each one of the safety premium measures, and Table 1.6

shows the summary statistics and correlation matrix between all these measures. They are all

closely related. All of the measures of the US Treasury safety premium, except the one estimated

from the model, are a relative premium of Treasuries with respect to a benchmark. The only

measure that attempts to properly capture the correct level is ϕt, and this is the reason why it is

relatively higher than the others.

Table 1.7 shows the estimation results for equation (1.20). Results are qualitatively similar

across all measures. The loading on the safety premium should capture (αi − α), therefore we

expect it to be monotonically decreasing in how safe the bonds are. The loadings are indeed

monotonic in ratings for all four proxies of the US Treasury safety premium. Furthermore, the

UST safety premium should explain a large fraction of the time series variation of the cross-basis.

As shown in Panel A, for ϕt, the r-squared of the regressions range between 43% and 90%, for a

mean of 60%. This means that one factor explains a large portion of the time series variation of the

cross-basis. Results are similar for the two K-VJ safety premium measures. The lowest r-squared

is reported for the box-trade spread. This is expected at least for one reason: the box-trade is

calculated from the put-call parity relationship for European-style option contracts and the longest

18Krishnamurthy and Vissing-Jørgensen (2012a) make the simplifying assumption that the AAA−Treasury spreadreflects liquidity premium whereas the BBB−AAA spread reflects safety premium. Their assumption is that, AAA-rated securities and UST have the same safety but different liquidity, whereas AAA- and BBB-rated securities havedifferent safety but roughly the same liquidity. In my paper, I do not focus on differentiating liquidity and safetypremium. Nevertheless, to get the best possible measures of the Treasury safety premium, I also construct the twomeasures. There are two issues in using the BBB−AAA spread: (i) In recent years, there are only two US corporationsthat are rated AAA, Johnson & Johnson and Microsoft; (ii) a large component of the BBB−AAA spread is due tocredit risk. In order to minimize the influence of company-specific shocks, I use the face-value weighted CDS-bondbasis spread between “BBB” and “AAA and AA.” The difference in credit risk is controlled by the CDS correction.

26

maturity available is 18 months, whereas the average maturity of the corporate bond is 7 years in

my sample. It could be that the drivers of short-term UST convenience yield are different the the

long-term safety premium. In summary, these are evidences in support for the close connection

between the basis and corporate bonds safety premium.

1.6 Firms’ Response to Safety Premium

This section studies how firms’ decisions relate to the safety premium in corporate bonds. I

present evidence that supports the predictions of the model of corporate safety creation presented in

Section 1.3. In the cross section of US non-financial firms, high perceived safety, measured by high

cross-basis, forecasts high net debt issuance. Net debt issuance is not reverted into real investment

but is instead used for net payouts, measured as the sum of dividends and equity repurchases.

These results suggest that companies that can create safety services are, on average, not financially

constrained, and that bond issuance can be a financial maneuver that benefits equity holders.

1.6.1 Impact of Safety Premium on Net Debt Issuance

The first set of empirical tests aims to identify how variation in a firm’s safety premium affects

net debt issuance. Since αi,t is not directly observed in the data, the first empirical specification

includes the cross-basis, which is a proxy of a firm’s relative safety premium with respect to the

market and the dependent variable of interest after the linearization described in (1.14). For each

time t, the cross-basis captures cross section variation in perceived safety across companies.

Prediction 1 from the model says that firms with higher perceived safety issue more debt than

otherwise similar firms with low perceived safety. I test this prediction by running the following

regression

NDIi,t+1 = β1CrossBasisi,t + β2Xi,t + δi + δt + εi,t, (R1)

where NDIi,t+1 is long-term net debt issuance as a share of lagged total assets, CrossBasisi,t

is the CDS-bond basis minus the basis index, the basis index is the corporate bond market value-

27

weighted average of Basis, Xi,t are controls, and δi and δt are firm and time fixed effects, respec-

tively.

Based on equation (1.14), it is clear that controls should capture firm net worth (wi,t) and

investment opportunities (zi,t). In the baseline specification, the controls are log(TotalAssets) and

cash normalized by assets, to account for firms’ net worth, and Tobin’s Q to control for firms’

investment opportunities. Due to the limitations of those control variables, I also include the CDS

spread as a control. The CDS price captures any factor that affects a firm’s credit risk in a way that

is not related to the safety provision of the firm’s bonds. Net debt issuance is measured in quarter

t + 1, and all independent variables are measured in quarter t in order to avoid the “bad controls

problem" (Angrist and Pischke (2009a)).

The identification assumption to correctly estimate the impact of firms’ safety premium on debt

issuance is that the error term in (R1), εi,t, is orthogonal to the measure of relative safety premium,

CrossBasisi,t. This assumption is violated if there are factors correlated to the cross-basis that

affect a firm’s net debt issuance and are not fully captured by the controls. The most salient

confounder is the possibility that the controls are not good enough to fully account for investment

opportunities. If investment opportunities are the main confounder, cross-basis should then be a

strong predictor of investment. As will become clear in the next subsection, on average, firms do

not use the funds raised from issuing safe debt for investment. The results are robust to different

measures of investment: capital investments, intangible investments, and acquisitions. Therefore,

not properly controlling for investment opportunities should not be the major concern. Still, the

lack of a natural experiment or a valid instrument leaves space for possible confounders that can

threaten the identification. I alleviate this concern by showing that the regression results are robust

to the inclusion of several extra controls, and in Section 1.7, I develop a myriad of tests to rule out

other possible alternative hypotheses.

Table 1.8 shows the regression estimated for model R1. Column (1) shows that the cross-

basis forecasts a positive net debt issuance for the average firm. The effect of cross-basis on debt

issuance is statistically and economically significant. The point estimate means that a 1 percentage

28

point increase in the cross-basis forecast an increase of net debt issuance by 10 basis points as

a percentage of total assets. This represents an increase of $27.4 million (= β2× mean(atq)) in

debt issuance, which is a 31%(

= β2mean(NDI) × 100

)increase relative to the quarterly average debt

issuance.

To assess the magnitude of the results, it is useful to compare the elasticity of leverage to

the cross-basis with capital structure literature. Heider and Ljungqvist (2015) estimate that a 1

percentage point increase in taxes leads to a 40 basis points increase in the long-term leverage ratio.

The semi-elasticity coefficient estimated in Column (1) of Table 1.8, suggests that 1 percentage

point increase in the cross-basis forecasts a 35 basis points(

= β2mean(long-term leverage) × 104

)increase

in long-term leverage ratio. The two estimates are in the same order of magnitude, but it is worth

noting that the quarterly standard deviation in the cross-basis is 0.96%, whereas tax changes are

relatively infrequent. The magnitude of the point estimates is evidence that the safety premium is

an important economic force in determining firms’ leverage ratios.

To ensure results are robust, I consider three extra controls: ratings fixed effects, leverage

ratios, and return on assets (ROA). Credit ratings are intrinsically related to firm default probability

and expected cost of bankruptcy, forces that impact firm leverage decisions in many traditional

corporate theories, such as the trade-off theory. Although the CDS spread should largely capture

this effect, there could be time-varying frictions, systematically correlated with ratings, that affect

a firm’s access to credit and are not related to the firm’s perceived safety. Since credit rating is one

of the main drivers of cross sectional variations in the cross-basis, those omitted factors could be of

concern. I control for credit ratings by re-estimating (R1) including rating buckets fixed effects.19

The results are reported in column (2) of Table 1.8. The cross-basis has a significant impact on

net debt issuance even within a rating bucket. The point estimate is 8 basis points. As expected,

the point estimate is slightly smaller than the baseline specification, reflecting the fact that ratings

are intrinsically related to perceived safety, as shown in Section 1.5.2, and the rating fixed effects

subsume this variation. Nevertheless, the results are qualitatively unchanged.

19I consider 5 rating buckets: AAA and AA, A, BBB, BB, and B and below. All notches are included in theassociated bucket.

29

In dynamic models of firm capital structure, leverage can be path dependent (see, e.g., Hen-

nessy and Whited (2005) and Admati et al. (2018)). One concern is that the cross-basis is capturing

variation in the leverage ratio in the cross section. To alleviate this concern, in column (3) of Table

1.8, I include leverage ratio as one of the control variables. After controlling for the leverage ratio,

the impact of cross-basis on net debt issuance remains significant. The point estimate is 8 basis

points, which means that a 1 percentage point increase in the cross-basis forecasts a 24% increase

in debt relative to the average quarterly net issuance. The slightly smaller point estimate is ex-

pected because, like ratings, leverage ratio is one of the drivers of the cross sectional variation in

the cross-basis; consequently, it subsumes part of the firms’ safety premium variation.

One interesting difference between the baseline specification and the one that includes leverage

ratio as a control is the impact of the CDS spread on net debt issuance. Whereas a decrease in

the CDS spread forecasts larger issuance in the main specification, this is not robust to including

leverage ratio as a control. The credit premium measured by the CDS spread has a small impact

on debt issuance after controlling for leverage ratios. This result is probably related to the strong

correlation between the leverage ratio and the CDS spread.

Previous literature has documented the impact of profitability on leverage. Theoretically, how

profitability should impact net debt issuance is not straightforward. Under the trade-off theory,

profitable firms face lower expected costs of financial distress and find interest tax shields more

valuable, therefore high profitability should be related to higher debt issuance (Myers (1984)).

Under the pecking order theory of capital structure, firms should rely on external finance only if

the internal funds are not enough to meet the financing needs (Myers and Majluf (1984)). In this

case, everything else constant, firms with high profitability should issue less debt. In any case,

the source of concern for identifying the impact of the cross-basis on net debt issuance is that the

cross-basis is systematically correlated with firms’ profitability in a way that is not captured by

the controls and is not related to firms’ safety premium. To alleviate this concern, I control for

return on assets as a measure of profitability. Column (4) reports the results. The effect issuance

response to cross-basis variation remains almost unaltered. The ROA regression coefficient is not

30

statistically significant. Due to the relatively recent period that my sample covers compared to

other studies of firms’ capital structure, this result is consistent with Goyal and Frank (2009), who

show that the impact of profitability on leverage has been diminishing over time.

All together, the results reported in Columns (1) to (4) of Table 1.8 provide strong support for

the response in net debt issuance due to cross sectional variation in perceived safety, as described

in Prediction 1.1.

I now turn to explore the direct effect of the aggregate safety premium on issuance. Prediction

2 says that net debt issuance should respond to variation in the the aggregate safety premium.

Moreover, Prediction 3 says that this response should be stronger for firms with higher perceived

safety. I test these predictions by running the following regression

NDIi,t+1 = β1UST SPt × αHi,t + β2αHi,t + β3Xi,t + δi + δt + εi,t, (R2)

where USTSPt is the UST safety premium and αHi,t is a dummy variable equal to 1 if the firm has

high αi,t and 0 otherwise. I construct αHi,t in two ways. First, I set αHi,t to one if the firm belongs to

the highest quintile of the quarterly cross-basis distribution. Second, I set αHi,t to one if the firm is

rated A- and above in the quarter. “UST SP" is a proxy for the US Treasury safety premium. The

baseline estimation uses the structurally estimated US Treasury safety premium.20 The controls

are the same as in (R1).

The results are reported in columns (5)-(8) of Table 1.8. In all specifications, firms with high

safety premium respond to an increase in the aggregate safety premium by issuing relatively more

debt. This result is robust to including rating buckets fixed effects and/or extra controls as shown

in columns (6) and (8). It is noteworthy that estimating the aggregate safety premium coefficient is

not possible in this specification, since it is subsumed by the time fixed effects. Nevertheless, the

interaction coefficient provides evidence in support to Prediction 2 and 3.

This specification also alleviates concerns that noise in the cross-basis measure is systemat-

ically correlated with firm decisions and could be biasing the results from regression R1. The

20Results considering other proxies for the aggregated safety premium are reported in the appendix.

31

results in columns (5)-(8) of Table 1.8 provide strong evidence for the cross sectional effects of

safety premium variation on the net debt issuance, that they are not driven by idiosyncratic noise

in the firm-level cross-basis measure.

In summary, I find strong evidence that firms respond to higher safety premia by issuing more

debt than their comparable peers with lower safety premia. This result is robust to the inclusion

of ratings fixed effects and extra controls. The impact of the cross-basis on net debt issuance is

statistically and economically significant.

1.6.2 Impact of Safety Premium on Firms’ Real Decisions

The second set of empirical tests aim to identify how firms use the proceeds from issuing safe

debt. Let DebtIssuance be an indicator variable equal to 1 if the firm has strictly positive net debt

issuance value and 0 otherwise. I run the following empirical model

yi,t+1 =β1CrossBasisi,t ×DebtIssuancei,t+1 + β2CrossBasisi,t+

β3DebtIssuancei,t+1 + β4Xi,t + δi + δt + εi,t,

(R3)

The cross-basis and controls are the same as in regression (R1). The main y-variables of interest are

net payouts and investment. I measure payouts as the sum of dividends and equity repurchases. The

proper measure of investment must consider the increasing share of intangible investment among

corporate investment (Crouzet and Eberly (2020)) and also acquisitions. To this end, I measure

investment in three different ways: (1) the traditional capital investment, defined as CAPEX plus

net PPE bought; (2) intangible investment, measured as the sum of R&D and 30% of SG&A

expenses, as in Peters and Taylor (2017); and (3) acquisitions.

Although the model presented in Section 1.3 does not cover the impact of safety premium vari-

ation on liquid assets accumulation, this is an important margin to be empirically tested. To this

end, I include ∆Cash and financial investment in the set of y-variables. Using financial invest-

ments instead of Compustat CHE changes is important. It captures the increasing relevance of the

financial assets portfolio in non-financial firms, which is often not properly accounted for in the

32

Compustat CHE variable. Those issues are described in Darmouni and Mota (2020).

Table 1.9 shows the regression results. Interestingly, only payouts respond to the cross-basis

conditional on positive debt net issuance. The impact on payouts is statically and economically

significant. The point estimate signifies that 1 percentage point increase in the cross-basis forecasts

8 basis points (= β1 + β2) increase in total net payouts as a percentage of total assets. This is

an increase of 8%(

= β1+β2mean(payouts | DebtIssuance)

)to firms’ average quarterly net payout conditional

on positive net debt issuance. In dollar amounts, 1 percentage point increase in the cross-basis

forecasts $23.2 million, a number very close to the effect of the cross-basis on net debt issuance,

$27.4 million.

Investment and liquid assets accumulation do not respond to the cross-basis conditional on

positive issuance. The proceeds from issuing safe debt are largely transformed into equity payouts

rather than investment. This result is consistent with the hypothesis that firms that benefit from a

safety premium are not likely to be financially constrained. It is also consistent with Stein (1996),

who says that firms with decreasing returns to scale will not respond to cheap financing by investing

more as long as they are already at their optimal scale.

The lack of response in cash accumulation or financial investment differentiates the safety

premium mechanism from the market timing described in Bolton et al. (2013) and Eisfeldt and

Muir (2016). In case of a stochastic cost of external finance, firms should raise external finance

when it is cheap, guaranteeing liquidity if future adverse shocks happen in periods when external

finance is expensive. Under this theory, firms should issue debt in times of high safety premium

and accumulate liquid assets. The results in this paper suggest instead that firms that benefit from

a high safety premium are not likely to face financial constraints in raising funds, thus the liquidity

accumulation response is not observed.

In summary, the results of this subsection show that the proceeds from issuing debt in response

to variation in the safety premium are most converted into equity payout, rather than real invest-

ments. Based on Predictions 1 and 2, this is strong evidence that companies most affected by the

safety premium are not likely to be financially constrained. Furthermore, the results described in

33

this subsection are robust to including rating fixed effects, leverage ratio, and past profitability as

controls. The results are reported in Table A.1 in A.5.

1.6.3 Heterogeneity across Firms’ Characteristics

According to the model’s predictions, responses to variation in the safety premium vary across

firms’ initial conditions. First, according to Prediction 4 the extent to which a firm can increase

its leverage in response to an increase in the safety premium depends on its debt capacity and its

expected bankruptcy cost. Effectively, its default risk acts as a constraint on its ability to take

advantage of the discount in debt financing provided by the safety premium. Second, accord-

ing to Predictions 1 and 2, the impact on investment or payouts is related to the firm’s financial

constraints. Unconstrained firms are more likely to operate at their optimal scale. Therefore, an

increase in the safety premium should have a stronger payout response than in real investment. The

opposite should be true for constrained firms.

I examine the heterogeneity across different firms by running the following regression

Yi,t+1 = β1CrossBasisi,t×UNCi,t+β2CrossBasisi,t+β3UNCi,t+β4Xi,t+δi+δt+εi,t, (R4)

where UNC is a dummy variable equal to one if the firm is likely to be unconstrained and 0

otherwise. I group firms by five relevant characteristics: ratings, profitability, cash, payouts, and

size. For each characteristic and quarter, I set UNCi,t = 1 for firms that are in the largest 20%

quantile. For ratings, I consider investment grade ratings (BBB− and above) to be unconstrained.

To study a firm’s response conditional on issuing, I select the sample for which net debt issuance

is strictly positive.21

Table 1.10 reports the results for each one of the five characteristics. Column (1) shows that

the issuance response of investment grade and large firms to the cross-basis is stronger than other

firms, suggesting that they have a smaller marginal cost of increasing leverage. Investment grade

21Ideally, the triple interaction of CrossBasis × UNC × DebtIssuance should be considered. Due to samplesize restrictions, I consider instead the selected sample in which net debt issuance is strictly positive.

34

firms also engage in higher payouts in response to the cross-basis, as shown in column (2) of Panel

A. As predicted by the model, in all specifications, the impact on capital investment is smaller for

unconstrained firms, as shown in column (3). The differential impact in intangible investment for

investment grade, profitable, and high-payout firms is solely driven by SG&A expenses. Columns

(8) and (9) show that there are no differential effects in liquidity accumulation. Finally, one in-

triguing result emerges from the heterogeneous analysis, reported in column (7). Investment grade

and large firms engage in acquisitions in response to an increase in the cross-basis. Although spec-

ulative, this last result suggests that the safety premium can act as a financial advantage for firms

that can benefit from it, potentially having consequences for product market competition.

In summary, the results presented in this subsection are largely consistent with the corporate

safety-creation model. Firms that are likely to be financially unconstrained, in the sense that they

could achieve their optimal investment and scale even in the absence of a safety premium, respond

to the cross-basis by issuing debt and engaging in payouts. The impact of the cross-basis on capital

investment is smaller than for the contained group. These are in accordance with Predictions 1 and

2. Furthermore, in accordance with Prediction 4, unconstrained firms respond to higher cross-basis

by issuing more debt than constrained firms.

1.6.4 Summary

The results in this section show two notable patterns governing firms’ activities. First, the

cross-basis, a measure of relative safety premium in corporate bonds, strongly forecasts net bond

issuance. The results are robust to adding rating fixed effects and several extra controls. Second,

the proceeds from issuing safe debt are not converted into investment measured either as capital

investment or intangible investment. Instead, the proceeds from issuing debt in response to the

cross-basis are used to finance payouts in the form of equity repurchases or dividends. The results

suggest that firms that benefit from a safety premium embedded in the price of their liabilities are

likely to be financially unconstrained.

35

1.7 Robustness and Additional Tests

1.7.1 Is the Cross-basis Really Capturing Variation in the Safety Premium?

To validate the cross-basis as measure of the relative safety premium in corporate bonds, it is

important to consider factors that could influence the cross-basis that are not related to the aggre-

gate safety premium or the firm specific perceived safety. The first point worth noting is that any

frictions that affect all bonds or CDSs equally affect the CDS-bond basis, but they do not affect

the cross-basis. Second, frictions that are not systematically correlated with firms’ fundamentals

would make the cross-basis measurement noisy, but would not jeopardize the validity of the anal-

yses on how firms respond to safety premium variation. The frictions of concern are then those

likely to exhibit strong cross sectional correlation with the reference firm’s fundamentals, such as

credit quality.

Bai and Collin-Dufresne (2019) present a list of factors that could explain the cross sectional

variation of CDS-bond basis. From this list, the factors of concern for my study are counterparty

risk and bond liquidity. I add to that the restructuring uncertainty and mismatch in the payoff

structure of bonds. In this section, I discuss in detail why each one of these factors is unlikely to

be a first-order concern in validating cross-basis as a measure of the safety premium.

Before I start, it is worth noting that, after a thorough study of the cross section of the CDS-bond

basis, Bai and Collin-Dufresne (2019) conclude that only credit rating is consistently significant

in explaining the cross sectional variations of the basis, indicating that collateral quality is always

relevant in explaining the cross sectional variation in the CDS-bond basis. Since collateral value is

one of the safety services of interest, this result is reassuring of the interpretation of the cross-basis

as a relative safety premium measure.

36

Can it be counterparty risk?

In case of a credit event, the protection seller of the CDS contract must deliver the face value

of the bond in exchange for, typically, a bond from an eligible pool. Though, if the protection

seller counterparty defaults (or has defaulted) when the underlying firm defaults, then the CDS

protection expires worthless. This is called counterparty risk.

The evidence is that counterparty risk is small in CDS contracts. Arora et al. (2012) use the

Lehman default to assess the premium associated with counterparty risk. In the authors’ own

words, they find that counterparty effects on CDS prices are “vanishingly small." The modest

size of counterparty risk is also documented by Du et al. (2017). There are some reasons that

rationalize why the counterparty risk is likely to be small. First, CDS are highly collateralized;

second, derivative contracts are senior liabilities in case of counterparty default; third, the typical

counterparty is a large bank, or more recently, very often a central counterparty,22, all of which have

a low probability of default. The combination of a high concentration of counterparty entities for

single-name CDS, together with the small counterparty risk, makes it unlikely that the counterparty

risk is the main driver of the cross sectional variation in the CDS-bond basis.

Can it be restructuring risk?

Bond investors might incur losses due to credit restructuring events that do not trigger the CDS

payment under the “No Restructuring" (XR) CDS contract. This friction would make the hedged

bond carry a credit risk. The evidence though is that restructuring risk is also small. I am able

to measure the premium that investors are willing to pay for restructuring risk by comparing the

prices for different restructuring clauses. By comparing the difference in prices to buy protection

for the same entity at the same time, I find that the difference in prices accounts for less than few

basis points of the CDS spread, and moreover it has small explanatory power for cross sectional

regressions. The small premium paid for restructuring risk is consistent with the magnitude found

22Following the great financial crisis, regulatory changes moved the market to become more centrally clearedthrough a CCP, as recommended by the Dodd-Frank Act in 2009.

37

by Berndt et al. (2007).

Can it be mismatch between the payoff structure of the bond and that of the CDS?

Another issue that potentially affects the CDS-bond basis is the mismatch between the payoff

structure of the bond and that of the CDS. For instance, the typical corporate bond has a fixed

coupon with semiannual coupon payments, whereas a CDS has fixed spreads paid quarterly. It

is also typical that in case of default, the CDS protection buyer has to pay the accrued interest

from the last coupon payment to the CDS protection seller, whereas the holder of the bond would

not necessarily receive the last coupon. Furthermore, the credit spread might not be the ideal

comparison to the CDS spread, since the typical bond is fixed rate rather than floating.23

The CDS-spread Par Equivalent CDS (PECS) method to calculate the CDS-bond basis is a

well-known methodology that accommodates the differences in payoff structure of the bond and

the CDS.24 In the appendix, I recalculate the basis using the PECS methodology and I show the

assumptions necessary to achieve the equivalence between equation (1.15) and the PECS basis.

The result is that the CDS-bond basis calculated with the PECS methodology is very similar to

the simplified method described in equation (1.15). The disadvantage of the PECS methodology

is that it is more vulnerable to outliers. For this reason, I use the CDS-bond basis introduced in

equation (1.17) as my benchmark measure. As shown in A.6, results are qualitatively unchanged

when using PECS.

Can it be liquidity?

Finally, differences in the cross-basis could be driven by differences in market liquidity of the

underlying bond. One could think that bond yields are noisy proxies for the true price because of

lack of trading or high bid-ask spreads, and that this noise is correlated with bond characteristics

such as ratings. I deal with this concern by testing how much of the cross sectional variation of

23Duffie (1999), the ideal corporate spread to compute the CDS-bond basis would be that of a floating rate bond.This issue is also studied by Longstaff et al. (2005).

24PECS was developed by JP Morgan. For further details see Elizalde, Doctor, and Saltuk (2009). PECS is alsothe methodology used in Bai and Collin-Dufresne (2019).

38

the CDS-bond basis is explained by bid-ask spread or turnover. As shown by the r-squares of

regressions in columns (5)−(7) of Table 1.5, the two variables together add very little explanatory

power. These results alleviate concerns related to the mis-measurement in the cross-basis due to

bond iliquidity.

Clearly, the notion of liquidity goes beyond measures of bid-ask spreads and turnover. For

example, in Holmstrom and Tirole (2001), assets earn a liquidity premium whenever they are good

“reserve assets," simply because they offer better insurance than others against income shortfalls

and other liquidity needs. Liquidity is also related to assets’ exposure to adverse selection, and

information-insensitive assets are likely to be more liquid (see, e.g. Dang et al. (2015)). In both

cases, assets with stable cash flows are more likely to be liquid.

Indeed the regulatory development of Dodd-Frank has induced a demand for safety directly

linked to liquidity considerations. Banking institutions are now required to hold in their balance

sheet a particular fraction of assets deemed to be liquid (the liquidity coverage ratio or LCR).

Amongst the assets that can meet the LCR is high quality corporate debt as level 2B assets.25

In my terminology this is akin to an increase in the demand for safety, which translates into an

increase in demand for corporate debt. Liquidity and safety are thus fundamentally intertwined.

1.7.2 Delayed Reactions: Effect of Safety Premium in Different Time Hori-

zons

There is large evidence that firms’ financial and investment decisions can be lumpy and de-

layed. In Section 1.6, I found that the cross-basis forecasts higher debt issuance and does not

forecast investment in the span of one quarter. It is interesting to check whether these results are

robust when considering longer response periods. To this end, I re-estimate regressions in Model

(R1) and Model (R3) considering different horizons of the y-variable.

25For details see the Federal Register Vol.79 No.197., Department of the Treasury (2014). The directive allows forcorporate debt of non-finanacial firms that meet the definition of “investment grade” under 12 CFR to be part of Level2B high-quality liquid assets (HQLA). According to the agencies, “meeting this standard is indicative of lower overallrisk and, therefore, higher liquidity for a corporate debt security." (pg. 61459)

39

Figure 1.9 shows regression coefficients and 5% confidence intervals for dependent variables 1

to 8 quarters ahead. Panel (a) plots β1 coefficient of Model (R1). This picture shows that the effect

of cross-basis on net issuance is concentrated in the first quarter, and there is no evidence of lagged

responses.

Panels (b)−(g) of Figure 1.9 show β1 coefficients for Model (R3). The dependent variables

are the same as in Table 1.9. In this case, coefficients are interpreted as firms’ future response to

a variation on the cross-basis conditional on positive net debt issuance at t. I do not find evidence

that firms that issue debt in response to a high cross-basis respond by investing more in any period

ahead. The response is concentrated in net payouts in the one quarter ahead.

In summary, by looking at different horizons, I do not find evidence of pronounced delay in

firms’ behavior that contradicts the previous section’s conclusions.

1.7.3 Limits to Arbitrage and the Demand for Safety

If we depart from the representative investor economy, there could be investors that value

safety services and investors that do not. Investors that do not value safety services are natural

arbitrageurs that can supply safety services by shorting safe assets. In this economy a positive

safety premium will only exist in equilibrium if there are arbitrage costs related with supplying

safe assets. Furthermore, the equilibrium aggregate safety premium must be both the marginal

utility of consuming one extra unit on safety services and the marginal cost of arbitrageurs of

supplying this unit.

Clearly violations of the law of one price cannot be too high. In A.7, I develop an alternative

model that explicitly incorporates arbitrageur along the lines of Gârleanu and Pedersen (2011). I

also develop an example on how an arbitrageurs can exploit the negative CDS-bond basis trade,

and specifically link it to the implicit cost of capital of the arbitrageur (see also Boyarchenko et al.

(2018) and Bai and Collin-Dufresne (2019)).26 As it is intuitive, this cost of capital places a upper

26I would like to thank Aref Bolandnazar for helpful discussions about the CDS contract margins requirements,which are necessary to properly calculate the cost of arbitrage of the negative CDS-bond basis trade.

40

bound on the CDS-bond basis, and thus how much prices can deviate from the discounted cash

flows on account of the demand for safety.

More broadly, the higher regulatory cost imposed on intermediaries since the Great Financial

Crisis have forced many of them on the side of the demand for safety rather than on the supply.

Intermediaries now have to hold particular securities in order to meet liquidity or capital require-

ments, for example. When these constraints bind, their activities as arbitrageurs are curtailed.27.

Thus the high correlation in the different measures of violations of arbitrage relations that have

been observed since the passage of the new, more stringent regulatory framework for financial in-

stitutions around the world. For instance, in my case the average cross-basis for IG bonds is highly

correlated with the magnitude of violations of other apparent arbitrage trades such as the dollar

CIP-deviation described in Du et al. (2018).

1.8 Conclusion

In this paper, I study the role of non-financial corporations in the supply of safe assets. I argue

that corporate debt of highly rated firms, while not completely insulated from credit risk, has some

of the safety features needed to function as a store of value, as collateral, or as regulatory capital.

Thus, corporate debt can serve as imperfect substitutes for traditional safe securities such as US

Treasuries, other developed countries’ sovereign debt, and highly rated asset-backed securities. In

this case, by issuing debt, corporate managers may capture the premium investors are willing to

pay for safe assets.

The paper introduces a model of the supply of safety services by non-financial corporates. The

model generates testable predictions regarding the safety premium component in corporate debt

prices and about the firm’s response to variation in the safety premium. The paper innovates in

modeling corporate debt as supplying safety services to varying degrees; that is, safety is not bi-

27A rapidly growing literature explores how the balance sheet cost of financial intermediaries can address violationsof the law of one price in a variety of markets, as well as the effect on market liquidity and volume. Some examplesare Du et al. (2018), Duffie (2018),Andersen et al. (2019), Fleckenstein and Longstaff (2020), Bolandnazar (2020),among others.

41

nary, as has been traditionally assumed in the literature, but rather it varies smoothly between the

safety provided by US Treasuries and that of an asset that provides no safety services whatsoever.

This innovation allows me to exploit cross sectional differences in bond prices to identify firm spe-

cific components of the safety premium without the need to estimate the aggregate safety premium,

an estimation which is often contentious.

I introduce a novel measure of this relative safety premium, the cross-basis, and I show that

firms with a higher cross-basis issue more debt. I argue that this is because firm managers cre-

ate additional shareholder value, above the value associated with standard business operations, by

engaging in the supply of safety services. The supply of safety services by non-financial corpo-

rates, through corporate debt issuance, only occurs when the supply of safety services is in limited

supply relative to the demand for safety. This paper is the first to offer a model of the supply of

safety services and to show the existence of an upward sloping supply curve among non-financial

corporates.

Much remains to be done to understand the market for safety services, in particular what de-

termines the time series variation in the demand and supply of safety. First, as already mentioned,

a key building block of my framework is the observation that different securities provide different

degrees of safety services. US Treasury officials are on record as stating that they do not take into

account the level of the demand for safety in their decisions regarding the supply of Treasuries.

Thus one can safely assume that the sources of variation in the supply of safety services is exoge-

nously driven by the funding needs of the US government, as has been traditionally assumed in the

literature. Moreover, informal accounts of the years leading up to the global financial crisis suggest

that the financial services sector was in the business of providing safety services. However, regu-

lation after the financial crisis seems to have greatly impaired the ability of the financial sector to

increase the supply of safety services. Is this in fact the case? And, is this the reason non-financial

corporations are in the business of supplying safety services? The image that my model suggests is

that there are multiple suppliers of safety services with different marginal costs of supplying these

services, and that the non-financial corporate sector is one such supplier.

42

In addition, progress needs to be made on the determinants of the demand for safety. The

literature, and this paper is no exception, takes the demand for safety services as a primitive, and

models it in reduced form. But these safety services are tangible. For example, the ability of

a particular asset to relieve liquidity or regulatory constraints, while challenging to measure, in

principle could be observed and measured. Thus, it would be helpful to link exogenous sources of

variation in liquidity or capital requirements to changes in asset premia in a way that establishes

the presence of a particular channel in the demand for safety. Moreover, it might be that agents

experience a particular form of psychological relief when holding certain assets that is not captured

by the cash flow characteristics of the asset in question. This consideration may suggest the need

for further development of a behavioral theory of taste for safety.

This paper shows that the safety premium affects the yields of a much broader class of financial

assets than has been recognized. Additionally, safety may explain part or all of a puzzling present

phenomenon. Yields on financial assets, across classes and jurisdictions, have been extraordinarily

low for years. Why should this be so? The safety premium may contribute to such consistently and

extraordinarily low yields, as investors who would otherwise have demanded higher risk premia

have instead been constrained in holding assets that offer some safety services. As markets and

contracts develop greater complexity in collateral requirements, insurance considerations, and reg-

ulation, we can expect the safety premium to be even more central in explaining the cross section

of asset prices.

43

Figures

Figure 1.1: Corporate Safety Premium and Bond Price

(a) Corporate safety premium (b) Corporate bond price

This figure shows how, for a fixed level of investment, the corporate safety premium and corporate bond price varywith the amount of bonds issued. Each line shows the value for different levels of aggregate safety premium denotedby ϕ.

44

Figure 1.2: Firm’s Total Value as a Function of Debt Issuance

(a) No bankruptcy cost (ξ = 0) (b) Positive bankruptcy cost (ξ = 0.3)

This figure shows how, for a fixed level of investment, the total value of the firm varies with the amount of bondsissued. Each area shows the value for different levels of UST safety premium denoted by ϕt. The dashed line is thevalue of b that maximizes the value of the firm for each value of ϕt.

45

Figure 1.3: Response to Perceived Safety

(a) Unconstrained optimal debt issuance (b) Constrained optimal debt issuance

(c) Unconstrained optimal investment (d) Constrained optimal investment

(e) Unconstrained optimal dividends (f) Constrained optimal dividends

This figure shows optimal debt issuance, optimal investment, and optimal dividends as a function of the firm-specificperceived safety, αi,t, for different levels of the aggregate safety premium, ϕt. The plots on the left are for a firm thatis financially unconstrained, and the plots on the right are for a firm that is financially constrained.

46

Figure 1.4: Response to Aggregate Safety Premium

(a) Unconstrained optimal debt issuance (b) Constrained optimal debt issuance

(c) Unconstrained optimal investment (d) Constrained optimal investment

(e) Unconstrained optimal dividends (f) Constrained optimal dividends

This figure shows optimal debt issuance, optimal investment, and optimal dividends as a function of the aggregatesafety premium, ϕt, for different levels of perceived safety, αi,t. The plots on the left are for a firm that is financiallyunconstrained, and the plots on the right are for a firm that is financially constrained.

47

Figure 1.5: Interaction Effects of Bond Issuance Response to Safety Premium

0.00 0.01 0.02 0.03 0.04 0.05φt

0.05

0.10

0.15

∂bt+1

∂αt

(a) Bond issuance response to αi,t

0.0 0.2 0.4 0.6 0.8 1.0αt0

2

4

6

8

∂bt+1

∂φt

(b) Bond issuance response to ϕt

This figure shows how optimal bond issuance responds to perceived safety αi,t as function of ϕt, and how optimalbond issuance responds to perceived safety ϕt as function of αi,t.

Figure 1.6: Heterogeneous Effects on Bond Issuance Due to Initial Net Worth

0.0 0.1 0.2 0.3 0.4 0.5wt

0.02

0.04

0.06

0.08

∂bt+1

∂αt

constrained firm unconstrained firm

(a) Bond issuance response to αi,t

0.0 0.1 0.2 0.3 0.4 0.5wt

2

4

6

8

∂bt+1

∂φt

constrained firm unconstrained firm

(b) Bond issuance response to ϕt

This figure shows how the optimal bond issuance responds to perceived safety αi,t and ϕt as a function of the initialnet worth wi,t. Firms with low net worth are financially constrained, and firms with high net worth are not financiallyconstrained. These curves are calculated from applying the implicit function theorem to firms’ first-order conditions.

48

Figure 1.7: The CDS-bond Basis

−10.0

−7.5

−5.0

−2.5

0.0

2004 2006 2008 2010 2012 2014 2016 2018 2020

CD

S−

bond

bas

is (

%)

AAA and AA A BBB BB B and Below

This figure shows the times series of the CDS-bond basis across ratings. The CDS-bond basis is the difference betweenthe CDS spread and the bond’s implied credit spread. For each rating class, the CDS-bond basis is the face-valueweighted average.

Figure 1.8: US Treasury Safety Premium.

0

1

2

3

4

5

6

2004 2006 2008 2010 2012 2014 2016 2018 2020

Val

ue (

%)

UST SP ( ϕt ) AAA credit−spread (K−VJ) BBB − (AAA and AA) basis−spread Box (BDM)

This picture shows the time series of different measures of the US Treasury safety premium. All measure definitionsare described in detail in A.2.

49

Figure 1.9: Effect of Cross-Basis on Firms’ Decisions for Different Time Horizons.

−0.1

0.0

0.1

1 2 3 4 5 6 7 8Quarter

Est

imat

e

(a) Net debt issuance

−0.1

0.0

0.1

1 2 3 4 5 6 7 8Quarter

Est

imat

e

(b) Payout

−0.1

0.0

0.1

1 2 3 4 5 6 7 8Quarter

Est

imat

e

(c) Capital investment

−0.1

0.0

0.1

1 2 3 4 5 6 7 8Quarter

Est

imat

e

(d) Intangible investment

−0.1

0.0

0.1

1 2 3 4 5 6 7 8Quarter

Est

imat

e

(e) Acquisitions

−0.1

0.0

0.1

1 2 3 4 5 6 7 8Quarter

Est

imat

e

(f) Financial investment

−0.1

0.0

0.1

1 2 3 4 5 6 7 8Quarter

Est

imat

e

(g) ∆Cash

This picture shows regression coefficients in the y-axis and 1 to 8 quarters ahead in the x-axis. Panel (a) plots the cross-basis regression coefficients, β1, of Model (R1). Panels (b) - (g) plots the interaction cross-basis and debt issuancedummy coefficient, β1, of Model (R3) for different dependent variables. The shaded area is the estimated coefficient95% confidence interval. Standard errors are clustered by both firm and time. Data is quarterly from 2003Q1 to2019Q3.

50

1.9 Tables

Table 1.1: Calibration

Parameter Description Value Target

ζ Decreasing returns to scale 0.65 Kuehn and Schmid (2014)

δ Depreciation rate 0.03 Kuehn and Schmid (2014)

ξ Bankruptcy costs 0.30 Crouzet (2018)

ρz Idiosy. shock persistence 0.85 Kuehn and Schmid (2014)

σz Idiosy. shock volatility 0.15 Kuehn and Schmid (2014)

ρx Agg. shock persistence 0.89 Begenau and Salomao (2019)

σx Agg. shock volatility 0.0093 Begenau and Salomao (2019)

β Rate of time preference 0.996 Kuehn and Schmid (2014)

γ Relative risk aversion 7.5 Kuehn and Schmid (2014)

51

Table 1.2: Number of Bonds and Firms with Valid CDS-Bond Basis

Number of Bonds Number of Firms

Year AAA and AA A BBB BB B and Below Total AAA and AA A BBB BB B and Below Total

2003 114 676 1054 301 197 2342 18 85 128 64 37 332

2004 103 634 1023 376 272 2408 16 92 146 79 60 393

2005 106 653 1006 485 317 2567 16 100 166 106 79 467

2006 117 641 900 413 361 2432 19 105 174 95 82 475

2007 144 620 896 363 408 2431 19 91 168 95 103 476

2008 130 609 886 348 352 2325 19 78 159 102 94 452

2009 128 637 979 324 335 2403 17 79 171 82 75 424

2010 166 648 1038 317 341 2510 18 79 171 79 75 422

2011 180 789 1078 362 311 2720 17 84 174 89 59 423

2012 253 922 1106 364 297 2942 21 90 167 87 63 428

2013 273 944 1213 347 267 3044 21 89 162 76 54 402

2014 288 931 1263 332 227 3041 21 85 159 71 40 376

2015 326 987 1435 360 212 3320 22 82 164 78 41 387

2016 365 936 1514 319 219 3353 23 73 166 71 46 379

2017 380 927 1550 275 203 3335 23 68 159 59 39 348

2018 339 951 1628 248 179 3345 22 69 153 52 34 330

2019 286 1010 1572 255 148 3271 19 73 138 50 32 312

2020 279 1076 1566 341 125 3387 15 70 131 47 26 289

avg 221 811 1206 341 265 2843 19 83 159 77 58 395

This table shows the number of distinct bonds and distinct firms in the sample per year and rating bucket. The last rowshows the average annual numbers during the sample. Data is from January 2003 to September 2020.

52

Table 1.3: Firm-Level Summary Statistics

Statistic N Mean St. Dev. Pctl(25) Median Pctl(75)

Basis (%) 19,227 −1.149 1.168 −1.393 −0.850 −0.536

Cross basis (%) 19,227 −0.228 0.942 −0.418 −0.052 0.220

Credit-spread (%) 19,227 2.590 2.391 1.191 1.839 3.251

CDS-spread (%) 19,227 1.461 1.792 0.482 0.874 1.724

Net Debt Issuance (% lag assets) 19,227 0.324 3.329 −0.555 −0.009 0.232

I{ DebtIssuance } 19,227 0.307 0.461 0 0 1

Net Payout (% lag assets) 19,227 0.012 1.362 0 0 0

ST Debt Net Issuance (% lag assets) 19,227 1.090 1.644 0.102 0.595 1.601

CAPEX (% lag assets) 19,227 1.287 1.321 0.496 0.899 1.576

Capital Investment (% lag assets) 19,227 1.224 1.255 0.464 0.864 1.529

R&D (% lag assets) 19,227 0.387 0.791 0.000 0.000 0.452

.3 × SGA (% lag assets) 19,227 1.057 1.046 0.362 0.733 1.362

Intangible Investment 19,227 1.444 1.300 0.463 1.026 2.187

Acquisitions (% lag assets) 19,227 0.515 2.289 0 0 0.1

Financial Investment (% lag assets) 19,227 0.016 1.145 −0.003 0.000 0.012

∆ Cash (% lag assets) 19,227 0.152 2.991 −0.869 0.044 1.094

Net Operating Cash Flows (% lag assets) 19,227 4.051 3.095 2.173 3.674 5.578

Other Cash Flows (% lag assets) 19,227 0.068 2.431 −0.174 0.004 0.214

Total Assets ($ Bi) 19,227 27.339 46.642 5.810 12.107 27.952

CHE (% assets) 19,227 8.718 8.807 2.547 5.782 11.986

ROA (%) 19,227 3.449 2.023 2.381 3.314 4.377

Leverage Ratio (%) 19,227 30.997 13.776 20.878 29.177 39.575

Long-term Leverage Ratio (%) 19,227 27.574 13.492 17.678 25.648 35.545

This table shows the firm-level quarterly data summary statistics. Mean, median, standard deviation, and selectedpercentiles are presented. For comparability, the statistics for the Compustat sample are presented based on the sametime period. Data is quarterly from 2003Q1 to 2019Q3. See A.1.1 for the details on the construction of each variable.

53

Table 1.4: CDS-bond Basis Summary Statistics

Panel A: Full Sample


Cross-basis 68,463 −0.256 1.183 −0.453 −0.052 0.239

Basis 68,463 −1.179 1.404 −1.417 −0.851 −0.513

Yield 68,463 5.231 3.130 3.395 4.806 6.267

Credit-spread 68,463 2.815 3.101 1.212 1.921 3.477

CDS-spread 68,463 1.666 2.416 0.491 0.914 1.924

Panel B: Pre-GFC (January 2003 – November 2007)


Cross-basis 19,026 −0.114 0.468 −0.295 −0.054 0.143

Basis 19,026 −0.887 0.501 −1.095 −0.810 −0.587

Yield 19,026 6.036 1.583 5.084 5.798 6.749

Credit-spread 19,026 2.055 1.526 1.055 1.562 2.599

CDS-spread 19,026 1.201 1.571 0.292 0.572 1.543

Panel C: During GFC (December 2007 – June 2009)


Cross-basis 6,535 −0.783 3.013 −1.085 −0.245 0.398

Basis 6,535 −3.067 3.255 −3.555 −2.191 −1.563

Yield 6,535 9.117 6.654 5.827 7.319 9.620

Credit-spread 6,535 6.544 7.018 2.859 4.546 7.250

CDS-spread 6,535 3.492 5.474 0.741 1.639 3.943

Panel D: Post-GFC (July 2009 – September 2020)


Cross-basis 42,902 −0.239 0.837 −0.504 −0.034 0.277

Basis 42,902 −1.021 0.904 −1.330 −0.775 −0.428

Yield 42,902 4.282 2.071 2.950 3.780 5.132

Credit-spread 42,902 2.585 2.094 1.213 1.868 3.373

CDS-spread 42,902 1.594 1.745 0.581 0.976 1.867

This table shows the summary statistics for the cross-basis, defined as the basis minus the basis index, where thebasis index is the face-value weighted average of all basis in the sample; the basis, defined as the difference betweencredit spread and maturity-matched CDS spread; corporate bond yield; credit spread (over US Treasury); and the CDSspreads. Data is monthly and the full sample is from January 2003 to September 2020. Panels B, C, and D shows thesame summary statistics for different sample windows. 54

Table 1.5: CDS-Bond cross section Analyses

Dependent variable:

Cross-basis

(1) (2) (3) (4) (5) (6) (7)

A −0.169∗∗∗ −0.165∗∗∗ −0.167∗∗∗ −0.162∗∗∗

(0.011) (0.011) (0.011) (0.010)

BBB −0.573∗∗∗ −0.538∗∗∗ −0.582∗∗∗ −0.547∗∗∗

(0.032) (0.031) (0.032) (0.031)

BB −0.947∗∗∗ −0.907∗∗∗ −0.977∗∗∗ −0.935∗∗∗

(0.045) (0.044) (0.047) (0.045)

B and Below −1.132∗∗∗ −1.060∗∗∗ −1.187∗∗∗ −1.114∗∗∗

(0.084) (0.080) (0.086) (0.082)

Rating rank −0.102∗∗∗

(0.007)

ICR −0.671∗∗∗

(0.056)

αi 1.159∗∗∗

(0.073)

Bid-ask spread −0.219∗∗∗ −0.213∗∗∗

(0.018) (0.019)

Turnover 0.008∗∗∗ 0.008∗∗∗

(0.001) (0.001)

Constant 0.349∗∗∗ 0.718∗∗∗ −0.113∗∗∗ −0.418∗∗∗ 0.451∗∗∗ 0.297∗∗∗ 0.397∗∗∗

(0.020) (0.055) (0.011) (0.023) (0.025) (0.018) (0.023)

Observations 66,549 66,549 62,964 62,964 66,549 66,549 66,549R2 0.255 0.249 0.119 0.105 0.269 0.264 0.278

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table presents Fama-MacBeth regression results of cross-basis on bond characteristics, as specified in equation(1.19). In Column (1), the proxy for safety is the rating buckets categorical variables: AAA and AA, A, BBB, BB,and B and below, the first category is omitted due to multicollinearity. In Column (2), the proxy for safety is ratingrank, which is equal to 1 if the bond is rated AAA, 2 if the bond is rated AA+, 3 if the bond is rated AA, etc. InColumn (3), the proxy for safety is the interest coverage ratio (ICR) at the firm level, which is the total long-term debtdivided by EBITA; both variables are quarterly and extracted from Compustat. In column (4), the proxy for safety isthe structurally estimated perceived safety αi,tas described in A.2. Data is monthly from January 2003 to September2020. Standard errors are shown in parenthesis.

55

Table 1.6: Treasury Safety Premium Proxies

(a) Summary Statistics


UST SP 213 1.299 0.845 0.859 1.060 1.394

AAA credit-spread (K-VJ) 213 0.800 0.331 0.617 0.718 0.857

AAA and AA - BBB basis-spread 213 0.440 0.324 0.255 0.338 0.506

Box (BDG) 171 0.371 0.204 0.250 0.325 0.420

(b) Correlation Matrix

UST SP AAA credit-spread (K-VJ) BBB - (AAA and AA) basis-spread Box (BDG)

UST SP 1

AAA credit-spread (K-VJ) 0.887 1

BBB - (AAA and AA) basis-spread 0.900 0.697 1

Box (BDG) 0.573 0.650 0.286 1

This table shows the summary statistics and correlation matrix of different proxies for US Treasury safety premium.A.2 describes the details of the estimation of each of those proxies. Data is monthly from January 2003 to September2020 if available.

56

Table 1.7: CDS-Bond Time Series Analyses

Panel A: UST SP (ϕT )

rating β0 βr t-stat(β0) t-stat(βr) R2

AAA and AA -0.146 0.393 -10.329 43.112 0.898

A -0.094 0.227 -7.045 26.397 0.768

BBB -0.028 -0.073 -3.355 -13.253 0.454

BB -0.092 -0.285 -2.842 -13.581 0.466

B and Below 0.131 -0.551 1.946 -12.689 0.433

Panel B: AAA−Treasury Spread


AAA and AA -0.294 0.823 -7.429 17.998 0.606

A -0.171 0.465 -6.080 14.332 0.493

BBB 0.020 -0.178 1.585 -12.362 0.420

BB 0.077 -0.674 1.560 -11.873 0.400

B and Below 0.384 -1.212 3.661 -9.989 0.321

Panel C: BBB−(AAA and AA) CDS-Bond Basis Spread


AAA and AA -0.070 0.989 -4.249 32.680 0.835

A -0.056 0.585 -4.410 25.058 0.748

BBB -0.039 -0.191 -5.025 -13.380 0.459

BB -0.154 -0.702 -4.929 -12.222 0.415

B and Below -0.029 -1.265 -0.432 -10.300 0.335

Panel D: Box-Trade Spread


AAA and AA 0.100 0.765 1.780 5.779 0.165

A 0.106 0.299 2.889 3.460 0.066

BBB -0.050 -0.206 -3.585 -6.282 0.189

BB -0.384 -0.212 -6.644 -1.556 0.014

B and Below -0.290 -0.776 -2.504 -2.837 0.045

This table shows the results of the time series regressions specified in equation (1.20). The dependent variable is theface-value weighted average cross-basis for each rating bucket. Each panel corresponds to a different proxy for theUS Treasury safety premium. A.2 describes the details of the construction of each of those proxies. Data is monthlyfrom January 2003 to September 2020 if available.

57

Table 1.8: Impact of Safety Premium on Net Debt Issuance

Dependent variable:

Net Debt Issuance(% of Lag Assets) at t+ 1

(1) (2) (3) (4) (5) (6) (7) (8)

Cross-basis 0.100∗∗∗ 0.090∗∗ 0.079∗∗ 0.077∗∗

(0.032) (0.035) (0.034) (0.034)

USTreas SP × HighCBB 0.120∗∗ 0.115∗∗

(0.051) (0.050)

USTreas SP × A and Above 0.134∗∗ 0.155∗∗∗

(0.055) (0.055)

HighCBB 0.041 −0.021(0.102) (0.100)

A and Above 0.383∗∗∗ 0.156(0.134) (0.133)

CDS-spread −0.106∗∗∗ −0.095∗∗∗ −0.043∗ −0.039 −0.112∗∗∗ −0.042 −0.084∗∗∗ −0.011(0.025) (0.029) (0.026) (0.026) (0.024) (0.026) (0.021) (0.022)

log(Total Assets) −0.905∗∗∗ −0.936∗∗∗ −0.829∗∗∗ −0.815∗∗∗ −0.892∗∗∗ −0.806∗∗∗ −0.887∗∗∗ −0.800∗∗∗

(0.154) (0.148) (0.138) (0.138) (0.151) (0.136) (0.151) (0.144)

CHE (% assets) −0.024∗∗∗ −0.025∗∗∗ −0.027∗∗∗ −0.027∗∗∗ −0.025∗∗∗ −0.027∗∗∗ −0.025∗∗∗ −0.027∗∗∗

(0.008) (0.008) (0.008) (0.008) (0.008) (0.008) (0.007) (0.008)

Tobin’s Q 0.005 0.006 0.028∗∗∗ 0.027∗∗∗ 0.006 0.027∗∗∗ 0.006 0.026∗∗∗

(0.005) (0.005) (0.007) (0.007) (0.005) (0.007) (0.005) (0.007)

Leverage Ratio (%) −0.059∗∗∗ −0.058∗∗∗ −0.058∗∗∗ −0.056∗∗∗

(0.008) (0.008) (0.008) (0.007)

ROA (%) 0.020 0.021 0.020(0.014) (0.014) (0.014)

Firm FE Yes Yes Yes Yes Yes Yes Yes YesTime FE Yes Yes Yes Yes Yes Yes Yes YesRating FE No Yes Yes Yes No Yes No No

Observations 19,227 19,227 19,227 19,227 19,227 19,227 19,227 19,227R2 0.065 0.066 0.077 0.077 0.065 0.077 0.066 0.077Adjusted R2 0.031 0.031 0.043 0.043 0.031 0.043 0.031 0.043

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table presents regression results of net debt issuance on different measures of the safety premium in corporatedebt, as described in (R1) and (R2). The y-variable is net debt issuance normalized by lag total assets in quarter t+ 1.The x-variables of interest are the cross-basis, defined as the CDS-bond basis minus the basis index; HighCBB, anindicator variable equal to 1 if the firm belongs to the highest quintile of cross-basis and 0 otherwise; “A and Above",an indicator variable equal to 1 if the firm is rated A- and above and 0 otherwise; and the US Treasury safety premium(UST SP). All independent variables are measured in quarter t. Standard errors are reported in parenthesis and areclustered by time. Data is quarterly from 2003Q1 to 2019Q3. See A.1.1 for the details on the construction of variablesand controls.

58

Table 1.9: Impact of Safety Premium on Firms’ Decisions

Dependent variable:

Net PayoutCapital

InvestmentR & D SG & A

Intangible

InvestmentAcquisitions

Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8)

Cross-basis × DebtIssuance 0.044∗∗∗ 0.018 −0.002 0.003 0.001 −0.080 0.015 −0.052(0.016) (0.016) (0.005) (0.004) (0.006) (0.056) (0.021) (0.051)

Cross-basis 0.041∗∗ 0.035∗∗ 0.002 0.005∗ 0.007∗ 0.035 0.008 −0.016(0.018) (0.015) (0.003) (0.002) (0.004) (0.023) (0.008) (0.022)

DebtIssuance 0.301∗∗∗ 0.193∗∗∗ 0.003 0.011∗∗ 0.014∗ 0.885∗∗∗ 0.114∗∗∗ 0.891∗∗∗

(0.031) (0.013) (0.007) (0.005) (0.008) (0.081) (0.024) (0.089)

CDS-spread −0.085∗∗∗ −0.085∗∗∗ −0.005∗∗ −0.002 −0.007∗∗ −0.033∗∗ −0.010∗ −0.007(0.019) (0.018) (0.002) (0.002) (0.003) (0.014) (0.005) (0.018)

log(Total Assets) −0.229∗∗∗ −0.124∗∗∗ −0.109∗∗∗ −0.341∗∗∗ −0.450∗∗∗ −0.274∗∗∗ −0.024 −0.651∗∗∗

(0.041) (0.023) (0.010) (0.008) (0.012) (0.077) (0.033) (0.095)

CHE (% assets) 0.024∗∗∗ 0.001 −0.0003 −0.004∗∗∗ −0.005∗∗∗ 0.083∗∗∗ −0.004 −0.180∗∗∗

(0.003) (0.001) (0.001) (0.0005) (0.001) (0.009) (0.003) (0.009)

Tobin’s Q 0.017∗∗∗ 0.008∗∗∗ 0.001 0.002∗∗∗ 0.002∗∗ 0.002 −0.002 0.009(0.004) (0.002) (0.001) (0.001) (0.001) (0.004) (0.002) (0.007)

Firms FE Yes Yes Yes Yes Yes Yes Yes YesTime FE Yes Yes Yes Yes Yes Yes Yes Yes

Observations 19,227 19,227 19,227 19,227 19,227 19,227 19,227 19,227R2 0.397 0.680 0.765 0.939 0.901 0.116 0.058 0.137Adjusted R2 0.375 0.668 0.757 0.937 0.897 0.083 0.023 0.105

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table presents regression results of firms’ decisions on the safety premium in corporate debt, measured by thecross-basis, as described in equation (R3). The y-variables are (1) net payout, defined as dividends plus net equityrepurchase, (2) capital investment, defined as CAPEX plus net PPE bought, (3) R&D expenses, (4) 30% of SG&Aexpenses, (5) intangible investment, defined as R&D plus 30% of SG&A expenses, (6) acquisitions, (7) financialinvestments, and (8) change in cash from the cash flow statement. All independent variables are measured in quartert+ 1 and normalized by lag total assets. The x-variables of interest are the cross-basis, defined as the CDS-bond basisminus the basis index; and DebtIssuance, an indicator variable equal to 1 if net debt issuance is strictly positive and0 otherwise. All independent variables are measured in quarter t. Standard errors are reported in parenthesis and areclustered by time. Data is quarterly from 2003Q1 to 2019Q3. See A.1.1 for the details on the construction of variablesand controls.

59

Table 1.10: Heterogeneous Impact of Cross-Basis on Firms’ Decisions

Panel A: Investment Grade

Dependent variable:

Net Debt

IssuanceNet Payout

Capital


Intangible


Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8) (9)

Cross-basis × IG 0.145∗∗ 0.114∗ −0.104∗∗∗ 0.011 0.025∗∗ 0.036∗∗ 0.224∗ 0.021 −0.104

(0.057) (0.058) (0.030) (0.009) (0.012) (0.015) (0.115) (0.047) (0.152)

Cross-basis 0.048 0.046∗ 0.104∗∗∗ −0.008∗ −0.001 −0.009 −0.021 0.007 −0.067

(0.036) (0.023) (0.025) (0.004) (0.004) (0.006) (0.045) (0.022) (0.052)

IG 0.511∗∗∗ 0.483∗∗∗ 0.198∗∗ 0.021 0.051∗∗ 0.072∗∗ 0.396 0.072 0.293

(0.102) (0.123) (0.075) (0.025) (0.024) (0.036) (0.281) (0.081) (0.235)

Controls Yes Yes Yes Yes Yes Yes Yes Yes Yes

Firms FE Yes Yes Yes Yes Yes Yes Yes Yes Yes

Time FE Yes Yes Yes Yes Yes Yes Yes Yes Yes

Observations 19,227 5,905 5,905 5,905 5,905 5,905 5,905 5,905 5,905

R2 0.066 0.450 0.765 0.772 0.947 0.907 0.244 0.192 0.233

Adjusted R2 0.032 0.384 0.737 0.744 0.940 0.896 0.153 0.095 0.141

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

Panel B: Cash Rich

Dependent variable:

Net Debt

IssuanceNet Payout

Capital


Intangible


Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8) (9)

Cross-basis × CashRich −0.021 0.020 −0.073∗∗∗ 0.001 −0.007 −0.006 0.065 −0.088 −0.137

(0.077) (0.066) (0.026) (0.010) (0.012) (0.016) (0.194) (0.056) (0.156)

Cross-basis 0.104∗∗∗ 0.090∗∗∗ 0.095∗∗∗ −0.005 0.009∗∗ 0.004 0.041 0.031 −0.062

(0.038) (0.023) (0.021) (0.005) (0.004) (0.007) (0.059) (0.021) (0.053)

CashRich −0.296∗∗ 0.095 −0.071 0.005 −0.040∗∗ −0.035 −0.003 −0.123 −0.291

(0.140) (0.113) (0.048) (0.034) (0.018) (0.035) (0.315) (0.111) (0.299)




Observations 19,227 5,905 5,905 5,905 5,905 5,905 5,905 5,905 5,905

R2 0.066 0.448 0.764 0.772 0.947 0.907 0.243 0.192 0.233

Adjusted R2 0.031 0.381 0.736 0.744 0.940 0.896 0.152 0.095 0.141

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

60

Panel C: High Profitability

Dependent variable:

Net Debt

IssuanceNet Payout

Capital


Intangible


Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8) (9)

Cross-basis × Profitable 0.087 −0.024 −0.149∗∗ 0.009 0.030∗ 0.039∗ −0.032 0.098 0.135

(0.094) (0.073) (0.067) (0.020) (0.016) (0.022) (0.150) (0.081) (0.160)

Cross-basis 0.087∗∗ 0.083∗∗∗ 0.085∗∗∗ −0.007∗ 0.003 −0.004 0.052 0.007 −0.106∗∗

(0.033) (0.020) (0.020) (0.004) (0.004) (0.006) (0.049) (0.024) (0.048)

Profitable 0.282∗∗∗ 0.465∗∗∗ 0.285∗∗∗ 0.054∗∗∗ 0.099∗∗∗ 0.153∗∗∗ 0.156 0.011 0.341∗∗

(0.086) (0.070) (0.062) (0.017) (0.013) (0.019) (0.162) (0.052) (0.144)




Observations 19,227 5,905 5,905 5,905 5,905 5,905 5,905 5,905 5,905

R2 0.066 0.453 0.767 0.772 0.948 0.908 0.243 0.192 0.233

Adjusted R2 0.031 0.387 0.739 0.745 0.941 0.897 0.152 0.095 0.141

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

Panel D: High Payout

Dependent variable:

Net Debt

IssuanceNet Payout

Capital


Intangible


Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8) (9)

Cross-basis × HighPayout 0.112 −0.070 −0.014 0.016 0.031∗∗ 0.048∗∗∗ 0.255∗ −0.029 −0.009

(0.087) (0.066) (0.034) (0.017) (0.014) (0.018) (0.145) (0.056) (0.204)

Cross-basis 0.087∗∗ 0.085∗∗∗ 0.082∗∗∗ −0.006 0.004 −0.002 0.037 0.019 −0.082

(0.034) (0.019) (0.021) (0.004) (0.004) (0.006) (0.049) (0.025) (0.053)

HighPayout 0.220∗∗∗ 0.720∗∗∗ 0.021 −0.006 0.028∗∗∗ 0.022 −0.420∗∗∗ −0.051 −0.200

(0.074) (0.062) (0.027) (0.014) (0.010) (0.016) (0.117) (0.047) (0.131)




Observations 19,227 5,905 5,905 5,905 5,905 5,905 5,905 5,905 5,905

R2 0.066 0.463 0.764 0.772 0.947 0.907 0.245 0.192 0.233

Adjusted R2 0.031 0.399 0.736 0.744 0.940 0.896 0.154 0.095 0.141

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

61

Panel E: Large

Dependent variable:

Net Debt

IssuanceNet Payout

Capital


Intangible


Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8) (9)

Cross-basis × Large 0.165 −0.073 −0.184∗∗ 0.008 −0.002 0.006 0.431∗ 0.007 −0.054

(0.115) (0.100) (0.074) (0.025) (0.014) (0.030) (0.242) (0.085) (0.194)

Cross-basis 0.091∗∗∗ 0.099∗∗∗ 0.095∗∗∗ −0.005 0.008∗ 0.003 0.022 0.014 −0.083

(0.033) (0.023) (0.020) (0.004) (0.004) (0.006) (0.049) (0.022) (0.053)

Large −0.118 −0.136 −0.138∗ −0.011 −0.014 −0.025 0.128 0.037 −0.213

(0.106) (0.100) (0.074) (0.020) (0.018) (0.028) (0.203) (0.087) (0.271)




Observations 19,227 5,905 5,905 5,905 5,905 5,905 5,905 5,905 5,905

R2 0.065 0.448 0.765 0.772 0.947 0.907 0.244 0.192 0.233

Adjusted R2 0.031 0.382 0.737 0.744 0.940 0.896 0.153 0.094 0.140

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table presents regression results of firms’ heterogeneous response to the safety premium, measured by the cross-basis, as described in equation (R4). The y-variables are (1) net debt issuance, (2) net payout, defined as dividendsplus net equity repurchase, (3) capital investment, defined as CAPEX plus net PPE bought, (4) R&D expenses, (5)30% of SG&A expenses, (6) intangible investment, defined as R&D plus 30% of SG&A expenses, (7) acquisitions,(8) financial investments, and (9) change in cash from the cash flow statement. All independent variables are measuredin time t+ 1 and normalized by lag total assets. The x-variable of interest is the cross-basis, defined as the CDS-bondbasis minus the basis index. UNC is an indicator variable equal to one if the firm belongs the 20% highest quantileof each one of the firm characteristics: CHE, profits, payout (all three as share of the total assets), and size. IG is adummy variable equal to 1 if the firm is rated BBB- or above, 0 otherwise. Standard errors are reported in parenthesisand are clustered by time. Data is quarterly from 2003Q1 to 2019Q3. See A.1.1 for the details on the construction ofvariables and controls.

62

Chapter 2

The Cross-Section of Risk and Return 1

2.1 Introduction

A common practice in the academic finance literature has been to create characteristic portfo-

lios (CPs) by sorting on characteristics positively associated with expected returns. The resulting

set of zero-investment CPs, which go long a portfolio of high characteristic firms and short a port-

folio of low characteristic firms, then serve as a model for returns in that asset space. Fama and

French (1993, 2015) are prominent examples of this approach, but there are numerous others, de-

veloped both to explain the equity market anomalies, and also the cross-section of returns in other

asset classes.2

Consistent with this, Fama and French (2015, FF) argue that a standard dividend-discount

model implies that a combination of firm characteristics based on valuation, profitability and in-

vestment should forecast firms’ average returns. Based on this they develop a five factor model—

consisting of the MktRF, SMB, HML, RMW, and CMA characteristic portfolios—and argue that

this model does well in explaining the cross-section of average excess returns for a variety of test

1This chapter is based on Daniel et al. (2020). We thank Stefano Giglio, Ravi Jagannathan, Sonia Jimenez-Garcès,Ralph Koijen, Lars Lochstoer, Maurizio Luisi, Suresh Sundaresan, Paul Tetlock, Brian Weller, Michael Wolf, DachengXiu, Leifu Zhang, as well as the participants of seminars at Amsterdam, BI Oslo, CEMFI, Cincinnati, Columbia,Kellogg/Northwestern, Michigan, TU München, Münster, UCLA, Washington University, Zürich, AQR, Barclays,Bloomberg, and the participants of conferences at the AFA, AFFI, EFA, EEA, Fordham, HKUST, Imperial College,“New Methods for the Cross Section of Returns” in Chicago, and Villanova for helpful comments and suggestions,and Dan Mechanic for his support with the computing cluster.

2Examples are: UMD (Carhart, 1997); LIQ (Pastor and Stambaugh, 2003); BAB (Frazzini and Pedersen, 2014);QMJ (Asness et al., 2019); PMU (Novy-Marx, 2013); ISU (Daniel and Titman, 2006) and RX and HML-FX (Lustiget al., 2011).

63

portfolios, based on a set of time-series regressions like:

rp,t = αp + bp,mrMktRF,t + bp,HMLrHML,t + bp,SMBrSMB,t

+bp,CMArCMA,t + bp,RMW rRMW,t + εp,t (2.1)

SMB, HML, RMW, and CMA are characteristic portfolios, formed by sorting on various com-

binations of firm size, valuation ratios, profitability and investment respectively. Fama and French

(1993, 2015) refer to these characteristic portfolios as “mimicking portfolios” or “factors”.3

Standard projection theory shows that the intercepts (αs) from these regressions will be zero

for all test assets if and only if the mean variance efficient (MVE) portfolio is in the span of the

CPs, or equivalently, if the maximum Sharpe ratio in the economy is the maximum Sharpe ratio

achievable with the CPs alone. Despite several critiques of this methodology, it remains popular in

the finance literature.

This paper is concerned with the standard procedure employed when constructing these CPs.

We show that, if characteristics are a good proxy for expected returns, then forming CPs by sorting

on characteristics alone will generally not explain the cross-section of returns in the way proposed

in the literature. The argument is based on the early insights of Markowitz (1952) and Roll (1977).

Suppose a set of characteristics are positively associated with expected returns, and a correspond-

ing set of long-short CPs are constructed by buying high characteristic stocks and shorting low

characteristic stocks. This set of portfolios will explain the returns of portfolios sorted on the same

characteristics, but are unlikely to span the mean variance efficient frontier of all assets, because

they do not take into account the asset covariance structure.

A simple example with a single characteristic and a single priced factor helps to illustrate this

intuition (we describe this example in detail in Section 2.2.1). Consider an economy with N assets

3As emphasized by Cochrane (2005, p.174), the word “factor” is used with different meanings in the asset pricingliterature. In this paper we use “characteristic portfolio” to refer to a zero-investment portfolio formed on the basis ofone or more firm characteristics. We use the term “factor” to refer to a latent economic force (see, e.g., equation (2.2))but not the return on an investment portfolio. Fama and French (1996, p.57) refer to these latent economic forces asrisk factors or “state variables of special hedging concern to investors.”

64

and let µµµ be the N × 1 vector of expected excess returns on these assets. Premia are driven by

exposure to a single risk factor, which is unobserved. Assume finally that there is a linear relation

between expected excess returns and a single characteristic, that is, µµµ = xxxλc, where xxx is the

corresponding N × 1 vector of stock characteristics and λc is some constant. Researchers are

interested in identifying the underlying risk factor. To do so they construct a CP, a zero investment

portfolio that goes long high characteristic stocks and short low characteristic stocks. Since the

CP earns a large excess return, it must also have a large exposure to the risk factor. Further, the

literature effectively argues that exposure to the CP should price the cross-section, in the spirit of

regressions like the one in equation (2.1). Of course, this will only be the case if the resulting CP

is mean variance efficient.

We show that, in general, this standard procedure will not produce a mean variance efficient

portfolio. The reason is that, while variation in the characteristic does indeed pick up variation

in the loading with respect to the priced risk factor, it also captures variation in the loadings with

respect to unpriced sources of common variation in returns. As a result, exposure to the CP com-

mands premium but the volatility of returns is too high because the portfolio also loads on these

unpriced sources of common variation. It follows that the Sharpe ratio of the CP is lower than the

Sharpe ratio of the projection of the risk factor on the space of returns: the CP is not mean-variance

efficient. We show how to improve on the CP by removing from it unpriced sources of common

variation in returns.

We extend these ideas to the empirically relevant case where average excess returns are ex-

plained by many characteristics, such as size, book-to-market, profitability, and investment. We

introduce the concept of a characteristic efficient portfolio (CEP), which has the smallest return

variance amongst the portfolios that has characteristic equal to one for one characteristic and zero

to all other characteristics. We further show that the complete set of CEPs spans the mean variance

frontier.

The CEPs are the solution to an optimization problem that takes into account the covariance

matrix of returns. Were the covariance matrix known, the calculation of the CEP weights would

65

be straightforward. However, there are numerous problems associated with using an estimated

covariance matrix to construct portfolios and no accepted way of correcting a sample covariance

matrix to fully resolve the problems associated with optimal portfolio construction.4 We instead

develop an empirically feasible strategy to get close to the CEPs. Our starting point is a set of

CPs—in our empirical implementation we start with the five FF CPs.5 We then introduce the

concept of a hedge portfolio: a characteristic balanced portfolio designed to pick up variation

in the loadings with respect to unpriced sources of common variation in returns, and with zero

loading on the priced factors. Characteristic balanced here means that the long- and the short-side

of the hedge portfolio have identical characteristics, and therefore according to our characteristic

model, also have zero expected return. In our theory section, we show how to select the optimal

hedge portfolio: It is the characteristic balanced portfolio that maximizes the loading with respect

to the CP. Finally we show how an optimal combination of the original CPs and the optimal hedge

portfolios delivers the CEPs.

Based on this theoretical development, we construct hedge portfolios for each of the five FF

CPs. There are two key empirical challenges: the construction of the optimal hedge portfolios, and

the calculation of the optimal hedge ratios. For the construction of the optimal hedge portfolios

we build on Daniel and Titman (1997), but improve on their procedure on multiple dimensions.

Through the use of higher frequency data and differential windows for calculating volatilities and

correlations, we are able to construct hedge portfolios that are highly correlated with the FF CPs,

but which have approximately zero expected returns. Importantly, like the FF CPs themselves, our

hedge portfolios are highly tradable: we form value-weighted portfolios once per year, at the end

of June, and hold their composition fixed for 12 months. Based on robustness considerations, we

also calculate the hedge ratios only once per year, also at the end of June, and based only on data

that are in the investor’s information set at that time. Thus our procedure is out-of-sample in the

4Black and Litterman (1991) describe this problem and a suggest shrinking the expected return estimates towardan equilibrium-based prior as a partial solution. Ledoit and Wolf (2003),Ledoit and Wolf (2004),Ledoit and Wolf(2012),Ledoit and Wolf (2017) each propose alternative covariance matrix estimators.

5We concentrate on the factors of Fama and French (2015). However, the critique we develop in Section 2.2applies to any factors constructed using this method.

66

sense that, given knowledge of the five FF characteristics, an investor could have hedged the FF

portfolios in exactly the way we do here.

Empirically, our hedge portfolios behave in a way that is consistent with our theory: except for

the size (SMB) hedge portfolio, they all earn economically and statistically significant five-factor

alphas.6 We combine each of the original five FF CPs with our hedge portfolios in an ex-ante

optimal way, i.e., we forecast the optimal hedge ratio, and generate the five FF CEPs. The optimal

combination of the five CEPs yields a squared Sharpe ratio of 2.13 versus 1.17 for the optimal

combination of the five FF CPs.

Our procedure has the important advantage that it does not require us to identify the sources of

unpriced risk. In fact, we are completely agnostic as to what these unpriced sources of common

variation in returns represent. We do argue, though, that one source of unpriced common variation

may be industry exposure, and present evidence that the standard FF CPs load on industry returns.

We compare the performance of the characteristic efficient FF portfolios with the performance of

a strategy in which we only hedge out the industry component of the original five FF CPs. The

squared Sharpe ratio of the ex-post optimal combination of the industry-neutral five FF CPs is

1.37, which is lower than what we obtain with our methodology, 2.13: There are unpriced sources

of common variation beyond industry and thus the industry-neutral CPs do not span the MVE

portfolio.

Our results are important for several reasons. First, they increase the hurdle for standard asset

pricing models. Following the logic of Hansen and Jagannathan (1991), the pricing kernel variance

that is required to explain the returns of our CEPs is 82% higher than what is required to explain

the returns of the FF CPs.

Second, in order to find economic explanations for the premia associated with characteristics

such as size and value, it is important to start out with portfolios that capture the factor premia with

the minimum possible return variance. In the context of rational models, recall that the ultimate

purpose behind this literature is to find the underlying risk factors that are the source of premia.

6Note that over this sample period, the SMB characteristic portfolio has the lowest Sharpe ratio of the five FF CPs.

67

As FF (page 3) suggest, building on the ICAPM logic of Merton (1973), “. . . the factors are just

diversified portfolios that provide different combinations of exposures to the unknown state vari-

ables” driving the marginal rate of substitution of the marginal investor. CPs then should correlate

with proxies for the marginal utility of the representative investor. But if, by construction, as we

argue, these CPs load on unpriced sources of common variation this correlation with the marginal

rate of substitution is bound to be biased towards zero and thus may lead to the wrong inferences

regarding the suitability of the proposed asset pricing model.7

Third, our CEPs are better benchmarks for the performance evaluation of managed portfolios.

While the characteristics approach to measure managed portfolio performance (see, e.g., Daniel

et al., 1997, DGTW) has gained popularity, the regression based approach initially employed by

Jensen (1968) (and later by Carhart (1997), Fama and French (2010) and numerous others) remains

the more popular. A good reason for this is that the characteristics approach can only be used

to estimate the alpha of a portfolio when the holdings of the managed portfolio are known, and

frequently sampled. In contrast, the Jensen-style regression approach can be used in the absence

of holdings data, as long as time series of portfolio returns are available.

However, as pointed out originally by Roll (1977), the regression approach requires that the

benchmark used in the regression test be efficient; otherwise the conclusions of the regression test

will be invalid. What we show in this paper is that, with the historical return data, efficiency of

the proposed CPs can be rejected. However, our CEPs incorporate the information both from the

characteristics and from the historical covariance structure and thus improve on their FF counter-

parts. Thus, if the CEPs are used as benchmarks and loadings can be estimated accurately, alphas

equivalent to what would be obtained with the DGTW characteristics-approach can be generated

with the regression approach without the need for portfolio holdings data.

Our paper connects to the recent vintage of papers that revisits the question of how to combine

characteristics into tradable portfolios (see Gu, Kelly, and Xiu, 2018; Huang, Li, and Zhou, 2018;

7We thank our discussant, Ralph Koijen, for emphasizing this point to us. Indeed, a recent literature studies theconnection between characteristic premia and risk. See Lewellen, Nagel, and Shanken (2010), and Daniel and Titman(2012) for summaries of the literature. One example is the study by Golubov and Konstantinidi (2019), which focuseson the value premium.

68

Freyberger, Neuhierl, and Weber, 2019; Kozak, Nagel, and Santosh, 2019; Liu, Tsyvinski, and

Wu, 2019). These papers all take as their starting point a set of characteristics that explain average

excess returns. Our focus instead is on improving the efficiency of the characteristic portfolios by

using individual asset loadings on the CPs. Another paper related to ours is Kelly, Pruitt, and Su

(2018), who develop a method they label Instrumented Principal Components Analysis (IPCA),

which they argue allows them to determine the set of priced factors that describe the returns of a

set of 36 characteristic-sorted portfolios with a small set of factors. Our empirical findings strongly

suggest that the characteristic-sorted portfolios employed by Kelly, Pruitt, and Su (2018), which

they refer to as latent factors, are inefficient as a result of ignoring information about the (future)

covariance structure that can be derived from historical covariances.

Finally, our work also connects to another set of papers which identify the priced components

of book-to-market CPs (see Gerakos and Linnainmaa, 2018; Golubov and Konstantinidi, 2019),

but our point is much broader and refers to the general procedure used to construct characteristic

portfolios.

2.2 Theory and examples

Fama and French (1993) and numerous subsequent studies construct characteristic portfolios

as a proxy for the priced risk associated with characteristic premia.8 These papers construct a

zero-investment CP by going long a unit-investment portfolio of high characteristic assets, and

short a unit-investment portfolio of low characteristic assets (where average returns are positively

correlated with the characteristic). Then, each paper proceeds to examine whether the returns of a

set of test assets are explained by a combination of well-known benchmark portfolios and the new

CP, often using regressions like the one in equation (2.1). The implicit argument here is that the

such a characteristic portfolio, in combination with the other benchmark portfolios, will span the

mean variance efficient portfolio.

8See footnote 2 for examples.

69

This paper makes two contributions. We first show that the standard characteristic portfolio

construction procedure is unlikely to yield the mean variance efficient (MVE) portfolio, because

the so-constructed CP will load on unpriced risk. Second, we show how to improve on this standard

procedure by constructing a hedge portfolio which captures the unpriced risk in the CP.

To illustrate, we start with a simple example. The example makes two key assumptions: ex-

cess returns are described by a two-factor structure and expected excess returns are linear in a

single characteristic. Section 2.2.1 further illustrates our results in the context of the popular HML

portfolio. Section 2.2.2 generalizes the simple example to the empirically relevant case in which

multiple characteristics are needed to fully describe the cross-section of average excess returns.

2.2.1 A simple example

Characteristic portfolios and mean variance efficiency

We consider a single period economy with N assets. Realized excess returns are determined

by a two-factor structure, so for asset i:

ri = βi (f + λ) + γig + εi, (2.2)

where E[f ] = E[g] = E[εi] = 0 for all i = 1, 2, · · · , N . Further, suppose that var (f) = σ2f ,

var (g) = σ2g , var (εi) = σ2

ε for all i = 1, 2, · · · , N , and that f , g, and εi are mutually orthogonal

for all i 6= j.

Let rrr denote the (N × 1) column vector of individual excess returns,

rrr> ≡ [r1 r2 · · · rN ] ,

where > denotes transpose. Taking expectations of equation (2.2) gives:

µµµ ≡ E [rrr] = βββλ, (2.3)

70

where βββ is the (N × 1) column vector of individual assets’ exposures to f .

The standard interpretation of f is that it is a proxy for shocks to the marginal rate of substi-

tution; the two canonical examples are that f is (some function of) consumption growth or that

(f + λ) is the return on the market portfolio (Cochrane, 2005, page 78). g is the unpriced source

of common variation. That there is only one factor that is the source of premia is without loss of

generality: For any factor structure there is always a rotation of the factor space in which there is

only one priced factor. Accurately determining f is important in assessing macroeconomic theo-

ries. Its projection onto the space of returns has the maximal Sharpe ratio, so financial economists

attempt to identify f by constructing portfolios with the highest possible Sharpe ratios.

Studies in this literature begin with the observation that expected excess returns in the cross-

section are a function of a set of characteristics. For instance, Fama and French (1993, page 4),

state that “two empirically determined variables, size and book-to-market equity, do a good job

explaining the cross-section of average returns on NYSE, Amex, and NASDAQ stocks for the

1963-1990 period,” and then build the characteristic portfolios SMB and HML based on the these

characteristics.

We follow the literature but go a step further. We assume that expected excess returns are

perfectly described by a linear function of characteristics. To make this example as simple as

possible we assume that expected excess returns are described by a single characteristic, xxx, an

N × 1 column vector and that this characteristic lines up perfectly with expected excess returns,

µµµ = xxxλc, (2.4)

where λc is the characteristic premium.

In order for equations (2.3) and (2.4) to hold simultaneously, it must be the case that:

βββ =

(λcλ

)xxx (2.5)

In this simple setting the characteristic is a perfect proxy for the exposure to the priced factor.

71

6

- γi

βi(=(λcλ

)xi)

A1, A2

��

A3

@@R

A4@

@I

A5, A6��

1−1

1

−1

Figure 2.1: Six assets in the space of loadings on priced and unpriced factors.

Thus, sorting on the characteristic will pick up variation in β. This is the motivation for the

standard procedure in the literature, first developed in Fama and French (1993), of constructing a

zero investment portfolio that goes long stocks with a high value of the characteristic x and short

stocks with low value of the characteristic.

A portfolio is defined by an N -dimensional column vector of portfolio weights,

www> ≡ [w1 w2 · · · wN ] , (2.6)

where wi is asset i’s weight in the portfolio for i = 1, 2, · · · , N .

To continue, suppose that there are only six stocks (N = 6) with equal market capitalizations.

The six stocks have characteristics and loadings on the priced and unpriced factors as illustrated in

Figure 2.1. Notice that assets 1 and 2 have identical loadings and characteristics. The same holds

for assets 5 and 6.

We now construct a specific characteristic portfolio, or CP, which we label c, on the basis of

72

characteristic x, by going long a value-weighted portfolio of the high characteristic stocks A1, A2,

and A3, and short a value-weighted portfolio of the low characteristic stocks A4, A5, and A6.9

Specifically, the weights on individual stocks in the characteristic portfolio c are given by:

www>c =

[1

3

1

3

1

3− 1

3− 1

3− 1

3

], (2.7)

and the return of portfolio c is:

rc = www>c rrr =1

3×

[3∑j=1

rj −6∑j=4

rj

]= 2(f + λ) +

2

3g +

1

3

[3∑j=1

εj −6∑j=4

εj

]. (2.8)

The CP’s return rc does indeed capture the common source of variation in expected excess

returns, since it loads on f . However, our point here is that CPs so constructed are likely to also

load on unpriced factors, and will therefore not be mean-variance efficient. This is the case in this

example: because of the cross-sectional correlation between the characteristic and the loading on

the unpriced factor—i.e., the fact that most high characteristic firms also have a high loading on

the unpriced factor—the constructed CP also loads on the unpriced source of common variation

g.10 Specifically, the CP loads on the factor f with βc = 2 and on g with γc = 23. The value of

the characteristic for this portfolio is xc = 2 λλc

and thus expected excess return is E [rc] = 2λ. The

variance of the returns is given by

σ2c = 4σ2

f +4

9σ2g +

2

3σ2ε , (2.9)

giving the CP a Sharpe ratio of

SRc =2λ√

4σ2f + 4

9σ2g + 2

3σ2ε

. (2.10)

9Because in this simple example all stocks have equal weight there is no difference between equal and valueweighting. The usual Fama-French construction uses value-weighted portfolios.

10Note that a cross-sectional correlation between the characteristic and the loading on the unpriced factor of exactlyzero constitutes the knife-edge case, i.e., it is extremely unlikely.

73

If σ2ε is small relative to the variance of the systematic factors f and g, then it is clear that the

CP is not MVE because it is exposed to both priced and unpriced risk. In this example then a single

characteristic lines up perfectly with expected excess returns and still the CP fails to deliver the

mean variance efficient portfolio. Can we improve on the CP?

Consider the following portfolio h with weights:

www>h =

[1

4

1

4− 1

2

1

2− 1

4− 1

4

]. (2.11)

This portfolio goes long stocks with high loadings on g and short stocks with low loadings on g.

The return of portfolio h is given by

rh =1

2

[r3 +

r5

2+r6

2

]− 1

2

[r1

2+r2

2+ r4

]= 2g +

1

2

[ε3 +

ε5

2+ε6

2

]− 1

2

[ε1

2+ε2

2+ ε4

].

(2.12)

The loading of the h portfolio on g is γh = 2. Notice that this portfolio is characteristic balanced

in that xh = www>hxxx = 0, so βh = 0 and E [rh] = 0 (see (2.4)).

We can use the portfolio h to improve on the CP by reducing its variance without changing its

expected excess returns. For this reason we refer to h as a hedge portfolio. Indeed, given that the

characteristic and hedge portfolios have loadings on g of γc = 23

and γh = 2, respectively, we can

form a portfolio for which for every dollar invested long in portfolio c we also invest $13

(short) in

the hedge portfolio h. This combined portfolio has the same expected return as the CP (which is

2λ) but its exposure to g is eliminated. The variance of this portfolio is

4σ2f +

3

4σ2ε (2.13)

11For a well-diversified portfolio for which residual variance is zero, this problem is the same as setting the loadingon the unpriced factor to zero; with residual risk it is not. In this example the Sharpe ratio is higher as long asσ2g >

(316

)σ2ε .

74

and thus it has a Sharpe ratio of2λ√

4σ2f + 3

4σ2ε

, (2.14)

which is higher than the Sharpe ratio of the CP, SRc (see equation (2.10)), whenever diversification

is large enough so that idiosyncratic risk vanishes.11

We can do better by combining the hedge portfolio with the characteristic portfolio in order

to maximize the Sharpe ratio. Given that the hedge portfolio has zero expected excess return this

is equivalent to finding the combination of the characteristic portfolio and the hedge portfolio that

minimizes the variance of the resulting portfolio, that is,

minδ

var (rc − δrh) ⇒ δ∗ = ρc,hσcσh, (2.15)

where σh is the standard deviation of the returns of the hedge portfolio, h, and ρc,h is the correlation

coefficient between the returns of the characteristic portfolio, c, and the hedge portfolio, h. We refer

to δ∗ as the optimal hedge ratio. It can be shown that this procedure improves the Sharpe ratio of

the characteristic portfolio bySR’SRc

=1√

1− ρ2c,h

, (2.16)

where SR’ is the Sharpe ratio of the improved characteristic portfolio.

Notice that there are several ways of constructing the hedge portfolio h so as to remove expo-

sure to g from the characteristic portfolio c. Equation (2.16) shows though that the optimal hedge

portfolio is the one that is maximally correlated with the CP. In Section 2.2.2 we extend these in-

sights to the empirically relevant case in which a full description of the cross-section of expected

excess returns requires multiple characteristics. Before showing these result formally, we illustrate

the ideas of the example in the context of a popular characteristic portfolio, the HML portfolio of

Fama and French (1993).

75

The simple example in practice: Industry portfolios as g

Book-to-market is one characteristic that has been shown to align with average returns in the

cross-section and HML is a popular characteristic portfolio.12 Asness et al. (2000), Cohen and

Polk (1995) and others13 have shown that if book-to-market ratios are decomposed into an across-

industry component and a within-industry component, then only the within-industry component—

that is, the difference between a firm’s book-to-market ratio and the book-to-market ratio of its

corresponding industry portfolio—forecasts future returns. This literature then suggests that the

exposure of HML to industry returns is unpriced, that is, that industry is one unpriced source of

common variation, g. Therefore, if the industry exposure of HML was hedged out, it would result

in a characteristic portfolio with lower risk, but the same expected return, i.e., with a higher Sharpe

ratio. But, does HML really load on industries?

Figure 2.2 plots the R2 from 126-day rolling regressions of daily HML returns on the twelve

daily Fama and French (1997) value-weighted industry excess returns. The time period is 1963/07

- 2019/12.14 The plot shows that, while there are short periods where the realized R2 dips below

50%, there are also several periods where it exceeds 90%. The R2 fluctuates considerably but the

average is well above 70%. The upper Panel of Figure 2.3 plots, for the same set of daily, 126-day

rolling regressions, the regression coefficients for each of the 12 industries. As it is apparent, these

coefficients display considerable variation: sometimes the HML portfolio loads more heavily on

some industries than on others.

The behavior of the ‘Money’ industry during and after the Great Recession of 2008 is a striking

example of the large industry effect on HML. The lower Panel of Figure 2.3 shows that the regres-

sion coefficient associated with ‘Money’ increased dramatically between 2007 and early 2009,

as stock prices for firms in this segment collapsed and those firms quickly became classified as

12Fama and French (1993, 2015) refer to HML as well as to the other portfolios, as factors or factor portfolios. Weinstead use the expression characteristic portfolios throughout, in order to further distinguish between the underlyingfactor, f , and the portfolio formed on characteristic sorts.

13See also Lewellen (1999), Cohen et al. (2003), and Golubov and Konstantinidi (2019).14The industry classification follows Ken French’s data library at http://mba.tuck.dartmouth.edu/

pages/faculty/ken.french/Data_Library.

76

http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/Data_Library.

http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/Data_Library.

value.15 As shown in Figure 2.4, the volatility of returns also increased dramatically. As a re-

sult of these two effects, ‘Money’ explained a substantial amount of the variation of HML returns

during those years. Indeed Figure 2.5 plots the R2 of a regression of the return on HML on the

‘Money’ industry excess returns alone. Between late 2008 and late 2010, the R2 was well above

60%. The reason for this was that as of December 2007, the top 4 firms by market capitalization

in the ‘Money’ industry were J.P. Morgan, Bank of America, Citigroup and Wells Fargo. Three

of these four were in the large value portfolio (Big/High-BEME to use the standard terminology).

While the market capitalization of these firms fell dramatically through 2008, they remained large

and, particularly as the volatility of the returns on the ‘Money’ industry increased, these firms and

others like them drove the returns both of the HML portfolio and the ‘Money’ industry portfolio.

However, there were firms in the ‘Money’ industry that did not have high book-to-market ratios,

even in the depths of the financial crisis. For example, in 2008 American Express (AXP) and

UnitedHealth Group (UNH) were both “L” (low book-to-market) firms. Yet both AXP and UNH

had large positive loadings on HML at this point in time (see Table 2.1). The reason is that, at this

time, both AXP and UNH covaried strongly with the returns on the ‘Money’ industry, as did HML.

We can exploit this variation within the ‘Money’ industry to construct a characteristic balanced

hedge portfolio wwwh as illustrated in the example in the previous section. The short side of the

characteristic balanced portfolio features firms with high loadings on HML and low and high book-

to-market, such as American Express and Citi, respectively. Loosely speaking, in the example in

Figure 2.1, Citi would be likeA1 (i.e., a value stock in the finance industry), and American Express

would be like asset A4 (a growth stock in the finance industry). The characteristic balanced hedge

portfolio goes long both value and growth stocks with low loadings on HML, and goes short value

and growth firms with high loadings, such as Citi and Amex.16

15As shown in Laeven and Huizinga (2009) banks during the crisis used accounting discretion to avoid writingdown the value of distressed assets. As a result the value of bank equity was overstated. The market knew better andas a result the book-to-market ratio of bank stocks shot up during the crisis.

16By maximizing the negative loading on HML, subject to the constraint that the the portfolio be book-to-marketneutral, we pick up the unpriced part of the HML-exposure. In this simple example the unpriced component of HMLis money-industry return.

77

2.2.2 The general case

We now show that the insights of the previous section extend to the more general case in which

there are multiple factors and characteristics that drive excess returns.

As in the example in Section 2.2.1, we consider a single period economy with a large number

of assets N but in which now realized excess returns are determined by a factor structure in which

the number of factors is K. Recall that for any multifactor representation there is always a rotation

with only one priced factor f and a vector of (K − 1) of unpriced factors ggg. We can then write

realized excess returns as

ri = βi (f + λ) + γγγiggg + εi, (2.17)

where E[f ] = E[gk] = E[εi] = 0 for all k = 1, 2, · · · , K − 1 and i = 1, 2, · · · , N . Further,

suppose that var (f) = σ2f , var (ggg) = Σg, cov (f,ggg) = 000, cov (εi, f) = 0, cov (εi, ggg) = 000 for

i = 1, 2, · · · , N and cov (εi, εj) = 0 for all i 6= j. γγγi is the (1 × (K − 1)) vector of asset i’s

exposure to the unpriced factors ggg.

As before we assume that there is a linear relation between expected excess returns and char-

acteristics

µµµ = Xλλλc, (A1)

where µµµ is again an (N × 1) vector of expected excess returns, X is now an (N ×M) matrix of

characteristics, and λλλc is an (M × 1) vector of characteristic premia.

Assumption (A1) is consistent with model (2.17) as long as

βββ =1

λXλλλc, (2.18)

where βββ is an (N × 1) vector of loadings on f . Thus, under assumption (A1), asset i’s exposure to

f is a linear combination of the M characteristics that describe expected excess returns.

This setting captures, in somewhat simplified form, the current state of the asset pricing litera-

ture, where a set of characteristics explains average returns. The asset pricing tests in the literature

78

(e.g., Fama and French, 1993, 2015) construct a set of CPs, one for each of the characteristics

that are shown to capture variation in average excess returns in the cross-section, and then examine

whether the cross-section of returns is explained by the CPs’ returns, for example using time-series

regressions like that in equation (2.1). The hope is that the projection of f on the space of returns

(the MVE portfolio) will be in the span of those CPs. If this were the case, then the factor model

with the CPs as factors would be a valid multi-factor asset pricing model. We now examine when,

for CPs constructed in this way, the MVE portfolio will indeed be in the span of the CPs.

Our starting point is the set of M CPs, one for each characteristic. The (N × 1) vector of

weights for the m-th CP, wwwc,m, will have positive values for firms with a high value of the m-th

characteristic, and negative values for firms with a negative characteristic. The return of the m-th

CP is then:

rc,m ≡ www>c,mrrr for m = 1, 2, · · · ,M. (2.19)

We further define the (N ×M) matrix of CP weights as:

Wc = [wwwc,1 wwwc,2 · · · wwwc,m] . (2.20)

This paper is concerned with, first, whether the CPs span the MVE frontier, and, second, if they

do not, whether and how we can improve on the CPs. A key concept in what follows is that of

characteristic efficient portfolios or CEPs.

Definition 2.1 (Characteristic efficient portfolios) The weight-vector of the m−th characteristic

efficient portfolio is the solution to the program

minwwwc,m

1

2www>c,mΣwwwc,m subject to www>c,mX = eeem. (Pm)

In program (Pm), Σ is the (N ×N) return covariance matrix and eeem is an (M × 1) vector with

the m−th entry equal to 1 and all others equal to 0. Intuitively the m−th CEP selects among the

portfolios that have the m−th characteristic equal to one and all other characteristics equal to zero,

79

the one with the minimum variance possible.

Our main results are that, first, the M CEPs span the mean variance efficient frontier and,

second, that the asset loadings with respect to CEPs line up perfectly with asset characteristics.

Finally we show how to adjust any set of CPs to transform them into CEPs.

Characteristic efficient portfolios

As shown in the Appendix B.2.1, the solution to program (Pm) is

www∗c,m = Σ−1X(X>Σ−1X

)−1eeem. (2.21)

Let

W ∗c ≡

[www∗c,1 www∗c,2 · · · www∗c,M

]= Σ−1X

(X>Σ−1X

)−1, (2.22)

be the (N ×M) matrix of CEP weights.

Let B∗, an (N × M) matrix, be the projection coefficient of rrr on rrr∗c , where each column

bbb∗m corresponds to the vector of individual stocks’ loadings on the m-th CEP, that is, B∗ ≡

[bbb∗1 bbb∗2 · · · bbb∗M ]. Armed with this we can prove the following:

Proposition 2.1 Under assumption (A1)

1. The returns of the CEPs span the mean-variance-efficient portfolio, that is,

SR∗2 = µµµ∗>c Σ∗−1c µµµ∗c = µµµ>Σ−1µµµ (2.23)

where

µµµ∗c ≡ W ∗>c µµµ = λλλc and Σ∗c ≡ var

(W ∗>c rrr)

=(X>Σ−1X

)−1. (2.24)

2. Asset loadings with respect to the CEPs line up with the characteristics

B∗ = X (2.25)

80

Proof: See Appendix B.2.2. �

Proposition 2.1 says that given (A1), there is an optimal way of constructing portfolios linked

to the characteristics so that they span the mean variance efficient frontier. One property of these

portfolios is that the loadings of any test portfolio on the CEPs will equal the vector of portfolio

characteristics for that test portfolio. That is, if the portfolios are the CEPs there is no distinction

between characteristics and covariances.

Notice that any rotation of the CEPs, W ∗c = W ∗

c A where A is (M ×M) and full rank, will also

span the MVE portfolio of excess returns. But for any such rotation the loadings will no longer be

the corresponding characteristic but a linear combination of them.17

We are interested in constructing CEPs rather than CPs. However, Σ is difficult to estimate,

which may justify the implicit choice in the literature to use simple characteristic sorting proce-

dures to construct CPs, but in general these CPs will not span the same space as the CEPs.

There is though a particular case of interest in which the CPs span the mean variance efficient

frontier. It is when K = M , that is, when the number of characteristics equals the number of

factors that explain the covariance matrix. In this case, adding the hedge portfolios would not

increase the maximum Sharpe ratio achievable with the CPs. Our view though is that this case is

of limited practical interest. The number of factors that capture common variation in stock returns

is likely large (at least the number of industries!) whereas the number of characteristics that have

been found to explain the cross-section of expected excess returns is smaller. In particular, it is

easy to show that as long as K >> M and the characteristics are correlated with the loadings on

the unpriced factors γγγ in the cross-section, the CPs will not span the MVE portfolio. We proceed

next by showing how to recover CEPs from the CPs.

17That the loadings are a linear function of the characteristics follows from the fact that the loadings with respectto any rotation of the CEPs are given by

B∗ = ΣW ∗c A(A>W ∗>c ΣW ∗c A

)−1= X

(A>)−1

,

where the last equality uses (2.22).

81

Characteristic portfolios and hedge portfolios

Our starting point is a set of CPs, for example the five portfolios in Fama and French (2015).

Let B, an (N × M) matrix, be the matrix of projection coefficients of rrr on rrrc, where each

column bbbm corresponds to the vector of loadings of individual assets on the m-th CP, that is,

B ≡ [bbb1 bbb2 · · · bbbM ]. The empirical counterpart of B is a matrix of a multivariate time series

regression coefficients of each asset’s excess return on rrrc.

We show next that there exists a set of optimal hedge portfolios, with weights given by the

columns of the (N × M) matrix W ∗h , that can be combined with the original CPs to obtain (a

rotation of) the CEPs. That is,

W ∗c A = Wc −W ∗

h∆∗, (2.26)

where A is an (M ×M) rotation matrix, and ∆∗ is an (M ×M) matrix of optimal hedge ratios.

Definition 2.2 (Optimal hedge portfolio) The weight-vector of the m−th characteristic hedge

portfolio is the solution to the program

maxwwwh,m

www>h,mbbbm subject to www>h,mX = 000 and1

2www>h,mΣwwwh,m = σ2 (Ph,m)

Program (Ph,m) delivers a portfolio weight-vector www∗h,m so as to maximize the correlation of

the returns of the hedge portfolio, rh,m ≡ www∗>h,mrrr, with the returns of the corresponding CP, rc,m,

conditional on having zero characteristic exposure. Define

W ∗h =

[www∗h,1 www∗h,2 · · · www∗h,M

].

Proposition 2.2

1. The weights of the optimal hedge portfolios are given by

W ∗h =

(Wc −W ∗

cX>Wc

)Σ−1c E−1 (2.27)

82

where E is an (M ×M) diagonal matrix specified in the Appendix B.2.3 and W ∗c is given

by (2.22).

2. A and ∆∗ in expression (2.26) are given by

A = X>Wc and ∆∗ = EΣc (2.28)

Proof: See Appendix B.2.3 �

To understand the intuition of Proposition 2.2 start by noticing that W ∗c A is a rotation of the

CEPs’ weights such that these rotated CEPs have the same characteristic as the original CPs. The

corresponding optimal hedge portfolios have zero expected excess returns,

Errrh = 0 where rrrh = W ∗>h rrr (2.29)

Finally, there is the interpretation of the optimal hedge ratios. A bit of algebra, reported in the

Appendix B.2.4, shows that18

∆∗ = Σ−1h W ∗>

h ΣWc, (2.30)

that is, for each CP, the optimal hedge ratios are the coefficients of a multivariate regression of

its return rc,m on the returns of all the optimal hedge portfolios. The intuition of this result is

straightforward. The hedge portfolios command zero premia, but their returns are correlated with

rrrc. Hence, the optimal hedge ratios are such that the return of the CEP is orthogonal to rrrh. That

is, consistent with exactly the intuition in the example in Section 2.2.1, each CEP is the residual

from a projection of the CP on the set of hedge portfolios, meaning that each CEP is equal to the

corresponding CP, orthogonalized to the set of hedge portfolios.

In sum, the optimal hedge portfolios do not remove any of the premia from the CPs but remove

sources of variation that do not command any premium. In other words, each of the optimal hedge

18The reason why we write ∆∗, rather than ∆, is because, as it was the case in the example (see equation (2.15)),the optimal hedge ratios are the solution of an optimization problem that results in an optimal combination of the CPsand the optimal hedge portfolios.

83

portfolios has zero expected excess return, and hence zero loading on the priced risk factor f .

The only reason that the optimal hedge portfolios load on the CPs is because they all load on the

unpriced risk factors.19 All optimal hedge portfolios together then form a basis for the unpriced

components in the CPs and can be used to hedge out all the exposure to unpriced risk factors. This

is the source of the improvement in the Sharpe ratio when going from CPs to CEPs.

An important implication of this analysis is that whenever we have a set of characteristics

that explain expected excess returns, we can always find a set of portfolios that span the MVE

portfolio. These portfolios lack economic content and thus so do the loadings with respect to

those portfolios, which are simply the characteristics (see Proposition 2.1). These CEPs though

can help in discriminating amongst alternative economic models: if a particular economic variable

is uncorrelated with the CEPs, then it cannot be a candidate for a state variable that drives the

marginal rate of substitution of the marginal investor. But the validity of such an inference depends

on using the CEPs rather than the CPs; the correlation of an economic variable with a CP could

result from a correlation of the variable with the priced factor, or with the unpriced factors on

which the CP loads.

2.3 Empirical results

2.3.1 Hedge portfolio construction

Recall that our starting point is any given set of CPs on which we want to improve. To do so

our methodology involves two steps. First, we construct the hedge portfolios. Second we find the

optimal hedge ratios.

In our empirical exercise, we focus on the FF five-factor model and we follow these authors in

the construction of their CPs. The empirical goal is to construct the best possible hedge portfolios,

as introduced in (Ph,m). The empirical procedure to construct the hedge portfolios builds on Daniel

and Titman (1997). The idea is to use ex-ante forecasts for the loadings for each stock i on the

19We assume that N is large enough and we can ignore idiosyncratic risk.

84

returns of the CPs, rc,m, in order to construct hedge portfolios with maximum loadings on the CPs.

At the same time, these hedge portfolios are constructed in such a way that they have characteristics

as close as possible to zero.

More precisely, our empirical approach is based on the first order conditions of the program (Ph,m).

We show in the Appendix B.2.3 that the weights of each optimal hedge portfolio are given by

www∗h,m =1

κ2,m

Σ−1(bbbm −Xκκκ>1,m

), (2.31)

them−th column ofWh in (2.27). In (2.31)κκκ1,m, which is (1×M), and κ2,m > 0 are the Lagrange

multipliers associated with the first and second constraints in program (Ph,m).

Roughly, equation (2.31) says, holding fixed the characteristics, the hedge portfolio goes long

stocks with high loadings and short stocks with low loadings with respect to the CP. Intuitively,

one can think about stocks’ loadings with respect to the CPs as a combination of exposure to priced

and unpriced risk. When we control for the characteristics, we control for cross-sectional variation

in the exposure to the CEPs, the sole source of variation in premia. Thus, holding characteristics

constant, sorting on CP loadings captures the remaining variation: the one with respect to the

unpriced sources of common variation.

Empirically, we construct the optimal hedge portfolios by sorting individual stocks into charac-

teristic bins and then within each bin sort again on the forecast loading with respect to a particular

CP. We form the portfolio by buying the high loading stocks and shorting the low loading stocks.

There are two challenges in constructing optimal hedge portfolios. First, our theory requires

that we control for all characteristics. Roughly, within each characteristic bin stocks have the same

characteristic values. But if there are, for example, five characteristics and we sort stocks into three

bins for each of them, this would result in 243 portfolios. Some of these portfolios would surely

contain very few stocks and thus would not be sufficiently diversified. It turns out that, empirically,

controlling for size and one additional characteristic at a time is enough to deliver hedge portfolios

that have close to zero exposure to all characteristics, while obtaining well-diversified portfolios.

85

We return to this issue when we discuss the characteristic properties of the hedge portfolios.

Second, notice that construction of the optimal hedge portfolios requires use of the full covari-

ance matrix (Σ; see equation (2.31)). We instead construct an approximation to the optimal hedge

portfolio by, for a set of firms with roughly equal characteristics, going long a value-weighted port-

folio of stocks with low loadings and short a value-weighted portfolio of stocks with high loadings

with respect to the CPs. This results in a hedge portfolio that we can combine with the CPs to get

close to the CEPs. Indeed, because we are not using the theoretically optimal hedge portfolios we

cannot exactly recover the CEPs from Definition 2.1. We still refer to these “approximate CEPs”

as CEPs in order to avoid the need to introduce additional terminology. Finally, value-weighting

stocks in each of the portfolios sorted on characteristics and loadings guarantees that our portfolios

do not overweight small stocks; this avoids the inherent difficulties in trading small stocks because

of a lack of liquidity.

We describe next the exact procedure to construct the hedge portfolios based on the example

of HML. We first calculate book-to-market (BEME) and market capitalization (ME) break points

at the marks of 33.3% and 66.7% based on NYSE firms. For BEME we use data from the end of

December of the previous year and for ME we use data from the end of June of each year. Then,

at the end of June of a given year, all NYSE, AMEX and NASDAQ stocks are placed into one of

the nine resulting bins. Next, within these nine bins, each of the stocks is sorted into one of three

additional bins formed based on the stocks’ forecast future loading on the HML CP. This last sort

results in portfolios of stocks with similar characteristics (BEME and ME) but different loadings on

HML.20 The firms remain in those portfolios between July and June of the following year. Finally,

we construct our hedge portfolio for HML by going long an equal weighted combination of all

high loading portfolios and short an equal weighted combination of all low loading portfolios.

The hedge portfolios for RMW and CMA are constructed in exactly the same way, simply by

replacing BEME with operating profitability (OP) and investment (INV). For SMB, we follow FF

20A potential concern with independently triple-sorted portfolios is sparse portfolio population. The number oftraded firms has varied substantially over time, reaching a “listing peak” in 1996 when it started declining (see Doidgeet al., 2017). We show in Appendix B.4.1 that even in periods of relatively few listed firms, the resulting portfolios aresufficiently diversified.

86

and construct three different hedge portfolios: the first sorts are based on BEME and ME, and then

within these 3x3 bins, we conditionally sort on the loading on SMB. The second and third versions

use OP and INV instead of BEME in the first sort. Then, an equal weighted portfolio of the three

different SMB hedge portfolios is used as the hedge portfolio for SMB. We do exactly the same for

the hedge portfolio for the market (MktRF), using forecast loadings on MktRF instead of forecast

loadings on SMB.

Clearly a key ingredient of the last step of the sorting procedure is the estimation of the future

loading on the corresponding characteristic portfolio. Our purpose is to obtain forecasts of loadings

in the five-factor FF model:21

ri,t = αp + bi,MktRF rMktRF,t + bi,HMLrHML,t + bi,SMBrSMB,t

+ bi,CMArCMA,t + bi,RMW rRMW,t + εp,t

(2.32)

For each stock, we instrument future loadings with pre-formation loading forecasts. The result-

ing estimation method is intuitive and is close to the method proposed by Frazzini and Pedersen

(2014) to estimate individual-firm market loadings. These authors build on the observation that

correlations are more persistent than variances22 and propose estimating correlations and variances

separately. They then combine these estimates to produce the pre-formation loadings. Specifi-

cally, correlations are estimated using a five-year window with overlapping log-return observa-

tions aggregated over three trading days, to account for non-synchronicity of trading. Variances

of characteristic portfolios and stocks are estimated on daily log-returns over a one-year horizon.

Furthermore, we introduce an additional intercept in the pre-formation regressions for returns in

the six months preceding portfolio formation, i.e., from January to June of the rank-year (see Fig-

ure 1 in Daniel and Titman (1997) for an illustration). Further, we use constant-allocation and

constant-weight pre-formation characteristic portfolio returns, as in Daniel and Titman (1997).23

The accuracy of loading forecast impacts the efficacy of the hedge portfolios. Intuitively, if

21We write, say, rHML,t instead of rc,BEME,t to simplify the notation.22See, e.g., De Santis and Gerard (1997)23See Appendix B.3.2 for details.

87

our forecasts of future loadings are very noisy, then sorting on the basis of forecast loadings will

not capture variation in the actual post-formation loadings of the sorted portfolios. In contrast, if

the forecasts are accurate, then our hedge portfolio—which goes long the high forecast loading

portfolio and short the low forecast loading portfolio—will indeed have large loading with respect

to the corresponding FF CP. Notice that this relates to statistical power of rejecting the benchmark

asset pricing model. Under the null hypothesis, αs are equal to zero for all stocks. Given our

theory, the alternative hypothesis is that the hedge portfolios have zero expected returns and strong

positive loadings, which translates into large negative αs. The larger the ex-port loadings, the

higher is the power of the test designed to reject the null hypothesis that the benchmark model is

true. We show in the Appendix B.4.2 that using a low power methodology that follows Daniel and

Titman (1997) and Davis et al. (2000) leads to different results than the ones presented in the next

section. Indeed, with hedge portfolios constructed using the low power method we are not able to

reject the FF five-factor model.

2.3.2 Description of the sorted portfolios

Table 2.2 presents average monthly excess returns for the portfolios sorted on characteris-

tics and forecast-loadings, which we combine to form our hedge portfolios. Each panel presents

a set of sorts with respect to size and one characteristic—either book-to-market, profitability or

investment—and the loading on HML (Panel A), RMW (Panel B), CMA (Panel C), MktRF (Pan-

els D-F), or SMB (Panels G-I).

For each of the 27 portfolios in each subpanel, we report value-weighted monthly excess re-

turns. The column labeled “Avg.” gives the average across the 9 portfolios for a given charac-

teristic. First, note that the average returns in the “Avg.” column are consistent with empirical

regularities well-known in the literature: the average returns of value portfolios are higher than

those of growth, historically robust profitability firms beat weak profitability firms, and histori-

cally conservative investment firms beat aggressive investment firms.

In Table 2.3 we present the post-formation loadings and αs. These are the coefficients from

88

regressing the monthly excess returns of the BEME/OP/INV×ME×loading sorted portfolios on

the excess returns of the five FF CPs, in the sample period from 1963/07 to 2019/12. We see

that there are large differences between the post-formation loadings of the low-forecast-loading

(“1”) and high-forecast-loading (“3”) portfolios For the value, profitability, and investment sorts,

the average post-formation differences in loading of the “3” and “1” portfolios are 0.8, 0.69, and

0.96 respectively. Given these large differences in loadings, it is remarkable that the difference in

the average monthly returns for the high- and low-loading portfolios are 6, 7, and -4 basis points

per month for the value, profitability and investment-loading sorts, respectively (see the last rows,

labeled “Avg.” of Panels A-C in Table 2.2).24 This is consistent with the Daniel and Titman (1997)

conjecture that average returns are a function of characteristics, and are unrelated to the loadings

on the FF-CPs after controlling for the characteristics.

In Figures 2.6, 2.7 and 2.8 we analyze the average characteristics of the sorted portfolios. Each

dot in these plots represents one of the 27 portfolios from the 3×3×3, BEME/OP/INV×ME×loading

sort. The dotted lines connect all portfolios within the same BEME/OP/INV×ME bucket. The x-

axis is the respective post-formation loading and the y-axis the average characteristic value. Figure

2.6 shows the portfolios sorted on HML loadings (Panel A), the portfolios sorted on RMW load-

ings (Panel B) and the portfolios sorted on CMA loadings (Panel C). Figure 2.7 shows portfolios

that form the market hedge portfolio and Figure 2.8 shows portfolios that form the SMB hedge

portfolio.

Ideally, all the dotted lines in Figures 2.6, 2.7 and 2.8 should be horizontal straight lines. This

would mean that, within each characteristic bucket, forecast loadings were uncorrelated with any

characteristic. However, our method uses coarse characteristic sorts and therefore we do not expect

the characteristics of the high and low loading portfolios to be identical. Rather, we expect that

the loading with respect to the CP will be correlated with the characteristic used to construct it.

For example, bHML is correlated with BEME in the cross-section. This correlation translates into

differences in the average characteristics of the low b and the high b portfolios. Last, we do not

24For comparison, the average excess returns of the HML, RMW, and CMA portfolios over the same period are 30,27, and 22 bp/month, respectively.

89

control for the two other characteristics, when sorting on a particular loading. For example, when

we form BEME×ME×bHML sorted portfolios, we do not control for OP or INV. To the extent

that OP and/or INV are cross-sectionally correlated with bHML, we might also pick up variation in

those characteristics.

One way to assess the magnitude of the deviation of our sorted portfolios from the ideal “char-

acteristic balanced” case, is to compare the spread in characteristics within and across characteristic

buckets. The former constitutes the vertical distance between red and green dots, that are connected

with a dotted line, which we call the “unintended characteristic spread”. The latter is the vertical

distance across the dotted lines for the cases where the characteristic on the y-axis corresponds to

the one used to form the CP (i.e. HML with BEME, RMW with OP, and CMA with INV). This is

the “intended spread” in the characteristic between high and low characteristic portfolios used to

form the CP. This gives us an idea of the magnitude of a large spread in the characteristic.

As one can see from the figures, in general, the unintended spreads are relatively small, com-

pared to the intended spreads. For example, in the case of portfolios sorted on bhml, the unintended

BEME spread within characteristic buckets is at most about 0.23 (among the small-value stocks).

When we compare that to the intended spread in BEME between small value and small growth

stocks, which amounts to 1.13, we can conclude that the unintended spread is relatively small.

We view these results as evidence that even a sorting procedure as simple and coarse as ours

does a reasonable job in forming hedge portfolios that are close to being characteristic balanced.

Furthermore, the small observed return differences presented in Table 2.2 may be related to the

characteristic spread observed above. In fact, our theory predicts that the characteristic spreads

across low loading and high loading portfolios should relate to the expected returns in the hedge

portfolios. For example, among the firms in the small-cap, low BEME bucket in Panel A of Fig-

ure 2.6, there is considerable variation in OP. This could partially explain the 19 bp difference in

returns between the high and low bHML portfolios, as documented in the top row in Panel A of

Table 2.2.

Finally, from Figures 2.6, 2.7 and 2.8 we can also see that we forecast future loadings quite

90

well. Ideally, the low forecast loading stocks also have a low post-formation loading on the CPs.

Therefore, the red dots should always be the farthest to the left, whereas the green dots (the high

forecast loading stocks) should end up to the right of those, within each characteristic bucket.

Indeed we observe this pattern for all portfolios. Moreover, the spread between the red and the

green dots, within a given characteristic bucket, should be as large as possible. Indeed the spread

generated here is far bigger, as compared to, e.g., the one that is generated using a loading forecast

methodology following Daniel and Titman (1997) or Davis et al. (2000).25

2.3.3 Pricing results

In this subsection we describe the two key empirical results of this paper. First, we show that

we can reject the FF five-factor model using the hedge portfolios as test assets. Second we show

how to improve the Sharpe ratios of the FF CPs by combining them optimally with the hedge

portfolios. We argue that such CEPs have a better chance of spanning the mean variance efficient

frontier than the standard CPs proposed in the literature.

Pricing the hedge portfolios

We run a single time series regression of the monthly excess returns of the hedge portfolios

rh,m, m ∈ {HML,RMW,CMA,SMB,MktRF}, on the excess returns of the five FF CPs.

Table 2.4 reports the average excess returns, alphas and loadings as well as the corresponding

t−statistics.

Two attributes are important to determine the ability of the hedge portfolios in hedging unpriced

risk: They must have zero expected excess returns and have large loadings with respect to the

corresponding CP.

We first assess the hedge portfolios’ expected excess returns. Column “Avg" in Table 2.4

reports the monthly average excess returns of all hedge portfolios. Ideally, all of these numbers

should be exactly zero. For all 5 hedge portfolios, average excess returns are slightly positive but

25We present evidence on this in Appendix B.4.2.

91

statistically indistinguishable from zero. This result mirrors the fact that the hedge portfolios have

close to zero characteristic exposures. The fact that the excess returns are all slightly negative,

albeit insignificantly so, reflects the earlier insight, that characteristics and corresponding loadings

are inherently cross-sectionally correlated. A coarse sort (such as 3 buckets) is thus always at risk

of picking up some variation in the corresponding characteristic. Hence, going long high (short

low) loading stocks mechanically also tends to slightly tilt towards high (low) characteristic values,

as pointed out in Section 2.3.2.

We then turn to the hedge portfolios’ ability to hedge out unpriced risk by looking at their

post-formation loading on their corresponding CP. As expected, each hedge portfolio exhibits a

strong significantly positive loading on their corresponding CP. For example, the hedge portfolio

for HML has a loading on HML of 0.8 with a t−statistic of 28.21.

This directly translates into pricing implications, as indicated by the alphas. The five FF CPs

fail to price four out of five long-short hedge portfolios, three of them (MktRF, RMW, and CMA)

at a significance level of 5%.26 The last lines of the Table constructs equal-weight combinations

of these portfolios. The alphas for all of them are strongly statistically significant. For instance,

when we consider the equal-weight combination of four hedge portfolios (the ones corresponding

to HML, RMW, CMA and MktRF), the monthly alpha is -0.18 with a t-statistic of -5.92.

Ex-ante determination of the optimal hedge-ratio

Having studied the hedge portfolios, the next step is to construct characteristic efficient portfo-

lios, i.e.,

r∗c,m,t = rc,m,t − rrrh,tδδδm,t−1 (2.33)

where m ∈ {HML,RMW,CMA,SMB,MktRF}.27

The optimal hedge ratio δδδm,t−1 is determined ex-ante, in the spirit of equation (2.15). We

26The only one for which the FF model cannot be rejected, even at the 10% level, is the SMB hedge portfolio. Thefact that the FF model succeeds in pricing the SMB hedge portfolio is consistent with the notion that there is little toprice there, as we know that the size premium has historically been relatively weak.

27To be consistent with our notation, the returns of, for example, the HML portfolio at time t should be denoted byrc,BEME,t. We simplify the notation by calling it rHML,t.

92

employ the same loading forecast techniques as described before to forecast loadings, i.e., we

first calculate five years of constant-weight and constant-allocation pre-formation returns of rc,m,t

and rh,t. We then calculate correlations over the whole five years of 3-day overlapping return

observations and variances by utilizing only the most recent 12 months of daily observations.

Note that this is done in a multi-variate framework, i.e., we consider the covariance of each CP

with all five hedge portfolios, to account for the correlation structure among the hedge portfolios.

Consequently, both δδδm,t−1 and rrrh,t are M−dimensional vectors, where M = 5 in the case of

the FF model examined here. Note further, that the returns of the CEPs r∗m,t are (approximately)

orthogonal to the returns of the hedge portfolios rrrh,t. The reason why they are only approximately

orthogonal is because the δδδm,t−1 is estimated ex-ante, i.e., up to t− 1.

Characteristic efficient Fama and French portfolios

The first column of Table 2.5 reports key statistics on the returns of each of the CPs (rc):

the annualized average returns in percentages, the annualized volatility of returns and the squared

annualized Sharpe ratio. The second column reports the same three quantities for the CEPs, r∗c .

These portfolios are constructed exactly as in expression (2.33).

When we move from rc,m to r∗c,m, we see that the mean return of all characteristic portfolios

decreases, but that the volatility decreases substantially more. This leads to an increase in the

Sharpe ratio for each of the individual Fama and French CPs. For example, the squared Sharpe

ratio of the improved version of CMA is 0.3, where the squared Sharpe ratio of the original CMA

is 0.16.

The right-side panel of Table 2.5 presents p-values for the differences in means based on a

t-test, and for the volatilities using a Levene (1961) test for equality of variances. To test for

differences in Sharpe-Ratios, we use a test based on Jensen (1968)’s alpha. Specifically, to assess

whether the portfolio performance increases when we move from the CP to the CEP, we run a

time-series regression of the returns of the CEP on the CP, and obtain a p-value for the regression

intercept. Consistent with the interpretation of Jensen (1968) we are testing whether there is a

93

statistically significant performance differential between the CP and the CEP.

While the result that we improve on each characteristic portfolio individually is promising, the

ultimate goal of the exercise was to construct a set of portfolios that gets closer to spanning the

mean variance efficient portfolio, as compared to the CPs. Hence, in the second-to-last panel of

Table 2.5, we compute the in-sample optimal combination of both the original FF CPs (column

rc,m) and the CEPs (r∗c,m). The maximum achievable squared Sharpe ratio with FF CPs in the

sample period covered in this paper (1963/07 - 2019/12) is 1.17. The squared Sharpe ratio of the

optimal combination of the CEPs is instead 2.13.

Notice that each individual CEP is perfectly tradable, as all information used to construct them

is known to an investor ex-ante. Only the weights of optimal combinations of the five CPs as well

as CEPs, as reported in the bottom panel of Table 2.5, are calculated in-sample. Additionally, we

want to emphasize that the way we construct our portfolios is very conservative, in that we only

rebalance once every year—in order to be consistent with the rules of the game set by Fama and

French.

We reiterate that our empirical approach does not in general deliver the theoretical CEPs. Com-

putation of the optimal hedge portfolios (see equation (2.31)) and optimal hedge ratios (described

in equation (2.30)) requires knowledge of Σ, which is difficult to estimate, as well as of the post-

formation loadings. Nevertheless, the CEPs deliver a large improvement over the CPs. Moreover,

our empirical method is robust and, at same time, delivers tradable hedge portfolios.

Redundancy of HML

FF find that HML is redundant, in that it is spanned by the other CPs. Table 2.6 shows that

we can replicate this result based on our extended sample. The weight of HML in the ex-post

optimal combination, based on Markowitz optimization, is -2.0 % when we use the original FF

CPs (column rc). However, if we use the CEPs (column r∗c ), the weight on HML increases to 8.0

%, close to the weight on MktRF .

We can confirm this result by running spanning regressions in Table 2.7. The return of HML,

94

rHML is indeed spanned by the other four CPs (column 1). It is similarly subsumed by the other

four CEPs (column 2). The return of the CEP version of HML, r∗HML (column 3), is not fully

spanned by the returns of the other four FF CPs: removing unpriced sources of variation from

HML makes it an unspanned portfolio, relative to the other four original FF CPs. The alpha of

a regression of the HML CEP on the other CEPs has a t-statistic of 1.72. We can thus reject

redundancy of HML∗ at the 10% significance level.

An important additional empirical finding is that the correlation between HML∗ and RMW ∗

is now strongly negative, −.52, whereas it was positive but very small for HML and RMW , .09

(see Table 2.8). This fact suggests that both CPs, HML and RMW , load on an unpriced source

of common variation. The implication is that an investor can capture the premium associated with

exposure to HML∗ and RMW ∗ while lowering the total variance of the portfolio.

2.3.4 Industry-neutral characteristic portfolios

In Section 2.2.1, we argued that industry was one potential source of common variation that

was likely to be unpriced. Since we know that there are periods in which the FF CPs strongly

load on industry portfolios, a natural exercise is to construct CPs that are industry-neutral. In this

section, we construct industry-neutral versions of CPs and compare their performance with the

performance of the CEPs constructed in this paper.

To construct industry-neutral CPs we ex-ante hedge any exposure to the 12 FF industries out of

the FF CPs, except for the market.28 Define the returns of the industry-neutral portfolio, rc−ind,m,t,

as:

rc−ind,m,t = rc,m,t − rrrind,tδδδindc,m,t−1 (2.34)

wherem ∈ {HML,RMW,CMA,SMB,MktRF}, rrrind,t is a (1×12) vector with excess returns

of all 12 industries, δδδindc,m,t−1 is the ex-ante optimal industry hedge-ratio. Analogous to the previous

exercises, δδδindc,m,t−1 is estimated every June 30th, using correlations over the previous five years of

28The market is a linear combination of all industries and thus, hedging out industries from the market using themethod described is not feasible.

95

3-day overlapping return observations and variances by using only the most recent 12 months of

daily observations.29

Table 2.5 reports the mean, volatility and squared Sharpe ratios for all rc−ind,m and the in-

sample optimal combination of the industry-neutral characteristic portfolios. Hedging out industry

risk leads to an improvement in the squared Shape ratio for HML, CMA and SMB, consistent

with the hypothesis that it is generally unpriced risk. However, the CEPs outperform the industry-

hedged CPs in the case of RMW and CMA, i.e., the use of our hedge portfolio results in a greater

Sharpe ratio improvement than simply hedging out industry exposure. In contrast, for HML the

industry-neutral version has a higher Sharpe ratio than CEP. A possible explanation is that, in

theory, unlike our procedure, the industry-hedging can change the characteristic of the resulting

portfolio and as a consequence the exposure to the priced factor. Recall that our CEPs have the

same characteristics as the CPs, but lower variance and therefore a higher squared Sharpe ratio.

Instead, industry-neutral portfolios do not need to have the same characteristic as the original CPs

and our theory does not have a prediction for this case.

Nevertheless, improving the squared Sharpe ratios for each of the CPs is not the goal. Rather

it is to construct CEPs that span the MVE portfolio. Indeed, the ex-post optimal combination

of the CEPs shows a far more dramatic improvement over the original FF CPs compared to the

ex-post optimal combination of industry-hedged portfolios. Based on our Jensen (1968) test, the

CEPs’ optimal combination significantly outperforms the industry-neutral one.30 Since the market

could not be included in the industry-hedging, we repeat the ex-post optimal combination exercise

excluding MktRF in the last panel of Table 2.5. The CEPs also achieve a higher Sharpe ratio

compared to the industry-neutral CPs in this specification, and that difference is also statistically

significant.

These results suggest that simply hedging out industry exposure is not optimal for two potential

reasons. First, some component of the industry factors might be priced. Second, there can be other

29We also employ constant-weight, constant-allocation (as of June 30th) pre-formation returns of the factor- andindustry-portfolios.

30Note that we test whether Jensen’s α of regressing the CEP on the industry-neutral portfolio is statistically largerthan zero. For CPs where the point estimate of that α is negative (such as HML and SMB), we report a “-”.

96

sources of common variation that are not related to industries and do not command a premium.

Our procedure is designed to only hedge out unpriced sources of common variation and does not

require us to identify those sources.

2.4 Conclusions

This paper makes two contributions to the asset pricing literature. First, we examine the stan-

dard procedure employed for constructing characteristic portfolios (CPs): zero-investment portfo-

lios for which the long side is a portfolio of high characteristic stocks, and the short side consists

of a portfolio of low characteristic stocks. This procedure, which has become standard since Fama

and French (1993), does not guarantee that the set of portfolios will span the mean variance ef-

ficient frontier. The reason is that, when sorting on a characteristic, the resulting portfolios are

likely to load on an unpriced sources of common variation. Our second contribution is to show

how to construct hedge portfolios that capture the unpriced risk in these portfolios, and which can

be combined with the CPs to form characteristic efficient portfolios (CEPs) that are free of expo-

sure to these unpriced sources of common variation. Our hedge portfolios are constructed to have

maximum loading on the CPs, subject to having zero characteristic. We show in particular that

if the model linking characteristics and average excess returns is correct, the CEPs will span the

mean variance efficient portfolio.

We illustrate the empirical relevance of our ideas in the context of the five-factor model of

Fama and French (2015). We take the five characteristic portfolios from that model and construct

hedge portfolios for each. Then we construct empirical counterparts to the CEPs, one for each

of the characteristics (market, size, market-to-book, profitability and investment) by optimally

combining the original CPs with our hedge portfolios. The in-sample squared Sharpe ratio of the

optimal combination of the FF CPs is 1.16 whereas it is 2.16 for the CEPs. Removing unpriced

sources of common variation from the original CPs is both empirically and economically relevant.

This paper sheds light on some of the debates in the cross-sectional asset pricing literature.

97

First, an important, if somewhat implicit, assumption in the literature is that there is a model

linking average excess return to characteristics. The existence of this model is also our starting

point. Our contribution is then to show how to construct the CEPs from the CPs so that they span

the MVE portfolio. It is a representation theorem: If a complete model of average excess returns

and characteristics is available, then average excess returns can be fully described with a “factor

model” in which the “factors” are the CEPs.

Economic theory is interested in understanding the economic forces that are the sources of

premia in asset returns. The CEPs we construct are important when discriminating between al-

ternative economic models of the marginal rate of substitution of the representative investor. The

reason is that only the return of the optimal combination of the CEPs—which is the maximum

Sharpe ratio portfolio—is maximally correlated with shocks to the marginal rate of substitution of

the representative investor. The returns of the CPs are not, as these portfolios also load on unpriced

sources of common variation. CEPs then provide a lens through which we can learn about the

economic shocks that matter for the representative investor.

Second, the loadings of any portfolio on the CEPs equal the portfolio characteristics. This

clarifies the characteristics versus covariance debate. To put it sharply: In the context of the asset

pricing models that spring from Fama and French (1993) there is no distinction between charac-

teristics and covariances when the portfolios that serve as factors are the CEPs.

Third, we emphasize the distinction between priced and unpriced sources of common variation.

A full description of the covariance matrix of returns requires at the very least the CEPs and

the hedge portfolios, but only the CEPs are needed for pricing. This speaks to the theoretical

distinction between the APT of Ross (1976) and the ICAPM of Merton (1973). The first is a

model of the covariance matrix of returns whereas the second is a model of sources of premia, that

is, of expected excess returns. Keeping in mind this distinction when building asset pricing models

out of characteristics is key to guaranteeing that the resulting portfolios span the MVE portfolio.

98

Figures

Figure 2.2: Rolling regression R2s – HML returns on industry returns

1970

1980

1990

2000

2010

2020

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

R2 (

126-da

y-rollin

g)

This figure shows the R2 from 126-day rolling regressions of daily HML returns on the twelve daily Fama and French(1997) industry excess returns. The time period is 1963/07 - 2019/12.

99

Figure 2.3: HML loadings on industry-portfolios

1970

1980

1990

2000

2010

2020

−0.6

−0.4

−0.2

0.0

0.2

0.4

0.6

Indu

stry lo

adings

(126

-day-ro

lling)

BusEqMoneyChemsDurblEnrgyHlthManufNoDurOtherShopsTelcmUtils

1970

1980

1990

2000

2010

2020

−0.4

−0.2

0.0

0.2

0.4

0.6

Money

-indu

stry lo

ading of H

ML (126

-day

-rollin

g)

The upper panel of this figure plots the loadings from rolling 126-day regressions of the daily returns to the HMLcharacteristic portfolio on the twelve daily Fama and French (1997) industry excess returns over the 1963/07-2019/12time period. The lower panel plots only the loading on the Money industry portfolio (including the 95% confidenceinterval) and hides the other 11 industry-portfolio loadings.

100

Figure 2.4: Volatility of the money industry-portfolios

1970

1980

1990

2000

2010

2020

0.0

0.2

0.4

0.6

0.8

1.0An

nualize

d Vo

latility

(126

-day

-rollin

g)

This figure shows 126-day volatility of the daily returns to the Money portfolio over the 1963/07 - 2019/12 timeperiod.

Figure 2.5: Rolling regression R2s – HML returns on Money industry returns

2002

2004

2006

2008

2010

2012

2014

2016

2018

2020

0.0

0.2

0.4

0.6

0.8

1.0

R2 (

126-da

y-rollin

g)

This figure shows the R2 from 126-day rolling regressions of daily HML returns on daily Money industry excessreturns from the 12 Fama and French (1997) industry returns. The time period is 2000/01 - 2019/12.

101

Figure 2.6: Ex-post HML/RMW/CMA loadings vs. characteristicsB

EM

EPanel A: HML

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5Post-formation bHML

0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME

Low bHMLMedium bHMLigh bHML

Panel B: RMW

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5Post-formation bRMW

0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME

Low bRMWMedium bRMWigh bRMW

Panel C: CMA

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5Post-formation bCMA

0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME

Low bCMAMedium bCMAigh bCMA

OP


−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP



−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP



−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP


INV


−0.2

0.0

0.2

0.4

0.6

0.8

INV



−0.2

0.0

0.2

0.4

0.6

0.8

INV



−0.2

0.0

0.2

0.4

0.6

0.8

INV


ME


2

4

6

8

10

12

14

log(ME)

Lo bHMLMedium bHMLigh bHML


2

4

6

8

10

12

14

log(ME)

Lo bRMWMedium bRMWigh bRMW


2

4

6

8

10

12

14

log(ME)

Lo bCMAMedium bCMAigh bCMA

This figure shows the time-series average of post-formation characteristic portfolio loading on the x-axis and the time-series average of each characteristic on the y-axis for each of the 27 portfolios formed on size, characteristic (book-to-market/operating profitability/investment) and HML/RMW/CMA-loading. The first column uses sorts on book-to-market and HML-loading, the second one operating profitability and RMW-loading and the last one investment andCMA-loading.

102

Figure 2.7: Ex-post MktRF loadings vs. characteristicsB

EM

EPanel A: MktRF (ME × BEME)

0.6 0.8 1.0 1.2 1.4Post-formation bMktRF

0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME

Low bMktRFMedium bMktRFigh bMktRF

Panel B: MktRF (ME × OP)


0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME


Panel C: MktRF (ME × INV)


0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME


OP


−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP

Low bMktRFMedium bMktRFHigh bMktRF


−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP



−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP


INV


−0.2

0.0

0.2

0.4

0.6

0.8

INV



−0.2

0.0

0.2

0.4

0.6

0.8

INV



−0.2

0.0

0.2

0.4

0.6

0.8

INV


ME


2

4

6

8

10

12

14

log(ME)



2

4

6

8

10

12

14

log(ME)



2

4

6

8

10

12

14

log(ME)


This figure shows the time-series average of post-formation characteristic portfolio loading on the x-axis and the time-series average of each characteristic on the y-axis for each of the 27 portfolios formed on size, characteristic (book-to-market/operating profitability/investment) and MktRF-loading. The first column uses sorts on book-to-market andMktRF-loading, the second one operating profitability and MktRF-loading and the last one investment and MktRF-loading.

103

Figure 2.8: Ex-post SMB loadings vs. characteristics.B

EM

EPanel A: SMB (ME × BEME)

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5Post-formation bSMB

0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME

Low bSMBMedium bSMBigh bSMB

Panel B: SMB (ME × OP)


0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME


Panel C: SMB (ME × INV)


0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME


OP


−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP



−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP



−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP


INV


−0.2

0.0

0.2

0.4

0.6

0.8

INV



−0.2

0.0

0.2

0.4

0.6

0.8

INV



−0.2

0.0

0.2

0.4

0.6

0.8

INV


ME


2

4

6

8

10

12

14

log(ME)

Lo bSMBMedium bSMBigh bSMB


2

4

6

8

10

12

14

log(ME)



2

4

6

8

10

12

14

log(ME)


This figure shows the time-series average of post-formation characteristic portfolio loading on the x-axis and the time-series average of each characteristic on the y-axis for each of the 27 portfolios formed on size, characteristic (book-to-market/operating profitability/investment) and SMB-loading. The first column uses sorts on book-to-market andSMB-loading, the second one operating profitability and SMB-loading and the last one investment and SMB-loading.

104

Tables

Table 2.1: Low book-to-market stocks in the Money industry as of June 2008

Firm BE/ME βHML-portfolioAmerican Express 0.19 3United Health 0.27 3Aflac 0.29 3Charles Schwab 0.13 3Franklin Resources 0.27 3Cme Group 0.34 3Aetna 0.36 2Express Scripts Holding 0.04 1Northern Trust 0.27 3Price T. Rowe 0.17 3TD Ameritrade 0.18 2Cigna 0.34 1Navient 0.36 3Humana 0.32 2Nasdaq 0.32 2

The first column reports the largest fifteen stocks in the Money industry in the low book-to-market bin, sorted bymarket capitalization. The second column reports the book-to-market and the third reports the HML loading-portfolioto which the stock belongs as of June 30th, 2008.

105

Table 2.2: Average monthly excess returns for the sorted portfolios

Panel A: HML

Char-Portfolio bHML-PortfolioBEME ME 1 2 3 Avg.1 1 0.49 0.65 0.68 0.59

2 0.52 0.61 0.753 0.49 0.55 0.55

2 1 0.86 0.91 0.91 0.752 0.71 0.79 0.843 0.57 0.58 0.57

3 1 1.02 0.98 1.02 0.872 0.91 0.86 1.013 0.76 0.66 0.63

Avg. 0.71 0.73 0.77

Panel B: RMW

Char-Portfolio bRMW -PortfolioOP ME 1 2 3 Avg.1 1 0.63 0.84 0.79 0.61

2 0.58 0.71 0.713 0.26 0.49 0.48

2 1 0.88 0.95 0.82 0.712 0.72 0.79 0.763 0.51 0.45 0.5

3 1 0.89 1.04 0.97 0.82 0.77 0.84 0.933 0.61 0.58 0.59

Avg. 0.65 0.74 0.73

Panel C: CMA

Char-Portfolio bCMA-PortfolioINV ME 1 2 3 Avg.1 1 0.97 1 0.9 0.8

2 0.81 0.85 0.733 0.73 0.64 0.6

2 1 0.95 0.91 0.98 0.782 0.97 0.87 0.743 0.57 0.51 0.57

3 1 0.59 0.76 0.59 0.622 0.62 0.73 0.713 0.52 0.55 0.55

Avg. 0.75 0.76 0.71

Mkt

RF

Panel D: MktRF (ME × BEME)

Char-Portfolio bMktRF -PortfolioBEME ME 1 2 3 Avg.1 1 0.51 0.68 0.57 0.58

2 0.64 0.66 0.63 0.57 0.53 0.47

2 1 0.76 0.92 0.97 0.742 0.7 0.8 0.863 0.49 0.62 0.59

3 1 0.96 1.03 1.01 0.872 0.84 0.92 1.023 0.57 0.65 0.82

Avg. 0.67 0.76 0.77

Panel E: MktRF (ME × OP)

Char-Portfolio bMktRF -PortfolioOP ME 1 2 3 Avg.1 1 0.54 0.75 0.84 0.6

2 0.56 0.75 0.733 0.36 0.43 0.46

2 1 0.85 0.93 0.86 0.712 0.7 0.76 0.813 0.45 0.5 0.55

3 1 1.03 0.96 0.95 0.812 0.8 0.85 0.873 0.65 0.56 0.66

Avg. 0.66 0.72 0.75

Panel F: MktRF (ME × INV)

Char-Portfolio bMktRF -PortfolioINV ME 1 2 3 Avg.1 1 0.75 1 1.04 0.8

2 0.73 0.78 0.923 0.63 0.67 0.7

2 1 0.85 0.94 1.02 0.782 0.79 0.83 0.963 0.48 0.51 0.63

3 1 0.69 0.69 0.57 0.622 0.6 0.77 0.673 0.57 0.51 0.53

Avg. 0.68 0.74 0.78

SMB

Panel G: SMB (ME × BEME)

Char-Portfolio bSMB -PortfolioBEME ME 1 2 3 Avg.1 1 0.51 0.66 0.57 0.6

2 0.59 0.66 0.683 0.49 0.61 0.6

2 1 0.78 0.86 1.01 0.772 0.67 0.78 0.943 0.47 0.66 0.73

3 1 0.96 0.98 1.09 0.892 0.82 0.91 1.13 0.53 0.72 0.93

Avg. 0.65 0.76 0.85

Panel H: SMB (ME × OP)

Char-Portfolio bSMB -PortfolioOP ME 1 2 3 Avg.1 1 0.66 0.77 0.73 0.62

2 0.65 0.72 0.673 0.29 0.49 0.65

2 1 0.81 0.93 0.89 0.742 0.66 0.77 0.883 0.42 0.62 0.67

3 1 0.96 1.04 0.9 0.822 0.77 0.85 0.943 0.58 0.66 0.73

Avg. 0.64 0.76 0.78

Panel I: SMB (ME × INV)

Char-Portfolio bSMB -PortfolioINV ME 1 2 3 Avg.1 1 0.89 0.95 1 0.82

2 0.69 0.82 0.963 0.55 0.74 0.79

2 1 0.89 0.97 0.99 0.812 0.78 0.82 1.013 0.46 0.62 0.76

3 1 0.6 0.71 0.58 0.642 0.63 0.69 0.763 0.46 0.64 0.67

Avg. 0.66 0.77 0.84

Stocks are sorted into 3 portfolios based on the respective characteristic — book-to-market (BEME), operating profitability (OP) or investment (INV) — andindependently into 3 size (ME) groups. These are depicted row-wise and indicated in the first two columns. Last, within each bucket, stocks are sorted into 3further portfolios based on the loading forecast. These portfolios are displayed column-wise and in Panels A-C for HML, RMW, CMA, and Panels D-F for MktRF,and Panels G-I for SMB. The last column shows average returns of all 9 respective characteristic-portfolios. The last row shows averages of all 9 respectiveloading-portfolios. The sample period is 1963/07 - 2019/12.

106

Table 2.3: Alphas and loadings

Panel A: HML

Char-Portfolio pre-formation bHML-sorted portfolios

BEME ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 0.00 -0.03 -0.14 0.15 0.04 -0.47 -2.15 1.24

2 0.09 -0.09 -0.13 0.23 1.16 -1.43 -1.87 2.07

3 0.05 0.03 -0.02 0.07 0.83 0.48 -0.24 0.62

2 1 0.12 0.09 -0.02 0.13 1.79 1.68 -0.30 1.51

2 -0.11 -0.06 -0.04 -0.07 -1.56 -1.02 -0.62 -0.68

3 -0.06 -0.13 -0.14 0.08 -0.66 -1.85 -1.69 0.67

3 1 0.22 0.10 -0.07 0.29 3.08 2.07 -1.21 3.02

2 0.01 -0.04 0.01 -0.01 0.08 -0.63 0.15 -0.05

3 0.02 -0.16 -0.11 0.13 0.27 -2.04 -1.08 0.83

Avg. 0.04 -0.03 -0.07 0.11 1.07 -1.20 -2.12 1.86

post-formation bHML t(bHML)

1 1 -0.71 -0.23 0.08 -0.79 -15.22 -6.98 2.40 -13.93

2 -0.74 -0.19 0.29 -1.03 -19.12 -6.73 8.52 -19.71

3 -0.48 -0.18 0.17 -0.65 -16.67 -7.23 5.49 -12.98

2 1 -0.16 0.15 0.50 -0.66 -5.34 5.92 19.17 -15.97

2 -0.04 0.28 0.62 -0.66 -1.20 9.50 19.19 -14.59

3 -0.10 0.27 0.56 -0.66 -2.58 7.92 14.56 -11.56

3 1 0.07 0.45 0.84 -0.78 2.08 20.15 30.99 -17.32

2 0.24 0.51 1.01 -0.78 6.09 17.24 23.61 -12.09

3 0.16 0.65 1.33 -1.17 3.81 17.13 28.04 -15.66

Avg. -0.20 0.19 0.60 -0.80 -11.54 14.28 37.00 -28.21

Panel B: RMW

Char-Portfolio pre-formation bRMW-sorted portfolios

OP ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 0.03 0.07 -0.13 0.16 0.40 1.31 -2.22 1.61

2 0.22 0.05 -0.12 0.33 2.16 0.73 -1.76 2.75

3 0.11 0.01 -0.23 0.34 1.29 0.08 -2.99 2.60

2 1 0.09 0.07 -0.16 0.25 1.34 1.14 -2.78 2.80

2 0.01 -0.02 -0.14 0.15 0.19 -0.35 -2.11 1.65

3 0.15 -0.15 -0.20 0.35 1.99 -2.46 -2.86 2.98

3 1 0.03 0.07 -0.07 0.10 0.39 1.12 -0.95 0.91

2 -0.07 -0.09 -0.09 0.02 -1.05 -1.48 -1.13 0.19

3 0.16 0.01 -0.00 0.16 2.47 0.21 -0.05 1.53

Avg. 0.08 -0.00 -0.13 0.21 2.31 -0.00 -3.83 3.66

post-formation bRMW t(bRMW )

1 1 -0.89 -0.19 0.04 -0.92 -22.35 -7.54 1.26 -18.84

2 -1.09 -0.22 0.11 -1.19 -22.17 -6.74 3.27 -19.98

3 -1.22 -0.40 0.11 -1.33 -28.38 -11.44 2.82 -20.48

2 1 0.00 0.30 0.33 -0.32 0.13 10.52 11.53 -7.53

2 -0.03 0.30 0.37 -0.41 -1.10 10.28 11.78 -9.19

3 -0.51 0.16 0.29 -0.80 -13.98 5.18 8.53 -14.10

3 1 0.02 0.43 0.41 -0.39 0.54 14.70 11.56 -7.30

2 0.29 0.54 0.62 -0.32 8.92 17.94 15.80 -6.50

3 -0.07 0.27 0.42 -0.49 -2.11 10.17 13.84 -9.40

Avg. -0.39 0.13 0.30 -0.69 -22.51 9.79 18.56 -24.80

107

Panel C: CMA

Char-Portfolio pre-formation bCMA-sorted portfolios

INV ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 0.13 0.14 0.02 0.12 2.05 2.74 0.19 1.16

2 0.05 -0.00 -0.22 0.27 0.78 -0.05 -2.57 2.45

3 0.17 -0.11 -0.23 0.40 2.22 -1.71 -3.13 3.36

2 1 0.10 0.07 0.14 -0.04 1.64 1.22 2.15 -0.44

2 0.25 0.08 -0.13 0.39 3.42 1.36 -1.93 3.83

3 0.13 -0.07 -0.13 0.26 1.71 -1.21 -2.16 2.30

3 1 -0.22 -0.04 -0.18 -0.04 -3.10 -0.80 -2.82 -0.45

2 -0.01 -0.09 -0.01 -0.00 -0.13 -1.39 -0.15 -0.01

3 0.34 -0.00 -0.16 0.50 3.86 -0.06 -2.21 3.65

Avg. 0.10 -0.00 -0.10 0.20 2.91 -0.14 -3.01 3.39

post-formation bCMA t(bCMA)

1 1 0.06 0.25 0.68 -0.61 1.33 6.35 11.47 -8.14

2 -0.09 0.55 0.93 -1.02 -1.77 11.23 14.70 -12.38

3 -0.11 0.56 1.22 -1.33 -1.96 11.28 22.24 -15.03

2 1 -0.23 0.16 0.35 -0.58 -5.03 3.78 6.98 -8.27

2 -0.36 0.12 0.42 -0.78 -6.54 2.92 8.08 -10.34

3 -0.45 0.17 0.64 -1.09 -8.11 3.78 13.79 -12.85

3 1 -0.54 -0.11 -0.00 -0.54 -10.05 -2.77 -0.05 -7.55

2 -0.92 -0.15 0.24 -1.16 -13.88 -3.25 4.81 -13.70

3 -1.16 -0.23 0.33 -1.49 -17.41 -5.15 6.32 -14.59

Avg. -0.42 0.15 0.53 -0.96 -15.71 7.11 21.44 -21.13

Panel D: MktRF (ME × BEME)

Char-Portfolio pre-formation bMktRF-sorted portfolios

BEME ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 -0.10 0.01 -0.16 0.06 -1.25 0.10 -1.76 0.48

2 0.01 -0.05 -0.11 0.13 0.23 -0.79 -1.34 1.13

3 0.18 0.02 -0.14 0.32 2.94 0.47 -1.83 2.69

2 1 0.02 0.06 0.04 -0.02 0.33 1.09 0.62 -0.20

2 -0.01 -0.05 -0.16 0.15 -0.15 -0.74 -2.14 1.48

3 -0.06 -0.06 -0.25 0.19 -0.70 -0.82 -2.86 1.51

3 1 0.31 0.11 -0.12 0.43 4.82 2.14 -1.90 4.53

2 0.13 -0.01 -0.17 0.30 1.89 -0.14 -1.79 2.32

3 -0.06 -0.06 -0.12 0.06 -0.63 -0.75 -1.14 0.38

Avg. 0.05 -0.00 -0.13 0.18 1.21 -0.10 -2.94 2.37

post-formation bMktRF t(bMktRF )

1 1 0.86 1.01 1.18 -0.32 43.85 54.94 53.11 -10.35

2 0.90 1.08 1.23 -0.33 54.27 69.79 57.47 -11.38

3 0.88 1.01 1.17 -0.29 56.32 76.61 62.49 -9.79

2 1 0.74 0.94 1.13 -0.39 49.73 69.68 70.65 -15.97

2 0.79 1.02 1.24 -0.45 48.71 64.83 66.52 -17.74

3 0.79 1.02 1.22 -0.43 39.43 57.74 56.69 -13.55

3 1 0.65 0.91 1.21 -0.55 40.98 71.92 74.80 -23.35

2 0.78 1.07 1.33 -0.55 45.74 65.85 56.57 -17.11

3 0.83 1.05 1.29 -0.47 34.39 52.29 47.94 -11.33

Avg. 0.80 1.01 1.22 -0.42 81.52 152.09 108.71 -22.12

108

Panel E: MktRF (ME × OP)


OP ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 -0.07 0.01 -0.03 -0.03 -1.00 0.25 -0.45 -0.33

2 0.03 0.15 -0.04 0.07 0.45 2.08 -0.37 0.52

3 0.00 0.06 -0.09 0.10 0.05 0.71 -0.99 0.66

2 1 0.16 0.05 -0.17 0.34 2.47 0.86 -2.84 3.60

2 0.05 -0.05 -0.14 0.20 0.82 -0.85 -1.94 1.92

3 0.00 -0.01 -0.18 0.18 0.03 -0.19 -2.22 1.44

3 1 0.21 0.02 -0.16 0.38 2.79 0.36 -2.00 3.17

2 0.04 -0.08 -0.21 0.25 0.66 -1.28 -2.56 2.36

3 0.23 -0.04 -0.02 0.26 3.66 -0.73 -0.28 2.10

Avg. 0.07 0.01 -0.12 0.19 1.92 0.49 -2.70 2.57


1 1 0.72 0.97 1.20 -0.48 42.47 75.49 62.94 -18.18

2 0.82 1.07 1.26 -0.44 44.25 59.05 52.21 -13.14

3 0.85 1.07 1.25 -0.41 40.39 54.58 54.08 -11.22

2 1 0.69 0.92 1.13 -0.45 41.23 63.28 74.10 -18.97

2 0.76 1.02 1.20 -0.44 47.92 69.03 64.49 -17.30

3 0.80 1.02 1.20 -0.40 43.36 66.72 59.18 -12.59

3 1 0.83 1.00 1.26 -0.43 43.66 66.31 61.01 -14.37

2 0.89 1.10 1.29 -0.39 57.68 67.83 61.62 -14.63

3 0.86 1.00 1.15 -0.29 53.33 74.98 58.93 -9.66

Avg. 0.80 1.02 1.22 -0.41 81.95 165.09 111.34 -22.10

Panel F: MktRF (ME × INV)


INV ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 0.08 0.14 0.03 0.05 1.15 2.79 0.34 0.44

2 0.03 -0.06 -0.13 0.16 0.49 -1.01 -1.42 1.31

3 -0.01 -0.08 -0.15 0.15 -0.10 -1.19 -1.81 1.17

2 1 0.17 0.10 0.05 0.12 2.44 1.67 0.70 1.16

2 0.13 0.03 0.00 0.13 2.19 0.50 0.04 1.38

3 -0.00 -0.07 -0.08 0.08 -0.05 -1.23 -1.20 0.70

3 1 0.02 -0.11 -0.30 0.32 0.26 -1.91 -4.61 2.97

2 -0.02 0.02 -0.14 0.11 -0.34 0.35 -1.55 0.96

3 0.31 0.05 -0.08 0.39 4.63 0.87 -0.96 3.03

Avg. 0.08 0.00 -0.09 0.17 2.08 0.15 -2.10 2.28


1 1 0.74 0.98 1.23 -0.49 44.22 75.73 59.52 -17.70

2 0.82 1.05 1.31 -0.49 48.60 65.89 58.36 -15.97

3 0.87 1.08 1.23 -0.36 49.05 63.82 57.94 -11.48

2 1 0.67 0.89 1.11 -0.43 38.84 58.08 64.31 -16.62

2 0.77 0.99 1.20 -0.43 50.46 56.54 70.93 -17.96

3 0.79 0.96 1.14 -0.35 44.42 69.44 64.45 -12.15

3 1 0.81 1.00 1.20 -0.38 43.74 68.69 72.79 -14.16

2 0.89 1.09 1.25 -0.36 54.77 66.53 56.81 -12.16

3 0.86 1.07 1.24 -0.38 50.45 69.71 58.97 -11.72

Avg. 0.80 1.01 1.21 -0.41 84.05 167.65 113.64 -22.10

109

Panel G: SMB (ME × BEME)

Char-Portfolio pre-formation bSMB-sorted portfolios

BEME ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 -0.16 -0.07 -0.03 -0.13 -2.23 -0.98 -0.29 -1.05

2 -0.08 -0.02 -0.01 -0.07 -1.33 -0.32 -0.15 -0.78

3 0.04 0.05 0.03 0.01 0.96 0.93 0.36 0.15

2 1 0.00 -0.06 0.12 -0.12 0.02 -1.12 1.59 -1.16

2 -0.09 -0.12 -0.01 -0.08 -1.33 -2.04 -0.13 -0.85

3 -0.12 -0.07 -0.13 0.01 -1.57 -1.08 -1.63 0.10

3 1 0.17 -0.00 0.02 0.16 2.77 -0.09 0.26 1.57

2 0.04 -0.03 -0.03 0.07 0.52 -0.46 -0.36 0.59

3 -0.23 -0.03 0.10 -0.33 -2.83 -0.36 1.00 -2.39

Avg. -0.05 -0.04 0.01 -0.05 -1.53 -1.51 0.17 -0.93

post-formation bSMB t(bSMB)

1 1 0.91 1.14 1.37 -0.46 36.44 44.08 39.55 -10.52

2 0.40 0.64 0.91 -0.51 18.44 28.22 34.45 -15.74

3 -0.25 0.04 0.28 -0.53 -17.31 2.30 11.38 -15.93

2 1 0.68 1.00 1.38 -0.70 32.70 53.10 51.63 -19.17

2 0.28 0.52 0.80 -0.52 12.04 24.09 33.24 -16.25

3 -0.31 0.01 0.28 -0.58 -11.96 0.48 10.19 -15.35

3 1 0.71 0.97 1.35 -0.64 31.94 51.88 53.94 -18.31

2 0.31 0.58 0.87 -0.56 12.69 24.53 31.23 -14.27

3 -0.20 -0.01 0.31 -0.51 -6.84 -0.55 9.25 -10.48

Avg. 0.28 0.54 0.84 -0.56 25.83 57.37 63.01 -27.26

Panel H: SMB (ME × OP)


OP ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 -0.07 -0.06 -0.03 -0.05 -1.19 -0.95 -0.30 -0.41

2 0.05 0.10 -0.00 0.05 0.67 1.40 -0.02 0.44

3 -0.14 0.03 0.23 -0.37 -2.07 0.38 2.50 -2.88

2 1 0.05 -0.01 -0.11 0.16 0.81 -0.22 -1.61 1.73

2 -0.06 -0.06 -0.03 -0.02 -0.95 -1.02 -0.48 -0.26

3 -0.09 0.06 -0.01 -0.08 -1.62 1.03 -0.10 -0.82

3 1 0.07 0.06 -0.15 0.22 1.03 0.87 -1.69 1.93

2 -0.09 -0.10 -0.07 -0.02 -1.41 -1.57 -0.90 -0.21

3 0.09 0.02 -0.03 0.12 2.04 0.28 -0.39 1.17

Avg. -0.02 0.00 -0.02 0.00 -0.70 0.14 -0.58 0.02


1 1 0.83 1.16 1.42 -0.59 39.16 56.16 45.04 -15.33

2 0.27 0.62 0.83 -0.57 10.44 24.82 27.10 -14.22

3 -0.34 -0.01 0.19 -0.52 -14.46 -0.63 5.69 -11.54

2 1 0.63 0.94 1.25 -0.62 28.71 45.45 50.49 -18.49

2 0.30 0.57 0.84 -0.54 13.70 28.85 33.40 -16.78

3 -0.31 -0.00 0.23 -0.54 -16.23 -0.03 9.44 -15.52

3 1 0.81 1.03 1.24 -0.43 34.79 45.60 40.33 -10.93

2 0.41 0.62 0.91 -0.50 19.15 27.18 34.89 -16.06

3 -0.24 0.03 0.35 -0.59 -15.84 1.32 13.14 -16.62

Avg. 0.26 0.55 0.81 -0.55 25.26 59.68 61.37 -27.85

110

Panel I: SMB (ME × INV)


INV ME 1 2 3 1-3 1 2 3 1-3

α t(α)

1 1 0.08 0.03 0.08 0.00 1.32 0.60 0.80 0.01

2 -0.10 -0.06 0.01 -0.11 -1.42 -0.96 0.18 -1.02

3 -0.12 -0.06 -0.03 -0.09 -1.96 -0.91 -0.45 -0.78

2 1 0.13 0.07 0.07 0.06 2.00 1.15 0.95 0.58

2 0.07 -0.01 0.12 -0.05 0.96 -0.15 1.86 -0.57

3 -0.10 0.01 0.07 -0.17 -2.01 0.21 1.01 -1.75

3 1 -0.14 -0.15 -0.21 0.07 -2.18 -2.58 -2.90 0.68

2 -0.04 -0.05 -0.04 0.00 -0.62 -0.74 -0.49 0.00

3 0.10 0.17 0.09 0.01 2.10 2.78 1.10 0.11

Avg. -0.01 -0.01 0.02 -0.03 -0.43 -0.21 0.50 -0.55


1 1 0.79 1.17 1.46 -0.67 36.11 59.36 41.59 -15.99

2 0.29 0.58 0.92 -0.64 11.74 25.18 32.60 -16.22

3 -0.26 -0.01 0.20 -0.46 -11.57 -0.33 7.38 -11.50

2 1 0.65 0.91 1.32 -0.67 28.13 44.90 49.78 -18.20

2 0.26 0.52 0.77 -0.51 10.93 25.60 33.92 -15.39

3 -0.28 0.00 0.26 -0.54 -16.71 0.20 10.35 -15.94

3 1 0.84 1.04 1.31 -0.46 36.75 50.52 51.44 -13.14

2 0.43 0.63 0.95 -0.52 20.29 28.37 34.83 -16.07

3 -0.28 0.09 0.35 -0.64 -16.25 4.25 12.18 -16.29

Avg. 0.27 0.55 0.84 -0.57 25.95 62.36 65.07 -27.97

The table shows alphas and loadings from time-series regressions of monthly excess returns of the loading-sortedportfolios on the five Fama and French characteristic portfolios from 1963/07 - 2019/12. The column labeled ‘1-3’shows the alphas/loadings of long low-loading short high-loading hedge-portfolios. The last row shows averages ofall 9 loading-portfolios.

111

Table 2.4: Results of time-series regressions of hedge-portfolios

Hedge-Portfolio Avg. α bMkt−RF bSMB bHML bRMW bCMA R2

rh,MktRF 0.10 -0.18 0.41 0.40 0.05 -0.17 -0.06 0.66

(0.80) (-2.44) (22.39) (15.18) (1.48) (-4.68) (-1.15)

rh,SMB 0.17 0.03 0.17 0.56 -0.01 -0.15 -0.16 0.72

(1.74) (0.50) (12.27) (28.28) (-0.33) (-5.57) (-3.95)

rh,HML 0.07 -0.11 0.03 -0.05 0.80 0.20 -0.54 0.61

(0.74) (-1.86) (1.80) (-2.34) (28.21) (6.68) (-12.03)

rh,RMW 0.08 -0.21 -0.05 0.04 0.31 0.69 0.11 0.65

(0.86) (-3.66) (-3.27) (1.96) (11.69) (24.80) (2.51)

rh,CMA -0.04 -0.20 0.04 0.02 -0.31 0.09 0.96 0.43

(-0.52) (-3.39) (2.60) (1.10) (-10.95) (2.90) (21.13)

EW3 0.04 -0.17 0.01 0.00 0.27 0.32 0.17 0.70

HML,RMW,CMA (0.64) (-5.45) (0.83) (0.39) (17.52) (20.56) (7.30)

EW4 0.05 -0.18 0.11 0.10 0.21 0.20 0.12 0.58

EW3+MktRF (1.17) (-5.92) (14.60) (9.75) (15.08) (13.71) (5.18)

EW5 0.07 -0.15 0.10 0.15 0.19 0.17 0.08 0.57

EW4+SMB (1.57) (-5.01) (14.08) (14.86) (13.60) (11.65) (3.83)

Stocks are first sorted based on size and one of book-to-market, profitability or investment into 3x3 portfolios. Con-ditional on those sorts, they are subsequently sorted into 3 portfolios based on the respective loading, i.e., on HML,RMW or CMA. For MktRF and SMB we use the average of three hedge portfolios, which are based on a 3x3 sort onsize and book-to-market, profitability or investment. The hedge portfolio then goes long the high loading and short thelow loading portfolios. On the bottom, we form combination-portfolios that put equal weight on three (HML, RMW,CMA), four (HML, RMW, CMA, MktRF) or five (HML, RMW, CMA, MktRF, SMB) hedge portfolios. Monthlyreturns of these portfolios are then regressed on the 5 Fama and French (2015) characteristic portfolios in the sampleperiod from 1963/07 - 2019/12.

112

Table 2.5: Sharpe Ratio improvement.

p-values

rc r∗c rc−ind (rc to r∗c ) (rc to rc−ind) (r∗c to rc−ind)

HML

Mean 3.64 2.51 2.59 0.21 0.30 0.91

Vol 9.64 5.87 5.18 <0.01 <0.01 <0.01

SR2 0.14 0.18 0.25 0.09 0.01 -

RMW

Mean 3.22 2.56 2.33 0.38 0.25 0.67

Vol 7.76 5.05 5.79 <0.01 <0.01 0.91

SR2 0.17 0.26 0.16 0.02 - 0.02

CMA

Mean 2.64 2.37 2.13 0.67 0.49 0.60

Vol 6.50 4.30 3.97 <0.01 <0.01 0.02

SR2 0.16 0.30 0.29 0.01 <0.01 0.06

SMB

Mean 2.81 1.92 2.84 0.29 0.97 0.22

Vol 10.24 6.48 8.26 <0.01 <0.01 <0.01

SR2 0.08 0.09 0.12 0.34 0.12 -

MktRF

Mean 6.62 6.14 - 0.77 - -

Vol 15.09 10.45 - <0.01 - -

SR2 0.19 0.35 - <0.01 - -

In-sample optimal combination

Mean 3.50 2.83 2.59 0.05 0.01 0.27

Vol 3.24 1.94 2.21 <0.01 <0.01 0.01

SR2 1.17 2.13 1.37 <0.01 <0.01 <0.01

In-sample optimal combination (without MktRF)

Mean 2.91 2.42 2.35 0.21 0.18 0.76

Vol 3.67 2.06 2.19 <0.01 <0.01 0.10

SR2 0.63 1.37 1.15 <0.01 <0.01 <0.01

We report the average return and return volatility (annualized, and in percent) and the corresponding annualizedsquared Sharpe-ratio for different versions of each of the five characteristic and characteristic efficient portfolios.rc are the returns of the characteristic portfolios. r∗c are the returns of the characteristic efficient portfolios calculatedas in equation (2.33). rindc are the returns of the industry-neutral characteristic portfolios, where, for the first fourcharacteristic portfolios, we ex-ante hedge out 12 FF industries exposure. As the industry portfolios explain almost100% of the market portfolio, we do not calculate an industry-neutral version of the market. The last three columnsdepict tests of differences between two portfolios, i.e., CPs to CEPs, CPs to industry-neutral CPs, and industry-neutralCPs to CEPs. For the mean, we report the p-value from a t-test of equal means. For the volatility, we report thep-value from Levene’s test of equal variances. In the Sharpe ratio row, we report the p-value of the α from regressingthe second on the first portfolio. If the α is negative, we report a “-” for the p-value. The second to last panel reportsthe statistics for the in-sample Markowitz optimal combination of the five original CPs, the CEPs, and the industry-neutral portfolios. The last panel repeats the exercise, excluding the MktRF portfolio. The sample period is 1963/07- 2019/12. 113

Table 2.6: Ex-post optimal Markowitz weights

rc r∗c

CMA 0.40 0.34

HML -0.02 0.08

MktRF 0.17 0.12

RMW 0.34 0.33

SMB 0.12 0.13

We report the weights on each of the five characteristic portfolios from a full-sample ex-post Markowitz optimization.The first column reports results for the original five characteristic portfolios, and the second column for the fivecharacteristic efficient portfolios. The sample period is 1963/07 - 2019/12.

Table 2.7: Spanning tests for HML

Portfolio HML HML HML∗ HML∗

α -0.02 (-0.28) 0.00 (0.03) 0.15 (2.61) 0.09 (1.78)

bMktRF 0.03 (1.56) -0.01 (-0.70)

bSMB 0.04 (1.37) 0.03 (1.52)

bRMW 0.26 (6.68) -0.14 (-5.21)

bCMA 1.05 (22.93) 0.44 (14.06)

bMktRF ∗ 0.10 (2.96) 0.02 (1.37)

bSMB∗ 0.10 (1.88) 0.05 (1.83)

bRMW ∗ 0.12 (1.51) -0.25 (-6.72)

bCMA∗ 1.04 (11.30) 0.79 (17.86)

R2 0.47 0.19 0.32 0.50

We regress the returns of the original HML characteristic portfolio, rc,HML, (first 2 columns) as well the returns on theHML characteristic efficient portfolio, r∗c,HML, (columns 3 and 4) on the returns of the remaining four characteristicand characteristic efficient portfolios. The sample period is 1963/07 - 2019/12.

114

Table 2.8: Correlations

Panel A: CPs

MktRF SMB HML RMW CMA

MktRF 1.00 0.26 -0.24 -0.24 -0.35

SMB 0.26 1.00 -0.04 -0.35 -0.04

HML -0.24 -0.04 1.00 0.09 0.66

RMW -0.24 -0.35 0.09 1.00 -0.13

CMA -0.35 -0.04 0.66 -0.13 1.00

Panel B: CEPs

MktRF ∗ SMB∗ HML∗ RMW ∗ CMA∗

MktRF ∗ 1.00 -0.27 -0.17 0.15 -0.28

SMB∗ -0.27 1.00 0.18 -0.26 0.15

HML∗ -0.17 0.18 1.00 -0.51 0.68

RMW ∗ 0.15 -0.26 -0.51 1.00 -0.49

CMA∗ -0.28 0.15 0.68 -0.49 1.00

The table shows the correlations of monthly excess returns among the CPs (Panel A) and CEPs (Panel B). The sampleperiod is 1963/07 - 2019/12.

115

Chapter 3

Should Information be Sold Separately? Evidence from MiFID II 1

3.1 Introduction

A key to efficient outcomes in financial markets is efficient information production. The pro-

duction of information, both in terms of quantity and quality, depends on how it is compensated.

We study how changes in the compensation schedules for information affect its production through

the lens of sell-side research.

The compensation schedule for sell-side research has recently changed dramatically. Sell-

side research has traditionally been bundled with transactions: rather than being sold separately,

it was cross-subsidized by trading commissions. However, a new regulation in Europe (MiFID

II, the second Markets in Financial Instrument Directive) unbundles research from transactions.

Implemented on January 3rd, 2018, MiFID II forces asset managers to separate payments for sell-

side research from trading commissions. This change allows us to study how the compensation

schedules for sell-side research affect its production, a question difficult for previous empirical

research to address due to the lack of economically significant variation in the business model of

the sell-side research industry.

Specifically, we ask: What happens to firms’ analyst research quantity and quality? What are

1This chapter is based on Guo and Mota (2020). We thank Charles Angelucci, Simona Abis, Olivier Darmouni,Xavier Giroud, Larry Glosten, Juan-Pedro Gómez (the discussant), Harrison Hong, Shiyang Huang (the discussant),Wei Jiang, Luca Mertens, Sophie Moinas, Tano Santos, Paul Tetlock, Tuomas Tomunen, Laura Veldkamp, KairongXiao, conference participants at EFA 2019, SFS Cavalcade Asia-Pacific 2019 and seminar participants at AQR CapitalManagement, City University of Hong Kong, Columbia Business School, Shanghai Advanced Institute of Finance,and Singapore Management University for their insightful and helpful suggestions. We acknowledge support by theChazen Institute for Global Business at Columbia Business School and Deming Center Doctoral Fellowship.

116

the underlying mechanisms driving the changes? Answers to these questions are crucial for deter-

mining the optimal regulation of financial analysts, and more broadly, for designing the optimal

way to pay for information.

The main goal of MiFID II is to protect end investors by improving market transparency. Before

MiFID II, trading commissions were opaque and it was difficult for end clients to fully assess the

services they were paying for. This generates incentives to assets manager to over charge clients for

transaction fees, potentially under-reporting their own management fees (Di Maggio et al. (2019)).

Therefore, one provision of MiFID II is to require asset managers to bill research services clearly,

which requires unbundling the payments for research from trading.2

As a result, unbundling fundamentally changes how research is compensated and, a priori, its

consequences are unclear. On the one hand, unbundling potentially results in less analyst research

covering fewer companies.3 This is due to the nonrival nature of information. In general, produc-

ing research is costly, but once it is produced and revealed to one party, it is difficult to exclude

others from using the same information. Therefore, directly selling research could result in market

failures (see e.g. Grossman and Stiglitz (1980), Admati and Pfleiderer (1986) and Romer (1990)).

Traditionally, brokerage houses overcome this issue by cross-subsidizing the high fixed cost of

research with transactions fee and marketing to investor that research was given for “free". Un-

bundling prevents cross-subsidizing, potentially causes a marked decrease in research production.

Furthermore, a reduction in the coverage quantity could then cause a decrease in the analyst peer

pressure, which impairs coverage quality (e.g., Hong and Kacperczyk (2010)).

On the other hand, unbundling could improve analyst incentives to produce better research.

Previous studies have shown that when research is bundled with transactions, research analysts

may strategically devote more effort to researching firms that generate more trading commissions

(Harford et al. (2018)). Analysts may even have incentives to produce inaccurate and biased fore-

casts to lure trading business in house (e.g., Hong and Kubik (2003), Fang and Yasuda (2009),

Hong and Kacperczyk (2010) and Karmaziene (2019)). By making research a standalone product,

2We discuss in detail why EU regulators enforce MiFID II in the next section.3See “Mifid II leads to exodus of sellside analysts" Financial Times, June 23rd, 2018.

117

unbundling could restore analyst incentives and improve research quality.

We find that the number of unique sell-side analysts covering a firm decreases after unbundling.

We perform our analyses in a difference-in-difference setting, exploring the different exposures to

unbundling between EU public firms and US public firms. We define EU public firms as the treat-

ment group and US public firms as the control group. In our main specification, analyst coverage

of EU firms changes by −0.651 analysts, which translates into a decrease of 7.45% relative to

the average coverage of these firms prior to the regulation. Contrary to the media and industry

concerns, this average drop does not come from small- or mid-cap firms but is concentrated in

large firms.4 Small firms’ coverage remains almost unchanged and large firms’ coverage drops by

10.53% on average.

Strikingly, we find that the unbundling causes coverage quality to improve. For example, mea-

sured by forecast error, coverage quality of affected firms on average increases by 19.19% relative

to the average coverage quality of these firms prior to the regulation. This result differs from

previous literature (e.g., Hong and Kacperczyk (2010) and Merkley et al. (2017)), which shows

a decrease in the coverage quantity implies a reduction in the coverage quality.5 One plausible

explanation for the difference is that unbundling strengthens analyst competition. Analysts who

sell research as a stand-alone product are likely to compete more directly in the quality domain.

Inferior analysts being competed out of business could account for the simultaneous decrease in

the coverage quantity and increase in the coverage quality.

To be more specific, at least two economic forces foster competition. First, unbundling forbids

asset managers to pass research costs to end investors through trading commissions. Most asset

managers have decided to charge research costs against their own profit and loss. Internalizing

research cost makes asset managers be much more selective and opt for better research. Sec-

ond, unbundling puts an explicit price on research. The evaluation scheme prior to the regulation

4See “French watchdogs calls for rethink of research rules". Financial Times, November 27th, 2018.5In their setting (i.e., under bundling), many analysts have other objectives (e.g., generating trading commissions)

in addition to quality. The quality of research relies on a few independent and impartial analysts who put quality inthe first place. The decrease in the quantity implies both a reduction in the peer pressure and a high propensity to losethese impartial analysts. As a result, research quality worsens.

118

presents a problem similar to moral hazard in teams. An individual’s evaluation is not directly tied

to an individual’s output, but to a bundled output consisting of research and transactions. How-

ever, under unbundling, analysts are evaluated directly by the services they provide. They have

stronger incentives to provide better research. As a result of those two forces, analysts compete in

the quality domain to attract and maintain clients.

We find evidence of two specific predictions of the competition channel: 1) analysts who pro-

duce worse research are more likely to be forced out of the analyst market (extensive margin) and

2) analysts who stay produce better research (intensive margin). For example, using ranks gener-

ated by analyst average forecast error (Hong and Kubik (2003)), we show that analysts who provide

worse forecasts (low-ranked analysts) are more likely to stop working. Conditional on staying, af-

fected analysts’ average forecast error also decreases. Although in a different setting, the beneficial

effects of unbundling in our paper could be related to Edelen et al. (2012). They study the effects

of unbundling distribution fees from commissions and find that unbundling imposes transparency,

mitigates agency conflicts and improves fund performance.

As extensions, we show that individual analyst forecast revision generates a more substan-

tial absolute market-adjusted abnormal return. This evidence suggests that individual forecasts

become more informative, which echos our findings on the coverage quality improvement. Inter-

estingly, when we turn to the sum of abnormal returns over all the individual forecast revisions,

we find that aggregate abnormal returns decrease after unbundling. This result suggests that the

total amount of new information in all the analyst forecast revisions decreases. We interpret these

results via the trade-off between forecast quantity and quality. While individual forecasts become

more informative, the total amount of valuable forecasts decreases.

We also find that the effects of unbundling may not be homogeneous across brokerage houses.

We document that large brokerage houses are likely to be more affected and lay off more analysts.

To the extent that brokerage houses affiliated with large banks are larger in size, the result implies

that these types of houses, whose research featured substantial bundling before the regulation, are

more likely to be affected. This evidence suggests that certain types of brokerage houses could find

119

it more difficult to survive under unbundling and that the market structure of the sell-side industry

may begin to change.

On the aggregate level, we find no evidence that firms’ earnings announcements convey more

information, nor do we find evidence on the widening of firms’ bid-ask spread. Although these

results do not contradict our findings on coverage quality improving, the aggregate market effect

of MiFID II does not seem to manifest itself immediately. Since the reform is relatively recent, we

expect that more time is needed to assess its overall impact on the capital markets. As a final note,

due to data availability, systematic analyses on the welfare impact of unbundling falls out of the

scope of our paper and is left for future studies.6

Our paper makes three contributions. To begin with, we are the first to investigate both the

impact and the underlying mechanism of unbundling on analyst research production. The output

of this project is crucial to resolve the vivid debate among academics, policymakers and practi-

tioners about the consequences of unbundling imposed by MiFID II. Furthermore, investigating

unbundling provides valuable information to policy makers and helps to shape future policy design

in other jurisdictions in addition to Europe. As US Security Exchange Commission chairman Jay

Clayton pointed out:7

“It is important to have data and other information about how MiFID II’s research

provisions are affecting broker-dealers, investors and small, medium, and large issuers,

including whether research availability has been adversely affected.”

Second, our paper contributes to the literature on analyst behavior. It has been long established

that analyst forecasts are on average optimistically biased due to incentive problems (e.g., Dre-

man and Berry (1995), McNichols and O’Brien (1997), and Chopra (1998) and Hong and Kubik

(2003)). Fang and Yasuda (2009), Hong and Kacperczyk (2010) and Kempf (2019) convincingly

show that implicit mechanisms such as career concerns and peer competition discipline analyst be-6As an example, unbundling may also change the pricing of transactions, which plays an important role in analyz-

ing investors’ welfare. However, systematic information on commission fees is hard to obtain, especially in Europe.Though some surveys suggest commission fees are dropping, a comprehensive analysis of commissions and investor’swelfare falls out of the scope of our paper.

7See SEC Press Release: https://www.sec.gov/news/press-release/2018-301.

120

https://www.sec.gov/news/press-release/2018-301

havior. Our analyses depart from, but also complement, these prior studies by showing that direct

market competition helps to discipline analysts even further. A unique finding in our paper is that

a decrease in the coverage quantity can be accompanied by an increase in the coverage quality due

to the strengthening of analyst competition.

Third, unbundling offers a unique setting through which we can study the optimal way to pay

for information (e.g., Veldkamp (2006b), [2006a], Van Nieuwerburgh and Veldkamp (2010)). In-

formation generally has positive externalities, and its production features high fixed costs. Bundling,

though creating incentive problems, covers the high fixed cost and allows for more information

production. It potentially enables easier and wider access to information. Unbundling, on the

other hand, restores the incentive problems but may result in the market unraveling and the under-

provision of certain information. Our results suggest that the improvement in the information

quality by selling information separately could come at a cost of reducing information quantity.

In the accounting literature, more recent works study MiFID II, mainly focusing on its effects

at the analyst level or the firm level (Fang et al. (2019), Lang et al. (2019)). What differs in our

work is that we construct a sample based on analyst-firm pairs, which allows us to identify the

mechanism for the drop in coverage quantity and the improvement in coverage quality. Our unique

contribution is to show that after the regulation, analyst competition is enhanced. We present clear

evidence that inaccurate analysts drop out, and that the ones who stay produce better research.8

Di Maggio et al. (2019) also studies unbundling using detailed transaction-level data in the US.

They emphasize the conflicts of interests between asset managers and end clients. They show that

bundling allows asset managers to under-report management fees. Our paper has a different focus.

We study how unbundling affects sell-side research provisions. Via investigating unbundling, we

aim to understand the effects of changing compensation schedule on information provision.

The remaining of the paper proceeds as follows. In Section 3.2, we provide regulation back-

ground. In Section 3.3, we develop our testable hypotheses. In Section 3.4, we describe our

8Another difference from Lang et al. (2019) is that we use annual data instead of quarterly data. Since we empha-size causality and mechanism, annual data allows us to mitigate concerns specific to the use of quarterly data that mayconfound our identification. These include: forecast seasonality, poor data coverage, selection bias, and staleness. De-tailed explanation can be found in Appendix C.1.2.

121

empirical design and the data we use. We present the firm level analyses in Section 3.5, analyst

analyses in Section 3.6, conduct complementary analyses Section 3.7 and conclude in Section 3.8.

3.2 Regulation background

MiFID II attempts to ensure a high degree of harmonized protection for investors in financial

instruments by improving market transparency and competitiveness. It is considered as one of the

most influential regulatory changes in the EU in response to 2008 financial crisis.

The unbundling provision in MiFID II prohibits asset managers from accepting “fees, com-

missions or any monetary or non-monetary benefits paid or provided by a third party”.9 However,

it provides an exception for research services provided by third parties as long as the asset man-

agement firm pays for it directly. Asset managers can either pay for research directly (against the

firm’s profit and loss) or pass costs onto clients by setting up a research payment account (“RPA”)

under the consent of clients.10 Whichever method asset managers choose, payments for research

needs to be separated, or unbundled, from payments for transactions (Figure 3.1b). According to

a survey by CFA Institute, the majority of asset managers pay for research directly, mainly due to

competition pressure from the whole asset management industry.11 With management fees declin-

ing and asset managers absorbing research costs internally, CFA institute expects a greater focus

on profitability and efficiency concerning research procurement.

Before MiFID II, analyst research was provided “for free" in exchange for trading or investment

banking business. The cost of research was cross-subsidized by trading commissions. Trading

commissions were usually passed onto asset managers’ clients who were unaware of the embedded

research cost (Figure 3.1a). This practice is known as bundling or payment via “soft dollars",

Bundling traces its origin to 1950’s. At that time, commission fees used to be fixed at a high

level (around 1% per share in the US before the May Day 1975) and brokerage houses compete

9Article 24(7) of MiFID II Directive and Article 11 of Commission Delegated Directive (EU) 2017/59310Even with RPA, clients are fully aware of the research cost. They are able to opt out purchasing research if they

choose to.11“MiFID II, One Year On" CAF Institute, 2019.

122

for clients by providing “free" research services (Jones (2002)). Gradually, bundling also becomes

demand-driven (Blume (1993) and Egan (2018)): Asset managers benefit from bundling.12 “Soft

dollars" are borne ultimately by end-investors. Payments for research in the form of trading com-

missions are not reported in the expense ratio and are not disclosed in the management fees. Hence,

“soft dollars" are hidden cost for end clients. On top of that, the provision of “free" supplementary

research services by the executing brokers – such as analyst research reports, corporate access,

or other non-monetary benefits – creates incentives for asset managers to route the trades to that

broker and the potential to trade more often than is appropriate for the clients. It may also preclude

the use of other brokers who may provide better execution services.

Although the practice could hurt end investors, in 1975, the soft-dollar industry lobbied the

US Congress to amend the 1934 Securities Exchange Act by adding Section 28 (e). This section

allows asset managers in the US to pay for research with soft dollars from client commissions

without breaking their fiduciary duty. Member states of the EU essentially followed the same

practice before the onslaught of MiFID II.

EU regulators are increasingly concerned that bundling practice hurts end investors and overly

favors asset managers. To protect end investors and promote transparency in the fee structure, they

mandate unbundling as part of MiFID II. A subsequent effect of unbundling becomes manifested

in the sell-side research industry. Unbundling completely changed the way sell-side research is

compensated. It provides us a valid and timely setting where we can study how the compensation

schedules for sell-side research affect its production.

The MiFID II regime, on its face, applies only to asset managers with a physical presence or

domicile in the European Economic Area (EEA). This includes US asset managers with an au-

thorized European subsidiary providing investment services to clients in the EEA. But given the

regime’s complexity and wide-ranging reach, the key provision extends its influences globally. For

example, US broker-dealers are indirectly affected by MiFID II if they provide investment research

12This is different from bundling in the traditional Industrial Organization literature. In many IO studies, bundlingis a way for the supply side to price-discriminate consumers and extract consumer surplus. See a summary in Tirole(1988).

123

services to EEA firms or EEA clients, or to US asset managers that provide services to EEA clients.

Receiving payments for research directly forces them to be registered as investment advisers sub-

ject to more strict regulation. Foreseeing the important consequences to US asset management

business, in October 2017, the SEC issued a temporary no-action relief to “reduce confusion and

operational difficulties that might arise in the transition to MiFID II’s research provisions" valid

until 2020.13 The relief allows US broker-dealers to receive direct payments from EEA clients

without being registered as financial advisers.

European Parliament began to discuss MiFID II in May 2014 and plan to apply it on January

3rd, 2017. Due to widespread concerns that the infrastructure would not be ready, the EU parlia-

ment decided to allow for a one-year delay for the launch of the regulation. On June 30, 2016, the

Official Journal of EU announced that MiFID II was supposed to be translated into national law by

July 3rd, 2017 and become fully applicable as from January 3rd, 2018.

It is worth noting that unbundling is only one of many important provisions of MiFID II.

For example, under MiFID II, post-trade transparency is extended to non-equity instruments and

post-trade information needs to be made public close to real time. High frequency trading firms

are required to set order limits and use circuit breakers to limit or temporarily halt trading to avoid

erroneous orders.14 However, unbundling is the most relevant and direct provision affecting analyst

research. It is possible that other provisions in MiFID II also affect some part of analyst research,

but as will be clear in the latter part of the paper, it would very difficult for other provisions to

account for all the findings in our paper both at the firm level and at the analyst level. The impact

of MiFID II on analyst research production arguably comes first and mostly from unbundling.

13See SEC no-action relief, October 26th, 2017.14See a summary by Bank of America Merrill Lynch: https://www.bofaml.com/en-us/content/mifid-ii-regulation-

summary-requirements.html

124

3.3 Hypotheses development

We begin by examining the effects of unbundling on research quantity. Research is a cost center

and is difficult to price (e.g., Romer (1990)). Being unable to cross-subsidize the cost of research

from transactions could force brokerage houses to scale back the amount of analyst research they

produce. More importantly, if investors, who attach little value to research (e.g., passive investors),

opt out from purchasing it, the total amount of wealth that can be allocated to research payments

will further decrease. Consequently, using analyst coverage as a measure for research quantity, we

anticipate that:

H1: Sell-side analyst coverage of firms in affected regions will decrease compared with the

coverage of firms in unaffected regions.

We further test whether the coverage of small firms and large firms are differentially affected,

which is the main concern of unbundling raised by market participants. We argue that large firms

are more likely to be affected:

H2: In affected regions, coverage of large firms will decrease MORE than of that of small

firms.

To understand the logic, notice that large firms usually have a larger amount of coverage to

begin with. The large number of research analysts covering a certain firm probably implies that

a lot of work was of low quality. Under unbundling, asset managers, who internalize research

costs in most cases, will have to be much more selective to purchase research. Under such cir-

cumstances, we expect that asset managers opt for better research and discard low quality services.

This indicates that large firms will lose more coverage.

The above hypothesis goes directly against a widespread concern in the market. Practition-

ers believe that small- and medium-sized firms will suffer more because they benefit the most

from cross-subsidization. Trading commissions collected from trading firms with larger demand

(large firms) are used to cover the research cost of small firms. This is no longer allowed under

15See “The EU’s Unbundling Directive is Reinforcing the Power of Scale". The Economist, January 5th, 2019.

125

unbundling.15 Robert Ophèle, chairman of the Autorité des Marchés Financiers of France, com-

mented that MiFID II has very detrimental effects on research, especially for mid-caps and that

Europe was engaged in a “dangerous game" as research capacity was being pared back for many

smaller companies.16 The answer to this apparent conflict in the priors should be an empirical one

and is worth investigating.

We then move to study the impact of unbundling on research quality. We argue that one of

the main drivers for the change in the quality is analyst competition. Unbundling makes analyst

compete more directly in the quality domain, and the overall research quality will improve:

H3: Coverage quality for firms in affected regions will increase compared with unaffected

regions.

There are at least two economic forces fostering competition. To begin with, asset managers

can no longer blindly pass research costs to end clients. Paying for research out of their own pock-

ets in most cases, they will be selective and opt for the best research.17 On top of that, unbundling

changes the evaluation schemes for research analysts. Before the regulation, research was bun-

dled with transactions. Research analysts used to have other incentives in addition to accuracy and

quality (e.g., generating fees). Under unbundling, analysts are evaluated and compensated directly

by the services they provide. Their incentives to provide high-quality research are strengthened.

The two forces imply that analysts will compete more directly in the quality domain as they try

to attract and maintain clients. It is then natural to anticipate that the overall research quality will

improve.

It is worth noting that H1 and H3 altogether deviate from previous literature. It has been shown

that coverage quality decreases together with coverage quantity (e.g., Hong and Kacperczyk (2010)

and Merkley et al. (2017)). According to this view, the potential drop in the analyst coverage after

MiFID II will imply the deterioration of the analyst coverage quality. We believe that the difference

lies precisely in whether research is bundled or not. Under bundling, analysts are disciplined by

16“French watchdog calls for rethink of research rules" Financial Times, November 27th, 2018.17Even with RPA, asset managers will still be careful about quality because end clients can veto the purchase of

unnecessary research.

126

their independent and impartial peers. A decrease in the coverage quantity implies not only a

reduction in the peer competition but also an increase in the propensity to lose these impartial

analysts. Consequently, coverage quality deteriorates. The unbundling requirement of MiFID II

fundamentally changes how sell-side research is evaluated. For the first time, research analysts

have to work out the value of their research, which ties closely to the quality. Fighting for the

limited resources from the buy-side, they compete more directly in the quality domain and their

incentives to provide better research are enhanced, regardless of the total number of analysts. The

urge for independent and impartial peers as a discipline device is weakened.

If analysts compete more directly in the quality domain, we expect that the improvement in

the coverage quality comes from two margins: 1) conditional on not exiting the analyst market,

analysts produce better research (intensive margin); 2) everything else being equal, inaccurate

analysts are more likely to exist the analyst market (extensive margin). “The analyst community

will [split] into those really good analysts — who will be able to earn more than they currently do

— and the average ones who will lose out.”18 These arguments lead to our fourth hypothesis:

H4: Everything else being equal, analysts who stay provide better research than before and

analysts who produce worse research are more likely to drop out.

Importantly, we are not trying to argue that the competition among research analysts was non-

existent before the regulation. In fact, competition has always existed. For example, the ranking

provided by Institutional Investors on an annual poll is a critical metric to access analysts’ prestige.

Career concerns and peer pressure have also served to discipline analysts (Hong and Kubik (2003),

Hong and Kacperczyk (2010)). But compared with unbundling, these are all indirect mechanisms

disciplining analysts. A direct mechanism such as unbundling, which puts an explicit price on the

research services, strengthens the competition and enhances analyst incentives to produce better

research.

18Laurence Hollingworth, former vice-chairman of EMEA investment banking at JPMorgan. “Mifid II and thereturn of the ‘star’ analysts" Financial Times, February 26th, 2018,.

127

3.4 Empirical design and data

3.4.1 Empirical design

We employ a difference-in-difference strategy. The strategy calls for proper treatment and

control groups, which is not straightforward in the case of MiFID II. Unbundling in MiFID II

puts constraints on asset managers, not on analysts nor publicly traded firms. It affects the trading

relationship between asset managers and brokerage houses. However, systematic information on

the trading relationship between the two parties is not available to researchers, especially in Europe.

We have instead systematic information on analysts and publicly traded firms they cover.

To investigate the impact of unbundling on analyst research and test the hypotheses with the

data we have, we leverage a well-established fact: local investors portfolios mostly consist of local

securities (Home Bias).19 This implies that analysts covering EU firms are more likely to serve EU

clients who are subject to MiFID II. In other words, these analysts are more likely to be affected.

This further implies that coverage outcomes of EU firms domiciled and listed in the EU, the main

targets of these analysts, are more likely to be affected as well.

We define the treatment group to be analysts covering mostly EU firms and firms domiciled and

listed in Europe, and the control group to be analysts covering mostly US firms and firms domiciled

and listed in the US. In our sample, analysts’ portfolios are extremely concentrated: in most cases,

they focus either exclusively on EU firms or US firms (See Figure C.1 in C.2). We then define EU

analysts to be the analysts whose portfolios consist of at least 70% of EU firms and US analysts to

be analysts whose portfolios consist of at most 30% of EU firms. The heterogeneous impact of the

regulation on analysts and firms between the two regions is the main variation through which we

identify the impact of unbundling.20

19See French and Poterba (1991), Coval and Moskowitz (2001), Ahearne et al. (2004) and Van Nieuwerburgh andVeldkamp (2009).

20In theory, if an EU fund manager trades a US stock through a brokerage house, the US stock would be indirectlyaffected by the MiFID II. This leads to the absence of a perfect control group. What we explore here is the difference inthe exposure of the treatment: EU analysts and EU firms are more exposed to the treatment than their US counterparts.

128

The identifying assumption is that, in the absence of MiFID II, coverage outcomes of the

treatment group and the control group would have maintained parallel trend. The good news is that

unbundling aims to impose transparency and protect investors. It is not a response to prior events

regarding analyst research. This lends us plausible exogenous variations. Nevertheless, a cross-

country comparison still calls for careful examinations of endogeneity issues that may violate the

identification assumption. We consider a myriad of robustness checks to allow for the possibility

of time-varying heterogeneity across differentially treated groups, including splitting post dummy

into year dummies, adding timing-varying fixed effects and conducting several placebo tests. All

the evidence suggests that the parallel-trend assumption holds and our empirical results are causal.

3.4.2 Data

Our data on analyst forecasts comes from I/B/E/S detail files. We focus on forecasts of annual

earnings per share (EPS) in the current fiscal year since it is the most commonly issued forecasts

in the I/B/E/S dataset and has the widest coverage. For each firm, we take the most recent forecast

by each analyst to account for staleness issues. As a result, we have one forecast issued by one

analyst for one firm in a given fiscal year. The sample selection is consistent with a large number

of papers in the literature (e.g., Hong and Kacperczyk (2010) and Giroud and Mueller (2011)).

Annual balance sheet information comes from Worldscope. Stock exchange and price infor-

mation come from Datastream. Since all three datasets are provided by Thomson Reuters, we are

able to merge them using the unique I/B/E/S identifier. MiFID II went into formal discussion in

the European Parliament in May 2014. To assess its impact, we choose our sample period from

fiscal year 2014 to fiscal year 2018.21 Details of the construction of firm level observations can be

found in Appendix C.1.1.

To account for IPOs and delistings which mechanically affect analyst coverage, we construct a

sample in which all firms have valid accounting and price information from 2014 to 2018. In other

21Firms with fiscal-year end in December 2018 usually have valid financial information in early 2019. Our dataruns until the calendar year May 2019.

129

words, all firms in our sample survived during the entire period. Moreover, we only consider firms

that appeared at least once in the I/B/E/S dataset during the relevant time period. We put zero’s

for analyst coverage when a firm is not covered by any analysts in a given year, hence not present

in the I/B/E/S data set. Our final sample has 21, 960 firm × year observations with 4, 392 firms

in each year. Out of all the firm × year observations, 486 of them do not have return on assets

(ROA). Since all the missing values spread out evenly across the years and most of them appear

only once for one firm, we fill in these missing ROAs using the industry level (Worldscope Item

06010) cross-sectional median in a given year.22

We construct three measures for firm level analyses: analyst coverage (Coveragejt), forecast

error (ForecastErrorjt) and forecast dispersion (ForecastDispersionjt). Coveragejt is defined

as the number of unique analysts covering firm j in fiscal year t. ForecastErrorjt is defined

as the absolute difference between the firm’s actual EPS and the mean of the analyst forecasts.

ForecastDispersionjt is defined as the standard deviation of all the forecasts across all the ana-

lysts following the same firm in a given fiscal year. Following previous literature, we scale forecast

error and forecast dispersion by the firm’s previous year-end stock price to mitigate heteroskedas-

ticity concerns.23 To ensure the reliability of these two measures, we require at least 2 different

analysts providing forecasts for the firm during the fiscal year.24

Since forecast error and dispersion are not defined for firms with coverage less than 2, we

construct two samples for firm level analyses. The first one is the sample for coverage quantity,

which contains all the firms satisfying our selection criterion (21, 960 firm × year observations).

This sample captures the change in the analyst coverage quantity even when firms have coverage

less than 2, possibly zero (not being covered at all). The second one is the sample for coverage

quality in which forecast error and forecast dispersion are properly defined. To limit the effects

of outliers on our results, we remove observations in the sample for coverage quality for which

22We obtain very similar results if we simply delete these observations. We also construct a balanced sample bydeleting all the firm-year observations if one ROAs in one year is missing. We lose about 9% of the total sample butthe results go through.

23See Gu and Hackbarth (2013), Zhang (2006b), Thomas (2002), Zhang (2006b) 2006a.24See Gu and Hackbarth (2013) and Giroud and Mueller (2011).

130

forecast error and forecast dispersion are larger than 10% of the firm’s previous year-end price

(around 2% of the sample).25 Finally, to alleviate the concerns for the composition effect, we

remove all observations of a firm from this sample if that firm’s forecast error or dispersion is not

defined in any given year.26

We present summary statistics for both samples in Table 3.1 and Table 3.2. In general, US

firms are larger, have lower book-to-market ratios and lower ROAs.27 Looking at the sample for

coverage quantity, we observe that on average US firms have more coverage than EU firms. While

US firms have on average 11.145 analysts producing earning forecasts each year, EU firms have

on average 8.558. Looking at the sample for coverage quality, we observe that both forecast error

and dispersion are larger in the EU. The average forecast error is 0.724% for EU firms and 0.442%

for US firms. Similarly, the average forecast dispersion is 0.848% for EU firms and 0.539% for

US firms. The distribution of analyst coverage is quite left skewed in both regions, and more so in

the EU (see Figure C.2 in Appendix C.2). Notice that for coverage less than 2, quality measures

are not defined and those firms are excluded from the sample for quality. Even after the sample

selection, we remain with a large number of unique firms in both regions, 1, 111 for the EU and

1, 693 to the US.

For analyst level analyses in the latter part of the paper, we focus on analyst-firm pairs. For

each pair, we can calculate the forecast error by taking the absolute distance between the actual

annual earnings and the analyst forecast, scaled by the firm’s previous year-end price. Similar to

firm level data, we remove analyst-firm pairs for which the forecast error is larger than 10% of

the firm’s previous year-end price (around 3% of the sample). Details of the construction of other

important variables can be found in the Appendix C.1.3.

25See Gu and Hackbarth (2013) and Giroud and Mueller (2011).26For example, if firm A does not have a valid forecast error in one year, we remove all the observations of firm A

from the sample. Our results hold if we only remove observations of A in that specific year.27In Appendix C.5, we check that all our results are robust to a propensity score matching procedure in which we

discard observations that are too different.

131

3.5 Investigating the impact of unbundling: firm level analyses

We first conduct firm level analyses and examine the impact of unbundling on firms’ coverage

outcomes. We investigate the following empirical model:

Yjt = β1Treatj × Postt + β2Xjt + αj + αt + εjt (3.1)

Yjt measures either analyst coverage Coveragejt, defined as the number of unique analysts

covering firm j in fiscal year t or coverage quality ForecastErrorjt, defined as the absolute dis-

tance between the firm’s actual EPS and the mean of the analyst forecasts, scaled by the firm’s

previous year-end price.28 Treatj is a dummy equal to 1 if a firm is domiciled and listed in Eu-

rope. Postt is a dummy equal to 1 if the fiscal year is equal to 2018. Xjt is a set of control variables

including: log of market capitalization (LN SIZE), log of book to market ratio (LN BM), return

on equity (ROA), total investment return in the current year (RET), return volatility (RETVOL),

GDP growth rate (GDP GROWTH) and unemployment rate (UNEMPLOYMENT RATE) in the

country where the firm is domiciled. When using ForecastErrorjt as the dependent variable, we

also control for the log of the average of the days between the analyst forecast date and the actual

earnings report date (LN DISTANCE). This control is constructed from the analyst forecasts by

taking the average over all the analysts following the same firm. It is likely to be an outcome of un-

bundling and suffers from the bad control problems (Angrist and Pischke (2009b)) if we measure

it year by year. Hence, we measure this variable in 2014 and interact it with Post (see for example

Barrot (2016)).29 Since unbundling does not affect public firms directly, other firm level controls

are not likely to be bad controls. Hence, we measure them year by year. We obtain very similar

results if we measure them in 2014 and interact them with Post. Finally, we also include firm and

time fixed effects. H1 and H3 both imply that β1 < 0.

To explore the heterogeneous response, we split firms into small firms and large firms. We

28In the paper, analyst coverage will sometimes be referred as coverage quantity.29We obtain very similar results if we exclude this control.

132

first calculate the average market capitalization in the pre-regulation years for each firm. Small

firms are defined as firms whose average market capitalization falls below the median. To maintain

the proportion of EU and US firms fixed, we calculate the median cut-offs separately in both

regions.30 We first run a difference-in-difference regression within the EU between small firms

and large firms:

Yjt = β3Smallj × Postt + β4Xjt + αj + αt + εjt (3.2)

We then perform the following triple-difference regression:

Yjt = β5Treatj × Postt + β6Treatj × Postt × Smallj +

+β7Smallj × Postt + β8Xjt + αj + αt + εjt (3.3)

When Yjt measures Coveragejt, H2 implies both that β3 > 0 and β6 > 0.

To assess the validity of the parallel trends assumption and provide evidence on the dynamic

impact of unbundling, we set up a Granger causality test, as suggested in Angrist and Pischke

(2009b). The goal is to see whether causes happen before consequences and not the other way

around. To do this, we split the Postt dummy into Dt year dummies, where t ∈ {2015, ..., 2018}

and run the following specification:

Yjt =∑t

ηt(Dt × Treatj) + β9Xjt + αt + αj + εjt (3.4)

We choose 2014, the year in which MiFID II went into discussion as our reference year. Standard

errors in all the analyzes are clustered at the country level. Although we already control for the

firm level characteristics, we repeat the analyses using a propensity score matching procedure to

mitigate the concerns that our results may be driven by the observable differences at the firm level.

All the results are robust to the matching procedure and details can be found in Appendix C.5.

30To account for the possibility that firm size changes over the years and large firms may become small firms afterthe regulation, we also define the Small dummy using the market capitalization in each year. This way, the variableSmall varies over the years. We obtain virtually the same results.

133

3.5.1 The impact of unbundling on analyst coverage

We first look at the overall effect of unbundling on analyst coverage. Figure 3.2 shows the time

series trends in the average analyst coverage of EU and US firms. A sharp decrease in the analyst

coverage of EU firms can be observed after unbundling. In fact, Panel A of Table 3.5 shows that

the average analyst coverage of EU firms changes from 8.74 in the pre-period to 7.831 in the post-

period. Even if there seems to be a small decrease in the average coverage of US firms as well, the

magnitude is much smaller and is not statistically significant.

We formally test H1 by estimating Model (3.1). Column (1) of Table 3.6 reports the result.

After unbundling, analyst coverage of EU firms changes by−0.651. This translates into a decrease

of 7.45% relative to the average analyst coverage of these firms prior to the regulation.31

Model (3.1) is suitable to recover the parameter of interest β1 only if the parallel trend assump-

tion holds. We test this assumption by estimating the dynamic coefficients described in Model

(3.4). Figure 3.5a shows the plot of dynamic coefficients with a 95% confidence interval.32 The

dynamic coefficients are significant only for 2018. This is strong support for the parallel trend

assumption. We further conduct a placebo test in which we focus on the pre-regulation years and

define the Post dummies as if the regulation occurred in one of these pre-regulation years. If the

parallel trend assumption holds, the coefficient in front of the interaction term Treatj × Postt

should not be statistically significant and if it is, the magnitude should be much smaller. Column

(1) to (3) in Panel A of Table 3.10 shows the results. None of the coefficients are statistically

significant. We take these results as evidence that parallel trend assumption is very likely to hold

in our setting. It is reasonable to believe that our control group establishes a valid counterfactual

of what would have happened to the treatment group in the absence of the reform.

We conclude that unbundling has had a negative effect on analyst coverage. Analyst coverage

has been widely used as a measure of research production quantity. We interpret the decrease

31We perform the same analyzes in the sample for coverage quality. Results are qualitatively the same and can befound in Table C.6 of Appendix C.4.2.

32Point estimates are reported in Table C.4 in the Appendix C.4.1. In the Appendix the readers can find pointestimates for all other dynamic coefficient plots presented in the following sections.

134

in coverage as evidence that there is a causal impact of unbundling on the quantity of research

produced by analysts.

We now turn to explore the heterogeneous effects of the regulation on analyst coverage (H2).

Both Figure 3.3 and Panel A of Table 3.5 show clear evidence that the analyst coverage of small

firms remains relatively unchanged and the analyst coverage of large firms decreases significantly.

These results are also confirmed in the formal difference-in-difference estimation. Column (2) of

Table 3.6 shows that the analyst coverage of small firms in the EU decreases much less than that of

large firms in the EU. The result is further strengthened by the triple-difference analyses in column

(3). The coefficient in front of the triple-difference term is 1.891 and statically significant. For

large firms, the analyst coverage change by −1.594, which translates in to a 10.53% decrease in

the average analyst coverage of large firms prior to the regulation. In Table C.7 of Appendix C.4.2,

we present the regression results in logs. All the results are qualitatively the same. These results

support H2: analyst coverage of large firms decreases more than that of small firms.33

In summary, we have shown that unbundling causes an aggregate decrease in the analyst cov-

erage of EU firms compared with US ones and that this decrease is more pronounced for large

firms. These are evidences in support for H1 and H2. The results are surprising: it contradicts

the common view in the media that small firms would be most affected. We interpret these results

through lenses of competition in the market for analyst research. When it becomes mandatory for

investors to pay separately for research, they are likely to stop buying low-quality research. Large

firms on average have more coverage than small firms and the probability that low-quality research

is produced is higher. If investors opt out of inferior research, large firms are more affected.

3.5.2 The impact of unbundling on analysts forecast quality

After establishing the negative effect of unbundling on analyst coverage (coverage quantity),

we turn to explore the impact on coverage quality. Panel B of Table 3.5 presents the results of a

33In Table C.8 of Appendix C.4.2, we also perform difference-in-difference analyses between EU firms and USfirms but separately for small and large firms (e.g., small EU firms vs. small US firms; large EU firms vs. large USfirms). We obtain very similar results: the decrease in the coverage is concentrated in large firms.

135

simple one-difference analysis. It shows that the forecast error decreases for EU firms. In Figure

3.4, we plot the empirical cumulative distribution of the forecast errors. We can see that for EU

firms, the distribution of the post-year is “smaller than" the distribution of the pre-years since the

post-year distribution lies mostly above that of the pre-years. We do not observe a similar trend

among US firms. We also plot the histogram of the forecast errors in the pre- and post-regulation

years in Figure C.3 of Appendix C.2 and observe a similar pattern: the histogram in the post-year

shifts towards 0 for EU firms, indicating an overall decrease in the forecast error. Again, we do not

observe a clear similar pattern for US firms. All the evidence suggests that the forecast error of EU

firms decreases after unbundling.

Table 3.6 reports the formal difference-in-difference results. Notice that the number of obser-

vations changed from the previous analyses on coverage quantity, because we now focus on the

sample for coverage quality. Column (4) of Table 3.6 shows that overall, the change in the forecast

error of EU firms is −0.142%. This translates into a decrease of 19.19% relative to the average

forecast errors of EU firms prior to the regulation. The statistically and economically significant

decrease shows that unbundling has caused analyst research to be more precise, an indication of

quality improvement. Column (2) shows that the change in the forecast errors is comparable among

both small and large firms. In fact, by checking the triple interaction term in column (3) of the same

table, we find no evidence that the coverage quality of small and large firms are affected differen-

tially. Finally, we perform the dynamic coefficient test described in Model (3.4). The results can be

found in Figure 3.5b. We do not find evident violations of the parallel trend assumption. Placebo

tests in column (4) to (6) of Table (3.10) show reassuring evidence. Hence, we are confident in

claiming that unbundling causes an increase in analyst research quality in the EU.

The results presented in this subsection are in line with hypothesis H3. Even though we doc-

ument a decrease in the coverage quantity due to unbundling, we find that the coverage quality

improves. This provides supportive evidence that unbundling disciplines analyst forecast behavior.

Being evaluated directly and fighting for limited resources from the buy-side, analysts compete

more intensively among the quality domain and produce better forecasts.

136

3.6 Investigating the channel: analyst level analyses

To reconcile the simultaneous decrease in the coverage quantity and the increase in the coverage

quality, we argue that analyst competition is enhanced. Quality becomes the main concern of

analysts due to unbundling. The enhancement of competition 1) motivates existing analysts to

produce better forecasts (intensive margin) and 2) makes analysts who produce worse forecasts

more likely to drop out of the analyst market (extensive margin). To find evidence of the two

margins, we now zoom in to the analyst level and test (H4).

3.6.1 Intensive margin

We begin with the first part of H4. To compare forecast accuracy measured by forecast error

within the same analyst over different years, we focus on analyst-firm pairs that exist throughout

the whole sample period (from 2014 to 2018). We conduct analyses similar to the ones done at the

firm level:

Yijbt = γ1Treati × Postt + γ2Xi × Postt + γ3Zijt + αi + αj + αb + αt + εijbt (3.5)

Yijbt is the forecast error analyst i working in brokerage house b incurs on firm j in fiscal year

t (the absolute difference between firm j’s actual earnings and the analyst i’s forecast, scaled by

firm j’s previous year-end price). Treati is a dummy equal to 1 if the analyst is an EU analyst.

Postt is a dummy equal to 1 if the fiscal year equal to 2018. Analyst fixed effect αi ensures that we

compare forecast error within the same analyst across different years. Ideally, we should include

firm × year fixed effect (αjt) to subsume any firm-level, time-varying heterogeneity that may

confound our causal interpretation. However, the geographic concentration of analyst portfolios as

shown in Figure C.1 implies that αjt will almost subsume Treati × Postt. Instead, we can only

include firm fixed effect and year fixed effect. We also include brokerage fixed effect αb to control

for brokerage firms the analysts work for.

137

To control for time-varying heterogeneity, we include both analyst-level controls and firm-level

controls. Xi is a vector of standard analyst-level controls that are shown to affect analyst forecast

errors, including the log of the number of firms the analyst follows (LN FIRMS COVERED),

the log of the average coverage of the portfolio firms that the analyst follows (LN AVERAGE

COVERAGE), the log of the number of years one analyst worked (LN TENURE) and the log of

the days between the forecast date and the earnings report date (LN DISTANCE). These variables

are likely to be the outcomes of unbundling and serve as bad controls if we measure them year by

year. Hence, we measure them in 2014 and interact them with Post. We obtain similar results if

we simply exclude these variables. Zijt include firm level controls as defined in Section 3.5. Since

unbundling does not affect public firms directly, these controls are not likely to be bad controls and

we measure them year by year. We obtain similar results if we measure them in 2014 and interact

them with Post. Standard errors are clustered at the firm level.34 H4 implies that γ1 < 0.

To mitigate the concern for idiosyncratic noise in analyst-firm level analyses, we aggregate our

measures at the analyst level. To be more specific, we estimate the following model:

Yibt = γ4Treati × Postt + γ5Xi × Postt + γ6Zit + αi + αb + αt + εibt (3.6)

Yibt is the average of all the forecast errors analyst iworking in brokerage house b incurs in year

t. Treati and Postt are defined as before. In the baseline regression, we include brokerage house

+ year fixed effect (αb + αt). We also investigate the specification where we include brokerage

house× year fixed effect (αbt), which kills all the time-varying brokerage house level variation that

may confound our causal interpretation. This strong fixed effect may also kill some meaningful

variation. For example, if unbundling disciplines analysts through the brokerage house they work

for (e.g., brokerage houses decide to put more pressure on analysts to produce better research after

MiFID II), αbt will prevent us from capturing this channel. Hence, we take this specification as a

robustness check.

Similar to firm level analyses, we estimate a triple-difference model with respect to analysts

34This is a conservative way of clustering. Clustering at analyst level leads to smaller standard errors.

138

mostly covering relatively small firms (small analysts) and analysts mostly covering relatively

large firms (large analysts). To achieve this, we first calculate the average market capitalization

of all the firms within an analyst portfolio in a given year. We then average the average market

capitalization across all the pre-regulation years. Small analysts are defined as analysts whose

average of the portfolio average market capitalization falls below the median.35 As before, we

calculate the median cutoff separately in both regions. To aggregate firm level controls to the

analyst level, we simply average the firm level variables within an analyst’s portfolio. Standard

errors are clustered at the analyst level. H4 implies that γ4 < 0. Table 3.3 reports some summary

statistics of the sample we use. EU analysts, in general, follow fewer firms and have larger forecast

errors.

Table 3.7 reports the regression results. Column (1) and Column (2) present results at the

analyst-firm pair level. Column (3) and (4) present the results at the analyst level with brokerage

house + year fixed effects. Column (5) and (6) present the results at the analyst level with broker-

age house× year fixed effect. The results are consistent over all these specifications: analysts who

stay after the regulation produce lower forecast errors. Using column (2) as the main specification,

the average forecast error of EU analysts changes by −0.131%, which translate into a 16.01% de-

crease in the average forecast errors of EU analysts prior to the regulation. The triple interaction

term further shows that the magnitude in the decrease is comparable among analysts who focus on

small firms and analysts who focus on large firms. Notice that the total number of observations

drops a lot in the brokerage house × year fixed effect specification. If we allow for brokerage

house × year fixed effect, all the meaningful variations come from brokerage houses that covering

both EU firms and US firms. Hence we focus on analysts who work for multi-continent brokerage

houses in this specification. Contrary to the analyst level, we do have a sufficient number of bro-

kerage houses that cover firms in both continents. Even with a smaller number of observations, we

still observe a statistically significant decrease in the average forecasting error.

35In principle, one can define small analysts to be analysts focusing on small firms. But analyst portfolio is a mixof both small and large firms. In most cases, large firms are the majority. Focusing on analysts covering only smallfirms leads to limited observations.

139

Figure 3.6 presents the dynamic coefficients plots for all three specifications with a 95% confi-

dence interval. Unbundling takes effect only after the implementation date. Placebo tests in Panel

B of Table 3.10 show similar evidence: none of the coefficients are statistically significant in the

pre-regulation years. The results all suggest that the parallel trend assumption is very likely to hold

in our setting.

The results presented in this subsection are in line with the first part of hypothesis H4: affected

analysts who continue to work after the regulation produces better forecasts.

3.6.2 Extensive margin

Baseline comparison

We now turn to the second part of H4. We want to compare forecast quality between analysts

who drop out due to unbundling and analysts who do not. To achieve this, we need a quality

measure that is comparable across different analysts. The simplest measure is the average forecast

errors incurred by the analyst as defined above. However, different analysts tend to cover different

firms. Some firms are more difficult to forecast than others. An analyst with a high average forecast

error may either be 1) analyst who does not perform well or 2) analyst who follows firms more

difficult to analyze. Therefore, the average forecast error is problematic when we want to compare

across different analysts.

To account for all these issues, we follow Hong and Kubik (2003) to construct a relative accu-

racy measure comparable across different analysts. First, we sort the analysts covering a specific

firm in a year based on their forecast errors. Then, we assign a rank based on the sorting: the best

analyst receives the first rank. In case of a tie, we assign all those analysts the midpoint value of the

ranks they take up. Notice that the maximum rank an analyst can get depends on the total number

of analysts covering the firm. Analysts covering thinly followed firms will on average have lower

ranks and are deemed to be better. To take this into account, we scale an analyst’s rank for a firm

140

by the number of analysts that cover that firm. Formally, we use the following score measure:

Scoreijt = 100− Rankijt − 1

Number of Analystsjt − 1× 100 (3.7)

where Number of Analystsjt is the number of analysts who cover firm j in year t. The more

accurate the forecast is, the lower the rank and the higher the score. We compute the scores for

every firm j in year t. To reduce idiosyncratic noise in the specific analyst-firm pairs and get the

analyst level score in the pre-regulation years, we follow Hong and Kubik (2003) and define the

relative accuracy measure (RelativeAccuracyi) to be:

RelativeAccuracyi =1

T

T∑t

(1

J

J∑j

Scoreijt)

where J is the set of firms analyst i covers in year t and T includes all the years prior to 2018. To

increase the likelihood that the analyst is affected by MiFID II, we focus on analyst-firm pairs that

exist consecutively in 2015, 2016, 2017. If the analyst covers the stock for three consecutive years

and suddenly stops covering it after the regulation, it is more likely that the stop is due to MiFID

II.

Table 3.4 presents summary statistics for the sample we use for the extensive margin analyses in

this subsection. By construction, the mean and median of the relative accuracy measure is around

50. The analysts we study covers on average 10 firms and have worked for about 10 years.

We focus on EU analyst and run the following regression at the analyst-firm level:36

Stopij = π1RelativeAccuracyi + π2Xij + αj + εij (3.8)

36We also use RelativeAccuracyij , relatively accuracy defined on analyst-firm pair by averaging across the years,for robustness checks and results are qualitatively the same.

141

and the following regression at the analyst level:37

DropOuti = π3RelativeAccuracyi + π4Xi + εi (3.9)

On the analyst-firm level, we define a dummy variable (Stopij) equal to 1 if the analyst i stops cov-

ering the firm j after the regulation. At the analyst level, we define a dummy variable (DropOuti)

equal to 1 if the analyst stops covering all the firms he/she used to cover before the regulation. The

firm fixed effect αj ensures that we compare different analysts within the same firm. Controls Xij

in Model (3.8) include the log of the number of unique firms the analyst follows prior to 2018 (LN

FIRMS COVERED), the log of the average of the average coverage of the portfolio firms that the

analyst follows across all the years prior to 2018 (LN AVERAGE COVERAGE) and the log of the

number of years one analyst has worked prior to 2018 (LN TENURE).38 We do not have firm level

controls due to the fixed effect αj . In addition to these variables, controls Xi in Model (3.9) fur-

ther include standard firm level controls aggregated at the analyst level as defined in the previous

sections. Standard errors are clustered at the analyst level for Model (3.8) and the brokerage house

level for Model (3.9). H4 implies that π1 < 0 and π3 < 0.

Column (1) in Table 3.8 shows the result of Model (3.8) at the analyst-firm level with firm fixed

effect. The result confirms our hypothesis: higher relative accuracy (better forecast quality) makes

it less likely for an analyst to stop covering a firm. Within the same firm, if the analyst’s relative

accuracy improves by 1 point, he/she is 83.9 basis points less likely to stop covering a firm. This

translates into a 3.93% drop in the unconditional probability of stopping covering a firm after the

regulation.

Column (3) shows the result of Model (3.9) at the analyst level with the brokerage house fixed

effect. The dependent variable is a dummy equal to 1 if the analyst stops covering all firms he/she

used to cover prior to the regulation. As we can see, a one-point increase in the relative accuracy

37We also conduct logit regression and results are very close. It is more econometrically appealing to clusterstandard errors and to include fixed effects in the linear model.

38For LN AVERAGE COVERAGE, we first take the average over the firms the analyst covers and then take theaverage over all the years prior to 2018. All the variables are measured prior to 2018 and are not likely to be affected bythe regulation. Our results are robust when we measure them in 2015 or just simply exclude them from the regression.

142

results in a 62.2 basis decrease in the probability of dropping out of the sample. This translates

into a 2.58% decrease in the unconditional probability of dropping out. Suppose that there are

8 analysts (the average number of analysts covering an EU firm prior to the regulation) covering

a firm. If one of them improves his/her ranking from 4th to 3rd, his/her relative accuracy will

improve by 14.3 points. This makes him/her 8.89% less likely to drop out and potentially be laid

off (14.3× 0.00622). Moreover, the coefficients in front of analyst experience (LNTENURE) is

positive and highly significant. This shows that experienced analysts are less likely to drop out. To

the extent that analysts with longer experiences tend to produce better forecasts, this result conveys

further supportive evidence that more accurate analysts are more likely to stay.

Pre and post comparison

To sharpen our analyses, we perform similar regressions year by year and investigate whether

analyst accuracy matters more after unbundling. To achieve this, we define a dummy variable

Stopijt+1 equal to 1 if analyst i covering firm j at year t stops covering the same firm at year

t+1. At the analyst level, we define a dummy variable DropOutit+1 equal to 1 if the analyst stops

covering all the firms he/she used to cover at year t. We then turn (3.8) and (3.9) into:

Stopijt+1 = π5RelativeAccuracyit ∗ Postt + π6RelativeAccuracyit

+ π7Xijt + αj + αb + αt + εijt (3.10)

DropOutit+1 = π8RelativeAccuracyit ∗ Postt + π9RelativeAccuracyit

+ π10Xit + αb + αt + εit (3.11)

RelativeAccuracyit is the average of Scoreijt over all firm j analyst i covers at year t. Postt

is a dummy equal to 1 for year equal to 2017.39 Controls Xijt and Xit are defined in a similar

fashion. Standard errors are clustered at the firm level for (3.10) and at the brokerage house level

for (3.11).39Following Hong and Kubik (2003), we assume that the employment decision depends on the analyst performance

in the previous year.

143

The question we seek to answer here is that among the analyst population, whether more in-

accurate analysts are on average more likely to stop/drop out after unbundling. If the answer is

yes, we anticipate π5 < 0 and π8 < 0 in the EU analyst sample but not in the US analyst sample.

Notice that (3.10) and (3.11) are not standard panel regressions. They are different cross-section

regressions stacked over time. The way we define Stopijt+1 and DropOutit+1 conditions on an-

alyst covering the firm at t. If analyst i stops or drops out at t + 1, he/she will not be included

when we define dummies for year t + 2. In fact, it is common that analysts who drop out in one

year never reappear in the later years, i.e., many analysts appear only once in the analyses. For this

reason, we do not include the analyst fixed effect. We include other important controls as defined

in the above subsection. Instead of taking the average over all the pre-regulation years, we need

to measure them year by year. Since the latest control variables are measured in 2017, they are

determined before the regulation and are not likely to be the outcomes of the regulation.

Table 3.9 present the results. Column (1) shows the result of the analyst-firm level focusing

on EU analysts. As predicted, the coefficient in front of RelativeAccuracyit ∗ Postt is negative

and statistically significant, which implies that analysts producing better forecasts are more likely

to keep covering the same firm after the regulation. Column (2) presents the same analysis on US

analysts and we do not observe a similar pattern. This provides further evidence that analysts with

worse forecast records are more likely to stop covering a firm after unbundling only when they are

affected. We perform a triple-difference-type analysis in column (3), and the coefficient in front

of the triple interaction terms again confirms our hypothesis: forecast accuracy matters more after

unbundling for affected analysts. The results using DropOutit+1 as the dependent variable can

be found in column (4) to (6). The magnitude of all the coefficients is comparable with the ones

obtained at the analyst-firm level. Notice that when an analyst drops out, he/she stops covering

all the firms. This is a strong requirement because many analysts stop covering a set of firms

without completely dropping out. Thus, the variation at the analyst level may not be enough to

allow for a triple-difference-type analysis. This could explain why the coefficient in front of the

triple interaction term is not statistically significant.

144

In summary, the results presented in Table (3.8) and (3.9) are all in line with the second part

of H4. In other words, analysts who have lower relatively accuracy (analysts who produce worse

forecasts) are more likely to stop covering a firm and even drop out. Forecast accuracy matters

more after unbundling only among affected analysts. This section provides additional evidence on

the competition channel: analysts compete for research quality due to unbundling. The ones with

worse forecast records are more likely to be cast out.

3.7 Robustness check and extensions

Previous sections focus on analyzing the impact of unbundling on sell-side research production

and the underlying channel driving the results. We now conduct a few additional analyses to show

that our findings are robust. We also study how unbundling affects brokerage house employment

and what the capital market effects of MiFID II are.

3.7.1 Firm’s brokerage house coverage

In the previous analyses, we show that firms’ analyst coverage drops. One concern is that

this drop comes from brokerage houses laying off analysts providing duplicated research services.

For example, if two analysts cover the same firm in the same brokerage house, the one providing

duplicated or worse research may be laid off. If this is the case, a drop in the analyst coverage does

not imply a drop in the brokerage house coverage. Investors are still able to get access to the same

amount research from these brokerage houses and potentially from better analysts. In other words,

the quantity of research does not decline.

Although one brokerage house rarely hires two analysts covering the same public firm due

to redundancy, we check firms’ brokerage house overage directly.40 We perform difference-in-

difference analyses similar to Model (3.1) but replace the dependent variable with the number of

unique brokerage houses covering the firm. Under the condition that the relationship between

40This is a well-established empirical fact (e.g., Hong and Kacperczyk (2010)). In our sample, among all thebrokerage house-firm pairs, 86% of them has only one analyst coverage.

145

investors and brokerage houses are sticky (as shown, for example, in Di Maggio et al. (2019)), a

reduction in firms’ brokerage house coverage implies that investors lose access to research on these

firms from these brokerage houses, which further suggest a decrease in the quantity of research

supplied to these investors.

Table 3.11 shows the results. As we can see, firms’ brokerage house coverage drops after

unbundling. Similar to previous analyses, this decrease is driven by large firms. Large firms lose

more than 1 brokerage house after this regulation. The results here suggest that investors’ access

to firms’ research through brokerage houses may reduce. From these investors’ point of view, the

quantity of research decreases.41

3.7.2 Other quality measures

Analyst forecast quality is hard to measure. To show that our results are robust, we repeat our

analyzes using other commonly used measures of forecast quality.

Non-market-response based measures

First, we follow previous research and use forecast dispersion, defined as the standard deviation

of all the forecasts across all the analysts following the same firm in a given fiscal year, as a proxy

for firm level forecast quality (e.g., Gu and Hackbarth (2013), Behn et al. (2008)). One intuition

behind this measure is the following: if accuracy becomes analysts’ main concern, dispersion will

decrease when all of their forecast quality improves and the forecast values converge to the true

value.

One-difference analyses in Panel C of Table 3.5 reveal that the dispersion of EU firms decreases

after the regulation. Difference-in-difference analyses in Panel A of Table 3.12 further confirm the

result: unbundling causes a decrease in the forecast dispersion of the EU firms compared with

the US firms. The magnitude is comparable between small firms and large firms. These results

are consistent with the analyses using forecast error. Dynamic coefficient plot in Figure 3.5c and

41In untabulated test, we also find that the number of unique firms EU brokerage houses cover decreases.

146

placebo tests in column (7) to (9) of Table 3.10 both show no evident violations of the parallel trend

assumption. We conclude that the coverage quality of EU firms, measured by forecast dispersion,

indeed improves after the regulation. The analyses also show that the beliefs of different analysts

are converging after this regulation.

Another measure for quality is the number of firms each analyst covers. It is reasonable to

believe that if one analyst covers fewer firms, due to specialization, he/she will devote more time

to each firm and provide better forecasts. Panel B of Table 3.12 shows the results of the difference-

in-difference analyses similar to Model (3.6). The dependent variable is the number of firms each

analyst covers. We focus on analysts who cover at least one firm throughout the sample in Column

(1). As we can see, EU analysts cover less firms after the regulation and the decrease is statically

significant. We find similar evidence in column (2) in which we focus on all the analysts in our

sample. These results further support that the quality of analysts research in EU improves after

unbundling.

Market-response based measures

We can also assess analyst forecast quality using market responses to analyst forecasts. In-

tuitively, if analyst forecast quality improves and contains more information, stock prices should

move more substantially on the analyst forecast revision dates. Hence, we focus on the absolute

market-adjusted abnormal returns on the analyst forecast revision dates. Improvement in the fore-

cast quality suggests more significant absolute market-adjusted abnormal returns on the revision

dates.

We define three measures to capture this effect. At the analyst-firm level, we calculate the

absolute market-adjusted abnormal return on the forecast revision date:

ABRetj,i,d = |Retj,i,d −Retm,d| (3.12)

42In this section, we focus on the revision dates, i.e., dates on which analysts revise his/her forecasts. Our results,both at the analyst-firm level and at the firm level, are robust if we focus on all dates on which analysts issue forecasts.

147

Retj,i,d denotes the daily return of firm j when analyst i revises forecast for firm j at date d.42

To mitigate data errors in Datastream, we winsorize Retj,i,d at the 1% level. Retm,d denotes the

daily return of the stock market. Here, we define the market return for EU and US separately by

calculating the value-weighted stock returns of all the firms in each continent. If multiple revisions

occur on the same day for the same firm, we assign each analyst covering that firm the same

abnormal return. In an extreme case, we could attribute the abnormal returns generated by more

informative analysts to analysts who simply herds with the others. To mitigate this concern, we

keep only the first three analyst revisions by the exact revision time. ABRetj,i,d then helps us to

capture the new information in the forecast revisions at the analyst-firm level.

At the firm level, we aggregate analyst revisions. We first compute a yearly measure of aggre-

gate analyst informativeness (AGAI). Following previous literature (e.g., Frankel et al. (2006) and

Lehavy et al. (2011)), we define AGAI as follows: For a given firm in a given fiscal year, we first

sum the absolute market-adjusted abnormal returns on all the forecast revisions dates across all the

analysts. We then divide it by the sum of absolute market-adjusted abnormal returns of all trading

days for the firm in the given fiscal period. Specifically,

AGAIjt =

∑NREV Sd=1 |Retj,d −Retm,d|∑TDAY Sd=1 |Retj,d −Retm,d|

(3.13)

where Retj,d denotes the daily return of firm j and Retm,d denotes the daily return of the stock

market. NREV S is the set of unique forecast revision dates over all the analysts for the given firm

in the given fiscal year. We exclude forecast revision dates that coincide with earnings announce-

ments. This mitigates the concern that analyst forecast revisions respond to publicly available

earnings information. Intuitively, AGAI measures the aggregate abnormal returns on all forecast

revision dates. It allows us to capture the total amount of new information in the analyst forecast

revisions.43 Furthermore, AGAI incorporates changes in the total number of forecasts (the quantity

of information) caused by both changes in the number of analysts covering a firm and the number

43AGAI treats multiple revisions by different analysts on the same firm as one aggregate revision. It measures theaggregate information generated by all analysts on a given revision date.

148

of forecasts issued by one analyst. All else being equal, a decrease in the total number of forecast

revisions causes a decrease in AGAI.

To account for the changes in the total number of forecast revisions, we normalize AGAI by

the number of forecast revision dates of a given firm in the given fiscal year. Formally, we define

the average analyst informativeness as:

AV GAIjt =

∑NREV Sd=1 |Retj,d −Retm,d|∑TDAY Sd=1 |Retj,d −Retm,d|

× 1

NREV Sj(3.14)

where NREV Sj is the number of unique forecast revision dates. AVGAI then measures the aver-

age informativeness of one analyst forecast revision date.

We perform similar difference-in-difference analyses (Model 3.1) using the three new mea-

sures. Standard firm level controls are included. At the analyst-firm level, we include the log of the

number of days between two revision dates as an additional control. Standard errors are clustered

at the firm level.

We report the results in Table 3.13. Column (1) shows the results at the analyst-firm level.

We include analyst × firm fixed effect to capture changes in the abnormal return within the same

analyst-firm pair across different times. As we can see, analyst forecasts generate larger abnormal

returns after unbundling. This is evidence that individual analyst forecasts become more informa-

tive. Column (2) aggregates different analyst revisions at the firm level. It shows that the average

informativeness of one revision date improves after the regulation. These results provide further

evidence that individual analyst quality increases. In column (3), we show the results of aggregate

informativeness without normalizing the number of forecast revision dates. Interestingly, the coef-

ficient in front of TREAT × POST becomes statistically negative, suggesting a likely decrease

in the aggregate informativeness of analyst forecasts.

We interpret the results as a trade-off between quantity and quality. While individual forecasts

become more informative, the total number of valuable forecasts decreases after the regulation.

Hence, even though average analyst informativeness increases, the aggregate informativeness of

44Results of small firms and large firms can be found in the Appendix C.4.3.

149

analyst forecasts decreases.44

As a final note, we are aware of potential concerns with abnormal returns on the revision dates.

For example, analysts could respond to prior public disclosure events. In this case, their revisions

could appear to provide more information than they actually do. We try to address these concerns

by focusing on a short window (1-day) and by excluding revision dates coinciding with earnings

announcement dates. But still, abnormal returns on the forecast revision dates are not the cleanest

measure. Hence, we view our results here suggestive rather than conclusive.

3.7.3 Alternatives

In our paper, we argue that the enhancement of competition among analysts in the quality do-

main is one driver of our results. One assumption we rely on is that asset managers internalize

research costs and put more emphasis on research quality. What if these asset managers just claim

to internalize the cost but in effect still make hidden research payments through inflated commis-

sions? If that is the case, they probably would not focus a lot on research quality because end

investors bear the costs. Although it is hard to preclude this possibility based on the data we cur-

rently observe, for this argument to hold, commissions fees should at least stay more or less the

same, if not increase. However, recent surveys suggest that commissions fees drop by around 30%

due to unbundling.45 A study by Financial Conduct Authority (FCA) also finds no evidence of

asset managers passing research cost through trading commissions and break the new rule. Ac-

cording to this study, expensing research costs out of asset managers’ own profits already saved

end investors in the UK around £180 million in 2018. The FCA expects that the overall savings

from MiFID II rules could reach £1 billion over the next five years.46

Moreover, unbundling makes many end investors become aware of the hidden cost. End in-

vestors such as pensions, insurance companies, and sovereign funds have always been proponents

of unbundling due to opaque fee structures and inflated trading costs. As a result, asset managers

45"Bank and brokers suffer ‘dramatic’ fall in commissions" Financial Times, June 2nd, 2018.46Andrew Bailey, Chief Executive of the FCA, keynote speech on MiFID II at the European Independent Research

Providers Association.

150

face greater pressure to reduce commission fees after unbundling. Hiding research cost by inflating

fees not only exposes asset managers to severe legal risks but put them at a disadvantage amid the

increased market competition in the asset management industry. All these arguments suggest that

it is unlikely for asset managers to hide research costs and continue the “soft dollar" practice as

under bundling.

Another common worry is that other provisions in MiFID II may also affect sell-side research.

For example, post-trading transparency may make more information available to the general pub-

lic and less analysts research is needed. That is why we observe a drop in the analyst coverage.

However, post-trading transparency reveals mainly order flow information. Translating order flow

information into firms’ fundamental information which analysts specialize in is difficult. In this

case, analyst research should still be needed. Even if we assume that post-trading transparency

reveals fundamental information, the transparency story implies that small firms will be more af-

fected: the trading process of thinly traded stocks, which in most cases are small stocks, is more

likely to feature opaqueness. The improvement in transparency should benefit small firms more

than large firms. This further implies that fewer analysts on small firms are needed because more

information on small firms is now available. But we observe the opposite: small firms’ analyst

coverage remains unchanged and large firms’ coverage drops. To sum up, it could be the case that

other provisions may confound our results but it is unlikely for a provision other than unbundling

to explain all our findings.

An alternative story that can explain our findings is learning. Maybe EU analysts are be-

coming better at learning over the years and begin to produce better forecast results regardless of

unbundling.47 While this alternative is plausible and harder to preclude entirely, it does not fully

justify the sudden improvement in the overall forecast quality right after unbundling. First, if it

were not for unbundling, the learning process is likely to take more time and the effect should

manifest itself gradually over the years. If that were the case, our placebo test should have picked

up the learning effect, but we do not observe changes in the forecast quality in our placebo test.

47If analysts become better at learning due to unbundling, the story does not contradict our argument: unbundlingincentivizes analysts to produce better forecasts. One way to improve their forecasts is to become better at learning.

151

Second, the change in the cumulative distributions of forecast errors (i.e., Figure 3.4) shows that

quality improvement is a general phenomenon. It could be the case that a few analysts are better

at learning right after the regulation for reasons unrelated to unbundling, but it is be difficult to

argue that most of the analysts happen to become better at learning at the same time, right after the

regulation, and for reasons unrelated to unbundling.

We further investigate the learning story in Table 3.14. We introduce the lagged forecast error

in our regressions (similar to Model (3.5) and Model (3.6)). The intuition behind is the following:

analysts producing larger forecast errors in the previous period could face greater career pressure

and become better at learning to improve their performance in the current period, regardless of

unbundling. Thus, controlling for lagged forecast error help us to capture the change in the analyst

learning process unrelated to unbundling. We then check whether our results are robust to the

inclusion of the new control. Column (1) presents the results at the analyst-firm pair level. Column

(2) presents the results at the analyst level with brokerage house + year fixed effects. Column (3)

presents the results at the analyst level with brokerage house × year fixed effects. After adding the

new control, our results still hold and the coefficient of interest remains statistically significant and

economically meaningful.48 The results also show that larger previous forecast errors do predict

smaller current forecast errors, which explains why the magnitudes of the coefficients of EU ×

POST generally become slightly smaller. We conclude that learning may contribute but does not

explain the whole story: on top of learning, the competition introduced by unbundling still plays

an important role in driving the results.

3.7.4 Brokerage house level employment

In addition to research quantity and quality, unbundling may also affect different types of bro-

kerage houses differentially. The differential impact has important implications for the market

structure of the sell-side research industry (e.g., maybe the industry will begin to consolidate be-

48For example, in column (2), 0.122% translates into a 14.91% decrease in the average forecast error of EU analystsprior to the regulation.

152

cause certain types of brokerage houses are more affected and could not survive). To explore this

heterogeneity, we attempt to compare the employment results of independent research firms (we

will call them boutique houses) with the results of brokerage houses affiliated with large invest-

ment banks. Boutique houses normally do not provide trading services, while the business model

of research in brokerage houses affiliated with large banks features heavy bundling. It is likely

that the latter will be more affected by the regulation. However, our data only provides us with an

anonymous ID for the brokerage house analysts work for. We are unable to recover the real name

of these brokerage houses. To overcome this data limitation, we focus on the size of the brokerage

houses, i.e., the number of analysts each brokerage house employs. It is reasonable to assume that

boutique houses are smaller in size than houses affiliated with large banks.

To understand which type of houses are more affected by the regulation, we then perform the

following analysis within EU brokerage houses:

Ybt = φ1Smallb × Postt + φ2Xbt + αb + αt + εbt (3.15)

Ybt is the number of unique analysts working in brokerage house b in year t. EU brokerage

houses are defined to be the brokerage houses that only hire EU analysts. Smallb is a dummy

equal to 1 if the average number of analysts a brokerage house hires in the pre-regulation years falls

below the median. Postt is a dummy equal to 1 if the fiscal year t is equal to 2018. The control

Xbt includes standard firm level controls as defined previously. We average the firm level variables

within the brokerage house portfolio to get the brokerage house level firm controls. Standard errors

are clustered at the brokerage house level.

The results are presented in Table 3.15. Column (1) shows the results within EU brokerage

houses: after the regulation, large brokerage houses on average lose around 3 more analysts than

small houses. This can also be seen from the plot in Figure C.4 of Appendix C.2. Column (2)

shows the results of a triple-difference specification similar to Model (3.3). The coefficient in

front of the triple interaction term is not significantly different from 0: it seems that US large

brokerage houses also lose more analyst during this period.

153

The results in this section seem to be inconclusive. It is true that within the EU, large brokerage

houses are more affected. However, within the US, a region unaffected by the regulation directly,

large brokerage houses are also more affected. It is possible that small brokerage houses may not be

a good proxy for independent research firms and for this reason, brokerage houses in the US are not

valid counterfactuals in the triple-difference analyses. In addition, the number of observations at

the brokerage house level is small and the results thus may suffer from a low-power issue. With all

these limitations in mind, we leave careful analyses on how MiFID II have changed the landscape

of brokerage houses for future research.

3.7.5 Capital market effects

So far, our analyses focus on sell-side research. We now explore the capital market effects

of unbundling, or more generally, the capital market effects of MiFID II. Since capital market

outcomes are potentially affected by many other factors, the goal in this section is to show what

has happened, rather than to apply tight identification strategies and claim causality. To be coherent

with our previous analyses, we only focus on firms with valid I/B/E/S coverage in our sample.

Following the previous literature, we focus on two important variables at the firm level: earn-

ings announcement information content (EAinfojd) and bid-ask spread (Bid Ask Spreadjt)

(e.g., Cumming et al. (2011), Christensen et al. (2016) and Merkley et al. (2017) ). We per-

form exercises similar to Model (3.1) but replace the dependent variables to be either EAinfojt

or Bid Ask Spreadjt.

EAinfojt is defined as:

EAinfojt =1∑

d=−1

|Retj,d −Retm,d| (3.16)

where d denotes days around a firm’s earnings announcement date t, j denotes the firm, Retj,d49Here, we define the market return for EU and US separately by calculating the value-weighted returns of all the

firms in a given continent. Our results are qualitatively the same if we use all stocks in each country to calculate thestock market return.

154

denotes the daily return of firm j and Retm,d denotes the daily return of the stock market.49 To

mitigate data errors in Datastream, we winsorize Retj,d at the 1% level. EAinfo captures the

informativeness earnings announcements: its value should be close to 0 if analyst forecasts are

informative enough and no additional information is conveyed at the actual earnings report date.

The daily bid-ask spread is computed as the difference between the two prices divided by the

midpoint. We winsorize the spread at the 1% level. We then take the mean of the daily spread over

the year for a given firm and obtain Bid Ask Spreadjt. Bid-ask spread is a common measure of

market liquidity. If the overall informativeness of analyst forecast and market liquidity situation

do not deteriorate, we anticipate β ≤ 0. Results are shown in Table 3.16.

Column (1) and (2) are results for earnings announcement information content. Column (1)

shows the standard difference-in-difference result, while column (2) presents the dynamic coeffi-

cient estimates. Although the coefficient of interest is positive in column (1), it is not statistically

significant. In fact, column (2) reveals that all the coefficients are negative, and the parallel trend

assumption may not hold in this regression. The evidence suggests that firms’ earnings announce-

ments do not convey more information after unbundling. Similarly, for bid-ask spread, the overall

impact is not statistically different from zero. It seems that bid-ask spread decreased in 2017, one

year prior to the regulation. Overall, the results in this table imply that capital market situations do

not deteriorate after MiFID II.

These findings do not contradict with the previous studies by Christensen et al. (2016) and

Cumming et al. (2011) who show that capital market regulation aiming to improve transparency

and competition has overall positive effects on capital market outcomes. The findings are also con-

sistent with our story. Competition among analysts is strengthened after the regulation, resulting

in the improvement of forecast quality. This further improves the informativeness of analyst re-

ports and has non-negative effects on the liquidity of the market. However, we are aware that both

EAinfojd andBid Ask Spreadjt are noisy aggregate outcomes that can be affected by many other

factors in addition to analyst forecasts. Moreover, the reform is fairly recent, and its aggregate im-

pact may take some more time to manifest. The aggregate impact of MiFID II on capital markets

155

is an important question and is worth investigating more carefully in future research projects.

3.8 Conclusion

In this paper, we investigate the impact of unbundling on analyst research production. Firms

affected by unbundling experienced a more substantial drop in analyst coverage. Such a drop

does not come from small- or mid-cap firms but is concentrated in large firms. Furthermore,

firms’ coverage quality improves. We argue that the enhancement of analyst competition could

explain the results. Unbundling puts an explicit price on analyst research. On the one hand, asset

managers become more selective in purchasing research because they need to internalize research

costs. On the other hand, analysts are evaluated directly by the services they provide and have

stronger incentives to produce better research. As a result, analysts compete more directly in the

quality domain to maintain and attract clients. We find evidence of the two specific predictions

of the competition channel: 1) analysts who stay produce better research (intensive margin); 2)

analysts who produce worse research are more likely to stop working (extensive margin). Our

results highlight a potential trade-off between quantity and quality: improvement in the quality of

research by unbundling comes at a cost of reducing the quantity of research. Overall, our paper

provides a timely investigation of an important and controversial regulation. It helps to shed light

on key questions and debates, including how research should be paid for and how the market for

analysts should be regulated.

Our paper focuses on one important aspect of information production – the sell-side research

production but is silent on other dimensions of information production. For example, the rising

cost of sell-side research may encourage more buy-side players to establish their in-house research

team. A larger proportion of information production may then migrate from the sell-side to the

buy-side. Instead of offering conclusive answers, we view our paper as an important starting

point to understand how the compensation schedule for information affects the overall information

production. There are many interesting follow-up questions worth exploring: Will the sell-side

156

research markets begin to consolidate? Will research migrate from the sell-side to the buy-side?

What are the welfare implications for investors? We leave these questions for future research.

3.9 Figures

Figure 3.1: Illustration of Bundling and Unbundling

(a) Bundling

(b) Unbundling

This figure illustrates heuristically the two regimes in which sell-side research is compensated. Under bundling,research is paid through trading commissions and the commissions are passed on to end clients. Under unbundling,research is required to be paid separately. Asset managers can either pay for research directly (against the firm’s profitand loss) or pass the costs to clients by setting up a research payment account (“RPA”) under the consent of clients.Most asset managers opt for purchasing researching directly. The figure also shows that data we use for this projectcomes from the sell-side and the public firms it covers.

157

Figure 3.2: Average Analyst Coverage Over Time (EU vs. US)

●●

●

●●

−1

0

1

2014 2015 2016 2017 2018

Adj

uste

d A

vera

ge N

umbe

r of

Ana

lyst

s pe

r F

irm

● US EU

This figure shows the average analyst coverage of EU firms and US firms over time. Analyst coverage is the numberof unique analysts covering a specific firm in fiscal year t. We report the averages of the period from 2014 to 2018 andadjust the coverage value so that both lines start at 0 in 2014.

Figure 3.3: Average Analyst Coverage Over Time between Small Firms and Large Firms in the EU

● ● ● ● ●

−5.0

−2.5

0.0

2.5

5.0

2014 2015 2016 2017 2018

Adj

uste

d A

vera

ge N

umbe

r of

Ana

lyst

s pe

r F

irm

● Small Large

This figure shows the average analyst coverage of small firms and large firms in the EU. Analyst coverage is thenumber of unique analysts covering a specific firm in fiscal year t. Small firms are firms whose average fiscal year-endmarket capitalization before 2018 falls below the median. We report the averages of the period during 2014 to 2018and adjust the coverage value so that both lines start at 0 in 2014.

158

Figure 3.4: Empirical Cumulative Distribution of Forecast Error at the Firm Level

0.00

0.25

0.50

0.75

1.00

0.0 2.5 5.0 7.5 10.0Forecast Error (%) of EU Firms

Fra

ctio

n of

the

Tota

l Num

ber

of F

irms

Pre Post

(a)

0.00

0.25

0.50

0.75

1.00

0.0 2.5 5.0 7.5 10.0Forecast Error (%) of US Firms

Fra

ctio

n of

the

Tota

l Num

ber

of F

irms

Pre Post

(b)

This figure plots the empirical cumulative distribution of the forecast errors in the pre- and post-regulation years. Toget the forecast errors in the pre-regulation years, we simply average firms’ forecast errors overall all the pre-years.

159

Figure 3.5: Firm Level Outcomes (Dynamic Coefficients)

●●

●●

●

−3

−2

−1

0

1

2

2014 2015 2016 2017 2018

Dyn

amic

Coe

ffici

ents

with

95%

Con

fiden

ce In

terv

al

(a) Coverage

●● ●

●

●

−1.0

−0.5

0.0

0.5

1.0

2014 2015 2016 2017 2018

Dyn

amic

Coe

ffici

ents

with

95%

Con

fiden

ce In

terv

al

(b) Forecast Error

●●

●

●

●

−1.0

−0.5

0.0

0.5

1.0

2014 2015 2016 2017 2018

Dyn

amic

Coe

ffici

ents

with

95%

Con

fiden

ce In

terv

al

(c) Forecast Dispersion

This figure shows the dynamic effect of unbundling. We split the post dummies into year dummies (Model (3.4)) andplot the coefficient estimates along with a 95% confidence interval. We choose 2014, the year in which MiFID II wentinto discussion, as our reference year. Standard errors are clustered at the country level.

160

Figure 3.6: Analyst Level Forecast Error (Dynamic Coefficients)

●●

●

●

●

−1.0

−0.5

0.0

0.5

1.0

2014 2015 2016 2017 2018

Dyn

amic

Coe

ffici

ents

with

95%

Con

fiden

ce In

terv

al

(a) Analyst Firm Pair Level

● ●

●

●

●

−1.0

−0.5

0.0

0.5

1.0

2014 2015 2016 2017 2018

Dyn

amic

Coe

ffici

ents

with

95%

Con

fiden

ce In

terv

al

(b) Analyst Level (B House + Year Fixed Effects)

●●

●●

●

−1.0

−0.5

0.0

0.5

1.0

2014 2015 2016 2017 2018

Dyn

amic

Coe

ffici

ents

with

95%

Con

fiden

ce In

terv

al

(c) Analyst Level (B House × Year Fixed Effects)

This figure shows the dynamic effect of unbundling in the intensive margin analyses at the analyst level. We splitthe post dummies into year dummies and plot coefficient estimates along with a 95% confidence interval. We choose2014, the year in which MiFID II went into discussion, as our reference year. Standard errors are clustered at theanalyst level.

161

3.10 Tables

Table 3.1: Firm Level Summary Statistics (Sample for Coverage Quantity)

EU Firms: 2133 firms in total

Statistic Mean Median Pctl(25) Pctl(75) St. Dev.

Coverage 8.558 4 1 13 9.761Book to Market 2.448 1.046 0.595 1.855 5.799Investment Return (%) 11.597 7.429 −13.127 28.959 43.462Market Capitalization (Bil $) 5.324 0.547 0.114 2.751 17.385Return on Assets (%) 2.919 4.574 0.992 8.020 17.484Return Volatility (%) 33.330 28.996 22.628 38.431 17.041

US Firms: 2259 firms in total



This table provides the summary statistics of important firm level variables in the sample for coverage quantity. Cov-erage is the number of unique analysts covering a certain firm. Book to Market is defined as total asset (Worldscopeitem 02999) minus long-term debt (Worldscope item 03251) over market value (Worldscope item 08002). Investmentreturn is Worldscope item 08801, measured as (market price at year t+ dividends−market price at year t− 1)/marketprice at year t− 1). Market capitalization is Worldscope Item 08002 measured in billion dollars. Return on Assets isWorldscope Item 08326. Return volatility is the annualized standard deviation of daily returns over a year for a givenfirm.

162

Table 3.2: Firm Level Summary Statistics (Sample for Coverage Quality)

EU Firms: 1111 firms in total


Coverage 13.887 11 5 21 10.163Forecast Error (%) 0.724 0.340 0.126 0.848 1.085Dispersion (%) 0.848 0.456 0.224 0.960 1.163Book to Market 2.415 0.922 0.537 1.564 5.701Distance 120.245 115.029 94.026 141.500 43.478Investment Return (%) 11.775 9.067 −8.107 27.482 33.974Market Capitalization (Bil $) 9.595 2.174 0.692 7.529 23.104Return on Assets (%) 6.046 5.374 2.661 8.866 12.240Return Volatility (%) 27.868 26.029 21.319 32.267 10.164

US Firms: 1693 firms in total



This table provides the summary statistics of important firm level variables in the sample for coverage quality. Tobe included in this sample, firms need to be covered by at least 2 analysts each year. Coverage is the number ofunique analysts covering a certain firm. Forecast error is defined as the absolute distance between the firm’s actualannual earnings per share and the mean of the analyst forecasts, scaled by the firm’s previous year-end price. Forecastdispersion is defined as the standard deviation of all the forecasts across all the analysts following the same firm inthe same year, scaled by the firm’s previous year-end price. Book to Market is defined as total asset (Worldscopeitem 02999) minus long-term debt (Worldscope item 03251) over market value (Worldscope item 08002). Distance isthe average of the days between the analyst forecast date and the firm’s actual EPS report date. We take the averageover all the analysts following the same firm and measure this variable in 2014. Investment return is Worldscopeitem 08801, measured as (market price at year t+ dividends−market price at year t − 1)/market price at year t − 1).Market capitalization is Worldscope Item 08002 measured in billion dollars. Return on Assets is Worldscope Item08326. Return volatility is the annualized standard deviation of daily returns over a year for a given firm. We removeobservations for which forecast error and forecast dispersion is larger than 10% of the firm’s share price at the end ofthe previous year.

163

Table 3.3: Analyst Level Summary Statistics (Sample for Intensive Margin)

EU Analysts: 1211 in total


Average Forecast Error (%) 0.812 0.656 0.376 1.091 0.623Average Distance 78.751 70.500 41.000 105.000 52.712Average Coverage of Portfolio Firms 21.658 22.286 14.167 28.750 9.587Number of Firms Follows 7.557 7 5 10 3.817Tenure (years) 8.915 8 4 14 5.797

US Analysts: 1387 in total


Average Forecast Error (%) 0.403 0.272 0.156 0.504 0.396Average Distance 81.877 84.235 56.845 101.500 38.765Average Coverage of Portfolio Firms 21.367 20.250 14.229 27.279 10.047Number of Firms Follows 10.547 10 7 14 5.972Tenure (years) 8.740 8 4 13 5.432

This table provides the summary statistics of important variables used in the intensive margin analyses at the analystlevel. Average Forecast Error is the average of forecast errors (scaled by the firm’s previous year price) across all thefirms the analyst covers in a given year. Average Distance is the average of the days between the analyst forecast dateand the firm’s actual EPS report date. We take the average across all the firms within an analyst portfolio and measurethis variable in 2014. Average Coverage of Portfolio Firms is the average of the analyst coverage of all the firms theanalyst follows in 2014. We take the average across all the firms within an analyst portfolio. N of Firms Follows isthe number of firms the analyst follows in 2014. Tenure is the total number of years the analyst appeared in I/B/E/S,from 1995 to 2014. EU analysts are the analysts whose portfolios consist of at least 70% of EU stock and US analystsare the analysts whose portfolios consist of at most 30% of EU stocks. We remove analyst-firm pairs for which theforecast error incurred by the analyst are larger than 10% of the firm’s previous year-end price.

164

Table 3.4: Analyst Level Summary Statistics (Sample for Extensive Margin)

EU Analysts: 1841 in total


Average Coverage of Portfolio Firms 20.069 21.000 13.562 26.781 8.509Number of Firms Follows 9.402 9 6 12 4.799Relative Accuracy 51.141 51.619 45.160 57.690 10.616Tenure (years) 10.299 9 5 14 5.938

Unconditional Probability

Stop Covering a Firm Drop Out

Unconditional Probability (%) 24.156 21.347

This table shows the summary statistics of important variables in the extensive margin analyses at the analyst level.Relative accuracy is a measure capturing the analyst forecast quality and is comparable across different analysts. Thehigher the number, the more accurate the analyst forecast is. N of Firms Follows is the number of unique firms theanalyst follows prior to 2018. Average Coverage of Portfolio Firms is the average of the analyst coverage of all theportfolio firms which the analyst follows. We take the average across all the years prior to 2018. Stop is a dummyvariable equal to 1 if the analyst stops covering a firm after the regulation. Drop Out is a dummy equal to 1 if theanalyst stops covering all the firms he/she used to cover prior the regulation. Unconditional probability is the simpleaverage of the two dummy variables.

165

Table 3.5: One Difference (Pre and Post)

Panel A: Coverage (Sample for Quantity)

Pre Post Diff P-value

EU Full 8.740 7.831 -0.909 0

EU Small 2.350 2.292 -0.058 0.436

EU Large 15.136 13.375 -1.761 0

US Full 11.219 10.846 -0.373 0.117

US Small 5.003 4.447 -0.556 0

US Large 17.441 17.252 -0.189 0.588

Panel B: Forecast Error (%)


EU Full 0.742 0.652 -0.090 0.010

EU Small 0.880 0.800 -0.079 0.157

EU Large 0.604 0.502 -0.102 0.013

US Full 0.432 0.482 0.050 0.034

US Small 0.578 0.656 0.078 0.052

US Large 0.285 0.307 0.022 0.335

Panel C: Forecast Dispersion (%)


EU Full 0.870 0.762 -0.108 0.003

EU Small 0.920 0.831 -0.089 0.121

EU Large 0.819 0.693 -0.126 0.004

US Full 0.527 0.585 0.057 0.034

US Small 0.677 0.740 0.063 0.157

US Large 0.378 0.429 0.052 0.084

Panel D: Analyst Forecast Error (%)


EU Full 0.818 0.790 -0.028 0.320

EU Small 0.869 0.878 0.009 0.848

EU Large 0.767 0.701 -0.065 0.028

US Full 0.383 0.485 0.102 0

US Small 0.477 0.627 0.149 0

US Large 0.288 0.343 0.055 0.001

This table provides the average values of Coverage, Forecast Error and Forecast Dispersion and Average Forecast Error(Analyst Level) in the pre- and post-years (one difference). Pre-years include year 2014, 2015, 2016, 2017 while post-year is 2018. Panel (a) to (c) show the firm level outcomes. “Full" denotes the results for all the firms. “Small" denotesthe results for small firms. “Large" denotes the results for large firms. Small firms are firms whose average fiscalyear-end market capitalization over the pre-regulation years falls below the median. To maintain the proportion of EUand US firms fixed, we calculate the cutoff separately in both regions. Panel (d) shows the analyst level outcomes inthe sample for intensive margin analyses. “Full" denotes the results for all the analysts. “Small" denotes the results forsmall analysts. “Large" denotes the results for large analysts. To define small analysts, we first calculate the averagemarket capitalization of all the firms within an analyst portfolio in a given year. We then average the average marketcapitalization across all the pre-regulation years. Small analysts are defined as analysts whose average of the portfolioaverage market capitalization over the pre-regulation years falls below the median. To maintain the proposition ofanalysts fixed, we calculate the median cutoff separately for EU analysts and US analysts.

166

Table 3.6: Firm Level Outcomes

Dependent variable:

Coverage Forecast Error (%)

Full Small vs Large Triple Diff Full Small vs Large Triple Diff

(1) (2) (3) (4) (5) (6)

EU × POST −0.651∗∗∗ −1.594∗∗∗ −0.142∗∗∗ −0.143∗∗∗

(0.196) (0.268) (0.035) (0.048)

EU × POST × SMALL 1.891∗∗∗ 0.002(0.200) (0.074)

SMALL × POST 1.678∗∗∗ −0.228∗∗∗ 0.043 0.029∗

(0.204) (0.018) (0.064) (0.016)

LN SIZE 1.797∗∗∗ 1.542∗∗∗ 1.796∗∗∗ −0.287∗∗∗ −0.288∗∗∗ −0.288∗∗∗

(0.212) (0.228) (0.204) (0.030) (0.097) (0.031)

LN BM 0.781∗∗∗ 0.841∗∗∗ 0.812∗∗∗ 0.279∗∗∗ 0.470∗∗∗ 0.278∗∗∗

(0.072) (0.146) (0.071) (0.073) (0.109) (0.073)

ROA −0.003∗∗∗ −0.004∗ −0.003∗∗∗ −0.003∗∗ −0.004 −0.003∗∗

(0.001) (0.002) (0.001) (0.001) (0.004) (0.001)

RET −0.007∗∗∗ −0.006∗∗∗ −0.007∗∗∗ 0.004∗∗∗ 0.005∗∗∗ 0.004∗∗∗

(0.001) (0.001) (0.001) (0.001) (0.001) (0.001)

RETVOL 0.531∗∗∗ 0.594∗∗ 0.529∗∗∗ 0.751∗∗ 0.111 0.750∗∗

(0.169) (0.300) (0.165) (0.317) (0.458) (0.323)

LN DISTANCE × POST 0.039 0.116 0.034(0.072) (0.082) (0.074)

GDP GROWTH −0.062∗ −0.049∗ −0.061∗∗ −0.003 −0.002 −0.003(0.033) (0.027) (0.030) (0.009) (0.009) (0.009)

UNEMPLOYMENT RATE 0.142 0.124 0.118 0.008 0.016 0.008(0.108) (0.092) (0.097) (0.013) (0.012) (0.013)

Firm FE Yes Yes Yes Yes Yes YesYear FE Yes Yes Yes Yes Yes Yes

Observations 21,960 10,665 21,960 14,020 5,555 14,020R2 0.967 0.971 0.967 0.489 0.497 0.489Adjusted R2 0.958 0.964 0.959 0.361 0.369 0.361

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions specified in Model (3.1) and Model (3.3). The dependent variablein column (1) to (3) is Coverage, the number of unique analysts covering a specific firm j in fiscal year t. Thedependent variable in column (4) to (6) is Forecast Error, defined as the absolute distance between the firm’s actualEPS and the mean of the analyst forecasts, scaled by the firm’s previous year-end price. EU is a dummy variable equalto one if the firm is domiciled and listed in Europe. POST is a dummy variable equal to one if the fiscal year t isequal to 2018. SMALL is a dummy variable equal to one if firms’ average fiscal year-end market capitalization overthe pre-regulation years falls below the median. To maintain the proportion of EU and US firms fixed, we calculate thecutoff separately in both regions. Column (1) and column (4) are the results for the difference-in-difference regressionbetween EU firms and US firms. Column (2) and column (5) are the results for the difference-in-difference regressionbetween small firms and larger firms within the EU. Column (3) and Column (6) are the results for the triple-differenceregression. Standard errors are clustered at the country level.

167

Table 3.7: Analyst Level Outcomes (Intensive Margin)

Dependent variable:

Forecast Error (Analyst) (%) Average Forecast Error (%)

Full Triple Diff Full Triple Diff Full Triple Diff

(1) (2) (3) (4) (5) (6)

EU × POST −0.129∗∗∗ −0.122∗∗∗ −0.131∗∗∗ −0.113∗∗∗ −0.149∗∗ −0.132∗∗

(0.032) (0.040) (0.033) (0.037) (0.058) (0.057)

EU × POST × SMALL −0.017 −0.035 −0.067(0.049) (0.062) (0.118)

SMALL × POST 0.031 0.048 0.118(0.025) (0.055) (0.091)

LN FIRMS COVERED × POST −0.014 −0.013 −0.030 −0.028 0.045 0.052(0.020) (0.020) (0.029) (0.030) (0.042) (0.044)

LN COVERAGE × POST −0.023 −0.015 −0.093∗ −0.078 −0.019 0.039(0.019) (0.021) (0.051) (0.069) (0.078) (0.106)

LN TENURE × POST −0.005 −0.004 0.005 0.006 −0.035 −0.032(0.012) (0.012) (0.019) (0.019) (0.034) (0.034)

LN DISTANCE × POST −0.018∗∗ −0.019∗∗ −0.043 −0.043 0.019 0.016(0.008) (0.008) (0.029) (0.029) (0.036) (0.037)

Firm Level Controls Yes Yes Yes Yes Yes YesAnalyst FE Yes Yes Yes Yes Yes YesFirm FE Yes Yes No No No NoB House FE Yes Yes Yes Yes No NoYear FE Yes Yes Yes Yes No NoB House FE × Year FE No No No No Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions specified in Model (3.5) and Model (3.6). We focus on analyst-firm pairs that survive throughout the sample period (2014 to 2018). The dependent variable in column (1) and (2) isForecast Error, defined as the absolute distance between the firm’s actual EPS and the analyst’s forecast, scaled bythe firm’s previous year-end price. The dependent variable in column (3) to (6) is Average Forecast Error, definedas the average of forecast errors across all the firms the analyst covers in a given year. EU is a dummy variable equalto one if the analyst’s portfolio consists of at least 70% of EU stock and zero if the analyst’s portfolio consists of atmost 30% of EU stocks. POST is a dummy variable equal to one if the fiscal year t is equal to 2018. SMALL isa dummy variable equal to one for the analyst whose average of the portfolio average market capitalization over thepre-regulation years falls below the median. We take the average first over all the firms the analyst covers each yearand then over all the pre-regulation years. To maintain the proposition of analysts fixed, we calculate the median cutoffseparately for EU analysts and US analysts. Column (1) and column (2) are the results at the analyst-firm pair level.Column (3) to column (6) are the results at the analyst level. Standard errors are clustered at the firm level in column(1) and column (2), and at the analyst level column (3) to column (6).

168

Table 3.8: Analyst Level Outcomes (Extensive Margin). Probability of Stop or Dropping Out after Unbundling

Dependent variable:


(1) (2) (3) (4)

RELATIVE ACCURACY −0.008∗∗∗ −0.007∗∗∗ −0.006∗∗∗ −0.006∗∗∗

(0.001) (0.002) (0.001) (0.001)

LN TENURE −0.068∗∗∗ −0.078∗∗∗ −0.081∗∗∗ −0.100∗∗∗

(0.009) (0.014) (0.018) (0.017)

INTERCEPT 1.150∗∗∗ 0.854∗∗∗

(0.169) (0.311)

Controls Yes Yes Yes YesFirm FE Yes No No NoB House FE No No Yes No

Observations 9,595 9,595 1,841 1,841R2 0.216 0.039 0.285 0.088Adjusted R2 0.064 0.038 0.217 0.084

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the regressions specified in Model (3.8) and Model (3.9). The dependent variable incolumn (1) and (2) is Stop, a dummy variable equal to 1 if the analyst stops covering a firm after the regulation. Thedependent variable in column (3) and (4) is DropOut, a dummy variable equal to 1 if the analyst stops covering allthe firms he used to cover prior to the regulation. RELATIV E ACCURACY = 1

T

∑Tt ( 1

J

∑Jj Scoreijt), where

J is the set of firms analyst i covers in year t and T includes all the years prior to 2018. It is a measure capturingthe analyst forecast quality and is comparable across different analysts. The higher the number, the more accurate theanalyst forecast is. Column (1) and (2) shows the results for the analyst-firm level analyses and column (3) and (4)the analyst level analyses. Standard errors are clustered at the firm level in column (1) and (2) , and at the brokeragehouse level in column (3) and (4).

169

Table 3.9: Analyst Level Outcomes (Extensive Margin). Probability of Stop or Dropping Out Before and AfterUnbundling

Dependent variable:


EU US Pooled EU US Pooled

(1) (2) (3) (4) (5) (6)

RELATIVE ACCURACY × POST −0.001∗∗∗ 0.0001 0.0001 −0.001∗∗ −0.0005 −0.0005(0.0003) (0.0002) (0.0002) (0.0005) (0.001) (0.001)

RELATIVE ACCURACY × POST × EU −0.001∗∗∗ −0.001(0.0004) (0.001)

POST × EU 0.058∗∗∗ 0.021(0.020) (0.046)

EU × RELATIVE ACCURACY 0.006∗∗∗ 0.004∗∗∗

(0.0002) (0.0005)

RELATIVE ACCURACY −0.005∗∗∗ −0.011∗∗∗ −0.011∗∗∗ −0.003∗∗∗ −0.007∗∗∗ −0.007∗∗∗

(0.0002) (0.0001) (0.0001) (0.0003) (0.0004) (0.0003)

EU −0.293∗∗∗ −0.198∗∗∗

(0.016) (0.037)

LN TENURE −0.023∗∗∗ 0.001 −0.011∗∗∗ 0.018∗∗∗ 0.007 0.013∗∗∗

(0.002) (0.002) (0.001) (0.005) (0.005) (0.004)

Controls Yes Yes Yes Yes Yes YesFirm FE Yes Yes Yes No No NoB House FE No No No Yes Yes YesYear FE Yes Yes Yes Yes Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the regressions specified in Model (3.10) and Model (3.11). The dependent variable incolumn (1) to (3) is Stopijt+1, a dummy variable equal to 1 if the analyst i covering firm j in year t stops coveringthe same firm in year t+ 1. The dependent variable in column (3) to (6) is DropOutit+1, a dummy variable equal to1 if the analyst stops covering all the firms in year t + 1. RELATIV E ACCURACY = 1

J

∑Jj Scoreijt, where J

is the set of firms analyst i covers in year t. It is a measure capturing the analyst forecast quality and is comparableacross different analysts. The higher the number, the more accurate the analyst forecast is. POST is a dummy equalto 1 for the year 2017. Column (1) to (3) show the results for the analyst-firm level analyses. Column (1) presents theresults for EU analysts; column (2) for US analysts and column (3) for a triple-difference analysis. Column (4) to (6)show similar results for the analyst level analyses. Column (4) presents the results for EU analysts; column (5) for theUS analysts and column (6) for a triple-difference analysis. Standard errors are clustered at the firm level in column(1) to (3) , and at the brokerage house level in column (4) to (6).

170

Table 3.10: Placebo Test

Panel A: Firm Level

Dependent variable:

Coverage Forecast Error (%) Dispersion (%)

2015 2016 2017 2015 2016 2017 2015 2016 2017

(1) (2) (3) (4) (5) (6) (7) (8) (9)

EU × POST −0.071 −0.177 −0.160 −0.006 0.013 0.043 −0.050 −0.046 −0.003(0.113) (0.138) (0.152) (0.044) (0.027) (0.049) (0.031) (0.045) (0.071)

Controls Yes Yes Yes Yes Yes Yes Yes Yes YesFirm FE Yes Yes Yes Yes Yes Yes Yes Yes YesYear FE Yes Yes Yes Yes Yes Yes Yes Yes Yes

Observations 17,568 17,568 17,568 11,216 11,216 11,216 11,216 11,216 11,216R2 0.974 0.974 0.974 0.534 0.533 0.533 0.597 0.597 0.597Adjusted R2 0.965 0.965 0.965 0.377 0.377 0.377 0.463 0.462 0.462

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

Panel B: Analyst Level

Dependent variable:

Average Forecast Error (%)

2015 2016 2017

(1) (2) (3)

EU × POST −0.029 −0.056∗ 0.021(0.032) (0.034) (0.035)

Firm Level Controls Yes Yes YesAnalyst Level Controls × Post Yes Yes YesAnalyst FE Yes Yes YesB House FE Yes Yes YesYear FE Yes Yes Yes

Observations 10,392 10,392 10,392R2 0.601 0.601 0.601Adjusted R2 0.446 0.446 0.446

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

Panel A of this table shows the results of the panel regressions similar to Model (3.1) but only covers the pre-regulationyears. The dependent variable in column (1) to (3) is Coverage, the number of unique analysts covering a specificfirm j in fiscal year t. The dependent variable in column (4) to (6) is Forecast Error, defined as the absolute distancebetween the firm’s actual EPS and the mean of the analyst forecasts, scaled by the firm’s previous year-end price. Thedependent variable in column (4) to (6) is Forecast Dispersion, defined as the standard deviation of all the forecastsacross all the analysts following the same firm in the same year, scaled by the firm’s previous year-end price. EU isa dummy variable equal to one if the firm is domiciled and listed in Europe. From column (1) to column (3), POSTis a dummy variable defined as if the regulation occurred in 2015, 2016, 2017, respectively. Similar definition appliesto column (4) to (6) and column (7) to (9) . Column (1) to (3) are the results for Coverage. Column (4) to (6) arethe results for Forecast Error. Column (7) to (9) are the results for Forecast Dispersion. Standard errors areclustered at the country level. Panel B of this table shows the results of the panel regressions similar to Model (3.6).The dependent variable in column (3) to (6) is Average Forecast Error, defined as the average of forecast errorsover all the firms the analyst covers in a given year. EU is a dummy variable equal to one if the analyst’s portfolioconsists of at least 70% of EU stock and zero if the analyst’s portfolio consists of at most 30% of EU stocks. Fromcolumn (1) to column (3), POST is a dummy variable defined as if the regulation occurred in 2015, 2016, 2017,respectively. Standard errors are clustered at the analyst level.

171

Table 3.11: Firms’ Brokerage House Coverage

Dependent variable:

Brokerage Houses per Firm

Full Small vs Large Triple Diff

(1) (2) (3)

EU × POST −0.403∗∗∗ −1.081∗∗∗

(0.151) (0.207)

EU × POST × SMALL 1.359∗∗∗

(0.154)

SMALL × POST 1.275∗∗∗ −0.099∗∗∗

(0.157) (0.019)

Firm FE Yes Yes YesYear FE Yes Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions similar Model (3.1) on the sample for coverage quality. Thedependent variable is Brokerage Houses per F irm, the number of unique brokerage houses covering a specificfirm in fiscal year t. EU is a dummy variable equal to one if the firm is domiciled and listed in Europe. POST isa dummy variable equal to one if the fiscal year t is equal to 2018. SMALL is a dummy variable equal to one iffirms’ average fiscal year-end market capitalization over the pre-regulation years falls below the median. To maintainthe proportion of EU and Column (1) is the result of the difference-in-difference regression between EU firms and USfirms. Column (2)is the result of the difference-in-difference regression between small firms and larger firms withinthe EU. Column (3) is the result of the triple-difference regression. Standard errors are clustered at the country level.

172

Table 3.12: Other Measures of Quality

Panel A: Firm Level Dispersion

Dependent variable:

Dispersion (%)


(1) (2) (3)

EU × POST −0.166∗∗∗ −0.195∗∗∗

(0.024) (0.042)

EU × POST × SMALL 0.058(0.063)

SMALL × POST 0.048 −0.015∗∗

(0.066) (0.007)

Controls Yes Yes YesFirm FE Yes Yes YesYear FE Yes Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

Panel B: Number of Firms per Analyst

Dependent variable:

Firms Covered

(1) (2)

EU × POST −0.791∗∗∗ −0.705∗∗∗

(0.130) (0.110)

Firm Level Controls Yes YesAnalyst Level Controls × Post Yes YesAnalyst FE Yes YesBrokerage House FE Yes YesYear FE Yes Yes

Observations 14,435 26,555R2 0.875 0.865Adjusted R2 0.839 0.805

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

Panel A of this table shows the results of the panel regressions specified in Model (3.1) and Model (3.3). The dependentvariable is Forecast Dispersion, defined as the standard deviation of all the forecasts across all the analysts followingthe same firm in the same year, scaled by the firm’s previous year-end price. EU is a dummy variable equal to oneif the firm is domiciled and listed in Europe. POST is a dummy variable equal to one if the fiscal year t is equalto 2018. SMALL is a dummy variable equal to one if firms’ average fiscal year-end market capitalization over thepre-regulation years falls below the median. To maintain the proportion of EU and US firms fixed, we calculate thecutoff separately in both regions. Column (1) is the result for the difference-in-difference regression between EU firmsand US firms. Column (2) is the result for the difference-in-difference regression between small firms and larger firmswithin the EU. Column (3) is the result for the triple-difference regression. Standard errors are clustered at the countrylevel. Panel B of this table shows the results of the panel regressions specified in Model (3.6). The dependent variableis Firms Covered, the number of firms one analyst covers in fiscal year t. EU is a dummy variable equal to one ifthe analyst’s portfolio consists of at least 70% of EU stock and zero if the analyst’s portfolio consists of at most 30%of EU stocks. POST is a dummy variable equal to one if the fiscal year t is equal to 2018. Column (1) is results foranalysts who cover at least one firm throughout the sample period. Column (2) is the result of all the analysts in oursample. Standard errors are clustered at the analyst level.

173

Table 3.13: Abnormal Return and Analyst Informativeness

Dependent variable:

Absolute Abnormal Return Average Analyst Informativeness Aggregate Analyst Informativeness

(1) (2) (3)

I(EU *post) 0.001∗∗∗ 0.0003∗∗∗ −0.009∗∗∗

(0.0002) (0.0001) (0.001)

Controls Yes Yes YesFirm FE No Yes YesAnalyst × Firm FE Yes No NoYear FE Yes Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions similar to Model (3.1). The dependent variable in column (1)is ABRet ( Absolute Abnormal Return), the absolute market-adjusted abnormal return on the forecast revision datefor each analyst-firm pair. The dependent variable in column (2) is AV GAI (Average Analyst Informativeness). Itcaptures the average informativeness of one forecast revision date. The dependent variable in column (3) is AGAI(Aggregate Analyst Informativeness). It captures the aggregate informativeness of all revision dates. EU is a dummyvariable equal to one if the firm is domiciled and listed in Europe. POST is a dummy variable equal to one if thefiscal year t is equal to 2018. Standard errors are clustered at the firm level.

174

Table 3.14: Learning Effect (Lagged Forecast Error as Control)

Dependent variable:


(1) (2) (3)

EU × POST −0.134∗∗∗ −0.122∗∗∗ −0.158∗∗∗

(0.033) (0.035) (0.059)

LAGGED ERROR (PAIR) −0.071∗∗∗

(0.013)

LAGGED ERROR −0.142∗∗∗ −0.085(0.039) (0.097)

Firm Level Controls Yes Yes YesAnalyst Level Controls × Post Yes Yes YesFirm FE Yes No NoAnalyst FE Yes Yes YesB House FE Yes Yes NoYear FE Yes Yes NoB House FE × Year FE No No Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions specified in Model (3.5) and Model (3.6). The dependent vari-able in column (1) is Forecast Error, defined as the absolute distance between the firm’s actual EPS and theanalyst’s forecast, scaled by the firm’s previous year-end price. The dependent variable in column (2) to (3) isAverage Forecast Error, defined as the average of forecast errors across all the firms the analyst covers in agiven year. EU is a dummy variable equal to one if the analyst’s portfolio consists of at least 70% of EU stockand zero if the analyst’s portfolio consists of at most 30% of EU stocks. POST is a dummy variable equal to oneif the fiscal year t is equal to 2018. We include lagged forecast error as a control for the analyst learning ability.LAGGED ERROR (PAIR) is the lagged forecast error at the analyst-firm pair level. LAGGED ERROR is theaverage lagged forecast error at the analyst level. We take the average over all the firms the analyst covers. Column (1)presents the results on the analyst-firm pair level. Column (2) presents the results on the analyst level with brokeragehouse plus year fixed effects. Column (3) presents the results on the analyst level with brokerage house times yearfixed effects. Standard errors are clustered at the firm level in column (1) and at the analyst level in column (2) and(3).

175

Table 3.15: Brokerage House Employment

Dependent variable:

Number of Analysts

Small vs Large Triple Diff

(1) (2)

SMALL × POST 3.490∗∗ 3.943∗∗∗

(1.467) (0.863)

EU × POST × SMALL −0.543(1.587)

EU × POST −0.135(1.529)

Controls Yes YesB House FE Yes YesYear FE Yes Yes

Observations 545 1,185R2 0.953 0.953Adjusted R2 0.940 0.941

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions specified in Model (3.15). The dependent variable isNumber of Analysts Hired, defined as the number of unique analysts each brokerage house hires in each year.EU is a dummy variable equal to one if the brokerage house only hires EU analysts in the years prior to the regulation.POST is a dummy variable equal to one if the fiscal year t is equal to 2018. SMALL is a dummy variable equalto one if the average of the number of analysts a brokerage house hires over the pre-regulation years falls below themedian. To maintain the proportion of EU and US brokerage houses fixed, we calculate the cutoff separately in bothregions. Column (1) is the results for a difference-in-difference regression within the EU between small brokeragehouses and large brokerage houses. Column (2) is the result of the triple-difference regression. Standard errors areclustered at the brokerage house level.

176

Table 3.16: Capital Market Effects

Dependent variable:

EAinfo Bid Ask Spread

(1) (2) (3) (4)

EU × POST 0.002 0.0001(0.001) (0.001)

EU × Y15 −0.005∗∗∗ 0.0005(0.001) (0.001)

EU × Y16 −0.004∗∗ 0.0002(0.002) (0.0003)

EU × Y17 −0.004∗∗ −0.001∗∗

(0.002) (0.0005)

EU × Y18 −0.001 −0.00005(0.001) (0.001)

Controls Yes Yes Yes YesFirm FE Yes Yes Yes YesYear FE Yes Yes Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions specified in Model (3.1). The dependent variable in column (1)and column (2) is EAinfojt. EAinfojt is defined as EAinfojt =

∑1d=−1 |Retj,d − Retm,d|, where d denotes

days around a firm’s earnings announcement date t, j denotes the firm, Retj,d denotes the daily return of firm j andRetm,d denotes the daily return of the stock market. To mitigate data errors in Datastream, we winsorize Retj,d at the1% level. The dependent variable in column (3) and column (4) is Bid Ask Spreadjt. The daily bid-ask spread iscomputed as the difference between the two prices divided by the midpoint. We winsorize the spread at the 1% level.We then take the mean of the daily spread over the year for a given firm and obtain Bid Ask Spreadjt. EU is adummy variable equal to one if the firm is domiciled and listed in Europe. POST is a dummy variable equal to one ifthe calendar year t is equal to 2018. Y 15, Y 16, Y 17, Y 18 are year dummy variables that are ones if the calendar yeart is equal to 2015, 2016, 2017, 2018. Column (1) to column (2) are results for EAinfojt. Column (3) to column (4)are results for Bid Ask Spreadjt. Standard errors are clustered at the country level.

177

Bibliography

Admati, A. R., P. M. Demarzo, M. F. Hellwig, and P. Pfleiderer (2018). The Leverage RatchetEffect. Journal of Finance 73(1), 145–198.

Admati, A. R. and P. Pfleiderer (1986). A monopolistic market for information. Journal of Eco-nomic Theory 39(2), 400–438.

Ahearne, A. G., W. L. Griever, and F. E. Warnock (2004). Information costs and home bias: ananalysis of us holdings of foreign equities. Journal of international economics 62(2), 313–336.

Andersen, L., D. Duffie, and Y. Song (2019). Funding Value Adjustments. Journal of Fi-nance 74(1), 145–192.

Angrist, J. D. and J.-S. Pischke (2009a). Mostly harmless econometrics: An empiricist’s compan-ion. Princeton University Press.

Angrist, J. D. and J.-S. Pischke (2009b, November). Mostly Harmless Econometrics: An Empiri-cist’s Companion. Number 8769 in Economics Books. Princeton University Press.

Arora, N., P. Gandhi, and F. A. Longstaff (2012). Counterparty credit risk and the credit defaultswap market. Journal of Financial Economics 103(2), 280–293.

Asness, C. S., A. Frazzini, and L. H. Pedersen (2019). Quality minus junk. Review of AccountingStudies 24(1), 34–112.

Asness, C. S., R. B. Porter, and R. Stevens (2000, February). Predicting stock returns usingindustry-relative firm characteristics. SSRN working paper # 213872.

Badoer, D. C. and C. M. James (2016). The Determinants of Long-Term Corporate Debt Issuances.Journal of Finance 71(1), 457–492.

Bai, J. and P. Collin-Dufresne (2019). The CDS-bond basis. Financial Management 48(2), 417–439.

Baker, M., R. Greenwood, and J. Wurgler (2003). The maturity of debt issues and predictablevariation in bond returns. Journal of Financial Economics 70(2), 261–291.

Baker, M. and J. Wurgler (2002). Market timing and capital structure. Journal of Finance 57(1),1–32.

Barrot, J.-N. (2016). Trade credit and industry dynamics: Evidence from trucking firms. TheJournal of Finance 71(5), 1975–2016.

Begenau, J. and J. Salomao (2019). Firm financing over the business cycle. Review of FinancialStudies 32(4), 1235–1274.

178

Behn, B. K., J.-H. Choi, and T. Kang (2008). Audit quality and properties of analyst earningsforecasts. The Accounting Review 83(2), 327–349.

Berndt, A., R. A. Jarrow, and C. O. Kang (2007). Restructuring risk in credit default swaps: Anempirical analysis. Stochastic Processes and their Applications 117(11), 1724–1749.

Binsbergen, J. H. V., W. F. Diamond, and M. Grotteria (2019). Risk-Free Interest Rates. SSRNElectronic Journal.

Black, F. and R. Litterman (1991, September-October). Global portfolio optimization. FinancialAnalysts’ Journal 48(5), 28–43.

Blume, M. E. (1993). Soft dollars and the brokerage industry. Financial Analysts Journal 49(2),36–44.

Bolandnazar, M. (2020). The Market Structure of Dealer-to-Client Interest Rate Swaps. (Decem-ber), 1–47.

Bolton, P., H. Chen, and N. Wang (2013). Market timing, investment, and risk management.Journal of Financial Economics 109(1), 40–62.

Boyarchenko, N., A. M. Costello, and O. Shachar (2019). The Long and Short of It: The Post-Crisis Corporate CDS Market. SSRN Electronic Journal (August).

Boyarchenko, N., P. Gupta, N. Steele, and J. Yen (2018). Trends in Credit Basis Spreads. FederalReserve Bank of New York Economic Policy Review 24(2), 15.

Brennan, M. J. and E. S. Schwartz (1984). Optimal Financial Policy and Firm Valuation. Journalof Finance 39(3), 593–607.

Caballero, R. J. and E. Farhi (2018). The safety trap. Review of Economic Studies 85(1), 223–274.

Caballero, R. J., E. Farhi, and P. O. Gourinchas (2016). Safe asset scarcity and aggregate demand.American Economic Review 106(5), 513–518.

Caballero, R. J., E. Farhi, and P. O. Gourinchas (2017). The safe assets shortage conundrum.Journal of Economic Perspectives 31(3), 29–46.

Carhart, M. M. (1997, March). On persistence in mutual fund performance. Journal of Finance 52,57–82.

Chopra, V. K. (1998). Why so much error in analysts’ earnings forecasts? Financial AnalystsJournal 54(6), 35–42.

Christensen, H. B., L. Hail, and C. Leuz (2016). Capital-market effects of securities regulation:Prior conditions, implementation, and enforcement. The Review of Financial Studies 29(11),2885–2924.

Cochrane, J. H. (2005). Asset pricing: Revised edition. Princeton university press.

179

Cohen, R. B., C. Polk, and T. Vuolteenaho (2003, April). The value spread. Journal of Fi-nance 58(2), 609–642.

Cohen, R. B. and C. K. Polk (1995, October). An investigation of the impact of industry factors inasset-pricing tests. University of Chicago working paper.

Coval, J. D. and T. J. Moskowitz (2001). The geography of investment: Informed trading and assetprices. Journal of political Economy 109(4), 811–841.

Crouzet, N. (2018). Aggregate implications of corporate debt choices. Review of Economic Stud-ies 85(3), 1635–1682.

Crouzet, N. and J. Eberly (2020). Rents and Intangible Capital: A Q+ Framework. Working Paper.

Crouzet, N. and J. C. Eberly (2019). Understanding Weak Capital Investment: the Role of MarketConcentration and Intangibles. NBER Working Paper Series 53(9), 1–10.

Cumming, D., S. Johan, and D. Li (2011). Exchange trading rules and stock market liquidity.Journal of Financial Economics 99(3), 651–671.

Dang, T., G. Gorton, and B. Holmström (2015). Ignorance, Debt and Financial Crises. WorkingPaper, 1–40.

Dang, T. V., G. B. Gorton, and B. R. Holmström (2019). The Information View of Financial Crises.SSRN Electronic Journal (i).

Daniel, K., L. Mota, S. Rottke, and T. Santos (2020). The Cross-Section of Risk and Returns.Review of Financial Studies 33(5), 1927–1979.

Daniel, K. D., M. Grinblatt, S. Titman, and R. Wermers (1997, July). Measuring mutual fundperformance with characteristic-based benchmarks. Journal of Finance 52(3), 1035–1058.

Daniel, K. D. and S. Titman (1997, March). Evidence on the characteristics of cross-sectionalvariation in common stock returns. Journal of Finance 52(1), 1–33.

Daniel, K. D. and S. Titman (2006, August). Market reactions to tangible and intangible informa-tion. Journal of Finance 61(4), 1605–1643.

Daniel, K. D. and S. Titman (2012, September). Testing factor-model explanations of marketanomalies. Critical Finance Review 1(1), 103–139.

Darmouni, O. and L. Mota (2020). The Financial Assets of Non-Financial Firms. SSRN ElectronicJournal.

Davis, J., E. F. Fama, and K. R. French (2000, February). Characteristics, covariances, and averagereturns: 1929-1997. Journal of Finance 55(1), 389–406.

De Santis, G. and B. Gerard (1997). International asset pricing and portfolio diversification withtime-varying risk. Journal of Finance 52(5), 1881–1912.

180

Department of the Treasury (2014). Liquidity Coverage Ratio: Liquidity Risk Measurement Stan-dards; Final Rule.

Di Maggio, M., M. Egan, and F. A. Franzoni (2019). The value of intermediation in the stockmarket. Available at SSRN 3430405.

Diamond, W. (2020). Safety Transformation and the Structure of the Financial System. Journal ofFinance (00), 1–40.

Doidge, C., G. A. Karolyi, and R. M. Stulz (2017, mar). The U.S. listing gap. Journal of FinancialEconomics 123(3), 464–487.

Dreman, D. N. and M. A. Berry (1995). Analyst forecasting errors and their implications forsecurity analysis. Financial Analysts Journal 51(3), 30–41.

Du, W., S. Gadgil, M. B. Gordy, and C. Vega (2017). Counterparty Risk and Counterparty Choicein the Credit Default Swap Market. SSRN Electronic Journal.

Du, W., J. Im, and J. Schreger (2018). The U.S. Treasury Premium. Journal of InternationalEconomics 112, 167–181.

Du, W., A. Tepper, and A. Verdelhan (2018). Deviations from Covered Interest Rate Parity. Journalof Finance 73(3), 915–957.

Duffie, D. (1999). Credit Default Swap Valuation. Financial Analysts Journal.

Duffie, D. (2018). Post-Crisis Bank Regulations and Financial Market Liquidity. Baffi Lecture.

Edelen, R. M., R. B. Evans, and G. B. Kadlec (2012). Disclosure and agency conflict: Evidencefrom mutual fund commission bundling. Journal of Financial Economics 103(2), 308–326.

Egan, M. (2018). Brokers vs. retail investors: Conflicting interests and dominated products. Jour-nal of Finance, Forthcoming.

Eisfeldt, A. L. and T. Muir (2016). Aggregate external financing and savings waves. Journal ofMonetary Economics 84, 116–133.

Elenev, V., T. Landvoigt, and S. Van Nieuwerburgh (2020). A Macroeconomic Model with Finan-cially Constrained Producers and Intermediaries. SSRN Electronic Journal.

Elizalde, A., S. Doctor, and Y. Saltuk (2009). Bond-CDS Basis Handbook Measuring. Technicalreport.

Fama, E. F. and K. R. French (1993). Common risk factors in the returns on stocks and bonds.Journal of Financial Economics 33, 3–56.

Fama, E. F. and K. R. French (1995). Size and book-to-market factors in earnings and returns. Thejournal of finance 50(1), 131–155.

Fama, E. F. and K. R. French (1996, March). Multifactor explanations of asset pricing anomalies.Journal of Finance 51(1), 55–84.

181

Fama, E. F. and K. R. French (1997, February). Industry costs of equity. Journal of FinancialEconomics 43(2), 153–193.

Fama, E. F. and K. R. French (2010). Luck versus skill in the cross-section of mutual fund returns.Journal of Finance 65(5), 1915–1947.

Fama, E. F. and K. R. French (2015, April). A five-factor asset pricing model. Journal of FinancialEconomics 116(1), 1–22.

Fang, B., O.-K. Hope, Z. Huang, and R. Moldovan (2019). The effects of mifid ii on sell-sideanalysts, buy-side analysts, and firms. Buy-Side Analysts, and Firms (May 20, 2019).

Fang, L. and A. Yasuda (2009). The effectiveness of reputation as a disciplinary mechanism insell-side research. The Review of Financial Studies 22(9), 3735–3777.

Farhi, E. and M. Maggiori (2018). A model of the international monetary system. QuarterlyJournal of Economics 133(1), 295–355.

Farre-Mensa, J., R. Michaely, and M. C. Schmalz (2018). Financing Payouts.

Fleckenstein, M. and F. A. Longstaff (2020). Renting Balance Sheet Space: Intermediary BalanceSheet Rental Costs and the Valuation of Derivatives. The Review of Financial Studies 33(11),5051–5091.

Frankel, R., S. Kothari, and J. Weber (2006). Determinants of the informativeness of analystresearch. Journal of Accounting and Economics 41(1-2), 29–54.

Frazzini, A. and L. H. Pedersen (2014, December). Betting against beta. Journal of FinancialEconomics 111(1), 1–25.

French, K. R. and J. M. Poterba (1991). Investor diversification and international equity markets.Technical Report 2.

Freyberger, J., A. Neuhierl, and M. Weber (2019). Dissecting Characteristics Nonparametrically.Review of Financial Studies forthcoming.

Gârleanu, N. and L. H. Pedersen (2011). Margin-based asset pricing and deviations from the lawof one price. Review of Financial Studies 24(6), 1980–2022.

Gerakos, J. and J. T. Linnainmaa (2018). Decomposing value. Review of Financial Studies 31(5),1825–1854.

Giambona, E., R. Matta, J.-L. Peydro, and Y. Wang (2020). Quantitative Easing, Investment, andSafe Assets: The Corporate-Bond Lending Channel, Volume 00.

Giglio, S. (2014). Credit default swap spreads and systemic financial risk. SSRN Electronic Jour-nal (April).

Giroud, X. and H. M. Mueller (2011). Corporate governance, product market competition, andequity prices. The Journal of Finance 66(2), 563–600.

182

Golubov, A. and T. Konstantinidi (2019). Where is the risk in value? evidence from a market-to-book decomposition. Journal of Finance forthcoming.

Gorton, B. G., S. Lewellen, and A. Metrick (2012). The Safe-Asset Share. American EconomicReview: Papers & Proceedings 102(3), 101–106.

Gorton, G. (2017). The history and economics of safe assets. Annual Review of Economics 9,547–586.

Gorton, G. and G. Pennacchi (1990). Financial Intermediaries and Liquidity Creation. The Journalof Finance 45(1), 49–71.

Goyal, V. K. and M. Z. Frank (2009). Capital Structure Decisions: Which Factors Are ReliablyImportant? Financial Management, 1–37.

Graham, J. R., M. T. Leary, and M. R. Roberts (2014). How Does Government Borrowing AffectCorporate Financing and Investment? SSRN Electronic Journal.

Greenwood, R., S. Hanson, and J. C. Stein (2010). A gap-filling theory of corporate debt maturitychoice. Journal of Finance 65(3), 993–1028.

Grossman, S. J. and J. E. Stiglitz (1980). On the impossibility of informationally efficient markets.The American economic review 70(3), 393–408.

Gu, L. and D. Hackbarth (2013). Governance and equity prices: Does transparency matter? Reviewof finance 17(6), 1989–2033.

Gu, S., B. T. Kelly, and D. Xiu (2018). Empirical Asset Pricing via Machine Learning. Yale SOMworking paper.

Guo, Y. and L. Mota (2020). Should information be sold separately? evidence from mifid ii.Journal of Financial Economics (Forthcoming).

Gutiérrez, G. and T. Philippon (2017). Declining Competition and Investment in the U.S.Mimeo (November), 1–77.

Hansen, L. P. and R. Jagannathan (1991, April). Implications of security market data for modelsof dynamic economies. Journal of Political Economy 99(2), 225–262.

Harford, J., F. Jiang, R. Wang, and F. Xie (2018). Analyst career concerns, effort allocation, andfirms’ information environment. The Review of Financial Studies 32(6), 2179–2224.

He, Z., A. Krishnamurthy, and K. Milbradt (2019). A model of safe asset determination. AmericanEconomic Review 109(4), 1230–1262.

Heider, F. and A. Ljungqvist (2015). As certain as debt and taxes: Estimating the tax sensitivity ofleverage from state tax changes. Journal of Financial Economics 118(3), 684–712.

Hennessy, C. A. and T. M. Whited (2005). Debt dynamics. Journal of Finance 60(3), 1129–1165.

183

Holmstrom, B. and J. Tirole (2001). LAPM : A Liquidity-Based Asset Pricing Model. The Journalof Finance LVI(5), 1837–1867.

Hong, H. and M. Kacperczyk (2010). Competition and bias. The Quarterly Journal of Eco-nomics 125(4), 1683–1725.

Hong, H. and J. D. Kubik (2003). Analyzing the analysts: Career concerns and biased earningsforecasts. The Journal of Finance 58(1), 313–351.

Huang, D., J. Li, and G. Zhou (2018, October). Shrinking factor dimension: A reduced-rankapproach. Washington University-Olin School working paper.

Jensen, M. C. (1968, June). The performance of mutual funds in the period 1945-1964. Journal ofFinance 23(2), 389–416.

Jiang, Z., A. Krishnamurthy, and H. Lustig (2020). Foreign Safe Asset Demand and the DollarExchange Rate. SSRN Electronic Journal.

Jones, C. M. (2002). A century of stock market liquidity and trading costs. Available at SSRN313681.

Kacperczyk, M., C. Pérignon, and G. Vuillemey (2020). The Private Production of Safe Assets.Journal of Finance.

Karmaziene, E. (2019). Incentives of financial analysts: trading turnover and compensation.Swedish House of Finance Research Paper (16-15).

Kelly, B. T., S. Pruitt, and Y. Su (2018, January 22). Characteristics are covariances: A unifiedmodel of risk and return. Chicago Booth Research Paper.

Kempf, E. (2019). The job rating game: Revolving doors and analyst incentives. Journal ofFinancial Economics.

Kim, G. H., H. Li, and W. Zhang (2016, sep). CDS-bond basis and bond return predictability.Journal of Empirical Finance 38, 307–337.

Kopecky, K. A. and R. M. H. Suen (2010). Finite state Markov-chain approximations to highlypersistent processes. Review of Economic Dynamics 13(3), 701–714.

Kozak, S., S. Nagel, and S. Santosh (2019). Shrinking the Cross Section. Journal of Finance forth-coming.

Krishnamurthy, A. and A. Vissing-Jørgensen (2012a). The aggregate demand for Treasury debt.Journal of Political Economy 120(2), 233–267.

Krishnamurthy, A. and A. Vissing-Jørgensen (2012b). Why an MBS-Treasury swap is better policythan the Treasury twist. Working Paper, 1–5.

Krishnamurthy, A. and A. Vissing-Jorgensen (2015). The impact of Treasury supply on financialsector lending and stability. Journal of Financial Economics 118(3), 571–600.

184

Kuehn, L. A. and L. Schmid (2014). Investment-Based Corporate Bond Pricing. Journal of Fi-nance 69(6), 2741–2776.

Laeven, M. L. and H. Huizinga (2009). Accounting discretion of banks during a financial crisis.IMF working paper 9-207.

Lang, M. H., J. Pinto, and E. Sul (2019). Mifid ii unbundling and sell side analyst research.Available at SSRN 3408198.

Ledoit, O. and M. Wolf (2003). Improved estimation of the covariance matrix of stock returns withan application to portfolio selection. Journal of empirical finance 10(5), 603–621.

Ledoit, O. and M. Wolf (2004). Honey, i shrunk the sample covariance matrix. The Journal ofPortfolio Management 30(4), 110–119.

Ledoit, O. and M. Wolf (2012). Nonlinear shrinkage estimation of large-dimensional covariancematrices. The Annals of Statistics 40(2), 1024–1060.

Ledoit, O. and M. Wolf (2017). Nonlinear shrinkage of the covariance matrix for portfolio selec-tion: Markowitz meets goldilocks. Review of Financial Studies 30(12), 4349–4388.

Lehavy, R., F. Li, and K. Merkley (2011). The effect of annual report readability on analystfollowing and the properties of their earnings forecasts. The Accounting Review 86(3), 1087–1115.

Lenel, M. (2020). Safe Assets, Collateralized Lending and Monetary Policy. SSRN ElectronicJournal (January).

Levene, H. (1961). Robust tests for equality of variances. Contributions to probability and statis-tics. Essays in honor of Harold Hotelling, 279–292.

Lewellen, J. (1999, October). The time-series relations among expected return, risk, and book-to-market. Journal of Financial Economics 54(1).

Lewellen, J., S. Nagel, and J. Shanken (2010). A skeptical appraisal of asset pricing tests. Journalof Financial Economics 96(2), 175–194.

Liu, Y., L. Schmid, and A. Yaron (2020). The Risks of Safe Assets. Working Paper, 1–39.

Liu, Y., A. Tsyvinski, and X. Wu (2019, jun). Common Risk Factors in Cryptocurrency. YaleUniversity Working Paper.

Longstaff, F. A., S. Mithal, and E. Neis (2005). Corporate yield spreads: Default risk or liquidity?New evidence from the credit default swap market. Journal of Finance 60(5), 2213–2253.

Lustig, H. N., N. L. Roussanov, and A. Verdelhan (2011). Common risk factors in currency mar-kets. Review of Financial Studies 24(11), 3731–3777.

Ma, Y. (2019). Nonfinancial Firms as Cross-Market Arbitrageurs. Journal of Finance 74(6),3041–3087.

185

Markowitz, H. M. (1952, March). Portfolio selection. Journal of Finance 7(1), 77–91.

McNichols, M. and P. C. O’Brien (1997). Self-selection and analyst coverage. Journal of Account-ing Research 35, 167–199.

Merkley, K., R. Michaely, and J. Pacelli (2017). Does the scope of the sell-side analyst industrymatter? an examination of bias, accuracy, and information content of analyst reports. TheJournal of Finance 72(3), 1285–1334.

Merton, R. C. (1973, September). An intertemporal capital asset pricing model. Economet-rica 41(5), 867–887.

Mota, L. (2021). The corporate supply of (quasi) safe assets. Available at SSRN.

Myers, S. C. (1984). The Capital Structure Puzzle. The Journal of Finance 39(3), 574–592.

Myers, S. C. and N. S. Majluf (1984). Corporate financing and investment decisions when firmshave information that investors do not have. Journal of Financial Economics 13(2), 187–221.

Nagel, S. (2016). The Liquidity Premium of Near-Money Assets. The Quarterly Journal of Eco-nomics 131(4), 1927–1971.

Novy-Marx, R. (2013). The other side of value: The gross profitability premium. Journal ofFinancial Economics 108(1), 1–28.

Oehmke, M. and A. Zawadowski (2015). Synthetic or Real? the Equilibrium Effects of CreditDefault Swaps on Bond Markets. Review of Financial Studies 28(12), 3303–3337.

Oehmke, M. and A. Zawadowski (2017). The anatomy of the CDS market. Review of FinancialStudies 30(1), 80–119.

Pastor, L. and R. F. Stambaugh (2003). Liquidity risk and expected stock returns. Journal ofPolitical Economy 111, 642–685.

Peters, R. H. and L. A. Taylor (2017). Intangible capital and the investment-q relation. Journal ofFinancial Economics 123(2), 251–272.

Roll, R. W. (1977). A critique of the asset pricing theory’s tests. Journal of Financial Economics 4,129–176.

Romer, P. M. (1990). Endogenous technological change. Journal of political Economy 98(5, Part2), S71–S102.

Rosenbaum, P. R. and D. B. Rubin (1985). Constructing a control group using multivariate matchedsampling methods that incorporate the propensity score. The American Statistician 39(1), 33–38.

Ross, S. A. (1976, December). The arbitrage theory of capital asset pricing. Journal of EconomicTheory 13, 341–360.

186

Rouwenhorst, G. (1995). Asset Pricing Implications of Equilibrium Business Cycle Models.

Stein, J. C. (1996). Rational capital budgeting in an irrational world. Journal of Business 69(4),429–455.

Stokey, N. L. (1989). Recursive methods in economic dynamics. Harvard University Press.

Strebulaev, I. A. and T. M. Whited (2012). Dynamic models and structural estimation in corporatefinance. Foundations and Trends in Finance 6(1-2), 1–163.

Sunderam, A. (2015). Money creation and the shadow banking system. Review of FinancialStudies 28(4), 939–977.

Tauchen, G. (1986). Finite state markov-chain approximations to univariate and vector autoregres-sions. Economics letters 20(2), 177–181.

Tauchen, G. and R. Hussey (1991). Quadrature-based methods for obtaining approximate solutionsto nonlinear asset pricing models. Econometrica: Journal of the Econometric Society, 371–396.

Thomas, S. (2002). Firm diversification and asymmetric information: evidence from analysts’forecasts and earnings announcements. Journal of Financial Economics 64(3), 373–396.

Tirole, J. (1988). The theory of industrial organization. MIT Press Books.

Van Nieuwerburgh, S. and L. Veldkamp (2009). Information immobility and the home bias puzzle.The Journal of Finance 64(3), 1187–1215.

Van Nieuwerburgh, S. and L. Veldkamp (2010). Information acquisition and under-diversification.The Review of Economic Studies 77(2), 779–805.

Veldkamp, L. L. (2006a). Information markets and the comovement of asset prices. The Review ofEconomic Studies 73(3), 823–845.

Veldkamp, L. L. (2006b). Media frenzies in markets for financial information. American EconomicReview 96(3), 577–601.

Warusawitharana, M. and T. M. Whited (2016). Equity market misvaluation, financing, and invest-ment. Review of Financial Studies 29(3), 603–654.

Zhang, X. F. (2006a). Information uncertainty and analyst forecast behavior. Contemporary Ac-counting Research 23(2), 565–590.

Zhang, X. F. (2006b). Information uncertainty and stock returns. The Journal of Finance 61(1),105–137.

187

Appendix A: Chapter 1

A.1 Data Appendix

A.1.1 Variable definitions

For corporate bond pricing data, I use WRDS bond returns which is based on TRACE and

FISD. For CDS-spreads data, I use Markit single name CDS-spread composites. All data is down-

loaded directly from the WRDS data service. The data set is bond returns centered. Details on the

merging algorithm can be found in A.1.4.

A.1.2 Measuring SG&A

I follow Peters and Taylor (2017) in measuring SG&A. Specifically, SG&A is the Compustat

variable XSGA minus XRD minus RDIP. There are two exceptions to this rule: (1) When XRD

exceeds XSGA but is less than cogs, SG&A is measured as XSGA with no further adjustments or

(2) When XSGA is missing, SG&A is set to zero. I also set XRD and RDIP to zero when missing.

As explained by Peters and Taylor (2017), the logic behind this formula is as follows:

According to the Compustat manual, XSGA includes R&D expense unless the

company allocates R&D expense to cost of goods sold (COGS). For example, XSGA

often equals the sum of Selling, General and Administrative and Research and De-

velopment on the Statement of Operations from firms’ 10-K filings. To isolate (non-

R&D) SG&A, we must subtract R&D from XSGA when Compustat adds R&D to

XSGA. There is a catch: When a firm externally purchases R&D on products not yet

being sold, this R&D is expensed as In-Process R&D and does not appear on the bal-

ance sheet. Compustat adds to XSGA only the part of R&D not representing acquired

In-Process R&D, so our formula subtracts RDIP (In-Process R&D Expense), which

188

Compustat codes as negative. We find that Compustat almost always adds R&D to

XSGA, which motivates our formula above. Standard & Poor’s explained in private

communication that, “in most cases, when there is a separately reported XRD, this is

included in XSGA.” As a further check, we compare the Compustat records and 10-K

filings for a random sample of one hundred firm-year observations with non-missing

XRD. We find that Compustat includes R&D in XSGA in 90 out of one hundred cases,

partially includes it in XSGA in one case, and includes it in COGS in seven cases. Two

cases remain unclear even after asking the Compustat support team. The screen above

lets us identify obvious cases in which xrd is part of COGS. This screen catches six of

the seven cases in which XRD is part of COGS. Unfortunately, identifying the remain-

ing cases is impossible without reading SEC filings. We thank the Compustat support

team from Standard & Poors for their help in this exercise. (Appendix B.2)

A.1.3 FISD Cleaning

FISD provides 4 databases of interest: “fisd_issue" that gives information at bond issue level,“fisd_mergedissuer"

that gives information about the bond issuer , “fisd_ratings_hist" to access the historical rating at

each moment in time, and finally “fisd_amt_out_hist" to be able to access each bonds amount

outstanding.

To calculate amount outstanding by firm, we consider the following bond types: US Corporate

Convertible (CCOV), US Corporate Inflation Indexed (CCPI), US Corporate Debentures (CDEB),

US Corporate LOC Backed (CLOC), US Corporate MTN (CMTN), US Corporate MTN Zero

(CMTZ), US Corporate Paper (CP), US Corporate Pass Thru Trust (CPAS), US Corporate PIK

Bond (CPIK), US Corporate Strip (CS), US Corporate UIT (CUIT), US Corporate Zero (CZ), US

Corporate Bank Note (USBN), US Corporate Insured Debenture (UCID).

To calculate firms bond net issuance we use variation in amount outstanding from Mergent

FIDS.

189

A.1.4 Merge FISD to Compustat

We use Merget FISD data set to retrieve bond characteristics. The bond identifier in FISD is

ISSUE ID, that maps one to one to CUSIP 9-digits. We use Compustat data set to retrieve firm-

level characteristics. Firm identifier in Compustat is GVKEY. The goal is to merge firm with the

bonds they issued.

CUSIP 6-digit is a firm identifier. The main challenge is though is that firms can issue bonds

through their subsidiaries and the CUSIP 6-digit does not reflect the parent company identity.

Our first task is to go from CUSIP 9-digit to a parent 6-digit CUSIP. I do this in two steps.

First, I merge FISD-issue data with FISD-issuer to the PARENT ID. I then get PARENT CUSIP,

the CUSIP 6-digit to the PARENT ID from FISD-issue. Second, I use SDC to get the UPARENT

CUSIP, the Ultimate Parent CUSIP 6-digit.

The figure below shows for all corporate bonds in FISD the successful merges with Compustat

(Panel a) and the successful merge with Compustat and Markit CDS data (Panel b).

Figure A.1: Successful Mergers

0

2000

4000

6000

2003 2005 2007 2009 2011 2013 2015 2017 2019 2021

Am

ount

Out

stan

ding

(Fa

ce V

alue

Bi $

)

Merged Not Merged

(a) FISD + Compustat

0

2000

4000

6000

2003 2005 2007 2009 2011 2013 2015 2017 2019 2021

Am

ount

Out

stan

ding

(Fa

ce V

alue

Bi $

)

Merged Not Merged

(b) FISD + Compustat + CDS

This figure shows the amount outstanding of all corporate bonds in FISD dataset. In green it is the amount outstandingwith successful merge with Compustat (Panel a) or Compustat and CDS data from Markit (Panel b). In red is theamount outstanding not merged.

190

A.2 Measures of Aggregate Safety Premium

AAA credit spread: It is the face-value weighted credit spread of AAA rated firms. I consider all

bonds in the sample. This measure was first introduced by Krishnamurthy and Vissing-Jørgensen

(2012a).

BBB − (AAA and AA) basis spread: It is the face-value weighted CDS-bond basis spread be-

tween “BBB” and “AAA and AA.” The difference in credit risk is controlled by the CDS correction

in the basis calculation. This measure is inspired by Krishnamurthy and Vissing-Jørgensen (2012a)

safety premium, who suggest to use the credit spread between BBB and AAA, but control for dif-

ferences in credit risk. The reason I use “AAA and AA" instead only “AAA" is because, in recent

years, there are only two US corporations that are rated AAA, Johnson & Johnson and Microsoft.

In order to minimize the influence of company-specific shocks I use the set of AAA and AA bonds.

Note that a large component of the BBB−AAA spread is due to credit risk, this is the reason I use

the basis spread instead of credit spread.

Box trade: Difference between a benchmark rates from the put-call parity relationship for European-

style options and UST yields. I used the 18-months maturity. Data is downloaded from Binsbergen’

personal website (here).

Structurally estimated: Based on the the model presented in Section 1.3, we can write the basis

as

basisi,t = (αi,t+1 − 1)ϕt (A.17)

=

[αµi,t exp

(−ψ bi,t+1

Et (Πt+1)

)− 1

]ϕt. (A.18)

Substituting all αi,t, we can re-write this relation as

basisi,t+1 =

[exp

(−ψ

∑k≥0

µkIt−k

)− 1

]ϕt where It =

bt+1

EtΠt+1

(A.19)

From this relationship we would like to estimate ψ, µ and ϕϕϕ = [ϕ1, ..., ϕT ]. I use a non-linear

191

https://sites.google.com/view/jules-van-binsbergen/data?authuser=0

least square estimation.

Consider the following estimation equation

basisi,t = (αi,t+1 − 1)ϕt + εi,t (A.20)

Let

{ψ, µ, ϕϕϕ} = argminψ,µ,ϕϕϕ∑t

∑i

ε2i,t (A.21)

be the unbiased estimator of interest.

Notice that given ψ and µ, each ϕt necessary is

ϕt = (xxxTt xxxt)−1xxxTt bbbt, (A.22)

where bbbt = [basis1,t, ..., basisN,t], xxxt = [x1,t, ..., xN,t] and

xi,t(ψ, µ) := (αi,t+1 − 1) (A.23)

Hence, we can replace (A.23) in (A.24) and get that

{ψ, µ} = argminψ,µ∑t

[bbbTt bbbt −

xxxTt bbbtxxxTt xxxt

]. (A.24)

Once we have {ψ, µ}, we can simply replace it in (A.22) to have an estimation of the UST

safety premium, ϕt.

A.3 Model

A.3.1 Price of the Bond

The main advantage of considering a model with liquidity default is that we can calculate the

price of the debt for each (Kt+1, Bt+1) in closed form. The price of the debt is given by

192

Pt+1 = a1 (1 + rc0p1) + a2rc1p2 + ϕ (Kt+1, Bt+1;xt, zt) (A.25)

where rc0 = (1− ξ)(1− δ)Kt+1 − fKt+1 −Bt+1

Bt+1

(A.26)

rc1 = (1− ξ)Kζt+1

Bt+1

(A.27)

mt =β

exp(xt)γ(A.28)

µ = µx + µz + σ2xγ (A.29)

σ =√σ2x + σ2

z (A.30)

a1 = mt exp

(µxγ +

1

2σ2xγ

2

)(A.31)

a2 = a1 exp

(µ+

1

2σ2

)(A.32)

p1 = Φ

(ln d− µ

σ

)(A.33)

p2 = Φ

(ln d− µ− σ2

σ

)(A.34)

Φ(·) is the normal distribution cdf. (A.35)

To see that, start with the bond price equation:

P (Kt+1, Bt+1;xt, zt) = E[Mt+1

((1− I{Vt+1<0}

)+ I{Vt+1<0}

Lt+1

Bt+1

)]+ s(Kt+1, Bt+1;xt, zt)

(A.36)

Or we can rewrite it as

P (Kt+1, Bt+1;xt, zt) =

[E[Mt+1]− E

[Mt+1I{Vt+1<0}

Bt+1 − Lt+1

Bt+1

]]+ s(Kt+1, Bt+1;xt, zt)

(A.37)

where s(Kt+1, Bt+1;xt, zt) is the safety premium for the bond.

193

But,

Bt+1 − Lt+1

Bt+1

=Bt+1 + (1− ξ) (f − (1− δ))Kt+1

Bt+1

− (1− ξ)Xt+1Zt+1Kζ

Bt+1

(A.38)

= rct − rct+1Xt+1Zt+1 (A.39)

We can find a close form solution to Pt, by solving the expectation in equation (A.37).

Lemma A.1 Let X = exp(x) and Z = exp(z), such that x ∼ N(µx, σx) and z ∼ N(µz, σz).

Then,

E[XαZβI{XZ<d}

]= exp(µxα + µyβ +

1

2

(σ2xα

2 + σ2zβ

2))Φ

(ln d− µx − µy − ασ2

x − βσ2z√

σ2x + σ2

z

)(A.40)

Hence, the price of the bond is:

Pt+1 = mtE[Xγt+1 + rctX

γt+1I{XZ<d} + rct+1X

γ+1ZI{XZ<d}]

+ s (Kt+1, Bt+1;xt, zt)(A.41)

= [at+1 (1 + rctpt+1) + a2rct+1p2] + s (Kt+1, Bt+1;xt, zt) (A.42)

(A.43)

where at+1 = exp

(µxγ +

1

2σ2xγ

2

), (A.44)

pt+1 = Φ

(ln d− µx − µz − γσ2

z√σ2x + σ2

s

)(A.45)

a2 = mt exp

(µx(γ + 1) + µz +

1

2(σ2

x(γ + 1)2 + σ2z

)(A.46)

p2 = Φ

(ln d− µx − µz − (γ + 1)σ2

z − σ2z√

σ2x + σ2

s

)(A.47)

194

A.3.2 Optimization

Figure A.2: Firm’s Total Value as Function of Capital and Debt Issuance

(a) Unconstrained optimal policy(b) Constrained optimal policy

This figure shows how, the total value of the firm varies with the amount capital invested and bond issued. The bluedot is the optimal policy {k∗t+1, b

∗t+1} that maximizes the value of the firm. In Panel (a) the firm’s investment is

unconstrained and Panel (b) firm’s investment is constrained. The parameter choices are: ϕt = 1%.

A.4 Infinite Horizon Dynamic Model

In this section, I present a numerical implementation of the infinite horizon version of the

model introduced in Section 1.3. The results are qualitatively equivalent to those obtained in the

two period model.

A.4.1 Simulation Strategy

For easiness of notation, in this section I omit the i subscript. Let

∫t = (kt, bt, αt, xt, zt, ϕt) (A.48)

be the vector state space representation of the firm at time t, where kt and bt are respectively the

level of capital and the debt issuance of the firm, i.e. the action, taken by the firm at time t− 1, αt

is the firm’s perceived safety, while xt, zt and ϕt are respectively the aggregate and idiosyncratic

195

productivity shocks and the aggregate safety premium. Note that the description of the state space

given here is slightly different than the one given in Section 1.3, as I replaced the net worth wt

with the action variables kt and bt. Recall that the net worth of the company is determined by the

formula

wt := exp(xt + zt)kζt + lv(1− δ)kt − bt, (A.49)

hence, given the action and the realization of productivity shocks, the net worth is determined. The

advantage of the parametrization proposed here is that it allows for a joint discretization of the

state space and the action space where each possible level of net worth generated by the choice of

an action and a realization of productivity shocks is faithfully represented.

As observed in Section 1.3, the value of the firm is given by the solution of the Bellman equa-

tion

V (∫t) = maxkt+1,bt+1

d(kt+1, bt+1; ∫t) + Et [Mt,t+1Vt+1(∫t+1)] . (A.50)

To find the optimal policy, first I discretize the state space, and then I apply the standard value

function iteration method (see for e.g. Stokey (1989)).

A.4.2 State Space Discretization

I discretize the state space by defining discrete spaces Sk, Sb, Sα, Sx, Sz and Sϕ for each

component of the state space vector and taking the Cartesian product of these sets. For Sk, Sb

and Sα I take a uniformly spaced grid of 41 points ranging from 1 and 1200 for Sk, 0 and 90 for

Sb, and 0 and 1 for Sα. For Sx, Sz and Sϕ, I follow a technique for approximating AR(1) models

with Markov chains originally introduced by Rouwenhorst (1995), generating three points for each

space. As noted in Kopecky and Suen (2010), when simulating highly persistent processes, this

approximation scheme is more reliable than alternative approaches commonly used in the literature

such as Tauchen (1986) or Tauchen and Hussey (1991).

196

Figure A.3: Issuance Response to Perceived Safety

This figure shows the optimal debt issuance as a share of kt as function of perceived safety, αt.

A.4.3 Main Results

This paper is mainly interested in the sensitivity of optimal bond issuance as a function of

perceived safety. In Figure A.3 I plot the numerical simulation of this function for the set of

approximation scheme described above. As one can see, the full dynamic model preserves the

qualitative behavior observed in the two period model.

197

A.5 Additional Empirical Results

Table A.1: Impact of Cross-basis on Firm’s Decisions Conditional on Positive Debt Issuance Including Ratings FE

Dependent variable:

Net PayoutCapital


Intangible


Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8)

Cross-basis × DebtIssuance 0.041∗∗ 0.017 −0.002 0.003 0.001 −0.082 0.014 −0.051

(0.016) (0.016) (0.004) (0.004) (0.006) (0.056) (0.021) (0.052)

Cross-basis 0.003 0.017 0.0004 0.001 0.001 0.030 0.005 −0.019

(0.011) (0.011) (0.003) (0.002) (0.003) (0.023) (0.008) (0.023)

DebtIssuance 0.278∗∗∗ 0.183∗∗∗ 0.001 0.007 0.008 0.887∗∗∗ 0.115∗∗∗ 0.879∗∗∗

(0.028) (0.012) (0.007) (0.005) (0.008) (0.082) (0.024) (0.091)

CDS-spread −0.005 −0.050∗∗∗ −0.0002 0.006∗∗∗ 0.005∗∗ −0.024∗ −0.005 0.009

(0.012) (0.013) (0.002) (0.002) (0.002) (0.013) (0.006) (0.017)

log(Total Assets) −0.183∗∗∗ −0.099∗∗∗ −0.102∗∗∗ −0.330∗∗∗ −0.432∗∗∗ −0.272∗∗∗ −0.015 −0.604∗∗∗

(0.042) (0.026) (0.010) (0.007) (0.013) (0.075) (0.033) (0.096)

CHE (% assets) 0.023∗∗∗ −0.0001 −0.0005 −0.005∗∗∗ −0.005∗∗∗ 0.083∗∗∗ −0.004 −0.181∗∗∗

(0.003) (0.001) (0.001) (0.0005) (0.001) (0.010) (0.003) (0.009)

Tobin’s Q 0.023∗∗∗ 0.010∗∗∗ 0.001 0.003∗∗∗ 0.004∗∗∗ −0.001 −0.003∗ 0.017∗∗

(0.005) (0.002) (0.001) (0.001) (0.001) (0.004) (0.002) (0.007)

Leverage Ratio (%) −0.021∗∗∗ −0.010∗∗∗ −0.001∗∗∗ −0.005∗∗∗ −0.006∗∗∗ 0.002 0.001 −0.019∗∗∗

(0.002) (0.001) (0.0003) (0.0005) (0.001) (0.004) (0.002) (0.004)

ROA (%) 0.087∗∗∗ 0.044∗∗∗ 0.008∗∗∗ 0.008∗∗∗ 0.015∗∗∗ 0.026∗∗∗ 0.019∗∗∗ 0.013

(0.014) (0.013) (0.002) (0.002) (0.004) (0.008) (0.007) (0.020)

Firms FE Yes Yes Yes Yes Yes Yes Yes Yes

Time FE Yes Yes Yes Yes Yes Yes Yes Yes

Ratings FE Yes Yes Yes Yes Yes Yes Yes Yes

Observations 19,227 19,227 19,227 19,227 19,227 19,227 19,227 19,227

R2 0.418 0.688 0.766 0.940 0.902 0.116 0.059 0.139

Adjusted R2 0.396 0.676 0.757 0.938 0.898 0.083 0.024 0.106

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows regression results from estimating R3. The y-variable are (2) net payout, which is dividends plus netequity repurchase, (2) capital investment, which is CAPEX plus net PPE bought, (3) R&D expenses, (4) 30% of SG&Aexpenses, (5) intangible investment, which is R&D plus 30% of SG&A expenses, (6) acquisitions. All independentvariables are measured in time t + 1 and normalized by lag total assets. For the x-variables, cross-basis is the firm-level CDS-bond basis minus the basis index, basis index is the face-value weighted average of CDS-bond basis in thecorporate bond market and DebtIssuance is an indicator variable equal to 1 if net debt issuance is strictly positive and0 otherwise. The controls are CDS spread at the firms level, log of total assets, Tobin’s Q, CHE as percentage of totalassets, book leverage ratio and return on assets (ROA). All columns include time, firm and rating buckets fixed effects.All independent variables are measured in quarter t. Standard errors are reported in parenthesis and are clustered bytime. Data is quarterly from 2003Q1 to 2019Q3.

198

Table A.2: Impact of Cross-basis on Firm’s Decisions Conditional on Positive Debt Issuance Including Ratings FE

Dependent variable:

Net PayoutCapital


Intangible


Financial

Investment∆ Cash

(1) (2) (3) (4) (5) (6) (7) (8)

Cross-basis 0.055∗∗∗ 0.040∗∗∗ 0.002 0.005∗∗ 0.007∗ 0.019 0.013 −0.025

(0.018) (0.013) (0.003) (0.002) (0.004) (0.016) (0.010) (0.022)

CDS-spread −0.087∗∗∗ −0.086∗∗∗ −0.005∗∗ −0.002 −0.007∗∗ −0.038∗∗ −0.010∗∗ −0.012

(0.019) (0.019) (0.002) (0.002) (0.003) (0.015) (0.005) (0.019)

log(Total Assets) −0.231∗∗∗ −0.125∗∗∗ −0.109∗∗∗ −0.341∗∗∗ −0.450∗∗∗ −0.281∗∗∗ −0.024 −0.658∗∗∗

(0.041) (0.024) (0.010) (0.008) (0.012) (0.080) (0.033) (0.097)

CHE (% assets) 0.023∗∗∗ −0.0004 −0.0004 −0.005∗∗∗ −0.005∗∗∗ 0.078∗∗∗ −0.005 −0.186∗∗∗

(0.003) (0.001) (0.001) (0.0005) (0.001) (0.009) (0.003) (0.009)

Tobin’s Q 0.018∗∗∗ 0.008∗∗∗ 0.001 0.002∗∗∗ 0.002∗∗ 0.004 −0.002 0.012

(0.005) (0.002) (0.001) (0.001) (0.001) (0.004) (0.002) (0.007)

Firms FE Yes Yes Yes Yes Yes Yes Yes Yes

Time FE Yes Yes Yes Yes Yes Yes Yes Yes

Observations 19,227 19,227 19,227 19,227 19,227 19,227 19,227 19,227

R2 0.391 0.675 0.765 0.939 0.901 0.087 0.056 0.120

Adjusted R2 0.369 0.663 0.757 0.937 0.897 0.053 0.021 0.088

Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows regression results from estimating R3. The y-variable are (2) net payout, which is dividends plus netequity repurchase, (2) capital investment, which is CAPEX plus net PPE bought, (3) R&D expenses, (4) 30% of SG&Aexpenses, (5) intangible investment, which is R&D plus 30% of SG&A expenses, (6) acquisitions. All independentvariables are measured in time t + 1 and normalized by lag total assets. For the x-variables, cross-basis is the firm-level CDS-bond basis minus the basis index, basis index is the face-value weighted average of CDS-bond basis in thecorporate bond market and DebtIssuance is an indicator variable equal to 1 if net debt issuance is strictly positive and0 otherwise. The controls are CDS spread at the firms level, log of total assets, Tobin’s Q, CHE as percentage of totalassets, book leverage ratio and return on assets (ROA). All columns include time, firm and rating buckets fixed effects.All independent variables are measured in quarter t. Standard errors are reported in parenthesis and are clustered bytime. Data is quarterly from 2003Q1 to 2019Q3.

A.5.1 Cash Flow Identity

It is useful to do a complete study of how firms use the proceeds from debt issuance in response

to the cross-basis. We can use the cash-flow statement to do a “follow the money" approach

and how does the firm use the funds from issuance. This identity also justifies my measures of

real investment (capital, intangible and acquisitions) and financial investment. I also estimated

199

Equation (R3) for all variables in (A.51), I did not find any patterns that alters the results and

interpretation in the body of the paper.

To follow the money, for every firm and quarter, consider the augmented budget equation

F − (i+ l′ − (1 + rl)l) = (d− e)− (Pb′ − b),

the prime notation denotes one period ahead values, F as before is the net cash flow from opera-

tions, i is total investment, l − (1 + rl)l is change in cash (or liquid assets), d is total payouts to

shareholders, e is equity issuance, therefore d − e is net payout to equity holders and Pb′ − b net

debt issuance.

I map the budget equation to the cash-flow identity in Compustat data by calculating the fol-

lowing:

Total Net Debt Issuance = Net Payouts + Real Investment +

Financial Investment + ∆Cash − Net Operating Profits + Others(A.51)

I consider further breakdowns for total net debt issuance and real investments. In Compustat

one can disentangle total debt issuance in current debt (less than one year maturity), and long-term

debt(less than one year maturity). This differentiation is interesting because the basis calculations

only captures bond liabilities for time to maturity longer than one year. Therefore long-term debt

issuance is the main dependent variable of interest. Real investment can be broken down in three

categories: capital investment, intangible investment and acquisitions. This is interesting because

forms of investment that are not purely capital expenditure is becoming increasingly important

among the largest US firms.

• Net Operating Profits (F): net operating cash flow net minus intangible investment.

• Total Debt Issuance (TDI): the total net debt issuance, TDI = NDI + NCDI , where NDI is

the non-current net debt issuance and NCDI is the current debt issuance.

200

• Real investment (RINV): RINV = KINV + ACQ + IINV, where KINV = CAPX - SPPE,

AQC is acquisitions and INVV is in intangible investment, IINV = R&D + SG&A .

• ∆Cash is the cash variation, ∆ Cash = CHECH.

• Financial Investment (FINV): FINV = IVCH - SINV - IVSTCH.

• Other: the residual, Others = − IVACO − FIAO − TXBCOF

A.6 CDS-bond Basis

We follow Elizalde et al. (2009) and apply the par equivalent CDS spread methodology, or

PECS, to compute the CDS bond basis.

To fix notation, let r(s) be the instantaneous short interest rate, and λi(s) the instantaneous

hazard rate for issuer i. We define the discount curve Z(t) and the survival probability curve Si(t)

for issuer i as

Z(t) := e−∫ t0 r(s)ds, (A.52)

Si(t) := e−∫ t0 λi(s)ds. (A.53)

(A.54)

Given a bond issued by issuer iwithN interest paymentsCF1, CF2, . . . CFN at times t1, t2, . . . tN ,

principal value V and maturity tN , its dirty price50 is given by the formula

P =N∑n=1

Z(tn)Si(tn)CFn + Z(tN)Si(tN)V. (A.55)

Given a CDS contract withM remaining payments, with payment times t1, t2, . . . , tM and year

50Inclusive of accrued interest.

201

fractions ∆1,∆2, . . . ,∆M the present value of the contract for the protection buyer is given by

πpb = NCM∑m=1

∆mZ(tm)Si(tm) +NC

2

M∑m=1

∆mZ(tm) [S(tm−1)− S(tm)] (A.56)

while for the counter-party of the contract, the protection seller, the net present value is given by

πps = N(1−RR)M∑m=1

Z(tm) [S(tm−1)− S(tm)] . (A.57)

where N is the notional value, RR is the recovery rate, and C is the CDS spread. In particular, the

CDS par spread Cpar is the one for which

πpb(Cpar) = πps (A.58)

The PECS methodology consists in three steps. First, one bootstraps a survival probability

curve SCDSi (t) from CDS prices. Second, one defines the bond-implied survival probability curve

SBondi (t) = SCDSi (t) + βi by choosing the β that minimizes the difference between market prices

PMktj and prices Pj(SBondi ) implied by Formula (A.55), averaging over all issued bonds j for issuer

i

βi := argmin∑j

(PMktj − Pj(SBondi )

)2. (A.59)

In this phase, one needs to recall that bonds in the US market are quoted using the clean price,

hence one must re-add the accrued interest before taking the difference. Finally, one can define the

par equivalent CDS spread Cpecs for a given tenor tM by solving Equation (A.58) with the survival

probability given by SBondi (t). The CDS-bond basis B(t) is given by the difference Cpecs − Cpar.

202

A.7 Alternative Model with Limits to Arbitrage

A.7.1 Explicit Model of Arbitrageurs

At each time t, the arbitrageur must choose his consumption Cat , his investment in risky assets

Qai,t+1, in the uncollateralized loans Ba

u,t+1 and in the collateralized loans Bac,t+1.

Let W at be the wealth of the arbitrageur in time t.

W at = QQQa

tXXX t +Bau,t +Ba

c,t − Cat (A.60)

As in Gârleanu and Pedersen (2011), I assume that arbitrageurs must post margins to finance

their positions. Margins are paid both in long and short positions.

The arbitrageur solves the problem51

max{ct,Qt+1,Bu,t+1,Bc,t+1}∞t=0

Et

[∑t

βtU (Ct)

](A.61)

s.t. Qt+1PPP t +Bu,t+1Pu,t +Bc,t+1Pc,t ≤ Wt (A.62)∑i

mi,t|Qi,t+1|Pi,t +Bu,t+1Pu,t ≤ Wt, (A.63)

where mi,t is the margin requirement of asset i and time t. All other terms are standard.

Suppose that u(c) = c(1−γ)

1−γ and ∆ct+1 = ln(Ct+1

Ct

)= µc + ρc∆ct + σ2

cηc,t+1, where ηc,t+1 ∼

N(0, 1)

From the agent’s a first-order conditions we have:

E(ru,t+1 − rc,t+1) = ψt (A.64)

E(ri,t+1 − rc,t+1) +1

2σ2i = covt

(mat+1, ri,t+1

)− mi,t (A.65)

where ψt = ln(λt+κtλt

)and mi,t = ln

(λt+κtmi,tsign(Qi,t+1)

λt

)51We drop the subscript a to ease the notation.

203

To see this, I solve the Lagrangian,

L = Et

[∑t

βtU (Ct) + λt (Wt −Qt+1PPP t −Bu,t+1Pu,t −Bc,t+1Pc,t) +

κt

(Wt −

∑i

mi,t|Qi,t+1|Pi,t −Bu,t+1Pu,t

)] (A.66)

[∂L

∂Qi,t+1

]: Et [− (λt + κtmi,tsign(Qi,t+1))Pi,t + (λt+1 + κt+1)Xi,t+1] = 0 (A.67)

=⇒ Pi,t = Et[λt+1 + κt+1

λt + κt

λt + κtλt + κtmi,tsign(Qi,t+1)

Xi,t+1

](A.68)

[∂L

∂Bu,t+1

]: Et [− (λt + κt)Pu,t + (λt+1 + κt+1)] = 0 (A.69)

=⇒ Pu,t = Et[λt+1 + κt+1

λt + κt

](A.70)

[∂L

∂Bc,t+1

]: Et [−λtPc,t + λt+1 + κt+1] = 0 (A.71)

=⇒ Pc,t = Et[λt+1 + κt+1

λt

](A.72)

[∂L

∂ct

]: Et [u′(Ct)− λt − κt] = 0 (A.73)

=⇒ u′(Ct) = λt + κt (A.74)

Hence, the price of each asset i must be equal evolves two Lagrange multipliers from the

arbitrageur problem. The first λt is the traditional budget constraint. The second is the κt is the

204

margin constraint. In world in which there is a second set of agents that value safety services, the

arbitrageurs is short in in assets that provide safety services. In this case, the safety premium is

higher, higher the margin of requirement of the asset.

A.7.2 An Example of a Negative CDS-bond Trade

In this subsection I present an example for the negative CDS-bond basis for a Marriot (MAR 3

3/4 10/01/25, rated BBB−), 5-year maturity bought in end of September 2020. The example builds

on negative CDS-basis trade presented in Boyarchenko et al. (2018) and Bai and Collin-Dufresne

(2019). At the time, the CDS-bond basis of MAR was −1.3%. If investors were able to finance

themselves at the UST rate, this would represent an arbitrage trade with profits of 1.3% per year.

The key idea of this example is to flesh out the true funding costs a financial intermediary would

incur if she enters this trade. By looking at the net payouts, one can then calculate the implicit

funding cost that would make this trade unattractive.

Table A.3: Negative CDS-bond Basis Cash Flow Diagram

Funding Market

Bond Dealer

Repo rate

Funding Market

Debt Market Derivatives Market

OIS rate

BondCash Initial Margin+

CDS Trade Spread

Protection Premium (on credit event)

Cash: Bond MV – Haircut

OIS rate

Cash: Haircut

Repo Market

CDS Seller

CDS-Bond Basis Trader

Cash: Margin

Figure A.3 shows a cash-flow diagram for a generic negative trade transaction. In this example,

the investor is a generic intermediary that buys the bond in the bond market, buys CDS protection

in the derivatives market and funds the position in the funding market. Funding markets are of two

types: secured lending and unsecured lending. I assume the bond transacts at par and the investor

205

enters in $10 Million position. All the cash flows are summarized in Table A.4.

The first transaction is to buy the cash bond. The investor uses the bond as collateral in the repo

market and pays the repo rate. I follow Gârleanu and Pedersen (2011) and assume that haircuts for

IG bonds is 25%. She need to finance the haircut on the unsecured funding market.

The second transaction is to buy the CDS protection for the total notional value. The investor

pays the upfront payment and the CDS fixed premium of 1%. She also needs to post margin for

the CDS position. I follow the FINRA guidelines and set the margin to 12.5% of the notional.52

Both CDS payments and margins are funded in the unsecured funding market.

Finally, the investor needs to fund the total of repo haircut, and CDS upfront payment, fixed

premium and margin. The unsecured borrowing cost is investor-specific and it is not easily observ-

able. In general, the funding cost is a base rate, like OIS, plus a spread. As a benchmark, as shown

in Table A.4, I assume this spread is zero and calculate what would be net payoff of this trade.

For the notional of $10 Million, the net payoff of this trade would be $116 Thousand per year, or

1.16%. This number is smaller than the negative CDS-bond basis, but it is still large.

Finally, what would be the minimal funding cost that would make this transaction unattractive

to the investor? This rate is simply the calculation net payoff as a ratio of total unsecured funding

needed. As shown in Table A.5, this number is 3.06%, which means that an unsecured funding

cost of OIS plus a spread of 3.06% would make this trade unprofitable. When compared to the

5-year credit spread of large banks in the US, this number looks high.53 If the credit spread is

the proper cost of funding of large financial institutions, they should have been engaging in the

negative CDS-bond basis trade. Though, the credit spreads do not take into consideration the

regulation burden in taking this trade. This investment, although theoretically risk free, adds to

the risk based capital and liquidity requirements of financial institutions. 3.06% thus represent the

shadow cost of funding of the intermediary, which includes, among others, “balance sheet rental

cost" due to regulation constraints.

52For more details see FINRA website.53For comparison, 5-years yield for unsecured senior debt on September 30, 2020 for JP Morgan was 0.86%, Bank

of America was 0.9% and Citi Bank was 1.15%.

206

https://www.finra.org/rules-guidance/rulebooks/finra-rules/4240

The balance sheet rental cost include costs related stringent regulation such as liquidity cover-

age ratios. For details on how this trade affects the balance sheet of the intermediary, the reader can

consult Boyarchenko et al. (2018). More broadly, a recent literature explores how these balance

sheet costs are linked to prices and liquidity in a variety of markets (Du et al. (2018), Duffie (2018),

Andersen et al. (2019), Fleckenstein and Longstaff (2020), Bolandnazar (2020)).

Table A.4: A Negative CDS-bond Basis Example

Variable Rate Dollar ValueCorporate Yield 3.10%Treasury Yield 0.30%CDS spread 1.50%CDS-bond basis -1.30%Cash positionBond Market Value 10,000,000.00Secured Funding 75.00% 7,500,000.00Repo Haircut 25.00% 2,500,000.00CDS positionNotional Value 10,000,000.00Upfront Payment (52,020.00)Initial margin 12.50% (1,250,000.00)Funding cost of the cash positionRepo Rate on 75% notional 0.38% (28,500.00)OIS Rate on Haircut 0.40% (10,000.00)OIS Upfront Payment 0.40% (208.08)OIS rate on CDS initial margin 0.40% (5,000.00)Trade CashflowBond Coupon 310,000.00CDS fixed premium 1.00% (100,000.00)CDS trade spread (effective) (150,000.00)Total 160,000.00TotalsTotal Payoff 160,000.00Total Funding Cost (43,708.08)Net Payoff 116,291.92

This table shows the annual cash flows for negative CDS-bond trade on bond MAR 3 3/4 10/01/25 (Marriott bond,rated BBB−) bought on Sep 2020. Bond and UST yields are from Bloomberg. Repo rate is from Bloomberg repo ratecalculation. CDS spread and OIS is from Markit. Upfront payment is from Market calculator, which can be foundhere.

207

https://www.markit.com/markit.jsp?jsppage=pv.jsp

Table A.5: Implicit Funding Cost

Variable Rate Dollar ValueNet Payoff 116,291.92Total Unsecured Funding 3,802,020.00Implicit funding rate 3.06%

This table shows the implicit funding cost that would make the negative CDS-bond trade not profitable. The exampleconsiders Marriot bond (MAR 3 3/4 10/01/25, BBB− bond) bought on Sep 2020.

208

Appendix B: Chapter 2

B.1 Notation

1. Asset specifics

N : Number of assets

M : Number of characteristics

rrr : Vector of excess returns (N × 1)

µµµ : Vector of expected excess returns (N × 1)

Σ : Variance-covariance matrix (N ×N )

X : Matrix of characteristics (N ×M)

λλλc : Vector of characteristic premia (M × 1)

2. Factor model

f : Priced factor

λ : Premium on priced factor

βββ : Vector of individual firms’ loadings (βi) on f (N × 1)

g : Vector of unpriced factors ((K − 1)× 1)

γγγi : Vector of asset i’s loadings on the unpriced factors ggg (1× (K − 1))

εi : Idiosyncratic shocks for asset i

3. Characteristic portfolios (CPs)

Wc : Matrix of CPs’ weights (N ×M)

wwwc,m : m−th column of matrix Wc (N × 1)

rrrc ≡ W>c rrr : Vector of CPs’ excess returns (M × 1)

209

µµµc ≡ Errrc : Vector of CPs’ expected excess returns (M × 1)

Σc ≡ var(W>c rrr)

: Covariance matrix of CPs’ returns (M ×M )

B : Matrix of the projection coefficients of rrr on rrrc (N ×M)

bbbm : m−th column of matrix B (N × 1)

4. Characteristic efficient portfolios (CEPs)

W ∗c : Matrix of CEPs’ weights (N ×M)

www∗c,m : m−th column of matrix W ∗c (N × 1)

rrr∗c ≡ W ∗>c rrr : Vector of CEPs’ excess returns (M × 1)

µµµ∗c ≡ Errr∗c : Vector of CEPs’ expected excess returns (M × 1)

Σ∗c ≡ var(W ∗>c r)

: Covariance matrix of CEPs’ returns (M ×M )

B∗ : Matrix of the projection coefficients of rrr on rrr∗c (N ×M)

bbb∗m : m−th column of matrix B∗ (N × 1)

5. Hedge portfolios

Wh : Matrix of hedge portfolios’ weights (N ×M)

wwwh,m : m−th column of matrix Wh (N × 1)

∆ : Matrix of hedge ratios (M ×M)

B.2 Proofs

B.2.1 The characteristic efficient portfolios

For each characteristic m, let the characteristic efficient portfolio (CEP) be the solution of the

problem:

minwwwc,m

1

2www>c,mΣwwwc,m (B.75)

s.t. www>c,mX = eee>m (B.76)

210

Where wwwc,m is an (N × 1) vector of portfolio weights, Σ is the (N ×N) covariance matrix, X an

(N ×M) characteristic matrix and eeem an (M × 1) vector with the mth entry equal to 1 and all

others equal to 0.

The Lagrangian is:

L =1

2www>c,mΣwwwc,m + κκκm

(eeem −X>wwwc,m

)(B.77)

The FOC with respect towww>c,m is given as:

Σwwwc,m −Xκκκ>m = 000 (B.78)

wwwc,m = Σ−1Xκκκ>m (B.79)

The FOC with respect to κκκm is given as:

X>wwwc,m = eeem (B.80)

κκκ>m =(X>Σ−1X

)−1eeem (B.81)

Hence,

www∗c,m = Σ−1X(X>Σ−1X

)−1eeem (B.82)

The set of CEP weights, W ∗c , an (N ×M) matrix, of which the mth column is the vector of

weights of the mth CEP, can be written as

W ∗c = Σ−1X

(X>Σ−1X

)−1(B.83)

211

B.2.2 Proof of Proposition 2.1

Part 1: Let Σ∗c be the covariance matrix of CEP returns and µµµ∗c be the expected excess returns

of the CEPs. Notice that under Assumption A1, we have that

Σ∗c =(X>Σ−1X

)−1and µµµ∗c = λλλc (B.84)

Hence, the maximum squared Sharpe ratio in the space spanned by W ∗c is:

SR∗2 = µµµ∗>c Σ∗−1c µµµ∗c = λλλ>c X

>Σ−1Xλλλc = µµµ>Σ−1µµµ (B.85)

Hence, the CEPs span the MVE portfolio.

Part 2: Let B∗ be the (N ×M) matrix of loadings from a projection of assets’ excess returns

on the the CEPs’ excess returns. Then:

B∗ = ΣW ∗c (W ∗>

c ΣW ∗c )−1 = X (B.86)

B.2.3 Proof of Proposition 2.2

Part 1: For each characteristic portfolio, the weight vector of the optimal hedge portfolio,

wwwh,m, solves

maxwwwh,m

www>h,mbbbm (B.87)

s.t. www>h,mX = 000 (B.88)

1

2www>h,mΣwwwh,m = σ2 (B.89)

Where bbbm is the mth multivariate regression coefficient of rrr on rrrc.

212

The Lagrangian is:

L = www>h,mbbbm − κκκ1,mX>wwwh,m + κ2,m

(σ2 − 1

2www>h,mΣwwwh,m

)(B.90)

The FOC with respect towww>h,m is given by:

bbbm −Xκκκ>1,m − κ2,mΣwwwh,m = 000 =⇒ (B.91)

wwwh,m =1

κ2,m

Σ−1(bbbm −Xκκκ>1,m

)(B.92)

The FOC with respect to κκκ1,m is given by:

X>wwwh,m = 0 (B.93)

Multiplying the transpose of (B.92) by X in both sides and substituting in (B.93), we have:

(bbb>m − κκκ1,mX

>)Σ−1X = 000 =⇒ (B.94)

κκκ1,m = b>mΣ−1X(X>Σ−1X

)−1(B.95)

Replacing κκκ1,m we have:

www∗h,m = Σ−1(B −X

(X>Σ−1X

)−1X>Σ−1B

)e 1κ2,m

(B.96)

where e 1κ2,m

is an (M × 1) vector with the mth entry equal to 1κ2,m

and all others equal to 0.

Substituting B we have:

www∗h,m =(WcΣ

−1c − Σ−1X

(X>Σ−1X

)−1X>WcΣ

−1c

)e 1κ2,m

(B.97)

213

By solving the problem for all characteristics m and substituting W ∗c , we have:

W ∗h =

(Wc −W ∗

cX>Wc

)Σ−1c E−1 (B.98)

where E is an (M ×M) diagonal matrix, with columns eκ2,m . �

Part 2: Rearranging Equation B.97 we have

W ∗cX

>Wc = Wc −WhEΣc (B.99)

Define ∆∗ = EΣc and A = X>Wc. Hence,

W ∗c A = Wc −W ∗

h∆∗ (B.100)

�

B.2.4 The optimal hedge ratio

The last step is to find the optimal hedge ratio ∆∗

Let

W ∗c = W ∗

cX>Wc (B.101)

And

Σ∗c = Var[W ∗>c R

]= W>

c X(X>Σ−1X)−1X>Wc (B.102)

From Equation (B.99) we have:

WhEκ2Σc = Wc − W ∗c (B.103)

214

Var[ΣcEκ2W

>h R]

= Var

[(Wc − W ∗

c

)>R

](B.104)

=⇒ ΣcEκ2ΣhEκ2Σc = Σc − Σ∗c (B.105)

But,

Σ∗c = Σc + ΣcEκ2ΣhEκ2Σc − 2W>c ΣWhEκ2Σc (B.106)

Substituting, we have:

ΣcEκ2ΣhEκ2Σc = W>c ΣWhEκ2Σc =⇒ (B.107)

∆∗ = ΣcEκ2 = W>c ΣWhΣ

−1h (B.108)

B.3 Empirical details

B.3.1 Empirical definition of main variables

We use data from Compustat and CRSP, downloaded directly from the WRDS data service.

215

Book Equity (BE) Stockholders book equity, minus the book value of preferred

stock, plus balance sheet deferred taxes (if available and fiscal

year is < 1993), minus investment tax credit (if available), mi-

nus post-retirement benefit assets (PRBA) if available. Stock-

holders book equity is shareholder equity (SEQ), common eq-

uity (CEQ) plus preferred stock (PSTK) or total assets (AT) mi-

nus liabilities (LT) plus minority interest (MIB, if available) (de-

pending on availability, in that order). Book value of preferred

stock is redemption (PSTKRV), liquidation (PSTKL), or par value

(PSTK) (depending on availability, in that order). Deferred taxes

is deferred taxes and investment tax credit (TXDITC) or deferred

taxes and investment tax credit (TXDB) plus investment tax credit

(ITCB) (depending on availability, in that order).

Market Equity (ME) Total firm market value (|PRC| ∗ SHROUT ) summed over all

securities belonging to a firm, identified by GVKEY, and if miss-

ing, by PERMCO, as of June. We give preference to GVKEY to

correctly account for tracking stocks. To be valid, ME must be

greater than zero.

Book to Market (BEME) Book equity as of December divided by market equity as of De-

cember(BEME

).

Investment (INV) Total asset (AT) growth(

ATtATt−1

− 1)

. We consider PERMCO as

the identification key. AT must be greater than zero to be consid-

ered.

Operating Profitability (OP) Operating profitability to book equity (BE) ratio. Operating prof-

itability is sales (SALE) minus cost of goods sold (COGS), minus

selling, general, and administrative expenses (XSGA), minus in-

terest expense (XINT). In order to be non-missing, SALE must be

non-missing, at least one of the other entries must be non-missing

and BE must be greater than zero.216

B.3.2 Loading estimation

We calculate ex-ante forecasts of loadings for b’s, i.e., loadings on the benchmark characteristic

portfolios; and for δ’s, i.e., the optimal hedge ratio.

To calculate ex-ante forecasts of loadings we follow Frazzini and Pedersen (2014) and use two

different data windows of individual stock returns: 12-months of daily returns for volatility and

60 months of overlapping 3-day-cumulated returns for correlation. For this estimation, we only

consider returns where Pt and Pt−1 are non-missing.

For the estimation of correlations and factor volatilities, we calculate Daniel and Titman (1997)

style pre-formation factor returns. Following their procedure, we use portfolio allocations and

weights as of June 30 (portfolio formation date), and calculate portfolio returns for the preceding

5 years, holding the allocation and weights constant for each day.

Finally, we consider the observation that returns of stocks that will be allocated to a particular

portfolio at the end of June, experience a level-shift in average returns starting already in January,

as described in Daniel and Titman (1997). To account for this, when we calculate b’s, we include a

dummy variable for the rank-year, i.e., a variable that is equal to 1 if the return observation belongs

to the year of portfolio formation.

B.3.3 Dealing with missing prices

CRSP stock files report missing values for returns (RET) if a stock does not have a valid price

for 10 periods or more. The price tolerance period represents 10 months for monthly returns

and 10 days for daily returns. For calculating traded portfolio returns, we instead follow Ken

French’s website and allow for a 200 days price tolerance period. This choice makes daily and

monthly returns comparable. For pre-formation factors as described above, where all returns can

be observed before formation, we exclude any observation with a missing price (as described

above).

217

B.4 Supplemental results

B.4.1 Portfolio bin population

Table B.1: Number of firms in each portfolio

Panel A: ME × BEME

Portfolio No. of firms

ME BEME min mean max

1 1 27 564 1040

1 2 86 552 966

1 3 189 960 1813

2 1 78 255 428

2 2 119 219 322

2 3 88 152 197

3 1 139 255 466

3 2 101 158 221

3 3 45 95 163

Panel B: ME × OP


ME OP min mean max

1 1 100 1175 2137

1 2 30 497 796

1 3 24 380 776

2 1 75 189 347

2 2 68 217 317

2 3 60 207 300

3 1 52 112 199

3 2 114 174 246

3 3 121 218 308

Panel C: ME × INV


ME INV min mean max

1 1 65 911 1458

1 2 29 515 967

1 3 44 699 1296

2 1 68 161 266

2 2 38 201 308

2 3 52 262 389

3 1 51 122 211

3 2 115 192 235

3 3 85 194 321

Time-series minimum/mean/maximum number of firms for which loading forecast could be calculated, within eachsize-characteristic sorted portfolio.

Table B.1 displays the minimum, mean and maximum population of the 3×3 independently

sorted ME × BEME, ME × OP, and ME × INV portfolios, counting only firms for which a

forecast loading is available. Within each of these, stocks are sorted within a characteristic bucket,

on the respective forecast loadings, i.e., when we divide the displayed numbers by three, we have

the minimum/mean/maximum number of firms of any of the 27 portfolios that the hedge portfolios

are based on.

It turns out that the resulting portfolios end up well-populated. The lowest number of firms

occurs in the small/high-OP portfolio, in 1964, when the small/high-OP portfolio contains 24

firms. In general, even portfolios of big/value stocks contain 95 stocks, on average, resulting in

loading-sorted portfolios with more than 30 stocks.

218

B.4.2 High power vs. low power

The achievable improvement on a set of CPs intimately depends on the post-formation loadings

of the hedge portfolios. Since the hedge portfolios have roughly zero expected returns, more

negative loadings means that we are capturing variation in loadings that is not related to returns,

which is translated in positive αs. Therefore, once we add these portfolios to the original CPs we

should expect large Sharpe ratio improvements.

Maximizing the variation in loadings that is not related to returns relates to the power of the

test that has as null hypothesis that the CPs form a valid asset pricing model (Daniel and Titman,

2012).

We show how the use of the methodology to forecast loadings advanced in this paper, which

we refer to as the “high power” methodology, increases the power of standard asset pricing tests.

We illustrate how a standard “low power” methodology used to estimate the loadings (see, e.g.,

Daniel and Titman, 1997; Davis et al., 2000) leads to a failure to reject asset pricing models and

thus imposes too low a bound on the volatility of the stochastic discount factor. We do so by

constructing characteristic balanced portfolios and showing that the ability of standard asset pricing

models to properly account for their average returns depends critically on whether one uses the low

or high power methodology.

The traditional low power approach uses as instruments for future loadings the result of re-

gressing monthly stock excess returns on characteristic portfolio excess returns over a moving

fixed-sized window based on, e.g., 36 or 60 monthly observations, skipping the most recent 6

months.

We first compare the low and high power methodology by looking at the post-formation load-

ings. We estimate the post-formation loadings by running a full-sample time series regression of

the monthly excess returns on the five FF CPs (see equation (2.32)). To check whether our high

power methodology results in larger dispersion of the post-formation loadings when compared

to the low power methodology, Figure B.1 shows the post-formation loadings on the x-axis and

the respective average characteristic on the y-axis for each of the 27 portfolios. Panels A and B

219

correspond to the low and high power methodology, respectively.

Consider, for example, the top panels in Figure B.1, which focus on the loadings on HML for

each of the two estimation methodologies. There are 3 × 3 groups of estimates—connected by

lines—each corresponding to a particular BEME ×ME bin. Each of those lines have three points

corresponding to the three portfolios from the conditional sort on ex-ante estimated loadings.

The ideal output would be to find variation in the loadings that is not associated with variations

in characteristics, our proxy for expected returns. Hence, the ideal equivalent of Figure B.1 would

be one in which the spread from low betas (red dots) and high betas (green dots) is maximized, but

conditional on belonging to a characteristic bucket, there is no additional correlation between the

characteristic and the forecast loading, i.e., the dashed lines are perfectly horizontal.

As it is readily apparent from Figure B.1, the high power methodology generates substantially

more cross-sectional dispersion in post-formation loadings than the low power methodology, which

is key to generating hedge portfolios that are maximally correlated with the candidate characteristic

portfolio. For example, focus on the loadings on HML for the large growth portfolios (portfolio

(1,3)). The low power methodology generates post-formation loadings on HML, bHML, for each of

the three portfolios of −0.43, −0.22 and 0.01, respectively. The high power methodology instead

generates post-formation HML loadings of−0.48,−0.18 and 0.17, respectively. For the low power

methodology, the loading of a portfolios that goes long on the low loading portfolio and short the

high loading portfolios is −0.45 with a t−statistic of −8.89. For the high power methodology the

same post-formation loading is −0.65 with a t−statistic of −12.98.

Notice that, reassuringly, both methodologies generate a positive correlation between pre- and

post-formation loadings for each of the book-to-market and size groupings. This positive correla-

tion between pre- and post extends to the case of CMA. But in the case of the loadings on RMW,

the low power methodology does not produce a consistent positive association between pre- and

post-formation loadings, whereas the high power methodology does.

The spreads in the loadings translate directly into the ability to reject the FF model. Table

B.2 Panel A shows the results of time series regressions of the low power hedge portfolios on the

220

FF CPs. First, notice that the average returns of the hedge portfolios are not statistically different

from zero, as in the high power case (Panel B). Second, notice that the loadings of each hedge

portfolio on the respective FF CP is much lower for the low power, compared to the high power

methodology. For example, the bHML for the high power methodology is 0.8 with a t-statistic of

28.21, whereas for the low power it is 0.54 with a t-statistic of 19.69. Finally, we can look at the

α’s. For the low power method we cannot reject the FF model using any of the hedge portfolios in

isolation, since all α’s are not statistically different from zero. For the equal weighted combinations

of hedge portfolios shown in the last rows of the table, we can see that the t-statistics of the αs are

significantly positive even for the low power methodology. However, the equal weight combination

αs are systematically bigger for the high power methodology.

In sum then our high power methodology forecasts future loadings better than the one used by

Daniel and Titman (1997) or Davis et al. (2000) and, as a result, they translate into more efficient

hedge portfolios as well as asset pricing tests with higher power.

221

Figure B.1: Ex-post loading vs. characteristicH

ML

Panel A: Low power


0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME


Panel B: High power


0.00

0.25

0.50

0.75

1.00

1.25

1.50

1.75

2.00

BEME


RM

W


−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP



−0.4

−0.2

0.0

0.2

0.4

0.6

0.8

1.0

OP


CM

A


−0.2

0.0

0.2

0.4

0.6

0.8

INV



−0.2

0.0

0.2

0.4

0.6

0.8

INV


This figure shows the time-series average of post-formation characteristic portfolio loading on the x-axis and the time-series average of the respective characteristic on the y-axis of each of the 27 portfolios formed on size, characteristic(book-to-market/operating profitability/investment) and characteristic portfolio loading. Panel A uses the low powermethodology and B uses the high power methodology. The first row uses sorts on book-to-market and HML-loading,the second one operating profitability and RMW-loading and the last one investment and CMA-loading.

222

Table B.2: Results of time-series regressions on characteristic-balanced hedge-portfolios

Panel A: Low power


rh,MktRF 0.13 -0.02 0.29 0.23 -0.07 -0.16 -0.02 0.57

(1.39) (-0.33) (18.54) (10.33) (-2.26) (-5.12) (-0.46)

rh,SMB 0.11 0.02 0.14 0.39 -0.04 -0.10 -0.16 0.59

(1.38) (0.32) (10.30) (19.94) (-1.53) (-3.59) (-3.96)

rh,HML 0.03 -0.05 0.01 0.00 0.54 0.04 -0.45 0.39

(0.46) (-0.83) (0.40) (0.17) (19.69) (1.51) (-10.35)

rh,RMW 0.03 -0.09 -0.03 0.01 0.18 0.30 0.01 0.32

(0.52) (-1.64) (-2.42) (0.71) (6.98) (11.13) (0.33)

rh,CMA -0.01 -0.09 0.03 -0.01 -0.29 -0.01 0.68 0.31

(-0.21) (-1.61) (2.41) (-0.31) (-10.68) (-0.26) (16.03)

EW3 0.02 -0.08 0.00 0.00 0.15 0.11 0.08 0.40

HML,RMW,CMA (0.50) (-2.66) (0.30) (0.36) (10.62) (7.97) (3.82)

EW4 0.04 -0.06 0.07 0.06 0.09 0.05 0.06 0.28

EW3+MktRF (1.40) (-2.20) (10.40) (5.94) (6.85) (3.26) (2.66)

EW5 0.05 -0.05 0.07 0.10 0.08 0.03 0.03 0.32

EW4+SMB (1.55) (-1.89) (10.41) (9.89) (6.07) (2.52) (1.38)

Panel B: High power


rh,MktRF 0.10 -0.18 0.41 0.40 0.05 -0.17 -0.06 0.66

(0.80) (-2.44) (22.39) (15.18) (1.48) (-4.68) (-1.15)

rh,SMB 0.17 0.03 0.17 0.56 -0.01 -0.15 -0.16 0.72

(1.74) (0.50) (12.27) (28.28) (-0.33) (-5.57) (-3.95)

rh,HML 0.07 -0.11 0.03 -0.05 0.80 0.20 -0.54 0.61

(0.74) (-1.86) (1.80) (-2.34) (28.21) (6.68) (-12.03)

rh,RMW 0.08 -0.21 -0.05 0.04 0.31 0.69 0.11 0.65

(0.86) (-3.66) (-3.27) (1.96) (11.69) (24.80) (2.51)

rh,CMA -0.04 -0.20 0.04 0.02 -0.31 0.09 0.96 0.43

(-0.52) (-3.39) (2.60) (1.10) (-10.95) (2.90) (21.13)

EW3 0.04 -0.17 0.01 0.00 0.27 0.32 0.17 0.70

HML,RMW,CMA (0.64) (-5.45) (0.83) (0.39) (17.52) (20.56) (7.30)

EW4 0.05 -0.18 0.11 0.10 0.21 0.20 0.12 0.58

EW3+MktRF (1.17) (-5.92) (14.60) (9.75) (15.08) (13.71) (5.18)

EW5 0.07 -0.15 0.10 0.15 0.19 0.17 0.08 0.57

EW4+SMB (1.57) (-5.01) (14.08) (14.86) (13.60) (11.65) (3.83)

Stocks are first sorted based on size and one of book-to-market, profitability or investment into 3x3 portfolios. Con-ditional on those sorts, they are subsequently sorted into 3 portfolios based on the respective loading, i.e., on HML,RMW or CMA. For MktRF and SMB we use the average of three hedge portfolios, which are based on a 3x3 sort onsize and book-to-market, profitability or investment. The hedge portfolio then goes long the low loading and short thehigh loading portfolios. On the bottom, we form combination-portfolios that put equal weight on three (HML, RMW,CMA), four (HML, RMW, CMA, MktRF) or five (HML, RMW, CMA, MktRF, SMB) hedge portfolios. Monthlyreturns of these portfolios are then regressed on the 5 Fama and French (2015) characteristic portfolios in the sampleperiod from 1963/07 - 2019/12. In Panel A we use the low power and in Panel B we use the high power methodologyfor forecasting loadings.

223

Appendix C: Chapter 3

C.1 Data construction

C.1.1 Firm level observations

We restrict our attention to publicly traded firms in Europe and the US. To achieve this, we

require firms’ primary quotes of major shares to be listed in one of the major stock exchanges in

Belgium, Denmark, Finland, France, Germany, Iceland, Ireland, Italy, the Netherlands, Norway,

Portugal, Spain, Sweden, Switzerland, UK, and US.

To be more specific, from Datastream we select stocks with type Equity whose primary quotes

of major shares are listed in the following stock exchange markets: Euronext Brussel, OMX Nordic

Exchange Copenhagen, Helsinki, Euronext Paris, Berlin, Deutsche Boerse, Munich, NASDAQ

OMX Iceland, Dublin, Milan, Euronext Amsterdam, Oslo Bors, Lisbon, Madrid, Mercado Con-

tinuo, Stockholm, Six Swiss, London Stock Exchange, NYSE, NASDAQ (US) and Amex. We

further delete firms whose main listing place is the OTC market. The European stock exchange

markets we select consist of the 18 largest stock exchange markets in Europe. Notice that Switzer-

land is not part of EEA but given the integrity of the European financial market and the fact that

Switzerland will have to amend its current legislation to ensure that Swiss financial institutions to

have unfettered access to the EU market, we include Switzerland in our sample. Our results are

robust if we exclude Switzerland. We also require firms listed in Europe to be domiciled in one of

these European countries and firms listed in the US to be domiciled in the US. We further select

firms whose financial documents are presented in one of these countries’ local currencies: Den-

mark Krone, Euro, Great Britain Pound, Iceland Krona, Swedish Krona, Swiss Franc and USD.

To be included in our final sample, firms need to have non-empty total assets and positive

54For example, Ambac financial group went into bankruptcy in 2010 and emerged back in 2013. Its total investmentreturn in 2013 is 170455.55%. This is simply because Worldscope assigned the price of this firm from OTC the marketin 2012 to be $0.014 and the publicly traded price in 2013 is $24.56.

224

book-to-market ratio. In some cases, Worldscope records information of firms that went into re-

organization and stopped trading. In these cases, extreme values appear in total investment return

(RET, Worldscope Item 08801).54 We delete observations with total investment return above 99%

or below 1% of the total distribution of this item. Other variables used in our study are: the log

of total market capitalization at fiscal year t in dollars (LNSIZE), the log of book to market ratio

(LNBM), return on assets (ROA) and price volatility (PRICEVOL). The precise construction of all

these variables can be found in Appendix C.1.3. Table C.1 of Appendix C.3 reports the number of

firms, average coverage, average GDP growth rate and average unemployment rate in each country

in our sample.

C.1.2 Choice of data frequency

Following literature, we use annual forecast instead of quarterly forecast data for several rea-

sons (e.g., Hong and Kacperczyk (2010) and Giroud and Mueller (2011)). First, analyst forecasts

exhibit strong frequency seasonality. For example, in the US, most of the new forecasts are clus-

tered at the beginning of the year. Such seasonality varies across countries and even across firms.

It is difficult to attribute changes in quarterly forecast outcomes to MiFID II rather than to season-

ality. Second, quarterly financial reports are not mandatory. Though quarterly filing is common in

the US, it is uncommon in Europe. Data coverage and quality on a quarterly basis are poor, espe-

cially for EU firms. More importantly, since quarterly filings are not mandatory and uncommon

in Europe, focusing on firms with quarterly financial reports and earnings per share (EPS) data

suffers from severe selection bias: firms who report quarterly inherently differs from firms who do

not. Finally, since financial reports normally release with a certain lag, analyst information set will

not be updated timely when they issue quarterly forecasts, i.e., it is likely that they do not observe

firms’ last quarter true earnings when they make quarterly forecasts. Consequently, staleness is

going to be more severe in quarterly forecasts.

225

C.1.3 Variable definitions

Average Coverage of Portfolio Firms The average of the analyst coverage of all the portfolio firms

that the analyst covers in a given year.

Average Forecast Error The average of forecast errors over all the firms the analyst

covers in a given year.

Book to Market (BM) Book to Market is defined as total asset (Worldscope item

02999) minus long-term debt (Worldscope item 03251) over

market value (Worldscope item 08002) (e.g., Fama and

French (1995)).

Coverage Coverage is the number of unique analysts who produce

forecasts for a certain firm during its fiscal year. Analyst

forecast information is obtained from I/B/E/S data set.

Distance (DISTANCE) Number of days between the analyst forecast date and the

firm’s actual earnings report date.

Drop Out A dummy variable equal to 1 if the analyst stops covering

all the firms he used to cover prior to the regulation.

Forecast Dispersion Forecast dispersion is defined as the standard deviation of

all the forecasts over all the analysts following the same

firm in the same year, scaled by the firm’s previous year-

end price. Analyst forecast information is obtained from

the I/B/E/S data set.

Forecast Error Forecast error is defined as the absolute distance between

the firm’s actual annual earnings per share and the mean

of the analyst forecasts, scaled by the firm’s previous year-

end price. Analyst forecast information is obtained from

I/B/E/S data set.

226

Market Capitalization (ME) Worldscope item 08002. It is firm’s market capitalization at the

fiscal year end. We convert it to USD using the exchange rate at

firm’s fiscal year end date.

N of Firms Follows The number of firms the analyst follows in a given fiscal year.

Relative Accuracy The average of all the scores across all the firms the analyst cov-

ers and across all the years prior to the regulation, as defined in

Section 3.4.

Return on Assets (ROA) Worldscope item 08326. Firms’ return on assets.

Return Volatility (RETVOL) Annualized standard deviation of daily returns over a year for a

given firm. It is constructed from Datastream item RI (Return

Index).

Stop A dummy variable equal to 1 if the analyst stops covering a firm

after the regulation.

Tenure The total number of years the analyst appeared in I/B/E/S, starting

from 1995.

Total Investment Return (RET) Worldscope item 08801. It is measured as Pt+Dividendst−Pt−1

MEt−1, P is

the firm’s year-end price.

227

C.2 Additional figures

Figure C.1: Geographic Composition of Stocks in Analyst Portfolio.

0.0

0.1

0.2

0.3

0.4

0.5

0.00 0.25 0.50 0.75 1.00Proportion of EU Firms in Analyst Portfolio Prior to 2018

Fra

ctio

n of

the

Tota

l Num

ber

of A

naly

sts

This figure shows the histogram of the geographic composition of the stocks in the analyst portfolio prior to theregulation. 1 means that all the firms the analyst covers are EU firms. 0 means that all the firms the analyst covers areUS firms.

228

Figure C.2: Average Analyst Coverage (EU vs. US)

0.0

0.1

0.2

0.3

[0,2) [2,4) [4,6) [6,8) [8,10) [10,15) [15,20) [20,20+)Average Analyst Coverage

Fra

ctio

n of

the

Toto

al N

umbe

r of

Firm

s

US EU

This figure shows the histogram of the average coverage of EU firms and US firms. Analyst coverage is the numberof unique analysts covering a specific firm in fiscal year t. We report the averages of the period from 2014 to 2018.

229

Figure C.3: Firm Level Forecast Error Distribution

Pre Post

0.0 2.5 5.0 7.5 10.0 0.0 2.5 5.0 7.5 10.0

0.00

0.05

0.10

Forecast Error (%) of EU Firms

Fra

ctio

n of

the

Tota

l Num

ber

of F

irms

(a)

Pre Post

0.0 2.5 5.0 7.5 10.0 0.0 2.5 5.0 7.5 10.0

0.00

0.05

0.10

Forecast Error (%) of US Firms

Fra

ctio

n of

the

Tota

l Num

ber

of F

irms

(b)

This figure plots the histograms of the forecast errors in the pre- and post-regulation years. To get the forecast errorsin the pre-regulation years, we simply average firms’ forecast errors overall these years.

230

Figure C.4: Average Number of Analysts per Brokerage House Over Time in the EU, Small vs. Large

● ●● ●

●

−4

0

4

2014 2015 2016 2017 2018Adj

uste

d A

vera

ge N

umbe

r of

Ana

lyst

s pe

r B

roke

rage

Hou

se

● Small House Large House

This figure shows the average number of unique analysts one brokerage house hires over the years in the EU betweensmall brokerage houses and large brokerage houses. Small brokerage houses are brokerage houses whose averagenumber of employees (analysts they hire) prior to the regulation falls below the median. We adjusted the number sothat both lines start at 0 in 2014.

231

C.3 Other summary statistics

Table C.1: Summary Statistics (Country Information)

Country N of Firms Coverage Average GDP Growth Rate Average Unemployment Rate

Belgium 70 5.851 1.490 7.648

Denmark 52 10.292 2.064 5.930

Finland 83 7.899 1.438 8.652

France 292 8.384 1.340 9.858

Germany 284 9.487 1.824 4.178

Iceland 3 1.333 3.924 3.504

Ireland 27 9.719 10.046 8.506

Italy 132 7.120 0.770 11.536

Netherlands 60 12.997 2.048 5.804

Norway 95 8.105 1.824 4.108

Portugal 19 7.526 1.828 10.626

Spain 75 15.107 2.654 19.770

Sweden 195 6.591 2.608 7.106

Switzerland 144 9.025 1.620 4.846

United Kingdom 602 8.131 2.022 4.902

United States 2, 259 11.145 2.284 4.922

This table provides information for the number of firms, the average coverage, the average GDP growth rate and theaverage unemployment rate in each country. Coverage is the number of unique analysts covering a certain firm. Wereport the averages of the period from 2014 to 2018 within each country.

232

Table C.2: Summary Statistics (Small vs. Large in Sample for Quantity)

EU Firms (small): 1067 firms in total


Coverage 2.339 2 1 3 2.373Book to Market 2.133 1.117 0.650 1.948 4.802Investment Return (%) 11.957 4.938 −18.651 32.701 51.324Market Capitalization (Bil $) 0.175 0.114 0.044 0.257 0.176Return on Assets (%) −0.456 3.639 −1.042 7.087 21.521Return Volatility (%) 39.285 34.366 26.288 46.162 20.008

US Firms (small): 1130 firms in total


Coverage 4.892 4 2 7 4.092Book to Market 2.130 1.000 0.569 2.103 3.358Investment Return (%) 3.669 0.029 −23.309 23.484 46.361Market Capitalization (Bil $) 0.552 0.391 0.149 0.829 0.516Return on Assets (%) −3.578 1.762 −2.570 5.939 31.654Return Volatility (%) 42.377 36.543 26.659 51.664 21.888

This table provides the summary statistics for important variables in the sample for coverage quantity. We split thesample into small firms and large firms. Small firms are firms whose average fiscal year-end market capitalization fallsbelow the median. We define the median cutoff separately in the EU and the US. Coverage is the number of uniqueanalysts covering a certain firm. Book to Market is defined as total asset (Worldscope item 02999) minus long-termdebt (Worldscope item 03251) over market value (Worldscope item 08002). Investment return is Worldscope item08801, measured as (market price at year t+ dividends−market price at year t−1)/market price at year t−1). Marketcapitalization is Worldscope Item 08002 measured in billion dollars. Return on Assets is Worldscope Item 08326.Return volatility is the annualized daily standard deviation of returns over a year for a given firm.

233

Table 1.18 Continued: Summary Statistics (Small vs. Large in Sample for Quality)

EU Firms (large): 1066 firms in total



US Firms (large): 1129 firms in total



234

Table C.3: Summary Statistics (Small vs. Large in Sample for Quality)

EU Firms (small): 556 firms in total



US Firms (small): 847 firms in total



This table provides the summary statistics for important variables in the Sample for coverage quality. To be includedin this sample, firms need to be covered by at least 2 analysts each year. We split the sample into small firms and largefirms. Small firms are firms whose average fiscal year-end market capitalization fall below the median. We define themedian cutoff separately in the EU and the US. Coverage is the number of unique analysts covering a certain firm.Forecast error is defined as the absolute distance between the firm’s actual annual earnings per share and the meanof the analyst forecasts, scaled by the firm’s previous year-end price. Forecast dispersion is defined as the standarddeviation of all the forecasts across all the analysts following the same firm in the same year, scaled by the firm’sprevious year-end price. Book to Market is defined as total asset (Worldscope item 02999) minus long-term debt(Worldscope item 03251) over market value (Worldscope item 08002). Distance is the average of the days betweenthe analyst forecast date and the firm’s actual EPS report date. We take the average over all the analysts following thesame firm and measure this variable in 2014. Investment return is Worldscope item 08801, measured as (market priceat year t+ dividends−market price at year t−1)/market price at year t−1). Market capitalization is Worldscope Item08002 measured in billion dollars. Return on Assets is Worldscope Item 08326. Return volatility is the annualizedstandard deviation of daily returns over a year for a given firm. We remove observations for which forecast error andforecast dispersion is larger than 10% of the firm’s share price at the end of the previous year.

235

Table 1.19 Continued: Summary Statistics Continued (Small vs. Large in Sample for Quality)

EU Firms (large): 555 firms in total



US Firms (large): 846 firms in total



236

C.4 Additional regression results

C.4.1 Dynamic coefficients point estimates

Table C.4: Firm Level Outcomes (Dynamic Coefficients)

Dependent variable:

Coverage Forecast Error (%) Dispersion (%)

(1) (2) (3)

EU × Y15 0.050 −0.020 −0.026(0.094) (0.052) (0.045)

EU × Y16 −0.134 −0.018 −0.080∗∗∗

(0.121) (0.051) (0.029)

EU × Y17 −0.201 0.033 −0.036(0.193) (0.045) (0.073)

EU × Y18 −0.715∗∗∗ −0.143∗∗∗ −0.199∗∗∗

(0.269) (0.044) (0.038)



Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the dynamic coefficient regressions specified in Model (3.4). The dependent variable incolumn (1), is Coverage, the number of analysts covering a specific firm in fiscal year t. The dependent variable incolumn (2), is Forecast Error, defined as the absolute distance between the firm’s actual annual earnings per shareand the mean of the analyst forecasts, scaled by the firm’s previous year-end price. The dependent variable in column(3) is Forecast Dispersion, defined as the standard deviation of all the forecasts across all the analysts following thesame firm in the same year, scaled by the firm’s previous year-end price. EU is a dummy variable equal to one if thefirm is domiciled and listed in Europe. Y 15, Y 16, Y 17, Y 18 are year dummy variables equal to one if year t is equalto 2015, 2016, 2017, 2018. We include other important firm level controls whose definitions and constructions can befound in the Appendix C.1.3. Standard errors are clustered at the country level.

237

Table C.5: Intensive Margin (Dynamic Coefficients)

Dependent variable:


(1) (2) (3)

EU × Y15 0.031 0.010 0.015(0.039) (0.037) (0.055)

EU × Y16 −0.044 −0.075∗ −0.034(0.036) (0.038) (0.060)

EU × Y17 0.004 −0.008 −0.017(0.037) (0.039) (0.065)

EU × Y18 −0.132∗∗∗ −0.150∗∗∗ −0.158∗∗

(0.041) (0.042) (0.077)

Frim level Controls Yes Yes YesAnalyst level Controls × Post Yes Yes YesAnalyst FE Yes Yes YesFirm FE Yes No NoYear FE Yes Yes NoB House FE No Yes NoB House FE × Year FE No No Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the dynamic coefficient regressions specified in Model (3.4). The dependent variablein column (1), is Forecast Error (Analyst), defined as the absolute distance between the actual annual earningsper share and the analyst’s forecast, scaled by the firm’s previous year-end price. The dependent variable in column(2) and (3) is Average Forecast Error, defined as the average of forecast errors across all the firms the analystcovers in a given year. EU is a dummy variable equal to one if the analyst’s portfolio consists of at least 70% of EUstock and zero if the analyst’s portfolio consists of at most 30% of EU stocks. Y 15, Y 16, Y 17, Y 18 are year dummyvariables equal to one if year t is equal to 2015, 2016, 2017, 2018. We include all the analysts (small and large) usedin the intensive margin analyses. Standard errors are clustered at the firms level in column (1) and at the analyst levelin column (2) and (3).

238

C.4.2 Additional results for firm level coverage

Table C.6: Firm Level Coverage (Sample for Coverage Quality).

Dependent variable:

Coverage


(1) (2) (3)

EU × POST −1.219∗∗∗ −2.496∗∗∗

(0.322) (0.391)


(0.242)

SMALL × POST 1.968∗∗∗ −0.564∗∗∗

(0.229) (0.009)



Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions specified in Model (3.1) on the sample for coverage quality. Thedependent variable is Coverage, the number of unique analysts covering a specific firm in fiscal year t. EU is adummy variable equal to one if the firm is domiciled and listed in Europe. POST is a dummy variable equal toone if the fiscal year t is equal to 2018. SMALL is a dummy variable equal to one if firms’ average fiscal year-endmarket capitalization over the pre-regulation years falls below the median. To maintain the proportion of EU and USfirms fixed, we calculate the cutoff separately in both regions. Column (1) is the result of the difference-in-differenceregression between EU firms and US firms. Column (2)is the result of the difference-in-difference regression betweensmall firms and larger firms within the EU. Column (3) is the result of the triple-difference regression. Standard errorsare clustered at the country level.

239

Table C.7: Firm Level Coverage (Results in logs).

Dependent variable:

LN (1+Coverage) LN Coverage


(1) (2) (3) (4) (5) (6)

EU × POST −0.008 −0.076∗∗∗ −0.036 −0.116∗∗∗

(0.018) (0.024) (0.023) (0.021)

EU × POST × SMALL 0.137∗∗∗ 0.160∗∗∗

(0.031) (0.017)

SMALL × POST 0.086∗∗∗ −0.054∗∗∗ 0.079∗∗∗ −0.079∗∗∗

(0.031) (0.002) (0.018) (0.001)

Controls Yes Yes Yes Yes Yes YesFirm FE Yes Yes Yes Yes Yes YesYear FE Yes Yes Yes Yes Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of panel regressions specified in Model (3.1). The dependent variable in column (1) tocolumn (3) is Log (1 + Coverage), the log of 1 + the number of unique analysts covering a specific firm in fiscalyear t. The dependent variable in column (4) to column (6) is Log Coverage, log of the number of unique analystscovering a specific firm in fiscal year t. EU is a dummy variable equal to one if the firm is domiciled and listed inEurope. POST is a dummy variable equal to one if the fiscal year t is equal to 2018. SMALL is a dummy variableequal to one if firms’ average fiscal year-end market capitalization over the pre-regulation years falls below the median.To maintain the proportion of EU and US firms fixed, we calculate the cutoff separately in both regions.Column (1)to column (3) are results obtained in the sample for coverage quantity. Since coverage can be 0 in this sample, we put1 + Coverage inside the log function. Column (4) to (6) are the results obtained in the sample for coverage quality.Column (1) and column (4) are the results for the difference-in-difference regression between EU firms and US firms.Column (2) and column (5) are the results for the difference-in-difference regression between small firms and largerfirms within the EU. Column (3) and Column (6) are the results for the triple-difference regression. Standard errorsare clustered at the country level.

240

Table C.8: Firm Level Coverage (EU Small vs. US Small and EU Large vs. US Large)

Dependent variable:

Coverage

Full Small Large Triple Diff

(1) (2) (3) (4)

EU × POST −0.651∗∗∗ 0.373∗∗∗ −1.634∗∗∗ −1.594∗∗∗

(0.196) (0.094) (0.289) (0.268)


(0.200)

SMALL × POST −0.228∗∗∗

(0.018)

Controls Yes Yes Yes YesFirm FE Yes Yes Yes YesYear FE Yes Yes Yes Yes


Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of panel regressions specified in Model (3.1) and Model (3.3). The dependent variable isCoverage, the number of unique analysts covering a specific firm j in fiscal year t. EU is a dummy variable equalto one if the firm is domiciled and listed in Europe. POST is a dummy variable equal to one if the fiscal year t isequal to 2018. SMALL is a dummy variable equal to one if firms’ average fiscal year-end market capitalization overthe pre-regulation years falls below the median. To maintain the proportion of EU and US firms fixed, we calculatethe cutoff separately in both regions. We split the firms into small firms and large firms and perform difference-in-difference analyses separately for them. Column (1) to column (3) are results for all the firms, small firms, and largefirms respectively. Column (4) is the result of the triple-difference regression. Standard errors are clustered at thecountry level.

241

C.4.3 Additional results for analyst informativeness

Table C.9: Analyst Informativeness (Small vs. Large).

Dependent variable:

Analyst Informativeness Aggregate Analyst Informativeness


(1) (2) (3) (4) (5) (6)

EU × POST 0.0003∗∗∗ 0.0004∗∗∗ −0.009∗∗∗ −0.016∗∗∗

(0.0001) (0.0001) (0.001) (0.002)

EU × POST × SMALL −0.0002 0.016∗∗∗

(0.0002) (0.003)

SMALL × POST 0.0002 0.0004∗∗∗ 0.011∗∗∗ −0.006∗∗∗

(0.0002) (0.0001) (0.002) (0.002)



Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of panel regressions similar to Model (3.1). The dependent variable in column (1) tocolumn (3) is Analyst Informativeness. It captures the average informativeness of one forecast revision date. Thedependent variable in column (4) to column (6) is Aggregate Analyst Informativeness. It captures the aggregateinformativeness of all revision dates. EU is a dummy variable equal to one if the firm is domiciled and listed inEurope. POST is a dummy variable equal to one if the fiscal year t is equal to 2018. SMALL is a dummy variableequal to one if firms’ average fiscal year-end market capitalization over the pre-regulation years falls below the median.To maintain the proportion of EU and US firms fixed, we calculate the cutoff separately in both regions. Column (1)and column (4) are the results for the difference-in-difference regression between EU firms and US firms. Column (2)and column (5) are the results for the difference-in-difference regression between small firms and larger firms withinthe EU. Column (3) and Column (6) are the results for the triple-difference regression. Standard errors are clusteredat the firm level.

242

C.5 Propensity score matching

In our difference-in-difference analyses, we use EU firms as the treatment group and US firms

as the control group. One concern is that firms in these two continents are different. For example,

Table 3.1 shows that US firms are in general larger, have lower book-to-market ratios and lower

ROAs. Although we explicitly control for these firm level characteristics, the differences may

still confound the effects of unbundling. In this section, we conduct a standard propensity score

matching to further check whether our results are driven by the observable differences in the firm

level characteristics.

In the sample for coverage quantity, we use 2014 data and match EU firms with US firms ac-

cording to their book to market ratio, investment return, market capitalization, return on assets and

return volatility. We choose the nearest-neighbor matching method and set the caliper parameter to

be 0.25 standard deviation of the estimated propensity scores (e.g., Rosenbaum and Rubin (1985)).

With this restriction, nearest- neighbor matching will be considered only if |Pt − Pc| < 0.25σp,

where Pt and Pc are the propensity scores for the treatment and the control, and σp is the standard

deviation of the estimated propensity score of the sample.55 We repeat a similar process in the

sample for coverage quality but include distance (the average of the days between the analyst fore-

cast date and the firm’s actual EPS report date) as an additional matching covariate. Table C.10

shows the summary statistics of the covariates before and after the matching. After the propensity

score matching, in the sample for coverage quantity, the treatment group and the control group

are only different in the investment return and market capitalization. In the sample for coverage

quality, the treatment group and the control group are comparable along all the firm level char-

acteristics. Figure C.5 shows the histograms of the propensity scores after matching. We can

observe that the distributions between treatment group and control groups are comparable. In fact,

a formal Kolmogoro-Smirnov test on the two samples does not reject the null that the treatment

sample and the control sample are drawn from the same distribution. Hence, the propensity score

55We do not have a large number of control firms to select. Requiring each firm in the treatment group to bematched with a firm in the control group will again introduce dissimilarities between the two groups.

243

matching effectively removes observations that are too different between the two groups based on

observables.

We then repeat the difference-in-difference analyses at the firm level. Table C.11 reports the

results. Consistent with what we find before, analyst coverage drops for EU firms and the drop is

concentrated in large firms. Similarly, forecast errors of EU firms decrease. The magnitude of these

coefficients is comparable with our baseline results without matching. Hence we conclude that

observable differences between EU firms and US firms do not drive our results and US firms could

serve as a valid counterfactual. In untabulated tests, we performance propensity score matching at

the analyst level for the intensive margin analyses and obtain similar results to the baseline model

in which we do not match the observations.

244

Figure C.5: Distributions of Propensity Scores after Matching.

ControlTreated

0.25

0.50

0.75

1.00

0.3 0.2 0.1 0 0.1 0.2 0.3Fraction of the Data

Pro

pens

ity S

core

(a) Sample for Coverage Quantity

ControlTreated

0.25

0.50

0.75

1.00

0.3 0.2 0.1 0 0.1 0.2 0.3Fraction of the Data

Pro

pens

ity S

core

(b) Sample for Coverage Quality

This figure plots the histogram of the propensity scores in the matched sample between the treatment group and thecontrol group.

245

Table C.10: Covariates Balance Table.

Panel A: Before Matching (Sample for Coverage Quantity)

Treated Control Diff P-value

N of Obs 2133 2259 126BM (%) 2.53 1.73 0.80 0.00Investment Return (%) 13.82 8.27 5.55 0.00Market Cap (Bil $) 5.24 9.21 −3.97 0.00ROA (%) 2.87 1.53 1.34 0.09Return Volatility 32.39 32.06 0.33 0.53

Panel B: After Matching (Sample for Coverage Quantity)


N of Obs 2074 2074 0BM (%) 1.97 1.78 0.19 0.11Investment Return (%) 11.83 9.20 2.63 0.02Market Cap (Bil $) 5.25 6.36 1.11 0.04ROA (%) 3.01 2.05 0.96 0.11Return Volatility 32.11 32.01 0.10 0.86

Panel C: Before Matching (Sample for Coverage Quality)


N of Obs 1111 1693 582BM (%) 2.44 1.53 0.91 0.00Distance 120.24 114.43 5.81 0.00Investment Return (%) 15.18 10.10 5.08 0.00Market Cap (Bil $) 9.45 11.96 2.51 0.04ROA (%) 6.08 4.26 1.82 0.00Return Volatility 26.26 28.35 −2.09 0.00

Panel D: After Matching (Sample for Coverage Quality)


N of Obs 1067 1067 0BM (%) 1.74 1.74 0.00 0.96Distance 118.86 117.37 1.49 0.38Investment Return (%) 14.75 13.35 0.408 0.31Market Cap (Bil $) 9.29 8.34 0.95 0.31ROA (%) 5.91 5.24 0.67 0.10Return Volatility 26.34 26.64 −0.30 0.52

This table shows the difference in the average values of matching covariates before and after the propensity scorematching. Book to Market is defined as total asset (Worldscope item 02999) minus long-term debt (Worldscope item03251) over market value (Worldscope item 08002). Distance is the average of the days between the analyst forecastdate and the firm’s actual EPS report date. We take the average over all the analysts following the same firm andmeasure this variable in 2014. Investment return is Worldscope item 08801, measured as (market price at year t+dividends−market price at year t − 1)/market price at year t − 1). Market capitalization is Worldscope Item 08002measured in billion dollars. Return on Assets is Worldscope Item 08326. Return volatility is the annualized standarddeviation of daily returns over a year for a given firm. Panel A and Panel show the results of sample for coveragequantity and Panel C and Panel D shows the results of the sample for coverage quality.

246

Table C.11: Firm Level Outcomes (Propensity Score Matching).

Dependent variable:

Coverage Forecast Error


(1) (2) (3) (4) (5) (6)

EU × POST −0.694∗∗∗ −1.568∗∗∗ −0.141∗∗∗ −0.120∗∗

(0.193) (0.266) (0.036) (0.048)

EU × POST × SMALL 1.753∗∗∗ −0.042(0.198) (0.065)

SMALL × POST 1.662∗∗∗ −0.102∗∗∗ 0.061 0.083∗∗∗

(0.203) (0.017) (0.059) (0.016)



Note: ∗p<0.1; ∗∗p<0.05; ∗∗∗p<0.01

This table shows the results of the panel regressions specified in Model (3.1) and Model (3.3). We use the propensityscore matching and match EU firms with US firms according to their book to market ratio, investment return, marketcapitalization, return on assets and return volatility. For the sample for coverage quality, we include distance as anadditional matching covariate. All variables are measured in 2014. The dependent variable in column (1) to (3) isCoverage, the number of unique analysts covering a specific firm j in fiscal year t. The dependent variable in column(4) to (6) is Forecast Error, defined as the absolute distance between the firm’s actual EPS and the mean of theanalyst forecasts, scaled by the firm’s previous year-end price. EU is a dummy variable equal to one if the firm isdomiciled and listed in Europe. POST is a dummy variable equal to one if the fiscal year t is equal to 2018. SMALLis a dummy variable equal to one if firms’ average fiscal year-end market capitalization over the pre-regulation yearsfalls below the median. To maintain the proportion of EU and US firms fixed, we calculate the cutoff separately inboth regions. Column (1) and column (4) are the results for the difference-in-difference regression between EU firmsand US firms. Column (2) and column (5) are the results for the difference-in-difference regression between smallfirms and larger firms within the EU. Column (3) and Column (6) are the results for the triple-difference regression.Standard errors are clustered at the country level.

247

Date post:	18-Dec-2021
Category:	Documents
Upload:	others
View:	1 times
Download:	0 times

Essays in Financial Economics Lira Rocha da Mota Mertens

Documents