Quo$Vadis,$raters?$A$frontier$approach$to$identify$misratings$in$sovereign$credit$risk
Meryem&Duygun&Huseyin&Ozturk&Mohamed&Shaban&Emili&Tortosa;Ausina&&
2014&/&10&
!
Quo Vadis, raters? A frontier approach to identify misratings in sovereign credit risk
2014 / 10
Abstract
This study attempts to provide one of the first comprehensive analyses on the misratings in sovereign credit ratings. The analyses are performed using partial frontier methods which should be considered innovative in this literature. By combining a robust variant of the Free Disposal Hull (FDH) estimator (order-m), we measure misratings (both under- and overratings) for both individual countries and groups of countries. Particular attention is put on the comparison between pre- and crisis years, in order to assess possible changes in the magnitude of the misratings. Our findings indicate that the degree of both overratings and underratings during the analysed period (1999–2010) is indeed remarkable. These misratings partially vanish during the last years of the sample (2008–2010), corresponding to the financial crisis, when many downgrades took place, especially in Eurozone countries. The results allow us to emphasise the importance of monitoring misratings for sustainable financial stability. These results also show the potential benefits of using partial frontier methods for measuring both under- and overratings.
Keywords: credit rating agency, partial frontier, rating, sovereign credit risk JEL classification: C51, E44, G15, G24
Meryem Duygun University of Leicester School of Management
Emili Tortosa-Ausina IVIE & Universitat Jaume I Department of Economics
Huseyin Ozturk University of Leicester School of Management
Mohamed Shaban University of Leicester School of Management
Quo Vadis, Raters? A Frontier Approach to Identify Misratings in
Sovereign Credit Risk
Meryem Duygun
University of Leicester
Huseyin Ozturk
University of Leicester
Mohamed Shaban
University of Leicester
Emili Tortosa-Ausina
Universitat Jaume I and Ivie
January 18, 2014
Abstract
This study attempts to provide one of the first comprehensive analyses on the misratings in sovereign
credit ratings. The analyses are performed using partial frontier methods which should be considered
innovative in this literature. By combining a robust variant of the Free Disposal Hull (FDH) estimator
(order-m), we measure misratings (both under- and overratings) for both individual countries and groups
of countries. Particular attention is put on the comparison between pre- and crisis years, in order to
assess possible changes in the magnitude of the misratings. Our findings indicate that the degree of
both overratings and underratings during the analysed period (1999–2010) is indeed remarkable. These
misratings partially vanish during the last years of the sample (2008–2010), corresponding to the financial
crisis, when many downgrades took place, especially in Eurozone countries. The results allow us to
emphasise the importance of monitoring misratings for sustainable financial stability. These results also
show the potential benefits of using partial frontier methods for measuring both under- and overratings.
Keywords: credit rating agency, partial frontier, rating, sovereign credit risk.
JEL Classification Numbers: C51, E44, G15, G24.
Communications to: Mohamed Shaban, School of Management, Ken Edwards Building, University of
Leicester, University Road, Leicester, LE1 7RH, UK. Tel.: +44 (0) 116 223 1820, fax: +44 (0) 116 252 3949,
e-mail: [email protected]
1. Introduction
The globalization has brought about the internationalization of financial markets in the last decades.
This has dramatically increased and differentiated investment opportunities across the world by creating
new challenges. Measuring risk in investments has been the core challenge prior to any financial investment
decision in contemporary financial markets. Financing with accurate cost has revealed the necessity of a
credit scoring that the credit rating agencies (CRAs) currently carry out.
The successful sovereign credit rating (SCR), or an early-warning model, will certainly benefit both
international lenders and borrowers due to the fact that repayment difficulties put burden on both sides. In
addition, unanticipated repayment problems can cause financial crises which can have cross country spillover
and contagion effects. These facts explain why sovereign credit ratings have been an indispensable issue
of international investment process particularly in recent years. However, an accurate estimation of credit
risk is a difficult task. Every attempt to measure it suffers from the weaknesses of measurement tools and
analysts’ biases. In addition, CRAs are under harsh criticism after the recent financial crisis. The crux of
the criticism is the poor predictive power of credit ratings.1
The literature on SCRs can be broadly grouped into two areas. The first area deals mainly with the
reactions of financial markets to SCR changes, e.g. government bond markets, swap markets, and interest
rate markets etc. (see e.g. Alsakka and ap Gwilym, 2013; Treepongkaruna and Wu, 2012; Candelon et al.,
2011). Most of these studies investigate the causality pattern between market reactions and sovereign credit
downgrades/upgrades, i.e. which one leads the other. Moreover concerns about the information content of
sovereign credit ratings and their association with bond spreads and default risk were intensified after the
global financial crisis (see e.g. Aizenman et al., 2013).
The second area represents the studies starting with Cantor (1995), and investigates the determinants of
sovereign credit ratings. Many of these investigations conclude that sovereign credit ratings can be explained
to a great extent by the level of GDP per capita, real GDP growth, external debt, the public debt level and
the government budget balance (as one of the latest studies see Hill, 2004; Sy, 2009; Gultekin-Karakas et al.,
2011). Some other contributions within this category examine the relationship between rating outlook and
rating changes (see e.g. Alsakka and ap Gwilym, 2012, 2010). After the 2008 financial crisis, some other
1In general, credit ratings have been on the top agenda of regulatory institutions in the early days of 2008 financial crisis.Credit ratings have been under fire once again after the Eurozone fiscal problems with specific attention to sovereign creditratings. A large body of rules have been adopted to regulate and supervise credit ratings in EU currently (see Darbellay andFrank, 2012). From a regulatory perspective, one of the most important reasons for a better prediction of credit ratings isunderlined by the recommendations of global financial institutions and joint initiatives. Recently the Financial Stability Board(FSB) that was established under the auspices of G-20 published a proposal to reduce over-reliance or CRA credit ratings.The proposal simply recommends financial institutions to carry out their independent credit assessment. In recommending so,the FSB aims at reducing over-reliance on CRA credit ratings that were blamed to be inaccurate especially after the recentfinancial crisis. Therefore, it is not hard to anticipate that developing a reliable credit scoring will be of utmost importance inthe near future for institutions who consider not buying CRA rating.
2
initiatives falling in this category explored further evidence underlying the downgrades of many Eurozone
countries. For instance, Afonso et al. (2012) investigate the relationship between fiscal imbalances and credit
rating downgrades, concluding that fiscal imbalances do actually have a negative impact on sovereign credit
ratings but in a diverse way for each country.
On the estimation of SCR changes, studies generally employ parametric models such as linear discriminant
analysis, principal component analysis, linear regressions, and ordered response models, among others. A
small literature niche investigates credit rating with artificial intelligence (AI) models (Wang et al., 2011;
Huang et al., 2004; Maher and Sen, 1997). Bennell et al. (2006) is one of the first studies introducing AI in
the estimation of sovereign credit ratings. According to their findings, AI models estimate sovereign credit
ratings more accurately compared to other statistical approaches .
This study aims to investigate the information content of sovereign credit ratings with respect to their
fundamental indicators. In previous studies, several statistical methods were employed in predicting sovereign
credit ratings—in particular ordered response models. However, as Wang et al. (2011) argues, multivariate
normality assumptions are frequently violated in statistical models. Furthermore, normality assumptions
for each and every independent variable are not warranted by these models. Therefore, the accuracy of
predictions is frequently low.
We employ a nonparametric partial frontier approach to explore whether credit ratings are in line with
what country fundamentals would suggest. Specifically, we propose using order-m estimators (Cazals et al.,
2002). With respect to their non-robust alternatives in which they are based (Data Envelopment Analysis,
DEA, and its non-convex alternative, Free Disposal Hull, FDH), partial frontier estimators such as order-m
offer several advantages, including their relative immunity to outlying observations, the fact that they are
less affected by the curse of dimensionality, and better properties in general (for instance, they also allow
achieving the√n rate of convergence with asymptotic normality).
We use order-m to estimate whether some inefficiencies might exist when CRAs construct the sovereign
credit ratings. In this particular setting, the rating obtained by each country would be the outputs, and
the several fundamental indicators used by CRAs would be the inputs. Therefore, we may assume that
countries attempt to maximise their credibility (reduce risk) with minimum input usage. In this process,
these inefficiencies would imply some misratings, or misalignments, are generated. These misratings (ineffi-
ciencies) could be of two kinds, namely, underratings and overratings. In this context, using partial frontier
approaches such as order-m is particularly convenient because not only underrated but also overrated coun-
tries are identified. The reason is that order-m frontiers allow estimating both inefficiency (underrating) and
superefficiency2 (overrating)—and, more importantly, which the underrated and overrated countries are.
2See Andersen and Petersen (1993).
3
Contrary to previous literature, an added advantage of using this approach is that inefficiencies (misrated
countries) would be identified contemporaneously, not ex post.
We will explore misratings in SCR for the 1999–2010 period—i.e., it includes both pre-crisis and crisis
years. The analysis will enable identifying which countries, or groups of countries, were those showing the
highest overratings or underratings. Comparing pre- and crisis years will also enable us to assess the changing
behaviour of CRAs once the financial crisis.
The study is structured as follows. Section 2 is devoted to describe the methodology based on a nonpara-
metric partial frontier approach. Section 3 introduces sovereign credit ratings and several country indicators,
defining the main variables considered in the study. Section 4 presents and discusses the results, whereas
Section 5 outlines some conclusions and policy implications.
2. Methodology
Our methods are based on the set of activity analysis techniques initially devised by Georgescu-Roegen
(1951). His ideas were refined in posterior stages in order to model the productive efficiency of decision
making units (DMUs), which may be of very different sorts. This type of units could be restricted to
countries, like it is our case, but a wide variety of organisations such as banks and other financial institutions,
municipalities, hospitals, etc. This implies that measures of performance via efficiency scores have become
widespread for operators in business, government, public transportation, infrastructure, energy production
and other sectors.
There is a wide variety of frontier methods which can be used to measure efficiency. In the case of
economic efficiency, they have been nicely reviewed by Murillo-Zamorano (2004). There are two main groups
of methods to estimate efficiency scores, namely, Stochastic Frontier Analysis, SFA (Aigner et al., 1977;
Meeusen and van den Broeck, 1977), and Data Envelopment Analysis, DEA (Charnes et al., 1978). There
has been a long standing division between SFA and DEA. Both methods have advantages and disadvantages—
the “historically” perceived merit of SFA is that the estimator is stochastic, in the case of DEA is that the
estimator is nonparametric in nature (Badunenko et al., 2012). Therefore, most comparative studies such
as, for instance, Ferrier and Lovell (1990) or Badunenko et al. (2012) conclude that different methods can
be preferable under different circumstances.
Although progress has been made both in the parametric (SFA) and nonparametric (DEA) fields, the
advances have been unequal—especially in terms of applications. According to Badunenko et al. (2012),
recent research has seen a relaxation of functional forms in the parametric field (SFA) and the introduc-
tion of asymptotics in the nonparametric field (DEA). In asymptotic terms, some of the newest estimators
4
introduced based on linear programming perform better than DEA and, in addition, they overcome some
of its disadvantages, including the “curse of dimensionality” (low number of DMUs relative to number of
input-output variables) or the influential role of outliers. Regarding the former, it results from the fact that,
as a given set of n observations are projected in an increasing number of orthogonal directions, the Euclidean
distance between the observations should necessarily increase. Regarding the latter, the envelopment esti-
mators such as DEA are very sensitive to outliers and extreme values, which may disproportionately (and
misleadingly) influence the evaluation of the performance of other DMUs.3
In a series of proposals (Cazals et al., 2002; Daraio and Simar, 2005; Aragon et al., 2005; Daouia and
Simar, 2007), two families of robust estimators—i.e., estimators which are much less sensitive to extrem
observations—have been proposed: (i) order-m frontiers (where m can be viewed as a trimming parameter);
and (ii) order-α quantile frontiers (analogous to traditional quantile functions but adapted to the frontier
problem). These are “partial” frontier estimators, as opposed to the traditional idea of a “full” frontier
that envelops all the data, given that the goal is not to estimate the absolute lowest (uppermost) technically
achievable level of input (output) for a given level of output (input), but rather to estimate something “close”
to these quantities. In addition, both order-m and order-α estimators, apart from not suffering from the curse
of dimensionality and being much more robust than either DEA or its non-convex variant (Free Disposal
Hull, FDH) have generally better properties, since they also allow achieving the√n rate of convergence with
asymptotic normality.
Because of these advantages, partial frontier methods suit particularly well our specific setting, where
the number of dimensions in which a country can be evaluated (i.e., the number of inputs and outputs)
could be high. Therefore, whereas the resulting DEA or FDH estimators could be affected by the curse of
dimensionality, the order-m or order-α estimators are less so.
Following Daraio and Simar (2007),4 order-m estimators are based on FDH estimators. Supposing there
exist m decision making units (i.e. credit rating agencies) using at most input level x, we define the set:
Ψ(x) = {(x′, y′) ∈ RN+M+ |x′ ≤ x, Yi ≤ y′} (1)
where i = 1, . . . ,m, and the Yi are m iid random variables drawn from the conditional M -variate distribution
FY (·|x).
In this context, the output-oriented efficiency score (i.e., our indicator of underrating) can be defined
3As indicated by Simar and Wilson (2008), this drawback is also present in parametric frontier estimators when deterministicfrontier models are considered.
4For recent applications in the field of finance, see for instance, Matallın-Saez et al. (2014), Abdelsalam et al. (2014b) orAbdelsalam et al. (2014a), among others.
5
relative to the Ψm(x) set (which is random, since it depends on random variables) as:
λ(x, y) = sup{λ|(x,λy) ∈ Ψ(x)} = maxi=1,...,m
!
minj=1,...,M
"
Y ji
yi
#$
(2)
For each combination of inputs and outputs, (x, y) ∈ RN+M+ , we will define the output-oriented order-m
efficiency score as an expectation for all x in the interior of the support of X (assuming that the expectation
exists) as:
λm(x, y) = E(λm(x, y)|X ≤ x) (3)
Therefore, in contrast to either FDH or its convex version (DEA), the idea of the order-m consists of
compare each observation with part of the frontier instead of the full frontier—which is the reason why we
refer to order-m as a partial frontier. Hence, for a given set of inputs, the technically feasible maximum
output is defined as the expected maximum output (rating) obtained by selecting randomly from the full
sample any m countries employing at most input levels x.
Following Simar and Wilson (2008), the expected maximum output level will be defined as:
y∂m(x) = yλm (4)
and the order-m version of the production possibilities (i.e., P = {(x, y)|x can produce y}) is:
Pm =%
(x, y)|(x, y) ∈ P, y ≤ y∂m(x).&
(5)
Note that, in the extreme case in which m → ∞ both order-m and FDH estimators converge, and both
approaches yield identical results. However, the most interesting cases will be those in which m has a finite
value, since the estimator will be more robust to outliers—i.e., not all the data points are being enveloped.
An algorithm is followed to compute the efficiency scores (see, for instance Matallın-Saez et al., 2014).
When implementing the estimation for the output-oriented case we are considering, for each observation
(country) we select those dominating it in the input space. We draw a B number of samples of size m from
this subsample (with replacement), which do not (necessarily) include the assessed country itself. Following
(2) λm is computed.
Interestingly, since the country under analysis is not (necessarily) included in the order-m sample (and
there will not necessarily exist other countries which dominate the country analysed in the output), efficiencies
can be either higher or lower than one.
6
Specifically, output-oriented efficiencies based on Shephard distance functions (reciprocal to the Farrell
distance functions) are either equal or lower than unity under FDH (or DEA). However, under order-m some
outlying observations (countries) can reach efficiency levels higher than one, to which the literature usually
refers to as superefficient units (Andersen and Petersen, 1993). We will consider those countries classified
as inefficient (i.e., with scores lower than one) are those being underrated. In contrast, we will refer to the
superefficient units (with values higher than one) as those being overrated.
3. Data and variables
SCRs are assigned through a series of qualitative and quantitative analyses. CRAs use a series of in-
dicators especially for their quantitative analysis by assigning weights to each indicator and update them
regularly. CRAs do not reveal any details regarding the weights they attach to each of these indicators pos-
sibly to avoid any subjective biases. However, these agencies make a list of indicators publicly available in
order to increase transparency of the rating process. In our study, we build a ratings database with sovereign
foreign currency rating provided by a major credit rating agency. The rating of a particular year is the rating
that was attributed at the last day of that year. In our database there are two main blocks of data. Both
sovereign credit ratings and macroeconomic, fiscal and financial indicators used in the analysis are obtained
from the agency. World Governance Indicators which show the quality of institutions are from the World
Bank. These indicators are monitored by CRAs since they clearly reflect the willingness of repayment of
sovereigns.
According to rating classification, there are 20 possible credit ratings for a country: Aaa, Aa1, Aa2, Aa3,
A1, A2, A3, Baa1, Baa2, Baa3, Ba1, Ba2, Ba3, B1, B2, B3, Caa1, Caa2, Caa3, Ca. Aaa is the highest rating
that a country can get and Ca is the lowest. A rating between Aaa and Baa3 signals a good investment
environment for a country whereas any rating between Ba1 and Ca is speculative. The original data had
information on 106 countries for the 1999–2010 periods. Because of the few number of observations for the
Caa2, Caa3, and Ca ratings, countries which have been in the “junk” category (i.e. countries with a credit
rating of Caa1 and lower) at least once over the period examined are excluded during the computations. We
will refer to the output variable as y1.
When we analyse the percentage distribution of credit ratings across in the developed and the developing
countries in Table 2, a clear pattern emerges: that is as income level increases, countries are more conducive
to obtain higher ratings. There exists a clear difference in the credit ratings of the developed and the
developing countries. The developed countries are populated in the credit ratings higher than Ba1, whereas
the highest credit rating a developing country obtains is A1. Moreover, Aaa is the mostly assigned credit
7
rating by 20.08%, followed by Ba1 by 8.01%, that are all given to the developed countries. We will refer to
the credit rating (output) as y1.
In order to better assess sovereign credit ratings, the data is divided into two sub-samples of developed
and developing countries. For this classification we used the World Bank country classification. The high
income OECD and high income non-OECD countries in World Bank’s classification are grouped under the
“developed country” group, while countries under the low income, lower middle income, and upper middle
income categories are grouped under the “developing country” group. Such classification will enable us to
discriminate the efficiency scheme with respect to countries’ development levels.
Table 3 and Table 4 present the variables that can be used as the inputs in our analysis. Although CRAs
use a vast dataset, these variables can be use as the representatives of the performance of a country. Below
we briefly summarise the motivation underlying the variable selection process.
Ratio of current account balance to GDP (balancegdp, x1): the current account (when in deficit) gives
a rough indication of how much net import of capital is needed for a country to meet the gap between
domestic saving and investment. Large and persistent current-account deficits can lead to a distortion
of external debt structure, if the deficits cannot be financed by inflows of direct investment or equity
positions in local companies. However, rapidly-growing countries with high investment rates can sus-
tain large deficits for many years if the investments are conducive to a growing export capacity which
can create the inflow of foreign earnings needed to service a growing debt. Since the nominal current
account will vary with the scale of a country’s size and openness to trade, we divide it by GDP to
allow for cross-country comparisons (Bennell et al., 2006; Gultekin-Karakas et al., 2011; Afonso et al.,
2007).
Ratio of general government financial balance to GDP (financialbalancegdp, x2): The fiscal bal-
ances and debt stocks of the various levels of government are among the most important indicators
examined by sovereign risk analysts. The ability of government to extract revenues from the population
of tax payers and users of services, the elasticity of revenue with respect to the growth or decline of
national income, and the rigidity of the composition of government expenditures are key factors that
determine whether central and local governments will be able to fulfil timely payments of interest and
principal on outstanding debt. We proxy fiscal balances with three indicators: general government
financial balance to GDP, general government primary balance to GDP, and general government debt
to GDP (Bennell et al., 2006; Gultekin-Karakas et al., 2011; Afonso et al., 2007). Ratio of general gov-
ernment financial balance to GDP indicates governments deficit or surplus in GDP. Higher government
deficits can create repayment problems which can be solved by inflationary money creation. Inflation
8
on the other hand can distort the dynamics of growth.
GDP per capita (gdppc, x3): GDP is the standard international measure of the size of an economy. While
frequently criticized for understating output by leaving out or underestimating the accumulation of in-
tangible assets (knowledge, organizational innovation, improved product quality, etc.) or for overstating
it by ignoring resource depletion and environmental degradation, GDP remains the only internationally
comparable standard. Nevertheless, GDP solely gives an aggregate level of the economy. To show the
relative wealth possessed by the average individual within a given country we use GDP per capita
(Bennell et al., 2006; Gultekin-Karakas et al., 2011; Afonso et al., 2007).
Inflation, (inflation, x4): inflation is an important indicator of excess demand pressure or of structural
distortions in the labour and product markets. Under extreme conditions of monetary instability (in
which, for example, central banks create money in order to finance government deficits) inflation can
accelerate to “hyperinflationary” levels that undermine normal productive activity. All in all, it is well
known that inflationary environment in a national economy leads to high uncertainty where production
decisions can hardly be taken (Gultekin-Karakas et al., 2011; Afonso et al., 2007).
Official foreign exchange reserves (foreignexcreserve, x5): foreign exchange reserves held by a coun-
try are the first line of defence against withdrawal of foreign credit. Hence foreign exchange reserves
play as a cushion especially for sudden outflows. Since the ratings we are studying are the ones that
are assigned to foreign exchange debts, ample reserves give further flexibility to the country (Gultekin-
Karakas et al., 2011; Afonso et al., 2007).
Government effectiveness (governmenteffectiveness, x6): this indicator is one of six measures of the
quality of institutions compiled by the World Bank. The index of government effectiveness combines
responses on the quality of public services and the bureaucracy that provides them, the competence
and political independence of civil servants, and the credibility of the government’s commitment to its
policies. Apart from the capacity pay of sovereigns, CRAs attach importance to willingness to pay of
them which can be broadly proxied by the index of government effectiveness (Gultekin-Karakas et al.,
2011; Afonso et al., 2007).
Ratio of general government primary balance to GDP (primarybalancegdp, x7): the primary bal-
ance figures exclude interest expenditures. Positive general government primary balance figures show
how governments progress in narrowing the general government deficits.
Nominal exports of goods and services, % change (exportsprcnt, x8): the percentage change of the
nominal exports of goods and services shows the performance of a country by degree to which the
9
country supplements its domestic saving with foreign export revenues in financing capital investment
(Gultekin-Karakas et al., 2011).
Nominal GDP percentage change (gdpprcnt, x9): the annual percent change in nominal GDP (in local
currency) is important in a sense that a decline in nominal GDP that is a combination of weak or
negative growth and falling prices, may signal a distress in economy that results in a rating downgrade.
In such circumstances, consumers and businesses may postpone purchases, expecting goods to be
cheaper in the future, and the real burden of household and corporate debt will increase (Bennell
et al., 2006; Gultekin-Karakas et al., 2011; Afonso et al., 2007).
Ratio of gross investment to GDP (investgdpratio, x10): investments that add to the country’s capi-
tal stock are the vital contributor to the process of economic growth. Countries with a sustained high
investment rate, especially in productive assets in the business sector and in infrastructure, will tend
to grow faster over the long term (Gultekin-Karakas et al., 2011).
Ratio of domestic savings to GDP (savinggdpratio, x11): the real investment undertaken within a coun-
try is necessarily equal to the sum of the domestic saving generated within its borders plus the use
of foreign saving. If a country cannot generate a high enough saving flow out of the incomes of the
domestic population in order to accelerate growth, it may face balance of payment constraints.
Ratio of general government debt to GDP (debtgdp, x12): general government debt to GDP is a broad
indicator of a government’s total debt stock. High level of debt stock becomes a severe threat to gov-
ernment financing when government revenues are relatively low. If debt is hardly rolled over, the risk
of default increases.
Previous studies offer a great deal of heterogeneity regarding the relevant factors to be considered in the
analysis—which in our case are the inputs. Our empirical strategy has consisted of selecting some variables
which could be deemed as “fundamental”, since they are consistently used in previous models, and introduce
sequentially those other variables whose use is less generalised.
Therefore, we will consider an initial model (Model 1, or “restricted model”) in which the relevant factors
(inputs) considered are the ratio of current account balance to GDP (x1), the ratio of general government
financial balance to GDP (x2), GDP per capita (x3), inflation (x4), official foreign exchange reserves (x5)
and government effectiveness (x6). The rest of the variables are introduced sequentially, constituting a new
model (which we refer to as “unrestricted” model). We then calculate efficiencies using the methods proposed
in Section 2 and test whether the efficiencies generated by each model are statistically significant or not.
10
We test for these differences between both the restricted and unrestricted models using the Li (1996) test.
It is a based on kernel smoothing, and it tests the null hypothesis that the densities corresponding to the
efficiencies generated by each model are equal (f(restricted model) = g(unrestricted model)). For previous
applications of these models see, for instance, Thieme et al. (2013). Results, which are provided in Table 5,5
indicate that only when introducing variables x7, x8 and x9 (ratio of general government primary balance
to GDP, nominal exports of goods and services % change, nominal GDP % change) efficiencies (misratings)
differ statistically. For the rest of the variables (x10, x11, x12, i.e., ratio of gross investment to GDP, ratio
of domestic savings to GDP, and ratio of general government debt to GDP) the differences among models
were not significant and, therefore, were not included in the model.
4. Results
4.1. General tendencies
Results are reported in Tables 6–10. The first of these tables (Table 6) reports summary statistics (mean,
interquartile range, median and standard deviation) for efficiency scores yielded by the order-m estimators.
The last column reports the number of misrated countries—either overrated or underrated—according to
our methods. Results are split into four panels, three of which report information for the different trimming
parameters considered—i.e., the selected value for m. The fourth panel reports a summary of information
on misrating, each row representing the summary statistics corresponding to the sum of underrated and
overrated countries for each m parameter, where the overratings have been inverted for an easier comparison
with the underratings.6
Regardless of the choice ofm, the amount of underrating is remarkable. On average, it ranges from 0.7837
(for mα=.90) to 0.7372 (for mα=.99).7 Recall that these values represent efficiencies and, therefore, the lower
the values, the higher the rating inefficiencies—i.e., the magnitude of the underrating. This would imply
that, for the entire sample, ratings could be improved by more than 20%. Since this is an average, for some
particular countries underrating is actually quite high, since the standard deviation is also relatively high,
ranging between 0.1565 (for mα=.99) to 0.1636 (for mα=.90). Although one may think these average values
are driven by outliers, it is not the case because the median also reveals high inefficiencies, and their values
are relatively close to those of the mean (they range from 0.7500 to 0.8318). The number of underrated
5For the definition of the T -statistic see Li (1996).6Since we adopt an output-oriented approach and efficiency is measured in terms of Shephard (1970) distance functions,
inefficient units are those with values lower than 1.7We have chosen the three values for the trimming parameter (m) based on the proposals by Daouia and Gijbels (2011), who
consider that order-α and order-m estimators are closely related when α = α(m) = (1/2)1/m . Given the general recommendationby Daraio and Simar (2007) to use trimming parameters for order-α equivalent to those generally used in regression analysis(i.e., the usual significance levels), we selected α = 0.90, α = 0.95 and α = 0.99, and the m values are those obtained substitutingin Daouia and Gijbels’s formula.
11
countries is also relatively high (from 72 to 81) compared with the size of the sample (1,023 country-year
pairs).
One of the main advantage of using partial frontier techniques (such as order-m) is their ability to identify
not only inefficiency but also superefficiency. In our particular context, the superefficient units would be
those overrated countries, whose efficiencies lie above unity. In this case, the amount of overrating is also
high, although this partly depends on the choice of trimming parameter, being particularly high for lower
values of m (mα=.90). The average values range from 1.0039 (for mα=.90) to 1.0580 (for mα=.99) and,
similarly to the underrating case, these values are not driven by outliers due to the closeness between the
values for the mean and the median.
Although the effect of the trimming parameter is reflected in the varying number of overrated countries
(the higher the m value, the lower the number of superefficient units, or outliers) and, therefore, it could be
deemed as a pitfall of this technique. However, in our particular setting this is actually an advantage, since we
are obtaining a full ranking of overrated or potentially overrated countries, which could provide a forecast
of those countries whose ratings would have to be corrected in the event of shock. This is of particular
importance for policymakers and especially for the agencies. The results indicate that CRAs assign higher
ratings to superefficient units than what their credentials imply. The results also suggest that superefficient
units need special scrutiny especially during the periods of turmoil. This would also be desirable since what
is simply expected from CRAs is to warn early of a possible credit event.
4.2. Results for different countries and temporal contexts
During 2008 global financial crisis and ongoing Eurozone crisis, many countries have faced frequent
downgrades. Interestingly, the developed countries have been the mostly downgraded countries. In our
analysis we check whether the proposed techniques in this study can capture the misratings especially for
advanced countries. We also particularly investigate the pre- and post crisis periods.
Tables 7, 8 and 9 report results for different groups of countries, depending on countries’ level of develop-
ment, OECD membership, or whether the analysed country has adopted the euro. Table 10 reports results
for pre-crisis (1999–2007) and crisis years (2008–2010).
Results in Table 7 report results for developing and developed countries based on the classification of
countries reported in Table 1. One might consider results for both groups of countries differ because, as
witnessed in recent crises developed countries were the ones who were overrated. On average, the differences
between both groups of countries are remarkable, and these differences are robust to the choice of parameter.
The differences are particularly large for underrated countries, especially for lower values of m. In the case
of mα=.90 the average gap between developed and developing countries is 0.1437, but in all cases it is in the
12
vicinity of 0.1 or above. Although in the case of the median these gaps are lower, they always exist and
are favorable to developed countries. Although one might therefore think there is a tendency to underrate
developing countries, the trend is actually to misrate them, since overrating is also higher compared to
developed countries—regardless of the m parameter considered.
When comparing results for OECD and non-OECD countries (Table 8), results are very similar to those
in Table 7, which was something one might expect. However, for the comparison based on the euro area
criterion, there are some particularities. With the exception of mα=.99, the number of overrated and under-
rated countries is relatively similar (especially for mα=.99). In addition, although, on average, underrating
is higher in the euro area (regardless of the m parameter considered), the median is actually higher (i.e.,
less underrating) for the non-euro countries. In contrast, the amount of overrating in the euro area is much
lower compared with non-euro countries (regardless of the summary statistic considered, either the mean or
the median), and this result is robust to the choice of m. Actually, for mα=.99 we did not find an overrated
country.
We also explore whether the crisis might have played a role when assessing misrating. Therefore, in
Table 10 we compare results during pre-crisis (1999–2007) and crisis (2008–2009) years. On average, the
magnitude of both underrating and overrating was higher during pre-crisis years (i.e. both indicators were
farther from 1), and this result is robust to the value of the trimming parameter. These results are not driven
by outlying observations, since the tendencies for the median coincide. This would confirm the virtues of the
methods we use to assess under- or overrating, since we are actually quantifying that ex post feeling among
practitioners, academics and policy-makers that during the pre-crisis years the ratings were not as accurate
as they should have been. Therefore by using the definition of superefficiency, non-parametric methods can
be used to detect the distressed countries who needs downgrades. This early action could align and allocate
risk ex ante by reducing the damage of financial mayhem. A specific attention should also be given to
inefficient units. Inefficiency in the results indicate that inefficient units deserve upgrades but for particular
reason the CRAs delay taking action. Due to asymmetric information stemming from CRAs’ expertise on
each country, this can be plausible. Yet, inefficiencies should also be at the focus because underrated units
can suffer repayment problems because of low ratings.
Similarly to what we did in Section 2 for choosing our model, we can consider the Li (1996) test in order to
ascertain whether the differences for groups of countries and groups of years are significant or not. Therefore,
we would be testing the null hypothesis that the densities between two particular groups of countries are
statistically different or not—i.e., we do not test whether results differ statistically for a particular statistic
(mean, median) but for the entire distributions of overratings. Results are reported in Table 11. They show
that differences are strongly significant when comparing euro area countries vs. non-euro area, as well as
13
when comparing pre-crisis vs. crisis years. However, in the case of the comparisons based on the level of
development results differ. When the OECD membership criterion is taken into account, differences are only
significant at the 5% significance level; for the comparison based on the level of development, these densities
do not differ statistically. This last is partly to be expected, since the classification criterion used might be
leading to groups of countries some of whose members differ remarkably in many aspects.
A graphical illustration is provided in Figure 1, which displays densities for the misratings corresponding
to all the groups considered. Only in the first of these densities (upper left sub-figure) it is apparent that
the lines almost overlap. In the other three cases the lines corresponding to each density being compared
(solid and dashed lines) are, in general, different, especially when considering the crisis or the euro effect.
5. Conclusions
Credit ratings stimulated much debate during and after 2008 financial crisis. The scope of the debate is
too broad and the content is quite mixed but what is certain is that misratings can create havoc in financial
markets. Even after several measures have been taken to curb the adverse effects of the crisis, credit ratings
still need a proper regulation. The papers investigating SCRs deal with a variety of topics but have only
scarcely taken misratings into the focus. This is partly due to many market players, policymakers and
academics take credit ratings as granted.
This study examined SCRs by proposing a nonparametric partial frontier approach. The main aim of the
study is to explore whether credit ratings are in line with what country fundamentals suggest. Apart from
its originality stemming from the investigation of misratings, this study employs nonparametric techniques
which should be considered as an innovative application in the credit ratings’ literature. The main advantage
of the partial frontier analysis conducted in the study is to measure the magnitude of countries’ misratings,
either over- or underrating, since the order-m estimators provide results for both inefficiencies (overratings)
and superefficiencies (underratings).
Our findings suggest that the magnitude of both overratings and underratings are indeed remarkable. Al-
though partial frontiers require specifying a trimming parameter, it is always possible to detect the potentially
underrated countries and, more interestingly, this can be done contemporaneously. Our results also reveal
differences among groups of countries for both underratings and overratings—specifically, developing coun-
tries receive lower ratings than their developed peers with respect to their fundamentals. These differences
were significant when comparing OECD vs. non-OECD countries or Eurozone vs. non-Eurozone countries,
and results generally suggested that, on average, misratings are higher for non-OECD and non-Eurozone
countries. The other interesting result suggests that the 2007–2008 financial crisis corrects misratings to
14
some extent. This can be explained by the downgrades which many advanced countries faced after 2008.
The importance of our findings should be assessed from financial stability point of view as well. Finan-
cial stability is among the top priorities of policymakers in the aftermath of 2008 financial crisis. Many
precautionary measures are taken to avoid further global crisis in the system. This study complements the
endeavours in this respect and recommends a vigilant monitoring of credit ratings. The methods we propose
using in this context could serve as a reliable basis for more effective monitoring. According to it, misratings
can then measured contemporaneously, guiding the correction of potential misalignments in SCRs in order
to achieve greater financial stability.
15
References
Abdelsalam, O., Duygun, M., Matallın-Saez, J. C., and Tortosa-Ausina, E. (2014a). Do ethics imply persistence?
The case of Islamic and socially responsible funds. Journal of Banking & Finance, 40:182–194.
Abdelsalam, O., Fethi, M. D., Matallın, J. C., and Tortosa-Ausina, E. (2014b). On the comparative performance of
Socially Responsible and Islamic mutual funds. Journal of Economic Behavior & Organization.
Afonso, A., Furceri, D., and Gomes, P. (2012). Sovereign credit ratings and financial markets linkages: Application
to European data. Journal of International Money and Finance, 31(3):606–638.
Afonso, A., Gomes, P. M., and Rother, P. (2007). What ‘hides’ behind sovereign debt ratings? Working Paper 711,
European Central Bank, Frankfurt am Main.
Aigner, D., Lovell, C. A. K., and Schmidt, P. (1977). Formulation and estimation of stochastic frontier production
models. Journal of Econometrics, 6:21–37.
Aizenman, J., Binici, M., and Hutchison, M. M. (2013). Credit ratings and the pricing of sovereign debt during the
euro crisis. NBER Working Papers 19125, National Bureau of Economic Research, Inc.
Alsakka, R. and ap Gwilym, O. (2010). Leads and lags in sovereign credit ratings. Journal of Banking & Finance,
34(11):2614–2626.
Alsakka, R. and ap Gwilym, O. (2012). Rating agencies’ credit signals: An analysis of sovereign watch and outlook.
International Review of Financial Analysis, 21(C):45–55.
Alsakka, R. and ap Gwilym, O. (2013). Rating agencies’ signals during the European sovereign debt crisis: Market
impact and spillovers. Journal of Economic Behavior & Organization, 85(C):144–162.
Andersen, P. and Petersen, N. C. (1993). A procedure for ranking efficient units in Data Envelopment Analysis.
Management Science, 39(10):1261–1264.
Aragon, Y., Daouia, A., and Thomas-Agnan, C. (2005). Nonparametric frontier estimation: A conditional quantile-
based approach. Econometric Theory, 21(2):358–389.
Badunenko, O., Henderson, D. J., and Kumbhakar, S. C. (2012). When, where and how to perform efficiency
estimation. Journal of the Royal Statistical Society, Series A (Statistics in Society), 175(4):863–892.
Bennell, J. A., Crabbe, D., Thomas, S., and ap Gwilym, O. (2006). Modelling sovereign credit ratings: Neural
networks versus ordered probit. Expert Systems with Applications, 30(3):415–425.
Candelon, B., Sy, A. N. R., and Arezki, R. (2011). Sovereign rating news and financial markets spillovers: Evidence
from the european debt crisis. IMF Working Papers 11/68, International Monetary Fund.
16
Cantor, R. Packer, F. (1995). Multiple ratings and credit standards: Differences of opinion in the credit rating
industry. Federal Reserve Bank of New York Research Paper 9527, Federal Reserve Bank of New York.
Cazals, C., Florens, J.-P., and Simar, L. (2002). Nonparametric frontier estimation: a robust approach. Journal of
Econometrics, 106:1–25.
Charnes, A., Cooper, W. W., and Rhodes, E. (1978). Measuring the efficiency of decision making units. European
Journal of Operational Research, 2(6):429–444.
Daouia, A. and Gijbels, I. (2011). Robustness and inference in nonparametric partial frontier modeling. Journal of
Econometrics, 161(2):147–165.
Daouia, A. and Simar, L. (2007). Nonparametric efficiency analsyis: A multivariate conditional quantile approach.
Journal of Econometrics, 140:375–400.
Daraio, C. and Simar, L. (2005). Introducing environmental variables in nonparametric frontier models: a probabilistic
approach. Journal of Productivity Analysis, 24:93–121.
Daraio, C. and Simar, L. (2007). Advanced Robust and Nonparametric Methods in Efficiency Analysis. Methodology
and Applications. Studies in Productivity and Efficiency. Springer, New York.
Darbellay, A. and Frank, P. (2012). Credit rating agencies and regulatory reform. In Hill, C. A. and Krusemark,
J. L., editors, Research Handbook on the Economics of Corporate Law. Edward Elgar Publishing.
Ferrier, G. D. and Lovell, C. A. K. (1990). Measuring cost efficiency in banking: econometric and linear programming
evidence. Journal of Econometrics, 46:229–245.
Georgescu-Roegen, N. (1951). The aggregate linear production function and its applications to von Neumann’s
economic model. In Koopmans, T., editor, Activity Analysis of Production and Allocation, pages 98–115. Wiley,
New York.
Gultekin-Karakas, D., Hisarciklilar, M., and Ozturk, H. (2011). Sovereign risk ratings: Biased toward developed
countries? Emerging Markets Finance and Trade, 47(0):69–87.
Hill, C. A. (2004). Regulating the rating agencies. Washington University Law Quarterly, 82:43.
Huang, Z., Chen, H., Hsu, C.-J., Chen, W.-H., and Wu, S. (2004). Credit rating analysis with support vector machines
and neural networks: a market comparative study. Decision Support Systems, 37(4):543–558.
Li, Q. (1996). Nonparametric testing of closeness between two unknown distribution functions. Econometric Reviews,
15:261–274.
Maher, J. J. and Sen, T. K. (1997). Predicting bond ratings using neural networks: A comparison with logistic
regression. Intelligent Systems in Accounting, Finance & Management, 6(1):59–72.
17
Matallın-Saez, J. C., Soler-Domınguez, A., and Tortosa-Ausina, E. (2014). On the informativeness of persistence for
evaluating mutual fund performance using partial frontiers. Omega, 42:47–64.
Meeusen, W. and van den Broeck, J. (1977). Efficiency estimation from Cobb-Douglas production functions with
composed error. International Economic Review, 18(2):435–555.
Murillo-Zamorano, L. R. (2004). Economic efficiency and frontier techniques. Journal of Economic Surveys, 18(1):33–
77.
Shephard, R. W. (1970). Theory of Cost and Production Functions. Princeton University Press, Princeton.
Simar, L. and Wilson, P. W. (2008). Statistical inference in nonparametric frontier models: Recent developments and
perspectives. In Fried, H., Lovell, C. A. K., and Schmidt, S. S., editors, The Measurement of Productive Efficiency,
chapter 4, pages 421–521. Oxford University Press, Oxford, 2nd edition.
Sy, A. N. R. (2009). The systemic regulation of credit rating agencies and rated markets. IMF Working Papers
09/129, International Monetary Fund, Washington.
Thieme, C., Prior, D., and Tortosa-Ausina, E. (2013). A mutilevel decomposition of school performance using robust
nonparametric frontier techniques. Economics of Education Review, 32:104–121.
Treepongkaruna, S. and Wu, E. (2012). Realizing the volatility impacts of sovereign credit ratings information on
equity and currency markets: Evidence from the Asian Financial Crisis. Research in International Business and
Finance, 26(3):335–352.
Wang, G., Hao, J., Ma, J., and Jiang, H. (2011). A comparative assessment of ensemble learning for credit scoring.
Expert Systems with Applications, 38(1):223–230.
18
Table 1: Countries by country segmentation
Developing countries Developed countries
Albania Latvia Australia LuxembourgAzerbaijan Lebanon Austria MacaoBelarus Lithuania Bahrain MaltaBolivia Malaysia Barbados NetherlandsBosnia and Herzegovina Mauritius Belgium New ZealandBotswana Mexico Canada NorwayBrazil Mongolia Cyprus OmanBulgaria Montenegro Czech Republic PortugalCambodia Morocco Denmark QatarChile Panama Estonia Saudi ArabiaChina Papua New Guinea Finland SlovakiaColombia Peru France SloveniaCosta Rica Philippines Germany South AfricaCroatia Poland Greece SpainDominican Republic Romania Hong Kong SwedenEgypt Russia Hungary SwitzerlandEl Salvador Singapore Iceland TaiwanFiji Islands St. Vincent and the Grenadines Ireland Trinidad & TobagoGuatemala Suriname Israel United Arab EmiratesHonduras Thailand Italy United KingdomIndia Tunisia Japan United States of AmericaIndonesia Turkey KoreaJordan Uruguay KuwaitKazakhstan Vietnam
Note: Developing country: Low income, lower middle income, and upper middle income, Developed country: high income OECD andhigh income non-OECD.
19
Table 2: Ratings by country segmentation
Rating Developing countries Developed countries Full sample
B3 4.92 0.00 2.41B2 8.66 0.00 4.25B1 13.78 0.00 6.76Ba3 6.89 0.00 3.38Ba2 11.61 0.00 5.69Ba1 14.96 1.33 8.01Baa3 12.80 2.84 7.72Baa2 8.07 4.55 6.27Baa1 6.89 4.92 5.89A3 3.35 6.06 4.73A2 6.89 8.71 7.82A1 1.18 10.61 5.98Aa3 0.00 6.44 3.28Aa2 0.00 10.04 5.12Aa1 0.00 5.11 2.61Aaa 0.00 39.39 20.08
Total 100.00 100.00 100.00
Note: Developing country: Low income, lower middle income, and upper middle income,Developed country: high income OECD and high income non-OECD.
20
Table 3: Descriptive statistics of country specific variables
Mean Std. Dev. Min Max
Financial and Macroeconomic Indicators (provided by the CRA)
balancegdp 0.614 11.383 –39.600 131.700expendituregdp 34.184 11.104 11.104 58.600financialbalancegdp –1.178 6.026 –23.100 48.400primarybalancegdp 1.381 5.511 –11.400 48.800exportsprcnt 10.800 15.049 –42.700 74.300gdppc 14,729.060 17,617.156 275.000 118,566.000gdpprcnt 9.215 9.368 –29.000 83.400inflation 4.497 5.590 –4.000 68.800investgdpratio 22.857 5.558 5.558 43.200savinggdpratio 24.987 12.257 –12.000 71.500foreignexcreserve 35.856 87.430 0.000 947.990debtgdp 45.439 30.327 0.000 191.600
Governance Indicators (The World Bank)
governmenteffectiveness 0.651 0.860 –1.169 2.408
21
Table 4: Variable definitions
Variable name Description (CRA’s Financial and Macroeconomic Indicators)
balancegdp Ratio of current account balance to GDPfinancialbalancegdp Ratio of general government financial balance to GDPprimarybalancegdp Ratio of general government primary balance to GDPexportsprcnt Nominal exports of goods and services (percentage change, USD)gdppc GDP per capitagdpprcnt Nominal GDP percentage change (local currency)inflation Inflation (CPI)investgdpratio Ratio of gross investment to GDPsavinggdpratio Ratio of domestic saving to GDPforeignexcreserve Official foreign exchange reserves (billion USD)debtgdp Ratio of general government debt to GDP
Variable name Description (Governance Indicators, The World Bank)
governmenteffectiveness Government effectiveness
22
Table 5: Model selection results based on the Li (1996) test, restricted vs. unrestricted models
Null hypothesis T -statistic p-value
H0 : f(Model 1) = g(Model 2) 1.7787 0.0376H0 : f(Model 2) = g(Model 3) 15.7205 0.0000H0 : f(Model 3) = g(Model 4) 12.6977 0.0000H0 : f(Model 4) = g(Model 5) 0.7868 0.2157H0 : f(Model 4) = g(Model 6) 0.4137 0.3396H0 : f(Model 4) = g(Model 7) 1.0173 0.1545
Model 1: x1, x2, x3, x4, x5, x6, y1Model 2: x1, x2, x3, x4, x5, x6, x7, y1Model 3: x1, x2, x3, x4, x5, x6, x7, x8, y1Model 4: x1, x2, x3, x4, x5, x6, x7, x8, x9, y1Model 5: x1, x2, x3, x4, x5, x6, x7, x8, x9, x10, y1Model 6: x1, x2, x3, x4, x5, x6, x7, x8, x9, x11, y1Model 7: x1, x2, x3, x4, x5, x6, x7, x8, x9, x12, y1
23
Table 6: Order-m efficiencies
Mean 1st quartile Median 3rd quartile Std.dev. #
mα=.90Underrated 0.7837 0.7372 0.8318 0.8919 0.1636 72Overrated 1.0580 1.0064 1.0180 1.0567 0.0982 114
mα=.95Underrated 0.7604 0.7122 0.7857 0.8573 0.1578 80Overrated 1.0281 1.0020 1.0065 1.0304 0.0484 48
mα=.99Underrated 0.7372 0.6667 0.7500 0.8571 0.1565 81Overrated 1.0039 1.0011 1.0016 1.0045 0.0050 4
mα=.90 0.8867 0.8363 0.9337 0.9893 0.1417 186Misrating mα=.95 0.8407 0.7500 0.8639 0.9787 0.1642 128
mα=.99 0.7494 0.6671 0.7500 0.8571 0.1624 85
24
Table 7: Order-m efficiencies, developed vs. developing countries
Mean 1st quartile Median 3rd quartile Std.dev. #
mα=.90
DevelopedUnderrated 0.8336 0.7692 0.8553 0.9128 0.09 47Overrated 1.0434 1.0048 1.0164 1.0450 0.0731 59
DevelopingUnderrated 0.6899 0.6040 0.7562 0.8327 0.2228 25Overrated 1.0736 1.0068 1.0267 1.0793 0.1181 55
mα=.95
DevelopedUnderrated 0.7999 0.7475 0.8066 0.8582 0.0994 51Overrated 1.0187 1.0017 1.0043 1.0208 0.0361 25
DevelopingUnderrated 0.6908 0.6000 0.7415 0.8200 0.2116 29Overrated 1.0383 1.0045 1.0120 1.0470 0.0581 23
mα=.99
DevelopedUnderrated 0.7733 0.7143 0.7505 0.8571 0.1062 51Overrated 1.0016 1.0014 1.0016 1.0019 0.0007 2
DevelopingUnderrated 0.6760 0.6000 0.6905 0.8295 0.2049 30Overrated 1.0062 1.0036 1.0062 1.0088 0.0074 2
mα=.90 0.9052 0.8557 0.9318 0.9894 0.0969 106Miss-rated (developed) mα=.95 0.8601 0.7803 0.8581 0.9787 0.1199 76
mα=.99 0.7818 0.7143 0.7857 0.8571 0.1128 53
mα=.90 0.8621 0.8087 0.9529 0.9884 0.1830 80Miss-rated (developing) mα=.95 0.8124 0.7312 0.9043 0.9755 0.2111 52
mα=.99 0.6958 0.6000 0.7143 0.8750 0.2130 32
25
Table 8: Order-m efficiencies, OECD vs. non-OECD countries
Mean 1st quartile Median 3rd quartile Std.dev. #
mα=.90
OECDUnderrated 0.8276 0.7857 0.8420 0.8666 0.0773 20Overrated 1.0434 1.0039 1.0171 1.0400 0.0782 41
Non-OECDUnderrated 0.7668 0.7173 0.8101 0.9030 0.1844 52Overrated 1.0662 1.0067 1.0200 1.0720 0.1075 73
mα=.95
OECDUnderrated 0.8148 0.7781 0.8182 0.8571 0.0851 25Overrated 1.0194 1.0010 1.0032 1.0199 0.0431 14
Non-OECDUnderrated 0.7356 0.6741 0.7500 0.8578 0.1767 55Overrated 1.0316 1.0031 1.0092 1.0383 0.0506 34
mα=.99
OECDUnderrated 0.7932 0.7500 0.7857 0.8571 0.0833 25Overrated – – – – – 0
Non-OECDUnderrated 0.7123 0.6429 0.7159 0.8571 0.1748 56Overrated 1.0039 1.0011 1.0016 1.0045 0.0050 4
mα=.90 0.9184 0.8664 0.9615 0.9922 0.0905 61Miss-rated (OECD) mα=.95 0.8750 0.7873 0.8579 0.9826 0.1080 39
mα=.99 0.7932 0.7500 0.7857 0.8571 0.0833 25
mα=.90 0.8712 0.8101 0.9286 0.9885 0.1590 125Miss-rated (non-OECD) mα=.95 0.8257 0.7300 0.8771 0.9717 0.1819 89
mα=.99 0.7312 0.6442 0.7232 0.8750 0.1832 60
26
Table 9: Order-m efficiencies, euro vs. non-euro countries
Mean 1st quartile Median 3rd quartile Std.dev. #
mα=.90
EuroUnderrated 0.8308 0.7729 0.8218 0.8503 0.0685 5Overrated 1.0280 1.0042 1.0092 1.0287 0.0422 8
Non-euroUnderrated 0.7802 0.7319 0.8327 0.8943 0.1683 67Overrated 1.0602 1.0067 1.0187 1.0624 0.1009 106
mα=.95
EuroUnderrated 0.7965 0.7500 0.7512 0.8182 0.086 5Overrated 1.0041 1.0007 1.0037 1.0071 0.0040 4
Non-euroUnderrated 0.7580 0.6915 0.7857 0.8575 0.1615 75Overrated 1.0303 1.0021 1.0081 1.0319 0.0500 44
mα=.99
EuroUnderrated 0.7886 0.7500 0.7500 0.8182 0.0952 5Overrated – – – – – 0
Non-euroUnderrated 0.7339 0.6636 0.7500 0.8571 0.1596 76Overrated 1.0039 1.0011 1.0016 1.0045 0.0050 4
mα=.90 0.9190 0.8503 0.9486 0.9922 0.0875 13Miss-rated (euro) mα=.95 0.8852 0.7512 0.9375 0.9934 0.1215 9
mα=.99 0.7886 0.7500 0.7500 0.8182 0.0952 5
mα=.90 0.8842 0.8356 0.9328 0.9885 0.1449 173Miss-rated (non-euro) mα=.95 0.8373 0.7500 0.8585 0.9747 0.1669 119
mα=.99 0.7470 0.6667 0.7500 0.8571 0.1658 80
27
Table 10: Order-m efficiencies, pre-crisis (1999–2007) vs. crisis years (2008–2010)
Mean 1st quartile Median 3rd quartile Std.dev. #
mα=.90
Pre-crisisUnderrated 0.7683 0.7309 0.8011 0.8721 0.1705 58Overrated 1.0682 1.006 1.018 1.087 0.1103 80
CrisisUnderrated 0.8478 0.8286 0.8668 0.9277 0.1152 14Overrated 1.0339 1.0069 1.0185 1.035 0.055 34
mα=.95
Pre-crisisUnderrated 0.7316 0.6866 0.75 0.82 0.161 61Overrated 1.0309 1.0026 1.0092 1.0311 0.0522 38
CrisisUnderrated 0.8529 0.8447 0.8788 0.9288 0.105 19Overrated 1.0175 1.0007 1.0034 1.0112 0.0301 10
mα=.99
Pre-crisisUnderrated 0.7076 0.6523 0.7176 0.8101 0.1591 62Overrated 1.0068 1.0045 1.0068 1.0091 0.0066 2
CrisisUnderrated 0.834 0.8258 0.8571 0.892 0.101 19Overrated 1.0011 1.001 1.0011 1.0011 1e-04 2
mα=.90 0.8702 0.8096 0.9177 0.9884 0.1526 138Miss-rated (pre-crisis) mα=.95 0.8239 0.7432 0.8462 0.978 0.1744 99
mα=.99 0.7165 0.6538 0.7232 0.8252 0.1643 64
mα=.90 0.934 0.9229 0.9706 0.9921 0.0903 48Miss-rated (crisis) mα=.95 0.8979 0.8571 0.9286 0.9868 0.1065 29
mα=.99 0.8497 0.8333 0.8593 0.9231 0.1079 21
28
Table 11: Differences among country classifications, results based on the Li (1996) test
Null hypothesis T -statistic p-value
H0 : f(developed) = g(developing) –0.9785 0.8361H0 : f(OECD) = g(non-OECD) 2.0193 0.0217H0 : f(euro) = g(non-euro) 25.8112 0.0000H0 : f(pre-crisis) = g(crisis) 4.2560 0.0000
29
Figure 1: Kernel density plots, overrated and underrated countries
0.0 0.5 1.0 1.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
Density
ofcountries
Developed ——— Developing -------
0.0 0.5 1.0 1.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
Density
ofcountries
OECD ——— Non-OECD -------
0.0 0.5 1.0 1.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
Density
ofcountries
Euro ——— Non-euro -------
0.0 0.5 1.0 1.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
Density
ofcountries
Pre-crisis ——— Crisis -------
Notes: All figures contain densities estimated using kernel density estimation for themisratings yielded by the order-m estimators. The vertical lines in each plot would rep-resents efficiency. The probability mass below 1 represents the underrated countries, thatabove one represents the overrated ones. A Gaussian kernel is chosen, and bandwidthsare estimated using plug-in methods.
30