D4.5
DELIVERABLE
PROJECT INFORMATION
Project Title: Systemic Seismic Vulnerability and Risk Analysis for Build-
ings, Lifeline Networks and Infrastructures Safety Gain
Acronym: SYNER-G
Project N°: 244061
Call N°: FP7-ENV-2009-1
Project start: 01 November 2009
Duration: 36 months
DELIVERABLE INFORMATION
Document Title:
D4.5 - Collection and harmonization of data for indicators
determined in D4.1-4.4 for European regional context
(NUTS3) through existing standardized data.
Date of issue: January 2011
Work Package: WP4
Deliverable/Task Leader: KIT-U
REVISION: FINAL
Project Coordinator:
Institution:
e-mail:
fax:
telephone:
Prof. Kyriazis Pitilakis
Aristotle University of Thessaloniki
+ 30 2310 995619
+ 30 2310 995693
i
Abstract
The objective of Syner-G Work Package (WP) 4 is the assessment of socio-economic im-
pacts due to seismic damages that influence preparedness and response activities in the
context of short-term emergency relief and recovery (2-3 weeks) in the following socio-
economic sectors:
1. Emergency shelter
2. Health care facilities
The WP will establish appropriate methodologies including indicator based systems for inte-
grating socio-economic impacts with fragility functions and performance models developed
in WP 3 and WP 5. This will include consideration of the interdependencies between ele-
ments and systems in the assessment of socio-economic impacts for each sector above.
For each of the sectors above, specific outputs will be produced. For example, three outputs
will be produced for emergency shelter:
1. DH: Displaced households 2. SSH: Shelter seeking households 3. Gap analysis for shelters including Social vulnerability of sheltered population
The outputs will result from combining physical damage of buildings and utilities with a set of
appropriate socio-economic indicators. The physical damage functions are being developed
in WPs 2, 3 and 5, whereas WP4 will develop the socio-economic indicator system. Choice,
direction (+/-) and weighting of indicators should be performed through the following phases:
1. The pool of indicators for urban level should come from the European Urban Audit, the only publicly available pan-European socio-economic database.
2. Perform principal component analysis (PCA) of all data from the urban audit 3. Calibrate/adjust using results from literature study on indicators 4. Expert elicitation using Analytical Hierarchy Process (AHP). Experts should be cho-
sen with civil protection experience. Experts could be chosen from the following pools:
a. End users in Syner-G project b. Representatives of European National Platforms for Disaster Reduction. c. NGOs
5. Final selection made based on combining results from above four processes
This technical note presents the results from Phases 1 & 2 above. The output is in the form
of eight Principal Components at city level and eight Principal Components at sub-city district
level. The Principal Components are extracted from the European Urban Audit indicators in
the period 2003-2006 believed to best represent the available data considering its relevance
to socio-economic vulnerability as well as data variation in a European context.
Keywords: Principal Component Analysis, European Urban Audit, Socio-Economic Vulner-ability, Emergency Shelter needs, Health Service needs
ii
Acknowledgments
The research leading to these results has received funding from the European Community's
Seventh Framework Programme [FP7/2007-2013] under grant agreement n° 244061.
iii
Deliverable Contributors
NGI Bjørn Vidar Vangelsten
KIT-U Bijan Khazai
James Daniell
Tina Kunz-Plapp
iv
Table of Contents
1 Introduction........................................................................................................................................ 1
1.1 BACKGROUND..........................................................................................................1
1.2 PRINCIPAL COMPONENT ANALYSIS – A BRIEF INTRODUCTION ........................2
2 Description of raw data ..................................................................................................................... 1
2.1 DATA SOURCE .........................................................................................................1
2.2 INDICATORS AND SAMPLED DISTRICTS ...............................................................5
2.3 LACK OF COMPLETENESS......................................................................................6
3 Analysis of 1999-2002 data................................................................................................................ 7
3.1 PREPARATION OF DATA SET .................................................................................7
3.1.1 Improving completeness.................................................................................7
3.1.2 Data transformation......................................................................................10
3.1.3 Correlation and removal of highly correlated variables .................................10
3.1.4 Detailed analysis of data to be included in Principal Component Analysis ....12
3.2 PRINCIPAL COMPONENT ANALYSIS ....................................................................17
3.2.1 Eigenvectors, eigen values and explanation of variability .............................17
3.2.2 Selecting principal components ....................................................................17
3.2.3 Detailed analysis of selected principal components......................................17
4 Analysis of 2003-2006 data.............................................................................................................. 21
4.1 CITY LEVEL ANALYSIS...........................................................................................23
4.1.1 Results from correlation analysis ..................................................................23
4.1.2 Results from PCA analysis ...........................................................................27
4.2 SUB-CITY LEVEL ANALYSIS ..................................................................................39
4.2.1 Results from correlation analysis ..................................................................39
4.2.2 Results from PCA analysis ...........................................................................45
4.3 COMPARISON OF CITY AND SUB-CITY DATA......................................................57
5 Summary and recommendations.................................................................................................... 60
6 References......................................................................................................................................... 63
v
List of Figures
Figure 1 Map of Urban Audit City Participant ........................................................................ 1
Figure 2: Probability density function estimates for indicator no. 1-8 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 13
Figure 3: Probability density function estimates for indicator no. 9-16 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 14
Figure 4: Probability density function estimates for indicator no. 17 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 15
Figure 5: Probability density function estimates for indicator nos. 1-8 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 31
Figure 6: Probability density function estimates for indicator nos. 9-16 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 32
Figure 7: Probability density function estimates for indicator nos. 17-24 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 33
Figure 8: Probability density function estimates for indicator nos. 1-8 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 49
Figure 9: Probability density function estimates for indicator nos. 9-16 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 50
Figure 10: Probability density function estimates for indicator nos. 17-21 (bars). Best fit
Generalised Extreme Value distribution (solid line). ......................................... 51
Analysis of 1999-2002 data
vi
List of Tables Table 1 Compilation of Urban Audit Indicators, available on Sub-City-District.
Highlighted are the indicators being most available over the districts.
Sources: European Commission 2010, p. 228-231) .................................. 3
Table 2: List of 44 indicators containing data in the Urban Audit data set ..................... 5
Table 3: The 20 indicators included in the condensed data .......................................... 7
Table 4: No of districts (out of 2820) having data for two paired indicators ................... 9
Table 5: Correlation matrix between the 20 transformed indicators .............................11
Table 6: The 17 indicators included in the principal component analysis. A total of 9 out
of the 17 indicators have been transformed using the equation in Section
3.1.2.........................................................................................................12
Table 7: Parameters for best fit Generalised Extreme Value probability density function
for the 17 transformed indicators..............................................................16
Table 8: Eigenvectors and eigenvalues resulting from the principal component analysis
sorted by decreasing eigenvalue..............................................................18
Table 9: Analysis of the the ten principal components included. Only eigenvector-
components with absolute value larger than 0.3 are listed, helping to
indentify which indicators dominate the principal component....................19
Table 10: Subjective labelling of the ten included principal components......................20
Table 11: Indicators used in the 2003-2006 data analysis at city and sub-city level and
information on expert judgment prioritation of indicators. Parameters
considered to be of high and medium relevance are given the values “H”
and “M” respectively in the column labelled “Priority”. The two last columns
are checked for the parameters believed to be relevant for the Shelter and
Health models respectively.......................................................................21
Table 12: No of cities (out of 377) having data for two paired indicators. Green colour:
Indicator pair with more than 300 data points. Red colour: Indicator pair
with less than 75 data points. ...................................................................24
Table 13: Correlation matrix between the 29 transformed indicators. Red colour: Abs
>0.8. Green colour: Abs<0.2. ...................................................................26
Table 14: The 24 indicators included in the principal component analysis. A total of 14
out of the 24 indicators have been transformed using the equation in
Section 3.1.2. ...........................................................................................28
Table 15: Correlation matrix between the 24 transformed indicators. Red colour: Abs
>0.5. Green colour: Abs<0.1. ...................................................................29
Table 16: Parameters for best fit Generalised Extreme Value probability density
function for the 24 transformed indicators.................................................34
Table 17: Eigenvectors and eigenvalues resulting from the principal component
analysis sorted by decreasing eigenvalue. Components with absolute
value larger than 0.3 marked in red. Components with absolute value
between 0.1 and 0.3 marked in green. .....................................................35
vii
Table 18: Correlation between principal components (PC1 to PC24) and each of the
original indicators. Green: Strong positive correlation. Yellow: Low
correlation. Red: Strong negative correlation............................................37
Table 19: Subjective labelling of the first eight principal components representing close
to 75% of the variation in the data ............................................................39
Table 20: The 24 indicators included in the correlation analysis. A total of 14 out of the
24 indicators have been transformed using the equation in Section 3.1.2.
.................................................................................................................40
Table 21: No of sub-city districts (out of 2466) having data for two paired indicators.
Green colour: Indicator pair with more than 1200 data points. Red colour:
Indicator pair with less than 200 data points.............................................42
Table 22: Correlation matrix between the 24 transformed indicators. Red colour: Abs
>0.5. Green colour: Abs<0.1. ...................................................................44
Table 23: The 21 indicators included in the principal component analysis. A total of 12
out of the 21 indicators have been transformed using the equation in
Section 3.1.2. ...........................................................................................46
Table 24: Correlation matrix between the 21 transformed indicators. Red colour: Abs
>0.4. Green colour: Abs<0.1. ...................................................................47
Table 25: Parameters for best fit Generalised Extreme Value probability density
function for the 21 transformed indicators.................................................52
Table 26: Eigenvectors and eigenvalues resulting from the principal component
analysis sorted by decreasing eigenvalue. Components with absolute
value larger than 0.3 marked in red. Components with absolute value
between 0.1 and 0.3 marked in green. .....................................................53
Table 27: Correlation between principal components (PC1 to PC21) and each of the
original indicators. Green: Strong positive correlation. Yellow: Low
correlation. Red: Strong negative correlation............................................55
Table 28: Subjective labelling of the first eight principal components representing close
to 75% of the variation in the data ............................................................57
Table 29: Comparison of correlation of data at city and sub-city level (Correlation at
sub-city level from Table 24 minus correlation at city level from Table 15).
Components where data were not available at sub-city level are marked
with NA. Components with absolute value larger than 0.5 are marked in
red. Components with absolute value between 0.2 and 0.5 are marked in
yellow. Components with absolute value between 0.1 and 0.2 are
unmarked(white background). Components with absolute value less than
0.1 are marked in green. ..........................................................................58
Introduction
1
1 Introduction
1.1 BACKGROUND
The objective of Syner-G Work Package (WP) 4 is the assessment of socio-economic
impacts due to seismic damages that influence preparedness and response activities in
the context of short-term emergency relief and recovery (2-3 weeks) in the following
socio-economic sectors:
1. Emergency shelter
2. Health care facilities
The WP will establish appropriate methodologies including indicator based systems for
integrating socio-economic impacts with fragility functions and performance models
developed in WP 3 and WP 5. This will include consideration of the interdependencies
between elements and systems in the assessment of socio-economic impacts for each
sector above.
For each of the sectors above, specific outputs will be produced. For example, three
outputs will be produced for emergency shelter:
1. DH: Displaced households 2. SSH: Shelter seeking households 3. Gap analysis for shelters including Social vulnerability of sheltered population
The outputs will result from combining physical damage of buildings and utilities with a
set of appropriate socio-economic indicators. The physical damage functions are being
developed in WPs 2, 3 and 5, whereas WP4 will develop the socio-economic indicator
system. Choice, direction (+/-) and weighting of indicators should be performed through
the following phases:
1. The pool of indicators for urban level should come from the European Urban Audit, the only publicly available pan-European socio-economic database.
2. Perform principal component analysis (PCA) of all data from the urban audit 3. Calibrate/adjust using results from literature study on indicators 4. Expert elicitation using Analytical Hierarchy Process (AHP). Experts should be
chosen with civil protection experience. Experts could be chosen from the fol-lowing pools:
a. End users in Syner-G project b. Representatives of European National Platforms for Disaster Reduction. c. NGOs
5. Final selection made based on combining results from above four processes
This technical note presents the results from Phases 1 & 2 above.
Analysis of 1999-2002 data
2
1.2 PRINCIPAL COMPONENT ANALYSIS – A BRIEF INTRODUCTION
Principal component analysis (PCA) is a technique to analyse data consisting of multi-
ple observations of a set of variables. PCA calculates inter-correlation between vari-
ables and a new set of transformed variables are created where the importance of
each of the new variables in terms of the variability of the data is identified. Granted
that the original data fulfil the requirement of multivariate normal distribution, the trans-
formed variables are all statistically independent. Even for data that are not multivariate
normally distributed, Jolliffe (2002) states that PCA can be very useful to understand
the structure of the data.
In the preface, Jolliffe (2002) states:
Principal component analysis is probably the oldest and best known of the
techniques of multivariate analysis. It was first introduced by Pearson (1901),
and developed independently by Hotelling (1933).
...
The central idea of principal component analysis (PCA) is to reduce the dimen-
sionality of a data set consisting of a large number of interrelated variables,
while retaining as much as possible of the variation present in the data set. This
is achieved by transforming to a new set of variables, the principal components
(PCs), which are uncorrelated, and which are ordered so that the first few retain
most of the variation present in all of the original variables. Computation of the
principal components reduces to the solution of an eigenvalue-eigenvector
problem for a positive-semidefinite symmetric matrix.
Several different methodological formulations are available for how to perform a PCA.
In the PCA tutorial from 2002, Smith, describes a six step procedure:
1. Get some data 2. Subtract the mean 3. Calculate the covariance matrix 4. Calculate the eigenvectors and eigenvalues of the covariance matrix 5. Choose principal components and form a feature vector 6. Derive the new data set
The purpose of analysing the European Urban Audit data using PCA is to
• identify correlation between indicators
• exclude indicators that have no or poor data, or have little variation
• identify which indicators are key to create socio-economic profiles of European urban areas
In the following, the format and content of the Urban Audit raw data will be presented in
more detail (Chapter 2). Chapter 3 present the necessary pre-processing of the data
before the results from the Principal Component Analysis are given in Chapter 4. Chap-
ter 5 summarises and gives recommendations.
Description of raw data
1
2 Description of raw data
2.1 DATA SOURCE
The Statistical Office of the European Communities, Eurostat, offers a huge range of
data across the 27 member states (Eurostat1). From national to local scale they cover
the topics of general and regional statistics, economy and finance, population and so-
cial conditions, industry, trade and services, agriculture and fisheries, external trade,
transport, environment and energy, and science and technology. Statistics of social
conditions are very broad covering information about demographics, health, education
and training, labour market, income, social Inclusion and living conditions, social pro-
tection, household budget, crime and criminal justice, and cultural aspects (European
Commission 2010). Most of the Eurostat data are provided on levels describing na-
tions, greater regions to large cities. These scales are too small for the purpose of shel-
ter needs estimations after disasters, requiring the sub- city district scale for reliable
outcome and precise located needs estimations.
Figure 1 Map of Urban Audit City Participant
1http://epp.eurostat.ec.europa.eu/portal/page/portal/region_cities/city_urban/urban_audit_data_collections
Analysis of 1999-2002 data
2
Urban Audit focuses on city scale, where data are collected on three levels: The core
cities, larger urban zones (LUZ) including the hinterlands, and sub- city- districts (SCD)
gathering inters- urban discrepancies. Information is covered over nine categories:
Demography, Social Aspects, Economic Aspects, Civic Involvement, Training and
Education, Environment, Travel and Transport, Information Society, and Culture and
Recreation. Eurostat is responsible for coordinating the flow of Urban Audit (European
Commission 2010). Figure 1 shows a map of the participating cities. The latest data
collection took place in 2009 with 322 cities in the EU27 plus 47 cities from Switzer-
land, Norway, Croatia, and Turkey. It covers approximately 20 percent of each national
population. The participation of each capital and main regional city was intended.
The results of the latest data collection are not yet published thus the model refers to
previous data sets. The Urban Audit data are freely accessible from Eurostat. Data on
SCD scale are most preferable. Table 1 shows a compilation of variables that are
available on SCD.
The full list of variables used for the creation of indicators is provided in Table 1. Only
data on SCD level are considered within the model, and listed in the tables. Information
on this scale are available about the population size, density, and sex and age distribu-
tion, mortality, the residents origin, number of households and household size, housing
quality, crime rate, unemployment rate, household income, social security benefiters,
education status, and the green space area (European Commission 2010).
Description of raw data
3
Table 1 Compilation of Urban Audit Indicators, available on Sub-City-District. Highlighted are the indicators being most available
over the districts. Sources: European Commission 2010, p. 228-231)
List of Urban Audit Indicators Code Label Numerator Denominator
DE1001I * Total resident population DE1001V -
DE1040I Proportion of total population aged 0-4 DE1040V DE1001V
DE1003I *Proportion of females to males in total population DE1003V DE1002V
DE1061I Total population change over 1 year DE1001V (t) DE1001V (t-1)
DE1062I * Total annual population change over 5 approx.years DE1001V (t) DE1001V (t-n)
DE2005I * Proportion of Residents who are not EU Nationals and citizens of a country with high HDI DE2005V DE1001V
DE2006I * Proportion of Residents who are not EU Nationals and citizens of a country with a medium or low HDI
DE2006V DE1001V
DE3003I * Total number of households DE3001V -
DE3004I * Average size of households DE3017V DE3001V
DE3002I * Proportion of households that are 1-person households DE3002V DE3001V
DE3005I * Prop. of households that are lone-parent households DE3005V DE3001V
Dem
ogra
phy
DE3008I * Prop. households that are lone-pensioner households DE3008V DE3001V
SA1001I * Number of dwellings SA1001V -
SA1018I * Proportion of dwellings lacking basic amenities SA1018V SA1001V
SA1012I * Proportion of households living in social housing SA1012V DE3001V
SA2029I Crude death rate per 1000 residents SA2019V*1000 DE1001V
SA2030I Crude death rate of male residents per 1000 male residents SA2020V*1000 DE1002V
SA2031I Crude death rate of female residents per 1000 female residents SA2021V*1000 DE1003V
Socia
l A
spects
SA2019I Total deaths per year SA2019V
SA2016I Mortality rate for <65 per year SA2016V
DE1040V + DE1043V + DE1046V + DE1052V + DE1025V
SA3001I * Total Number of recorded crimes per 1000 population SA3001V*1000 DE1001V
EC1201I Annual average change in employment over approx. 5 years EC1001V(t)- EC1001V(t-n)
EC1001V -EC1001V(t-n)
nom
ic
As-
EC1010I Number of unemployed EC1010V -
Analysis of 1999-2002 data
4
EC1020I * Unemployment rate EC1010V EC1001V
EC1148I Proportion of residents unemployed 15-24 EC1148V EC1142V
EC1202I Proportion of unemployed who are under 25 EC1148V EC1010V
EC1005I Net activity rate residents aged 15-64 EC1001V-EC1010V
DE1046V + DE1049V + DE1052V + DE1025V
EC1006I
Net activity rate residents aged 15-24 EC1142V-EC1148V
DE1046V + DE1049V
EC1007I Net activity rate residents aged 55-64 EC1145V-EC1151V
DE1025V
EC3039I * Median disposable annual household income (for city or NUTS 3 region) EC3039V -
EC3057I Percent. households with less than half nat.aver.income EC3057V EC3056V
EC3055I Percent. households with less than 60% of the national median annual disposable income EC3055V EC3056V
EC3060I Proportion of households reliant upon social security EC3060V EC3056V
EC3063I Proportion of individuals reliant on social security EC3063V DE1001V
TE2025I * Prop. of working age population qualified at level 1 or 2 ISCED TE2025V
DE1046V + DE1049V + DE1052V + DE1025V
TE2028I Prop. of working age population qualified at level 3 or 4 ISCED TE2028V
DE1046V + DE1049V + DE1052V + DE1025V
Tra
inin
g a
nd e
ducation
TE2031I Prop. of working age population qualified at level 5 or 6 ISCED TE2031V
DE1046V + DE1049V + DE1052V + DE1025V
EN5003I * Total land area (km2) -according to cadastral register EN5003V -
EN5001I Green space (in m2) to which the public has access per capita EN5001V*10000 DE1001V
EN5012I * Proportion of the area in green space EN5012V EN5003V
Enviro
n-
ment
EN5101I * Population density: total resident pop. per square km DE1001V EN5003V
TOTAL 41 Indicators in 5 categories on SCD level
5
2.2 INDICATORS AND SAMPLED DISTRICTS
In the downloaded Urban Audit data set, a total of 958 indicators are listed grouped as follows:
• Demography (DE)
• Economic aspects (EC)
• Environment (EN)
• Social aspects (SE)
• Training and education (TE)
Out of the 958 indicators, data are collected for only 44 indicators, see Table 2. Data has been col-
lected for two periods, 1999-2002 and 2003-2006. During the first period, data has been collected
for 7856 districts in 321 cities in 30 European countries. During the second period, data has been
collected for 2972 districts in 173 cities in 24 European Countries.
Table 2: List of 44 indicators containing data in the Urban Audit data set
Code Description
DE1001I Total population in Urban Audit cities
DE1003I Proportion of females to males in total population
DE1040I Proportion of total population aged 0-4
DE1061I Total population change over 1 year
DE1062I Total annual population change over approx. 5 years
DE2005I Proportion of Residents who are not EU Nationals and citizens of a country with high HDI
DE2006I Proportion of Residents who are not EU Nationals and citizens of a country with a medium or low HDI
DE3002I Proportion of one-person households in Urban Audit cities - %
DE3003I Total Number of Households
DE3004I Average household size in Urban Audit cities - number of persons per household
DE3005I Proportion of households that are lone-parent households
DE3008I Proportion of households that are lone-pensioner households
EC1005I Net activity rate residents aged 15-64
EC1006I Net activity rate residents aged 15-24
EC1007I Net activity rate residents aged 55-64
EC1010I Number of unemployed
EC1020I Unemployment rate in Urban Audit cities - %
EC1148I Proportion of residents unemployed 15-24
EC1201I Annual average change in employment over approx. 5 years
EC1202I Proportion of unemployed who are under 25
EC3039I Median disposable annual household income
EC3055I Percent. households with less than 60% of the national median annual disposable income
EC3057I Percentage of the households receiving less than half of the na-tional average household income
EC3060I Proportion of households reliant upon social security
Analysis of 1999-2002 data
6
Code Description
EC3063I Proportion of individuals reliant on social security
EC3063I Proportion of individuals reliant on social security
EN5001I Green space (in m2) to which the public has access, per capita
EN5001I Green space (in m2) to which the public has access, per capita
EN5003I Total land area (km2) according to cadastral register
EN5012I Proportion of the area in green space
EN5101I Population density in Urban Audit cities
SA1001I Number of dwellings
SA1012I Proportion of households living in social housing
SA1018I Proportion of dwellings lacking basic amenities
SA2016I Mortality rate for <65 per year
SA2019I Total deaths per year
SA2029I Crude death rate per 1000 residents
SA2030I Crude death rate of male residents per 1000 male residents
SA2031I Crude death rate of female residents per 1000 female residents
SA2031I Crude death rate of female residents per 1000 female residents
SA3001I Total number of recorded crimes per 1000 population
TE2025I Prop. of working age population qualified at level 1 or 2 ISCED
TE2028I Prop. of working age population qualified at level 3 or 4 ISCED
TE2031I Proportion of population aged 15-64 qualified at tertiary level (ISCED 5-6) living in Urban Audit cities - %
2.3 LACK OF COMPLETENESS
There is a lot of missing data in the data set. No district has data for all 44 indicators, and no indica-
tor has been collected for all districts. Organising the data into a matrix with indicator as column
heading and district as row heading, only 32 % of the matrix is filled for the 1999-2002 period and
35% for 2003-2006. To arrive at a data set with reasonable completeness for analysis purposes
while keeping an appropriate mix of indicators and geographical country/city spread, requires a
careful selection indicators and districts.
Analysis of 2003-2006 data
7
3 Analysis of 1999-2002 data
3.1 PREPARATION OF DATA SET
In the following, the data set for the period 1999-2002 is considered.
3.1.1 Improving completeness
To improve completeness of the data set, the following procedure of excluding cities and indicators
have been performed:
1. Sort indicators according to completeness, i.e. starting with the indicator having data for the most districts and ending with the indicator having the least data.
2. Subjectively exclude indicators with the least data. Indicators believed to carry information with significant importance for socio economic vulnerability should be kept when possible.
3. Sort cities according to completeness, i.e. starting with the city having data for the most indi-cators and ending with the city with data for the least indicators.
4. Subjectively exclude cities with the least indicators. Cities believed to have high significance, for example being among the last cities of a country, or representing a certain size or type of city should be kept.
5. If necessary, repeat the procedure.
Following the above procedure, a data set with 20 indicators (Table 3) and 2820 districts was cre-
ated representing 161 cities in 20 European countries. This data set has kept the Urban Audit indi-
cators believed to be key for socio economic vulnerability as well as a good geographical represen-
tation in Europe.
Table 3: The 20 indicators included in the condensed data
DE1001I: Total population in Urban Audit cities
EC1020I: Unemployment rate in Urban Audit cities - %
SA1001I: Number of dwellings
DE3003I: Total Number of Households
DE3002I: Proportion of one-person households in Urban Audit cities
DE3008I: Proportion of households that are lone-pensioner households
DE3005I: Proportion of households that are lone-parent households
EN5003I: Total land area (km2) according to cadastral register
EN5101I: Population density in Urban Audit cities
DE1003I: Proportion of females to males in total population
SA1018I: Proportion of dwellings lacking basic amenities
SA1012I: Proportion of households living in social housing
DE2006I: Proportion of Residents who are not EU Nationals and citizens of a
country with a medium or low HDI
DE2005I: Proportion of Residents who are not EU Nationals and citizens of a
country with high HDI
DE3004I: Average household size in Urban Audit cities - number of persons per
household
DE1062I: Total annual population change over approx_ 5 years
EC3039I: Median disposable annual household income
Analysis of 1999-2002 data
8
TE2025I: Prop. of working age population qualified at level 1 or 2 ISCED
EN5012I: Proportion of the area in green space
SA3001I: Total number of recorded crimes per 1000 population
The condensed data set with 20 indicators and 2820 districts has 68% completeness in comparison
to the 32% completeness of the original data set with 44 indicators and 7856 districts.
The diagonal in Table 4 gives more details on the number of districts (out of the 2820) that have
collected data on the various indicators. The indicator with most data is indicator no. 1“Total popula-
tion in Urban Audit cities” with 2804 out of the 2820 districts covered, whereas only 502 out of the
2820 districts have data on indicator no. 20 “Total number of recorded crimes per 1000 population”.
The off-diagonal elements in Table 4 show how much data can be found on each pair of indicators.
The most data can be found for indicator pair 1 and 2 (“DE1001I: Total population in Urban Audit
cities” and “EC1020I: Unemployment rate in Urban Audit cities - %”) with 2690 districts having cor-
responding data. The least data can be found for indicator pair 17 and 18 (“EC3039I: Median dis-
posable annual household income” and “TE2025I: Prop. of working age population qualified at level
1 or 2”) with only 47 districts having collected both these indicators.
The large variation in pair wise data as seen in Table 4 should be kept in mind when analysing the
results of the principal component analysis below as it clearly weakens the rigor of the analysis.
Analysis of 2003-2006 data
9
Table 4: No of districts (out of 2820) having data for two paired indicators
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
DE
100
1I: T
ota
l po
pula
tion in
Urb
an
Au
dit
citie
s
EC
102
0I: U
ne
mplo
ym
ent ra
te in
Urb
an
Audit c
ities - %
SA
100
1I: N
um
be
r of d
wellin
gs
DE
300
3I: T
ota
l Nu
mbe
r of H
ousehold
s
DE
300
2I: P
ropo
rtion o
f one-p
ers
on h
ou
se-
hold
s in
Urb
an A
udit c
ities
DE
300
8I: P
ropo
rtion o
f househ
old
s th
at
are
lon
e-p
ensio
ner h
ouseh
old
s
DE
300
5I: P
ropo
rtion o
f househ
old
s th
at
are
lon
e-p
are
nt h
ouseh
old
s
EN
500
3I: T
ota
l lan
d a
rea (k
m2
) acco
rdin
g
to c
ada
stra
l regis
ter
EN
510
1I: P
op
ula
tion d
ensity
in U
rba
n
Audit c
ities
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g
basic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l ho
usin
g
DE
200
6I: P
rop. o
f res. n
ot E
U n
at. a
nd
citiz
. of a
c.try
w/ m
ed. o
r low
HD
I
DE
200
5I: P
rop. re
s. w
ho a
re n
ot E
U n
at.
and
citiz
. of a
c.try
with
hig
h H
DI
DE
300
4I: A
vera
ge h
ouseh
old
siz
e in
Urb
an A
udit c
ities –
no p
ers
ons p
er h
h
DE
106
2I: T
ota
l an
nu
al p
op
ula
tion c
ha
nge
over a
ppro
x_ 5
yea
rs
EC
303
9I: M
edia
n d
ispo
sa
ble
ann
ual
hou
seh
old
inco
me
TE
20
25
I: Pro
p. o
f work
ing
age p
opula
tion
qualifie
d a
t level 1
or 2
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en
space
SA
300
1I: T
ota
l nu
mb
er o
f reco
rded c
rimes
per 1
000
po
pula
tion
1
280
4
269
0
263
3
262
2
258
3
245
4
244
8
230
0
229
9
224
6
205
2
194
5
186
7
182
3
152
9
959
920
677
611
502
2
270
6
253
5
252
4
248
5
235
6
235
0
220
2
218
5
214
8
204
9
195
8
175
3
170
9
143
0
959
920
598
509
456
3
264
9
251
2
247
3
246
9
246
3
219
3
217
6
221
6
206
8
195
8
169
7
165
3
154
4
823
920
648
501
488
4
263
8
259
9
245
8
246
4
223
1
221
4
209
2
200
7
195
8
168
5
164
1
153
2
907
859
597
547
406
5
259
9
245
1
245
7
222
4
220
7
205
3
200
0
195
1
164
6
160
2
152
5
868
859
565
543
374
6
247
0
244
1
210
7
209
0
204
3
199
3
195
2
151
7
147
3
153
6
747
859
548
431
365
7
246
4
209
2
207
5
206
0
200
3
194
1
152
8
148
4
151
5
733
859
565
412
374
8
231
6
229
9
176
6
170
6
177
8
137
5
135
3
119
3
688
818
381
623
433
9
229
9
174
9
168
9
176
4
137
5
135
3
117
8
687
818
365
610
432
10
226
2
199
6
158
0
136
3
131
9
114
5
425
920
693
321
306
11
206
8
163
4
116
6
112
2
108
0
391
920
460
251
222
12
195
8
107
2
107
1
110
5
519
856
286
249
279
13
186
7
182
3
148
7
801
158
677
530
420
14
182
3
146
5
779
158
633
530
420
15
154
4
609
97
533
363
282
16
959
55
144
398
338
17
920
47
56
56
18
693
162
166
19
623
294
20
502
Analysis of 1999-2002 data
10
3.1.2 Data transformation
Most of the 20 indicators in the condensed data set have zero to infinity range. The exceptions are
indicators measured in percent ranging from zero to hundred. For those indicators, the following
transformation has been performed on each indicator, i, to obtain a transformed indicator, itr, with
zero to infinity range:
3.1.3 Correlation and removal of highly correlated variables
Table 5 shows the correlation between data for the transformed indicators. Based on the results it
was decided to exclude indicator nos. 3, 4 and 5:
• 3. SA1001I: Number of dwellings
• 4. DE3003I: Total Number of Households
• 5. DE3002I: Proportion of one-person households in Urban Audit cities
Indicator nos. 3 and 4 were excluded as the information they provide is believed to be well repre-
sented by indicator no. 1 “Total population in Urban Audit cities” (correlation 1 to 3 = 0.97 and 1 to 4
= 0.98). The information in indicator no. 5 is believed to be well represented by indicator no. 15 “Av-
erage household size in Urban Audit cities – no persons per hh” (correlation 5 to 15 = -0.82).
As a result, the 17 indicators shown in Table 6 were included in the principal component analysis.
Analysis of 2003-2006 data
11
Table 5: Correlation matrix between the 20 transformed indicators
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
SA
300
1I: T
ot. n
o. o
f rec. c
rimes
per 1
000
po
p.
EN
501
2I: P
ropo
rtion o
f the a
rea
in g
ree
n s
pace
TE
20
25
I: P
rop
. of
work
ing
a
ge
pop
ula
tion q
ualifie
d a
t level 1
or
2 IS
CE
D
EC
303
9I:
Me
dia
n
dis
posa
ble
ann
ual h
ouse
hold
incom
e
DE
106
2I:
Tota
l an
nu
al
po
pula
-
tion
ch
an
ge
over
app
rox_
5
years
DE
300
4I: A
vera
ge h
h. s
ize –
no.
of p
ers
ons p
er h
h
DE
200
5I: P
rop. o
f res. w
ho a
re
not E
U N
atio
nals
and c
itizen
s o
f
a c
ountry
with
hig
h H
DI
DE
200
6I: P
rop. o
f res. w
ho a
re
not E
U N
at. a
nd c
itiz. o
f a c
ou
n-
try w
ith a
mediu
m o
r low
HD
I
SA
101
2I:
Pro
p.
of
hh.
livin
g
in
socia
l ho
usin
g
SA
101
8I: P
rop. o
f dw
ell. la
ckin
g
basic
am
enitie
s
DE
100
3I:
Pro
po
rtion
of
fem
ale
s
to m
ale
s
EN
510
1I: P
op
ula
tion d
ensity
EN
500
3I: T
ota
l lan
d a
rea (k
m2
)
DE
300
5I:
Pro
po
rtion
of
lo
ne-
pare
nt h
ousehold
s
DE
300
8I:
Pro
p.
of
lon
e-
pen
sio
ner h
ousehold
s
DE
300
2I:
Pro
p.
of
one
-pers
on
hou
seh
. in c
ities
DE
300
3I:
Tota
l N
um
be
r of
Househ
old
s
SA
100
1I: N
um
be
r of d
wellin
gs
EC
102
0I: U
ne
mpl. ra
te in
Urb
an
Audit c
ities - %
DE
100
1I:
Tota
l pop
. in
U
rban
Audit c
ities
IND
ICA
TO
R
-0.0
8
-0.0
5
-0.1
7
0.2
9
-0.0
5
-0.1
9
0.0
9
0.0
9
-0.0
2
0.0
1
-0.0
1
0.0
0
0.3
6
0.1
9
0.1
1
0.0
4
0.9
8
0.9
7
-0.0
7
1.0
0
1
-0.0
4
0.0
5
-0.0
1
-0.3
9
-0.0
8
0.2
9
-0.1
3
-0.0
6
0.5
4
0.1
4
-0.0
1
0.1
3
-0.1
6
0.3
5
-0.0
5
-0.0
8
-0.1
7
-0.0
8
1.0
0
2
-0.0
7
-0.0
9
-0.2
0
0.3
1
-0.0
7
-0.2
8
0.1
5
0.1
3
-0.0
4
0.0
2
0.0
3
0.0
4
0.3
0
0.1
4
0.1
8
0.1
9
0.9
9
1.0
0
3
-0.0
6
-0.0
8
-0.2
5
0.3
4
-0.0
6
-0.4
9
0.2
6
0.1
9
-0.0
4
0.0
2
0.0
2
0.0
6
0.2
2
0.1
6
0.2
0
0.1
9
1.0
0
4
0.3
9
-0.2
4
-0.3
8
0.1
9
-0.1
6
-0.8
2
0.5
1
0.3
3
-0.0
9
0.1
5
0.2
8
0.4
0
-0.3
6
-0.1
6
0.3
6
1.0
0
5
-0.0
4
-0.0
6
-0.3
4
-0.0
8
-0.2
9
-0.5
8
0.1
8
0.2
3
-0.0
1
0.0
9
0.4
9
0.2
0
-0.1
8
0.0
1
1.0
0
6
-0.1
4
0.1
6
-0.3
6
-0.4
8
-0.1
9
-0.0
4
-0.1
7
0.0
0
0.4
8
0.0
1
-0.0
7
-0.0
7
0.0
1
1.0
0
7
-0.1
3
0.1
6
0.3
6
0.0
7
0.1
4
0.2
6
-0.0
9
-0.1
0
-0.1
3
-0.0
1
-0.3
6
-0.4
0
1.0
0
8
0.0
4
-0.2
2
-0.2
4
0.1
4
-0.1
9
-0.2
5
0.0
5
0.0
7
0.0
6
0.1
4
0.2
8
1.0
0
9
-0.1
5
-0.0
6
-0.0
4
0.0
5
-0.2
2
-0.3
9
-0.2
1
-0.1
4
-0.0
2
-0.0
2
1.0
0
10
-0.1
5
-0.0
1
0.4
3
-0.1
6
-0.0
8
-0.1
0
-0.0
9
-0.0
3
0.0
4
1.0
0
11
-0.0
2
-0.0
1
0.1
4
-0.4
7
-0.0
1
0.1
2
-0.0
6
0.1
1
1.0
0
12
0.0
6
-0.1
4
-0.1
4
-0.2
9
-0.1
6
-0.3
3
0.4
3
1.0
0
13
0.0
8
-0.0
9
-0.3
3
0.3
5
-0.0
7
-0.4
5
1.0
0
14
-0.2
7
0.2
8
0.4
9
0.5
7
0.2
6
1.0
0
15
0.0
2
0.0
2
-0.0
2
0.1
9
1.0
0
16
-0.0
3
0.4
3
-0.7
8
1.0
0
17
-0.0
7
-0.0
5
1.0
0
18
-0.1
0
1.0
0
19
1.0
0
20
Analysis of 1999-2002 data
12
Table 6: The 17 indicators included in the principal component analysis. A total of 9 out of
the 17 indicators have been transformed using the equation in Section 3.1.2.
No Indicator name Trans-formed
1 DE1001I: Total population in Urban Audit cities
2 EC1020I: Unemployment rate in Urban Audit cities - % Y
3 DE3008I: Proportion of households that are lone-pensioner households
Y
4 DE3005I: Proportion of households that are lone-parent households
Y
5 EN5003I: Total land area (km2) according to cadastral reg-ister
6 EN5101I: Population density in Urban Audit cities
7 DE1003I: Proportion of females to males in total population
8 SA1018I: Proportion of dwellings lacking basic amenities Y
9 SA1012I: Proportion of households living in social housing Y
10 DE2006I: Proportion of Residents who are not EU Nation-als and citizens of a country with a medium or low HDI
Y
11 DE2005I: Proportion of Residents who are not EU Nation-als and citizens of a country with high HDI
Y
12 DE3004I: Average household size in Urban Audit cities - number of persons per household
13 DE1062I: Total annual population change over approx_ 5 years
14 EC3039I: Median disposable annual household income
15 TE2025I: Prop. of working age population qualified at level 1 or 2 ISCED
Y
16 EN5012I: Proportion of the area in green space Y
17 SA3001I: Total number of recorded crimes per 1000 popu-lation
3.1.4 Detailed analysis of data to be included in Principal Component Analysis
The bar-diagrams in Figure 2 (indicator number 1-8), Figure 3 (indicator number 9-16) and Figure 4
(indicator number 17) are histograms normalised to estimate probability distribution function for the
17 transformed indicators. The solid line shows a best fit generalised extreme value distribution. The
best fit parameters are listed in Table 7.
Analysis of 2003-2006 data
13
0 50 100 150 200 2500
0.02
0.04
0.06
0.08
0.1
0.12
Transformed value
5.
EN
5003I:
Tota
l la
nd a
rea
(km
2)
accord
ing t
o c
adastr
al
regis
ter
0 1 2 3 4 5 6
x 104
0
1
x 10-4
Transformed value
6.
EN
5101I:
Popula
tion d
ensity
in U
rban A
udit c
itie
s}
40 60 80 100 120 140 1600
0.01
0.02
0.03
0.04
0.05
Transformed value
7.
DE
1003I:
Pro
port
ion o
f fe
male
s
to m
ale
s in t
ota
l popula
tion
0 0.5 1 1.50
2
4
6
8
10
12
14
Transformed value
8.
SA
1018I:
Pro
port
ion o
f dw
elli
ngs
lackin
g b
asic
am
enitie
s
0 1 2 3 4
x 105
0
1
2
3
4
5x 10
-5
Transformed value
1.
DE
1001I:
Tota
l popula
tion
in U
rban A
udit c
itie
s
0 2 4 6 8 100
1
2
3
4
5
6
Transformed value
2.
EC
1020I:
Unem
plo
ym
ent
rate
in U
rban A
udit c
itie
s -
%
0 0.1 0.2 0.3 0.40
2
4
6
8
Transformed value
3.
DE
3008I:
Pro
port
ion o
f household
s
that
are
lone-p
ensio
ner
househ
0 0.05 0.1 0.15 0.2 0.250
5
10
15
20
25
Transformed value
4.
DE
3005I:
Pro
port
ion o
f household
s
that
are
lone-p
are
nt
household
Figure 2: Probability density function estimates for indicator no. 1-8 (bars). Best fit General-
ised Extreme Value distribution (solid line).
Analysis of 1999-2002 data
14
0 5 10 150
0.5
1
1.5
2
2.5
3
Transformed value
9.
SA
1012I:
Pro
port
ion o
f household
s
livin
g in s
ocia
l housin
g
0 0.2 0.4 0.6 0.8 10
5
10
15
20
25
Transformed value
10.
DE
2006I:
Pro
port
ion o
f
Resid
ents
who a
re n
ot
EU
Nationals
and c
itiz
ens o
f a c
ountr
y w
ith
a m
ediu
m o
r lo
w H
DI
0 0.05 0.1 0.15 0.20
50
100
150
200
Transformed value
11.
DE
2005I:
Pro
port
ion o
f
Resid
ents
who a
re n
ot
EU
Nationals
and c
itiz
ens o
f a c
ountr
y w
ith
hig
h H
DI
1 2 3 4 5 60
0.2
0.4
0.6
0.8
1
Transformed value
12.
DE
3004I:
Avera
ge h
ousehold
siz
e in U
rban A
udit c
itie
s
- num
ber
of
pers
ons p
er
household
-20 -10 0 10 200
0.05
0.1
0.15
0.2
0.25
0.3
0.35
Transformed value
13.
DE
1062I:
Tota
l annual popula
tion
change o
ver
appro
x 5 y
ears
0 1 2 3 4
x 104
0
0.2
0.4
0.6
0.8
1
1.2x 10
-4
Transformed value
14.
EC
3039I:
Media
n d
isposable
annual household
incom
e
0 2 4 6 8 100
0.5
1
1.5
2
2.5
Transformed value
15.
TE
2025I:
Pro
p.
of
work
ing
age p
opula
tion q
ualif
ied a
t
level 1 o
r 2 I
SC
ED
0 10 20 30 40 500
0.2
0.4
0.6
0.8
Transformed value
16.
EN
5012I:
Pro
port
ion o
f
the a
rea in g
reen s
pace
Figure 3: Probability density function estimates for indicator no. 9-16 (bars). Best fit General-
ised Extreme Value distribution (solid line).
Analysis of 2003-2006 data
15
0 500 1000 1500 2000 2500 3000 3500 40000
1
2
3
4
5
6
7
8x 10
-3
Transformed value
17.
SA
3001I:
Tota
l num
ber
of
record
ed c
rim
es p
er
1000 p
opula
tion
Figure 4: Probability density function estimates for indicator no. 17 (bars). Best fit General-
ised Extreme Value distribution (solid line).
Analysis of 1999-2002 data
16
Table 7: Parameters for best fit Generalised Extreme Value probability density function for
the 17 transformed indicators
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
No
SA
3001I:
Tota
l num
ber
of
record
ed
crim
es
per
1000
popula
tion
EN
5012I: P
roportio
n o
f the a
rea in
gre
en s
pace
TE
2025I: P
rop. o
f work
ing a
ge p
opula
tion q
ualifie
d a
t level
1 o
r 2 IS
CE
D
EC
3039I: M
edia
n d
isposable
annual h
ousehold
incom
e
DE
1062I: T
ota
l annual p
opula
tion c
hange o
ver a
ppro
x_ 5
years
DE
3004I: A
vera
ge h
ousehold
siz
e in
Urb
an A
udit c
ities -
num
ber o
f pers
ons p
er h
ousehold
DE
2005I: P
roportio
n o
f Resid
ents
who a
re n
ot E
U N
a-
tionals
and c
itizens o
f a c
ountry
with
hig
h H
DI
DE
2006I: P
roportio
n o
f Resid
ents
who a
re n
ot E
U N
a-
tionals
and c
itizens o
f a c
ountry
with
a m
ediu
m o
r low
HD
I
SA
1012I: P
roportio
n o
f household
s liv
ing in
socia
l housin
g
SA
1018I: P
roportio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
DE
1003I: P
roportio
n o
f fem
ale
s to
male
s in
tota
l popula
-
tion
EN
5101I: P
opula
tion d
ensity
in U
rban A
udit c
ities
EN
5003I:
Tota
l la
nd are
a (k
m2)
accord
ing to
cadastra
l
regis
ter
DE
3005I: P
roportio
n of
household
s th
at
are
lo
ne-p
are
nt
household
s
DE
3008I:
Pro
portio
n
of
household
s
that
are
lo
ne-
pensio
ner h
ousehold
s
EC
1020I: U
nem
plo
ym
ent ra
te in
Urb
an A
udit c
ities - %
DE
1001I: T
ota
l popula
tion in
Urb
an A
udit c
ities
Ind
icato
r nam
e
0.3
15
1.0
84
0.8
65
-0.1
73
-0.1
23
-0.0
97
1.8
39
1.8
16
0.9
57
0.4
88
-0.2
78
0.6
16
1.0
26
0.0
28
-0.1
39
0.2
63
0.2
64
Sh
ap
e
pa
-
ram
e-
ter, ξξ ξξ
48
.33
0.2
27
0.2
32
48
26
2.3
26
0.4
08
0.0
02
0.0
13
0.0
82
0.0
29
9.7
51
26
85
3.6
11
0.0
22
0.0
49
0.0
61
83
53
Sc
ale
pa
-
ram
e-
ter, σσ σσ
59
.32
0.1
55
0.2
81
14
29
1
-0.4
66
2.1
07
0.0
01
0.0
07
0.0
62
0.0
28
10
7.9
25
59
3.2
21
0.0
44
0.1
04
0.0
94
15
44
7
Lo
ca
-
tion
pa
r., µµ µµ.
Be
st fit
0.2
37
0.9
44
0.7
85
-0.2
09
-0.1
31
-0.1
20
0.8
87
0.4
39
-0.2
88
0.5
57
0.9
68
0.0
04
-0.1
59
0.2
26
0.2
40
Sh
ap
e
pa
-
ram
e-
ter, ξξ ξξ
44
.40
0.1
99
0.2
09
46
00
2.2
43
0.3
93
0.0
77
0.0
28
9.4
95
25
49
3.3
88
0.0
21
0.0
47
0.0
59
80
81
Sc
ale
pa
-
ram
e-
ter, σσ σσ
54
.48
0.1
33
0.2
62
13
95
0
-0.6
22
2.0
85
0.0
58
0.0
27
10
7.5
24
21
3.0
49
0.0
43
0.1
02
0.0
91
15
10
7
Lo
ca
-
tion
pa
r., µµ µµ.
5%
co
nfid
en
ce
inte
rva
l
va
lue
0.3
93
1.2
24
0.9
46
-0.1
38
-0.1
15
-0.0
74
1.0
28
0.5
38
-0.2
69
0.6
75
1.0
85
0.0
52
-0.1
19
0.3
01
0.2
89
Sh
ap
e
pa
-
ram
e-
ter, ξξ ξξ
52
.61
0.2
59
0.2
57
50
64
2.4
12
0.4
24
0.0
88
0.0
31
10
.01
28
29
3.8
48
0.0
22
0.0
50
0.0
63
86
34
Sc
ale
pa
-
ram
e-
ter, σσ σσ
64
.17
0.1
78
0.3
00
14
63
2
-0.3
10
2.1
29
0.0
67
0.0
30
10
8.3
26
98
3.3
93
0.0
45
0.1
06
0.0
97
15
78
8
Lo
ca
-
tion
pa
r., µµ µµ.
95
% c
on
fiden
ce
inte
rva
l
va
lue
Analysis of 2003-2006 data
17
3.2 PRINCIPAL COMPONENT ANALYSIS
The principal component analysis was performed on a standardised data set, where the standard-
ised variable, ztr, was calculated as:
Here itr is the transformed data, µtr is the mean of the transformed data and σtr is the standard de-
viation of the transformed data.
3.2.1 Eigenvectors, eigen values and explanation of variability
The eigenvectors with corresponding eigenvalues resulting from the principal component analysis
are shown in Table 8. The eigenvectors are sorted by decreasing eigenvalue, showing the eigen-
vector representing the largest variation (17.8%) in the data set in the first column and the eigenvec-
tor representing the least variation (0.2%) in column number 17.
3.2.2 Selecting principal components
The dimensions of the data set can now be reduced by omitting the principal components represent-
ing the least variation in the data set.
The third row in Table 8 shows cumulative explanation of variability as more eigenvectors are in-
cluded in the new data set. For example, including 10 out of the 17 eigenvectors, i.e. all eigenvec-
tors with eigenvalues larger than 0.7, gives a data set containing 85.5 % of the variation in the origi-
nal data set.
3.2.3 Detailed analysis of selected principal components
Base on the above analysis, it is decided to keep the first 10 principal components. In this section,
these components are analysed in order to understand what type of data that dominate each of
them.
Analysis of 1999-2002 data
18
Table 8: Eigenvectors and eigenvalues resulting from the principal component analysis
sorted by decreasing eigenvalue
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
No
SA
3001
EN
5012
TE
2025
EC
3039
DE
1062
DE
3004
DE
2005
DE
2006
SA
1012
SA
1018
DE
1003
EN
5101
EN
5003
DE
3005
DE
3008
EC
1020
DE
1001
Ind
icato
r co
de
Eig
en
valu
e
Cu
mu
lativ
e e
xp
lan
atio
n
of v
aria
bility
(%)
Exp
lan
atio
n o
f varia
bil-
ity (%
)
Eig
en
vecto
r no
-0.0
6
0.1
6
0.3
8
0.1
2
0.2
2
0.4
9
-0.2
5
-0.2
4
-0.0
4
-0.0
2
-0.2
5
-0.2
3
0.3
1
-0.1
1
-0.4
2
0.0
0
-0.0
6
3.0
8
17
.8
17
.8
1
-0.0
8
0.0
9
0.0
2
-0.5
4
-0.1
0
0.0
7
-0.2
0
-0.0
5
0.4
8
0.1
0
-0.0
4
-0.0
5
-0.0
4
0.4
3
-0.0
5
0.4
5
-0.0
4
2.4
7
32
.0
14
.2
2
-0.1
0
0.0
1
0.0
8
0.1
3
0.0
2
0.1
5
-0.3
6
-0.3
5
-0.0
3
0.1
4
0.4
6
0.3
7
-0.4
0
-0.1
9
0.0
5
0.1
1
-0.3
3
1.7
6
42
.2
10
.2
3
-0.3
8
0.3
7
-0.3
1
0.1
6
-0.0
8
0.0
1
-0.2
3
-0.2
8
-0.1
8
-0.1
3
0.2
0
-0.0
7
0.2
0
0.3
5
0.1
9
-0.1
2
0.3
8
1.5
0
50
.9
8.7
4
0.2
5
0.2
1
-0.4
8
0.0
5
0.2
7
0.0
7
0.0
6
-0.1
0
-0.0
2
-0.4
7
-0.1
8
-0.0
7
-0.3
3
0.1
9
-0.1
9
-0.0
3
-0.3
6
1.4
3
59
.1
8.3
5
-0.0
9
-0.5
8
-0.1
4
0.0
2
0.5
2
0.0
0
-0.0
7
-0.1
4
0.1
9
-0.2
0
0.0
5
0.2
4
0.0
2
0.0
2
-0.1
5
0.0
7
0.4
3
1.0
2
65
.0
5.9
6
0.2
5
-0.1
8
0.1
6
-0.3
0
0.0
2
-0.2
3
-0.2
3
-0.2
2
-0.0
1
-0.3
6
0.3
7
-0.5
1
0.1
4
-0.1
4
0.1
9
-0.1
3
-0.1
0
1.0
0
70
.8
5.8
7
0.5
4
0.0
7
-0.1
7
-0.0
3
0.2
1
-0.2
6
-0.1
5
-0.3
6
-0.1
9
0.5
6
-0.1
7
0.0
3
0.1
3
0.0
8
0.0
3
0.0
5
0.1
0
0.9
5
76
.3
5.5
8
-0.4
8
0.1
5
-0.1
1
-0.1
4
0.5
6
-0.1
5
0.2
1
0.0
5
-0.1
1
0.2
1
0.0
2
-0.2
6
0.0
4
-0.2
4
0.2
0
0.2
4
-0.2
3
0.8
7
81
.3
5.0
9
0.2
1
0.4
4
0.2
2
0.0
8
0.3
5
-0.0
2
0.1
6
0.0
7
0.5
4
-0.0
3
0.2
2
0.2
3
0.1
5
0.0
2
0.1
8
-0.3
2
0.0
4
0.7
3
85
.5
4.2
10
-0.2
6
-0.2
6
-0.0
2
0.0
2
0.0
4
-0.1
0
-0.1
0
-0.0
2
0.1
8
0.3
5
-0.0
6
-0.2
3
-0.2
0
0.3
1
-0.1
5
-0.6
5
-0.2
2
0.6
9
89
.4
4.0
11
-0.1
0
0.0
0
0.0
6
0.0
1
-0.2
3
-0.1
0
0.6
3
-0.6
6
0.1
7
-0.0
1
0.0
3
-0.0
6
-0.0
5
-0.0
8
-0.2
0
0.0
3
0.0
4
0.5
6
92
.7
3.2
12
-0.1
7
-0.1
5
-0.0
1
-0.1
6
-0.0
7
-0.1
4
-0.0
7
-0.2
0
-0.0
3
-0.2
0
-0.3
3
0.4
6
0.4
8
0.0
4
0.2
3
-0.1
4
-0.4
4
0.4
0
95
.0
2.3
13
-0.0
8
0.2
2
0.0
7
-0.3
2
0.0
5
-0.3
9
-0.0
1
0.1
6
-0.2
7
-0.0
5
0.2
8
0.2
5
0.1
0
0.0
2
-0.6
5
-0.0
9
0.0
0
0.3
9
97
.2
2.2
14
-0.1
3
0.2
4
-0.0
3
-0.2
8
-0.0
7
-0.1
6
-0.2
7
-0.1
0
0.1
4
-0.0
9
-0.4
4
0.0
5
-0.3
5
-0.4
9
0.0
5
-0.2
1
0.3
0
0.2
9
98
.9
1.7
15
0.0
7
-0.0
4
-0.5
5
-0.3
5
-0.1
2
0.4
8
0.0
8
0.0
5
0.0
9
0.1
8
0.2
3
0.0
2
0.2
6
-0.3
3
-0.0
8
-0.2
2
-0.0
1
0.1
5
99
.8
0.9
16
-0.0
5
0.0
1
-0.2
9
0.4
5
-0.1
8
-0.3
6
-0.2
5
0.0
8
0.4
4
0.0
3
0.0
2
-0.1
4
0.2
6
-0.2
6
-0.2
5
0.2
1
-0.1
4
EIG
EN
VE
CT
OR
CO
MP
ON
EN
TS
0.0
4
10
0
0.2
17
Analysis of 2003-2006 data
19
Table 9: Analysis of the the ten principal components included. Only eigenvector-
components with absolute value larger than 0.3 are listed, helping to indentify which indica-
tors dominate the principal component.
Subjective descriptor for principal
component
Ba
sic
, su
bu
rba
n, fa
mily
are
a
Po
ve
rty a
nd
so
cia
l pro
ble
ms
De
nse
u
rba
nis
m w
ith lo
w im
mig
ra-
tion
De
ve
lop
me
nt a
nd
urb
anis
m
So
cia
l d
eve
lop
men
t a
nd
sm
alln
ess
of d
istric
t
Urb
an
pop
ula
tion
with
imm
igra
tion
Sca
rce
pop
ula
tion a
nd s
ocia
l de
vel-
op
men
t
Po
ve
rty, c
rime
and
so
cia
l pro
ble
ms
Po
pu
latio
n g
row
th a
nd
low
crim
e
Po
pu
latio
n g
row
th an
d socia
l p
rob
-
lem
s
Principal component number 1 2 3 4 5 6 7 8 9 10
NoIndicator name Eigenvector components with absolute value > 0.3
1DE1001I: Total population in Urban
Audit cities -0.3 0.38 -0.4 0.43
2EC1020I: Unemployment rate in
Urban Audit cities - % 0.45 -0.3
3DE3008I: Proportion of households
that are lone-pensioner households -0.4
4DE3005I: Proportion of households
that are lone-parent households 0.43 0.35
5EN5003I: Total land area (km2)
according to cadastral register 0.31 -0.4 -0.3
6EN5101I: Population density in Ur-
ban Audit cities 0.37 -0.5
7DE1003I: Proportion of females to
males in total population 0.46 0.37
8SA1018I: Proportion of dwellings
lacking basic amenities -0.5 -0.4 0.56
9SA1012I: Proportion of households
living in social housing 0.48 0.54
10
DE2006I: Proportion of Residents
who are not EU Nationals and citi-
zens of a country with a medium or
low HDI
-0.4 -0.4
11
DE2005I: Proportion of Residents
who are not EU Nationals and citi-
zens of a country with high HDI
-0.4
12
DE3004I: Average household size in
Urban Audit cities - number of per-
sons per household
0.49
13DE1062I: Total annual population
change over approx_ 5 years 0.52 0.56 0.35
14EC3039I: Median disposable annual
household income -0.5
15TE2025I: Prop. of working age popu-
lation qualified at level 1 or 2 ISCED 0.38 -0.3 -0.5
16EN5012I: Proportion of the area in
green space 0.37 -0.6 0.44
17SA3001I: Total number of recorded
crimes per 1000 population -0.4 0.54 -0.5
Analysis of 1999-2002 data
20
Table 9 shows which of the eigenvector-components in the ten included principal components that
have absolute value larger than 0.3. This helps identify which indicators dominate each of the prin-
cipal components. For the first principal component, the following four indicators dominate:
No Indicator name Eigenvector
component
3 DE3008I: Proportion of households that are lone-pensioner
households -0.4
5 EN5003I: Total land area (km2) according to cadastral register 0.31
12 DE3004I: Average household size in Urban Audit cities - num-
ber of persons per household 0.49
15 TE2025I: Prop. of working age population qualified at level 1
or 2 ISCED 0.38
A high value for this principal component will therefore typically indicate a spatially large district with
large household size and basic education level. Subjectively it has been chosen to label this princi-
pal component “Basic, suburban family area”. In a similar manner, all ten principal components have
been labelled (Table 10). It should be noted that the label is subjective and a high value of the prin-
cipal component may be caused by other phenomena.
Table 10: Subjective labelling of the ten included principal components
No Subjective descriptor for principal component
1 Basic, suburban, family area
2 Poverty and social problems
3 Dense urbanism with low immigration
4 Development and urbanism
5 Social development and smallness of district
6 Urban population with immigration
7 Scarce population and social development
8 Poverty, crime and social problems
9 Population growth and low crime
10 Population growth and social problems
Analysis of 2003-2006 data
21
4 Analysis of 2003-2006 data
The initial selection of indicators for the 2003-2006 analysis is based on subjective expert opinion,
considering which indicators are more relevant for socio-economic vulnerability to earthquakes, in
particular needs and capacity for emergency health care and public emergency shelter. Table 11
shows the results from the selection process. Parameters considered to be of high and medium
relevance are given the values “H” and “M” respectively in the column labelled “Priority”. The two
last columns are checked for the parameters believed to be relevant for the Shelter and Health
models respectively. All parameters listed in the table are Urban Audit Indicators, except the last
three which are Urban Audit Variables. Indicators, as defined in the Urban Audit, are parameters
calculated from other Indicators or Variables. A Variable is a parameter that is directly given by the
collected data. This gives a total list of 30 parameters; 27 indicators and 3 variables; believed to be
relevant for the Shelter and Health models.
As not all data is available at sub-city level, analyses has been carried out at both city and sub-city
level. Which parameters are included in each of the four analyses (two at city level and two at sub-
city level) are also shown in Table 11.
Table 11: Indicators used in the 2003-2006 data analysis at city and sub-city level and infor-
mation on expert judgment prioritation of indicators. Parameters considered to be of high
and medium relevance are given the values “H” and “M” respectively in the column labelled
“Priority”. The two last columns are checked for the parameters believed to be relevant for
the Shelter and Health models respectively.
Parameters in-
cluded in the
analysis
Prio
rity
(H =
hig
h,
M =
me
diu
m)
Sh
elte
r
(x =
rele
va
nt)
He
alth
(x =
rele
va
nt)
Code List of Urban Audit Indicators and Variables
City
level: C
orre
latio
n (2
9)
City
level: P
CA
(24
)
Su
b-C
ity le
ve
l: Co
rrela
tion
(24)
Su
b-C
ity le
ve
l: PC
A (2
1)
DE1001I Total resident population
DE1040I Proportion of total population aged 0-4 x x x x H x x
DE1003I Proportion of females to males in total population x x x x M x
DE1061I Total population change over 1 year
DE1062I Total annual population change over 5 approx.years
DE2005I Proportion of Residents who are not EU Nationals
and citizens of a country with high HDI x x x x H x x D
em
og
rap
hy
DE2006I Proportion of Residents who are not EU Nationals x x x x H x x
Analysis of 2003-2006 data
22
and citizens of a country with a medium or low HDI
DE3003I Total number of households x x x x M x
DE3004I Average size of households x x x x H x x
DE3002I Proportion of households that are 1-person house-
holds
DE3005I Prop. of households that are lone-parent households x x x x M x
DE3008I Prop. households that are lone-pensioner house-
holds x x x x H x x
SA1001I Number of dwellings
SA1018I Proportion of dwellings lacking basic amenities x x x x H x
SA1012I Proportion of households living in social housing x x x x M x x
SA2029I Crude death rate per 1000 residents x x x x H x
SA2030I Crude death rate of male residents per 1000 male
residents x
2) H x
SA2031I Crude death rate of female residents per 1000 fe-
male residents x
2) H x
SA2019I Total deaths per year x 3)
H x
SA2016I Mortality rate for <65 per year x x x x H x
So
cia
l A
sp
ects
SA3001I Total Number of recorded crimes per 1000 popula-
tion x x x x H x
EC1201I Annual average change in employment over approx.
5 years
EC1010I Number of unemployed
EC1020I Unemployment rate x x x x M x x
EC1148I Proportion of residents unemployed 15-24
EC1202I Proportion of unemployed who are under 25
EC1005I Net activity rate residents aged 15-64
EC1006I Net activity rate residents aged 15-24
EC1007I Net activity rate residents aged 55-64
EC3039I Median disposable annual household income (for
city or NUTS 3 region) x x x x M x x
EC3057I Percent. households with less than half
nat.aver.income x x x
7) H x x
EC3055I Percent. households with less than 60% of the na-
tional median annual disposable income x
6) H x x
EC3060I Proportion of households reliant upon social security x x x 7)
M x
Eco
nom
ic A
sp
ects
EC3063I Proportion of individuals reliant on social security
TE2025I Prop. of working age population qualified at level 1
or 2 ISCED x x x x H x x
TE2028I Prop. of working age population qualified at level 3
or 4 ISCED x x x x H x x
Tra
inin
g a
nd
ed
uca
tion
TE2031I Prop. of working age population qualified at level 5
or 6 ISCED x x x x H x x
EN5003I Total land area (km2) -according to cadastral regis-
ter
EN5001I Green space (in m2) to which the public has access
per capita x x x x H x
EN5012I Proportion of the area in green space x x x x H x
En
vir
on
men
t
EN5101I Population density: total resident pop. per square km x x x x H x x
DE1028V Total Resident Population 65-74 x H x x
DE1055V Total Resident Population 75 and over x x
1) x
1) 5)
H x x
Va
ria
ble
s
CI1009V City Elections: Number of voters turned out 4)
H x x
1) The variables “DE1028V Total Resident Population 65-74” and “DE1055V Total Resident Population 75 and over” were merged in this analysis
2) The indicators “SA2030I Crude death rate of male residents per 1000 male residents” and “SA2031I Crude death rate of female residents per 1000 female residents” were excluded in the PCA city level
Analysis of 2003-2006 data
23
analysis as they both were strongly correlated with indicator “SA2029I Crude death rate per 1000 residents” (correlation = 0.93 and 0.94 respectively)
3) The indicator “SA2019I Total deaths per year” was excluded from the PCA city level analysis as it cor-related strongly with indicator “DE3003I Total number of households” (correlation = 0.98)
4) The variable “CI1009V City Elections: Number of voters turned out” was excluded from all analyses as it contained no data both at city and sub-city level.
5) The two remaining Urban Audit Variables included in the study had data at city level only. No sub-city data are available
6) The information contained in the indicator “EC3055I Percentage households with less than 60% of the national median annual disposable income” is represented in the indicator “EC3057I Percent. house-holds with less than half the national average income”. EC3055I was therefore removed.
7) The indicators “EC3057I Percent. households with less than half nat. aver. income” and “EC3060I Proportion of households reliant upon social security” has no pairwise city level data with the three education indicators TE2025I, TE2028I and TE2031I. It is therefore impossible to carry out principal component analysis on these data combinations. EC3057I and EC3060I were therefore removed in the PCA Sub-City level analysis.
4.1 CITY LEVEL ANALYSIS
The city level analysis was carried out in two stages:
1) The first phase studied correlation between all prioritised indicators and variables that had data. A total of 29 parameters were included as shown in Table 11 in the column labelled “City level: Correlation”.
2) The second phase was the principal component analysis. Compared to Phase 1, four indica-tors were removed due to high correlation and the two variables DE1028V and DE1055 were merged into one variable counting population aged 65 and above. Thus a total of 24 pa-rameters were included in the principal component analysis.
4.1.1 Results from correlation analysis
Table 12 shows the number of cities (out of 377) having data for two paired indicators. Green colour
is used for indicator pairs with more than 300 data points. Red colour is used for indicator pair with
less than 75 data points. The diagonal in Table 12 gives more details on the number of cities (out of
the 377) that have collected data on the various indicators. The indicators with most data are indica-
tor nos. 1 and 2 with 367 out of the 377 cities having collected data, whereas only 80 out of the 377
cities have data on indicator no. 29.
The off-diagonal elements in Table 12 show how much data can be found on each pair of indicators.
The most data can be found for indicator pair 1 and 2 with 360 cities having corresponding data.
The least data can be found for indicator pair 25 and 29 (“SA1018I: Proportion of dwellings lacking
basic amenities” and “EC3055I: Percent. households with less than 60% of the national median an-
nual disposable income”) with only 16 of 377 cities having collected both these indicators.
The large amount of missing data this indicates should be kept in mind when interpreting and using
the results from the correlation and PCA analysis, as this significantly weakens the rigour of the re-
sults. However, it is still believed that the results can be of practical use in studying underlying struc-
tures in the societies the data covers.
Analysis of 2003-2006 data
24
Table 12: No of cities (out of 377) having data for two paired indicators. Green colour: Indica-
tor pair with more than 300 data points. Red colour: Indicator pair with less than 75 data
points.
29
28
27
26
25
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
78
80
103
112
138
144
144
176
169
181
183
198
199
205
205
205
210
235
250
258
328
328
335
343
323
333
333
360
367
1
78
79
103
111
135
145
143
177
168
183
181
195
198
201
201
203
213
234
249
252
321
321
335
336
324
333
333
367
360
2
79
80
104
111
135
146
144
177
169
183
183
195
199
200
200
204
207
208
246
254
294
294
334
332
321
335
335
333
333
3
79
80
104
111
135
146
144
177
169
183
183
195
199
200
200
204
207
208
246
254
294
294
334
332
321
335
335
333
333
4
76
77
101
110
134
143
138
175
163
181
176
191
193
200
200
197
202
205
243
246
285
285
321
323
324
321
321
324
323
5
78
80
103
113
138
144
144
176
169
181
184
199
199
205
205
206
210
213
249
258
306
306
334
344
323
332
332
336
343
6
78
79
103
111
136
145
143
177
168
182
182
196
198
201
201
204
209
208
246
253
296
296
336
334
321
334
334
335
335
7
75
77
65
110
103
106
106
174
131
179
146
164
161
167
167
168
172
198
211
256
329
329
296
306
285
294
294
321
328
8
75
77
65
110
103
106
106
174
131
179
146
164
161
167
167
168
172
198
211
256
329
329
296
306
285
294
294
321
328
9
62
64
54
95
81
87
87
153
110
182
123
135
140
145
145
143
157
148
190
261
256
256
253
258
246
254
254
252
258
10
73
75
98
105
120
118
122
146
124
130
132
146
154
178
178
153
149
166
250
190
211
211
246
249
243
246
246
249
250
11
75
77
102
92
79
146
146
118
171
106
161
157
201
171
171
167
162
238
166
148
198
198
208
213
205
208
208
234
235
12
65
67
86
93
94
128
126
91
149
116
163
178
151
150
150
185
217
162
149
157
172
172
209
210
202
207
207
213
210
13
76
78
99
100
104
138
134
85
157
98
186
197
159
154
154
208
185
167
153
143
168
168
204
206
197
204
204
203
205
14
63
64
100
98
95
115
137
110
137
88
132
148
167
206
206
154
150
171
178
145
167
167
201
205
200
200
200
201
205
15
63
64
100
98
95
115
137
110
137
88
132
148
167
206
206
154
150
171
178
145
167
167
201
205
200
200
200
201
205
16
68
69
102
83
76
141
145
111
170
100
153
150
201
167
167
159
151
201
154
140
161
161
198
199
193
199
199
198
199
17
69
71
92
97
102
128
124
79
147
91
175
199
150
148
148
197
178
157
146
135
164
164
196
199
191
195
195
195
198
18
76
78
99
78
82
134
128
83
151
92
186
175
153
132
132
186
163
161
132
123
146
146
182
184
176
183
183
181
183
19
58
58
51
73
52
80
67
125
90
184
92
91
100
88
88
98
116
106
130
182
179
179
182
181
181
183
183
183
181
20
65
66
99
74
76
139
144
83
171
90
151
147
170
137
137
157
149
171
124
110
131
131
168
169
163
169
169
168
169
21
68
68
60
78
48
76
74
178
83
125
83
79
111
110
110
85
91
118
146
153
174
174
177
176
175
177
177
177
176
22
65
66
101
73
76
117
146
74
144
67
128
124
145
137
137
134
126
146
122
87
106
106
143
144
138
144
144
143
144
23
71
71
101
75
63
146
117
76
139
80
134
128
141
115
115
138
128
146
118
87
106
106
145
144
143
146
146
145
144
24
Analysis of 2003-2006 data
25
29
28
27
26
25
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
16
17
48
42
138
63
76
48
76
52
82
102
76
95
95
104
94
79
120
81
103
103
136
138
134
135
135
135
138
25
78
80
65
115
42
75
73
78
74
73
78
97
83
98
98
100
93
92
105
95
110
110
111
113
110
111
111
111
112
26
66
66
105
65
48
101
101
60
99
51
99
92
102
100
100
99
86
102
98
54
65
65
103
103
101
104
104
103
103
27
80
82
66
80
17
71
66
68
66
58
78
71
69
64
64
78
67
77
75
64
77
77
79
80
77
80
80
79
80
28
80
80
66
78
16
71
65
68
65
58
76
69
68
63
63
76
65
75
73
62
75
75
78
78
76
79
79
78
78
29
Table 13 shows the correlation matrix between the 29 transformed indicators. Red colour is used to
indicate highly correlated indicators (correlation >0.8 or <-0.8). Green colour is used to indicate rela-
tively independent indicators (-0.2 < correlation <0.2). It can be seen that only a few indicator pairs
are highly correlated, a larger number of indicator pairs are somewhat correlated whereas the ma-
jority of indicator pairs are relatively independent.
For the principal component analysis it was decided to exclude some of the indicators that are well
represented by other indicators. The indicators “SA2030I Crude death rate of male residents per
1000 male residents” and “SA2031I Crude death rate of female residents per 1000 female resi-
dents” were excluded as they both are strongly correlated with indicator “SA2029I Crude death rate
per 1000 residents” (correlation = 0.93 and 0.94 respectively). In addition, the indicator “SA2019I
Total deaths per year” was excluded as it correlates strongly with indicator “DE3003I Total number
of households” (correlation = 0.98).
For the principal component analysis it was also decided to merge the two variables counting elderly
people into one variable:
• DE1028V: Total Resident Population 65-74
• DE1055V: Total Resident Population 75 and over
As should be expected, the two variables are highly correlated (0.96). The resulting variable counts
all people aged 65 and over.
The information contained in the indicator “EC3055I Percentage households with less than 60% of
the national median annual disposable income” is well represented in the indicator “EC3057I Per-
cent. households with less than half the national average income” (correlation 0.86). EC3055I was
therefore excluded in the principal component analysis. As a result, the 29 indicators used in the
correlation analysis was reduced to 24 indicators for the principal component analysis.
Analysis of 2003-2006 data
26
Table 13: Correlation matrix between the 29 transformed indicators. Red colour: Abs >0.8.
Green colour: Abs<0.2.
29
28
27
26
25
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
305
5I: P
erc
en
t. hou
seh
old
s w
ith le
ss th
an 6
0%
of
the n
atio
nal m
edia
n a
nnu
al d
isposable
inco
me
EC
305
7I:
Pe
rce
nt.
ho
usehold
s
with
le
ss
than
half
nat.a
ve
r.inco
me
EC
306
0I: P
rop
ortio
n o
f hou
seh
old
s re
liant u
po
n s
ocia
l
secu
rity
EC
303
9I: M
edia
n d
ispo
sable
an
nu
al h
ouse
hold
inco
me
(for c
ity o
r NU
TS
3 re
gio
n)
SA
101
8I: P
ropo
rtion o
f dw
ellin
gs la
ckin
g b
asic
am
eni-
ties
SA
101
2I:
Pro
portio
n
of
ho
usehold
s
livin
g
in
socia
l
hou
sin
g
DE
300
8I:
Pro
p.
househ
old
s
that
are
lo
ne
-pensio
ner
hou
seh
old
s
EN
500
1I: G
reen s
pace (in
m2) to
whic
h th
e p
ublic
has
access p
er c
apita
DE
300
5I:
Pro
p.
of
househ
old
s
tha
t are
lo
ne-p
are
nt
hou
seh
old
s
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
TE
20
31
I: Pro
p. o
f work
ing a
ge p
opula
tion q
ualifie
d a
t
level 5
or 6
ISC
ED
TE
20
25
I: Pro
p. o
f work
ing a
ge p
opula
tion q
ualifie
d a
t
level 1
or 2
ISC
ED
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
200
6I:
Pro
po
rtion
of
Resid
ents
w
ho
are
not
EU
Natio
nals
an
d c
itizens o
f a c
oun
try w
ith a
me
diu
m o
r
low
HD
I
DE
200
5I:
Pro
po
rtion
of
Resid
ents
w
ho
are
not
EU
Natio
nals
an
d c
itizens o
f a c
ou
ntry
with
hig
h H
DI
TE
20
28
I: Pro
p. o
f work
ing a
ge p
opula
tion q
ualifie
d a
t
level 3
or 4
ISC
ED
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
SA
300
1I: T
ota
l Num
be
r of re
co
rded c
rime
s p
er 1
000
pop
ula
tion
EN
510
1I:
Pop
ula
tion
de
nsity
: to
tal
resid
en
t p
op.
per
squ
are
km
DE
105
5V
: Tota
l Resid
ent P
op
ula
tion 7
5 a
nd o
ve
r
DE
102
8V
: Tota
l Resid
ent P
op
ula
tion 6
5-7
4
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
203
1I:
Cru
de
de
ath
ra
te
of
fem
ale
re
sid
ents
pe
r
100
0 fe
male
resid
ents
SA
203
0I: C
rud
e d
eath
rate
of m
ale
resid
en
ts p
er 1
000
male
resid
ents
SA
201
9I: T
ota
l dea
ths p
er y
ea
r
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pu
-
latio
n
-0.2
3
-0.1
5
-0.1
5
-0.5
4
0.4
5
-0.3
2
0.3
3
0.0
0
0.0
2
-0.1
8
0.1
0
-0.0
7
-0.0
3
0.0
4
-0.1
7
0.3
6
0.2
5
-0.0
5
-0.2
3
0.0
9
0.0
9
0.0
5
0.3
0
-0.4
0
0.4
5
0.1
2
0.4
5
0.0
5
1.0
0
1
0.0
9
-0.0
7
-0.1
7
0.0
4
0.0
1
0.0
1
0.2
8
-0.0
7
0.0
5
-0.2
1
0.1
0
0.0
0
-0.0
9
0.2
7
0.2
6
-0.1
1
-0.0
7
0.9
8
0.1
4
0.3
4
0.9
6
0.9
8
0.1
3
-0.0
3
0.0
5
0.1
6
0.0
9
1.0
0
0.0
5
2
0.0
9
-0.0
6
-0.0
7
-0.2
1
0.4
5
-0.1
6
0.5
1
-0.0
9
-0.1
9
-0.1
6
-0.3
6
-0.0
7
-0.2
3
0.0
2
-0.0
4
0.3
0
0.0
5
-0.0
7
-0.0
7
0.0
7
0.0
4
0.0
5
0.9
3
-0.4
9
0. 6
4
0.7
6
1.0
0
0.0
9
0.4
5
3
0.4
4
-0.0
7
-0.0
4
0.2
6
0.2
5
-0.0
9
0.4
8
-0.0
9
-0.1
3
-0.0
7
-0.3
3
-0.2
1
-0.4
9
0.1
4
0.2
2
0.1
7
-0.1
5
0.0
1
0.3
1
0.0
6
0.1
5
0.1
2
0.9
4
-0.3
5
0. 3
9
1.0
0
0.7
6
0.1
6
0.1
2
4
-0.1
8
-0.1
0
0.0
8
-0.4
2
0.6
9
-0.1
8
0.3
3
-0.1
0
0.1
4
-0.2
5
-0.2
5
-0.1
6
-0.3
1
0.0
9
-0.0
7
0.5
0
0.1
0
-0.0
3
-0.0
8
-0.0
9
-0.0
2
0.0
0
0.5
4
-0.3
0
1.0
0
0.3
9
0. 6
4
0.0
5
0.4
5
5
-0.0
1
0.1
2
0.3
7
0.3
5
-0.2
1
0.4
8
-0.3
3
-0.0
1
0.5
6
-0.0
4
0.1
3
-0.0
2
0.2
1
0.0
7
-0.1
6
-0.2
3
-0.1
2
0.0
1
0.1
4
-0.1
2
-0.0
1
-0.0
4
-0.4
5
1.0
0
-0.3
0
-0.3
5
-0.4
9
-0.0
3
-0.4
0
6
0.3
0
-0.0
7
-0.0
6
0.0
5
0.3
5
-0.1
2
0.5
1
-0.1
0
-0.1
6
-0.1
2
-0.3
7
-0.1
5
-0.3
9
0.0
9
0.1
1
0.2
4
-0.0
6
-0.0
3
0.1
5
0.0
7
0.1
0
0.0
9
1.0
0
-0.4
5
0. 5
4
0.9
4
0.9
3
0.1
3
0.3
0
7
0.0
9
-0.0
7
-0.2
2
0.0
6
-0.0
8
-0.0
3
0.2
9
-0.0
6
0.0
9
-0.2
0
0.0
4
0.0
4
-0.0
7
0.3
2
0.2
0
-0.0
8
0.0
0
0.9
5
0.1
0
0.3
5
0.9
6
1.0
0
0.0
9
-0.0
4
0.0
0
0.1
2
0.0
5
0.9
8
0.0
5
8
0.0
7
-0.0
7
-0.2
4
0.0
9
-0.1
1
0.0
1
0.3
2
-0.0
6
0.1
1
-0.2
1
0.0
5
0.0
4
-0.0
9
0.3
3
0.2
4
-0.1
1
-0.0
4
0.9
1
0.1
4
0.3
5
1.0
0
0.9
6
0.1
0
-0.0
1
-0.0
2
0.1
5
0.0
4
0.9
6
0.0
9
9
-0.0
6
-0.0
1
-0.1
9
0.0
2
-0.1
7
0.2
1
-0.1
3
-0.1
6
-0.1
2
-0.2
9
-0.0
2
0.2
2
0.0
6
0.4
8
0.2
3
-0.2
1
0.0
4
0.3
0
-0.0
5
1.0
0
0.3
5
0.3
5
0.0
7
-0.1
2
-0.0
9
0.0
6
0.0
7
0.3
4
0.0
9
10
0.4
5
-0.1
1
-0.0
6
0.4
2
-0.1
3
0.1
9
0.1
6
-0.0
9
0.2
3
0.1
5
0.0
3
-0.3
7
-0.6
2
0.1
8
0.2
2
0.0
3
0.1
0
0.2
1
1.0
0
-0.0
5
0.1
4
0.1
0
0.1
5
0.1
4
-0.0
8
0.3
1
-0.0
7
0.1
4
-0.2
3
11
0.0
5
-0.0
7
-0.1
9
0.1
1
0.0
3
0.0
3
0.2
4
-0.0
8
0.0
7
-0.2
3
0.1
5
0.0
0
-0.1
4
0.3
1
0.3
4
-0.1
3
-0.0
2
1.0
0
0.2
1
0.3
0
0.9
1
0.9
5
-0.0
3
0.0
1
-0.0
3
0.0
1
-0.0
7
0.9
8
-0.0
5
12
0.3
7
-0.1
9
0.2
0
-0.1
9
-0.0
8
-0.1
3
0.0
9
0.0
1
0.4
4
0.0
8
-0.0
6
0.2
2
-0.0
4
-0.0
1
-0.2
9
0.2
7
1.0
0
-0.0
2
0.1
0
0.0
4
-0.0
4
0.0
0
-0.0
6
-0.1
2
0.1
0
-0.1
5
0.0
5
-0.0
7
0.2
5
13
-0.2
5
-0.2
2
-0.3
0
-0.3
9
0.3
2
-0.4
0
0.3
7
0.0
5
-0.0
9
0.2
5
-0.0
9
-0.3
6
-0.2
9
0.0
6
-0.1
3
1.0
0
0.2
7
-0.1
3
0.0
3
-0.2
1
-0.1
1
-0.0
8
0.2
4
-0.2
3
0.5
0
0.1
7
0.3
0
-0.1
1
0.3
6
14
-0.1
2
0.0
7
-0.3
1
0.2
7
-0.0
9
-0.2
1
0.0
5
-0.1
3
-0.1
6
-0.1
0
-0.0
9
0.1
0
-0.2
7
0.5
2
1.0
0
-0.1
3
-0.2
9
0.3
4
0.2
2
0.2
3
0.2
4
0.2
0
0.1
1
-0.1
6
-0.0
7
0.2
2
-0.0
4
0.2
6
-0.1
7
15
-0.0
5
-0.0
2
-0.1
8
0.3
1
0.1
1
-0.1
7
0.0
7
-0.1
7
0.2
5
-0.2
2
-0.2
3
0.0
7
-0.1
6
1.0
0
0.5
2
0.0
6
-0.0
1
0.3
1
0.1
8
0.4
8
0.3
3
0.3
2
0.0
9
0.0
7
0.0
9
0.1
4
0.0
2
0.2
7
0.0
4
16
Analysis of 2003-2006 data
27
29
28
27
26
25
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
305
5I: P
erc
en
t. hou
seh
old
s w
ith le
ss th
an 6
0%
of
the n
atio
nal m
edia
n a
nnu
al d
isposable
inco
me
EC
305
7I:
Pe
rce
nt.
ho
usehold
s
with
le
ss
than
half
nat.a
ve
r.inco
me
EC
306
0I: P
rop
ortio
n o
f hou
seh
old
s re
liant u
po
n s
ocia
l
secu
rity
EC
303
9I: M
edia
n d
ispo
sable
an
nu
al h
ouse
hold
inco
me
(for c
ity o
r NU
TS
3 re
gio
n)
SA
101
8I: P
ropo
rtion o
f dw
ellin
gs la
ckin
g b
asic
am
eni-
ties
SA
101
2I:
Pro
portio
n
of
ho
usehold
s
livin
g
in
socia
l
hou
sin
g
DE
300
8I:
Pro
p.
househ
old
s
that
are
lo
ne
-pensio
ner
hou
seh
old
s
EN
500
1I: G
reen s
pace (in
m2) to
whic
h th
e p
ublic
has
access p
er c
apita
DE
300
5I:
Pro
p.
of
househ
old
s
tha
t are
lo
ne-p
are
nt
hou
seh
old
s
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
TE
20
31
I: Pro
p. o
f work
ing a
ge p
opula
tion q
ualifie
d a
t
level 5
or 6
ISC
ED
TE
20
25
I: Pro
p. o
f work
ing a
ge p
opula
tion q
ualifie
d a
t
level 1
or 2
ISC
ED
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
200
6I:
Pro
po
rtion
of
Resid
ents
w
ho
are
not
EU
Natio
nals
an
d c
itizens o
f a c
oun
try w
ith a
me
diu
m o
r
low
HD
I
DE
200
5I:
Pro
po
rtion
of
Resid
ents
w
ho
are
not
EU
Natio
nals
an
d c
itizens o
f a c
ou
ntry
with
hig
h H
DI
TE
20
28
I: Pro
p. o
f work
ing a
ge p
opula
tion q
ualifie
d a
t
level 3
or 4
ISC
ED
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
SA
300
1I: T
ota
l Num
be
r of re
co
rded c
rime
s p
er 1
000
pop
ula
tion
EN
510
1I:
Pop
ula
tion
de
nsity
: to
tal
resid
en
t p
op.
per
squ
are
km
DE
105
5V
: Tota
l Resid
ent P
op
ula
tion 7
5 a
nd o
ve
r
DE
102
8V
: Tota
l Resid
ent P
op
ula
tion 6
5-7
4
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
203
1I:
Cru
de
de
ath
ra
te
of
fem
ale
re
sid
ents
pe
r
100
0 fe
male
resid
ents
SA
203
0I: C
rud
e d
eath
rate
of m
ale
resid
en
ts p
er 1
000
male
resid
ents
SA
201
9I: T
ota
l dea
ths p
er y
ea
r
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pu
-
latio
n
-0.0
2
0.3
8
0.3
7
-0.2
9
-0.1
0
-0.0
4
-0.4
8
0.0
7
-0.0
3
-0.0
1
-0.2
5
0.5
1
1.0
0
-0.1
6
-0.2
7
-0.2
9
-0.0
4
-0.1
4
-0.6
2
0.0
6
-0.0
9
-0.0
7
-0.3
9
0.2
1
-0.3
1
-0.4
9
-0.2
3
-0.0
9
-0.0
3
17
0.1
0
0.0
4
0.4
9
0.0
1
-0.2
9
0.1
3
-0.3
6
-0.2
6
-0.0
1
-0.2
5
-0.2
8
1.0
0
0.5
1
0.0
7
0.1
0
-0.3
6
0.2
2
0.0
0
-0.3
7
0.2
2
0.0
4
0.0
4
-0.1
5
-0.0
2
-0.1
6
-0.2
1
-0.0
7
0.0
0
-0.0
7
18
0.0
4
0.1
8
0.0
2
0.1
8
0.1
7
0.0
9
-0.1
8
0.1
7
-0.2
5
0.0
0
1.0
0
-0.2
8
-0.2
5
-0.2
3
-0.0
9
-0.0
9
-0.0
6
0.1
5
0.0
3
-0.0
2
0.0
5
0.0
4
-0.3
7
0.1
3
-0.2
5
-0.3
3
-0.3
6
0.1
0
0.1
0
19
-0.1
7
-0.1
8
-0.0
4
-0.2
5
- 0.2
1
-0.1
9
0.0
9
0.2
8
-0.0
5
1.0
0
0.0
0
-0.2
5
-0.0
1
-0.2
2
-0.1
0
0.2
5
0.0
8
-0.2
3
0.1
5
-0.2
9
-0.2
1
-0.2
0
-0.1
2
-0.0
4
-0.2
5
-0.0
7
-0.1
6
-0.2
1
-0.1
8
20
0.2
8
0.1
1
0.2
5
0.3
1
0.0
1
0.2
5
0.0
5
-0.0
8
1.0
0
-0.0
5
-0.2
5
-0.0
1
-0.0
3
0.2
5
-0.1
6
-0.0
9
0.4
4
0.0
7
0.2
3
-0.1
2
0.1
1
0.0
9
-0.1
6
0.5
6
0.1
4
-0.1
3
-0.1
9
0.0
5
0.0
2
21
-0.3
4
-0.3
7
0.3
8
-0.2
4
-0.0
6
0.0
2
-0.0
4
1.0
0
-0.0
8
0.2
8
0.1
7
-0.2
6
0.0
7
-0.1
7
-0.1
3
0.0
5
0.0
1
-0.0
8
-0.0
9
-0.1
6
-0.0
6
-0.0
6
-0.1
0
-0.0
1
-0.1
0
-0.0
9
-0.0
9
-0.0
7
0.0
0
22
0.0
9
-0.1
3
-0.3
9
-0.0
2
0.3
1
-0.2
1
1.0
0
-0.0
4
0.0
5
0.0
9
-0.1
8
-0.3
6
-0.4
8
0.0
7
0.0
5
0.3
7
0.0
9
0.2
4
0.1
6
-0.1
3
0.3
2
0.2
9
0.5
1
-0.3
3
0.3
3
0.4
8
0.5
1
0.2
8
0.3
3
23
0.0
2
0.1
7
0.5
3
0.5
2
-0.2
6
1.0
0
-0.2
1
0.0
2
0.2
5
-0.1
9
0.0
9
0.1
3
-0.0
4
-0.1
7
-0.2
1
-0.4
0
-0.1
3
0.0
3
0.1
9
0.2
1
0.0
1
-0.0
3
-0.1
2
0.4
8
-0.1
8
-0.0
9
-0.1
6
0.0
1
-0.3
2
24
-0.1
6
0.0
6
-0.0
6
-0.5
8
1.0
0
-0.2
6
0. 3
1
-0.0
6
0.0
1
-0.2
1
0.1
7
-0.2
9
-0.1
0
0.1
1
-0.0
9
0.3
2
-0.0
8
0.0
3
-0.1
3
-0.1
7
-0.1
1
-0.0
8
0.3
5
-0.2
1
0.6
9
0.2
5
0.4
5
0.0
1
0.4
5
25
0.3
4
0.4
6
-0.1
2
1.0
0
-0.5
8
0.5
2
-0.0
2
-0.2
4
0. 3
1
-0.2
5
0.1
8
0.0
1
-0.2
9
0.3
1
0.2
7
-0.3
9
-0.1
9
0.1
1
0.4
2
0.0
2
0.0
9
0.0
6
0.0
5
0.3
5
-0.4
2
0.2
6
-0.2
1
0.0
4
-0.5
4
26
0.0
1
0.3
0
1.0
0
-0.1
2
-0.0
6
0.5
3
-0.3
9
0.3
8
0.2
5
-0.0
4
0.0
2
0.4
9
0. 3
7
-0.1
8
-0.3
1
-0.3
0
0.2
0
-0.1
9
-0.0
6
-0.1
9
-0.2
4
-0.2
2
-0.0
6
0.3
7
0.0
8
-0.0
4
-0.0
7
-0.1
7
-0.1
5
27
0.8
6
1.0
0
0.3
0
0.4
6
0.0
6
0.1
7
-0.1
3
-0.3
7
0.1
1
-0.1
8
0.1
8
0.0
4
0.3
8
-0.0
2
0.0
7
-0.2
2
-0.1
9
-0.0
7
-0.1
1
-0.0
1
-0.0
7
-0.0
7
-0.0
7
0.1
2
-0.1
0
-0.0
7
-0.0
6
-0.0
7
-0.1
5
28
1.0
0
0.8
6
0.0
1
0.3
4
-0.1
6
0.0
2
0.0
9
-0.3
4
0. 2
8
-0.1
7
0.0
4
0.1
0
-0.0
2
-0.0
5
-0.1
2
-0.2
5
0.3
7
0.0
5
0.4
5
-0.0
6
0.0
7
0.0
9
0.3
0
-0.0
1
-0.1
8
0.4
4
0.0
9
0.0
9
-0.2
3
29
4.1.2 Results from PCA analysis
Table 14 lists the 24 indicators included in the principal component analysis. A total of 14 out of the
24 indicators have been transformed using the equation in Section 3.1.2.
Analysis of 2003-2006 data
28
Table 14: The 24 indicators included in the principal component analysis. A total of 14 out of
the 24 indicators have been transformed using the equation in Section 3.1.2.
No Indicator
Trans-
formed
1 DE1003I: Proportion of females to males in total population
2 SA2016I: Mortality rate for <65 per year
3 DE1040I: Proportion of total population aged 0-4 Y
4 SA2029I: Crude death rate per 1000 residents
5 DE1028V + DE1055V: Total Resident Population 65 and over
6 EN5101I: Population density: total resident pop. per square km
7 SA3001I: Total Number of recorded crimes per 1000 population
8 DE3003I: Total number of households
9 EC1020I: Unemployment rate Y
10
TE2028I: Prop. of working age population qualified at level 3 or
4 ISCED Y
11
DE2005I: Proportion of Residents who are not EU Nationals
and citizens of a country with high HDI Y
12
DE2006I: Proportion of Residents who are not EU Nationals
and citizens of a country with a medium or low HDI Y
13 DE3004I: Average size of households
14
TE2025I: Prop. of working age population qualified at level 1 or
2 ISCED Y
15
TE2031I: Prop. of working age population qualified at level 5 or
6 ISCED Y
16 EN5012I: Proportion of the area in green space Y
17 DE3005I: Prop. of households that are lone-parent households Y
18
EN5001I: Green space (in m2) to which the public has access
per capita
19
DE3008I: Prop. households that are lone-pensioner house-
holds Y
20 SA1012I: Proportion of households living in social housing Y
21 SA1018I: Proportion of dwellings lacking basic amenities Y
22
EC3039I: Median disposable annual household income (for city
or NUTS 3 region)
23 EC3060I: Proportion of households reliant upon social security Y
24
EC3057I: Percent. households with less than half
nat.aver.income Y
Table 15 shows the correlation matrix between the 24 transformed indicators. As some of the most
correlated indicators are excluded compared to the correlation analysis presented in Table 13, dif-
ferent cut-off values for red and green colour have been used in Table 15. Red colour is used to
indicate highly correlated indicators (correlation >0.5 or <-0.5). Green colour is used to indicate in-
dependent indicators (-0.1 < correlation <0.1).
Analysis of 2003-2006 data
29
Table 15: Correlation matrix between the 24 transformed indicators. Red colour: Abs >0.5.
Green colour: Abs<0.1.
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
305
7I:
Pe
rce
nt.
hou
seh
old
s
with
le
ss
than
half
nat.a
ve
r.inco
me
EC
306
0I:
Pro
po
rtion
of
househ
old
s
relia
nt
upon
so
cia
l
secu
rity
EC
303
9I:
Me
dia
n dis
po
sable
a
nn
ual
hou
seh
old
in
co
me
(for c
ity o
r NU
TS
3 re
gio
n)
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l hou
s-
ing
DE
300
8I:
Pro
p.
househ
old
s
that
are
lo
ne-p
ensio
ner
hou
seh
old
s
EN
500
1I:
Gre
en s
pace (in
m2)
to w
hic
h th
e public
has
access p
er c
apita
DE
300
5I:
Pro
p.
of
ho
use
hold
s
tha
t are
lo
ne
-pare
nt
hou
seh
old
s
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
TE
20
31
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 5
or 6
ISC
ED
TE
20
25
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 1
or 2
ISC
ED
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
200
6I:
Pro
portio
n of
Re
sid
ents
w
ho are
n
ot
EU
N
a-
tion
als
and c
itize
ns o
f a c
ountry
with
a m
ediu
m o
r low
HD
I
DE
200
5I:
Pro
portio
n of
Re
sid
ents
w
ho are
n
ot
EU
N
a-
tion
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
TE
20
28
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 3
or 4
ISC
ED
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
SA
300
1I:
Tota
l N
um
ber
of
record
ed
crim
es
pe
r 10
00
pop
ula
tion
EN
510
1I:
Po
pula
tion
density
: to
tal
resid
en
t pop
. per
squ
are
km
DE
102
8V
+ D
E10
55V
: Tota
l Resid
ent P
opula
tion 6
5 a
nd
over
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
DE
100
3I: P
roportio
n o
f fem
ale
s to
male
s in
tota
l popula
-
tion
-0.1
5
-0.1
5
-0.5
4
0.4
5
-0.3
2
0.3
3
0.0
0
0.0
2
-0.1
8
0.1
0
-0.0
7
-0.0
3
0.0
4
-0.1
7
0.3
6
0.2
5
-0.0
5
-0.2
3
0.0
9
0.0
7
0.3
0
-0.4
0
0.4
5
1.0
0
1
-0.1
0
0.0
8
-0.4
2
0.6
9
-0.1
8
0.3
3
-0.1
0
0.1
4
-0.2
5
-0.2
5
-0.1
6
-0.3
1
0.0
9
-0.0
7
0.5
0
0.1
0
-0.0
3
-0.0
8
-0.0
9
-0.0
1
0.5
4
-0.3
0
1.0
0
0.4
5
2
0.1
2
0.3
7
0.3
5
-0.2
1
0.4
8
-0.3
3
-0.0
1
0.5
6
-0.0
4
0.1
3
-0.0
2
0.2
1
0.0
7
-0.1
6
-0.2
3
-0.1
2
0.0
1
0.1
4
-0.1
2
-0.0
3
-0.4
5
1.0
0
-0.3
0
-0.4
0
3
-0.0
7
-0.0
6
0.0
5
0.3
5
-0.1
2
0.5
1
-0.1
0
-0.1
6
-0.1
2
-0.3
7
-0.1
5
-0.3
9
0.0
9
0.1
1
0.2
4
-0.0
6
-0.0
3
0.1
5
0.0
7
0.0
9
1.0
0
-0.4
5
0.5
4
0.3
0
4
-0.0
7
-0.2
3
0.0
7
-0.0
9
-0.0
1
0.3
0
-0.0
6
0.1
0
-0.2
0
0.0
4
0.0
4
-0.0
8
0.3
3
0.2
2
-0.1
0
-0.0
2
0.9
4
0.1
2
0.3
5
1.0
0
0.0
9
-0.0
3
-0.0
1
0.0
7
5
-0.0
1
-0.1
9
0.0
2
-0.1
7
0.2
1
-0.1
3
-0.1
6
-0.1
2
-0.2
9
-0.0
2
0.2
2
0.0
6
0.4
8
0.2
3
-0.2
1
0.0
4
0.3
0
-0.0
5
1.0
0
0.3
5
0.0
7
-0.1
2
-0.0
9
0.0
9
6
-0.1
1
-0.0
6
0.4
2
-0.1
3
0.1
9
0.1
6
-0.0
9
0.2
3
0.1
5
0.0
3
-0.3
7
-0.6
2
0.1
8
0.2
2
0.0
3
0.1
0
0.2
1
1.0
0
-0.0
5
0.1
2
0.1
5
0.1
4
-0.0
8
-0.2
3
7
-0.0
7
-0.1
9
0. 1
1
0.0
3
0.0
3
0.2
4
-0.0
8
0.0
7
-0.2
3
0.1
5
0.0
0
-0.1
4
0.3
1
0.3
4
-0.1
3
-0.0
2
1.0
0
0.2
1
0.3
0
0.9
4
-0.0
3
0.0
1
-0.0
3
-0.0
5
8
-0.1
9
0.2
0
-0.1
9
-0.0
8
-0.1
3
0.0
9
0.0
1
0.4
4
0.0
8
-0.0
6
0.2
2
-0.0
4
-0.0
1
-0.2
9
0.2
7
1.0
0
-0.0
2
0.1
0
0.0
4
-0.0
2
-0.0
6
-0.1
2
0.1
0
0.2
5
9
-0.2
2
-0.3
0
-0.3
9
0.3
2
-0.4
0
0.3
7
0.0
5
-0.0
9
0.2
5
-0.0
9
-0.3
6
-0.2
9
0.0
6
-0.1
3
1.0
0
0.2
7
-0.1
3
0.0
3
-0.2
1
-0.1
0
0.2
4
-0.2
3
0.5
0
0.3
6
10
0.0
7
-0.3
1
0. 2
7
-0.0
9
-0.2
1
0.0
5
-0.1
3
-0.1
6
-0.1
0
-0.0
9
0.1
0
-0.2
7
0.5
2
1.0
0
-0.1
3
-0.2
9
0.3
4
0.2
2
0.2
3
0.2
2
0.1
1
-0.1
6
-0.0
7
-0.1
7
11
-0.0
2
-0.1
8
0. 3
1
0.1
1
-0.1
7
0.0
7
-0.1
7
0.2
5
-0.2
2
-0.2
3
0.0
7
-0.1
6
1.0
0
0.5
2
0.0
6
-0.0
1
0.3
1
0.1
8
0.4
8
0.3
3
0.0
9
0.0
7
0.0
9
0.0
4
12
0.3
8
0.3
7
-0.2
9
-0.1
0
-0.0
4
-0.4
8
0.0
7
-0.0
3
-0.0
1
-0.2
5
0.5
1
1.0
0
-0.1
6
-0.2
7
-0.2
9
-0.0
4
-0.1
4
-0.6
2
0.0
6
-0.0
8
-0.3
9
0.2
1
-0.3
1
-0.0
3
13
0.0
4
0.4
9
0.0
1
-0.2
9
0.1
3
-0.3
6
-0.2
6
-0.0
1
-0.2
5
-0.2
8
1.0
0
0.5
1
0.0
7
0.1
0
-0.3
6
0.2
2
0.0
0
-0.3
7
0.2
2
0.0
4
-0.1
5
-0.0
2
-0.1
6
-0.0
7
14
0.1
8
0.0
2
0.1
8
0.1
7
0.0
9
-0.1
8
0.1
7
-0.2
5
0.0
0
1.0
0
-0.2
8
-0.2
5
-0.2
3
-0.0
9
-0.0
9
-0.0
6
0.1
5
0.0
3
-0.0
2
0.0
4
-0.3
7
0.1
3
-0.2
5
0.1
0
15
-0.1
8
-0.0
4
-0.2
5
-0.2
1
-0.1
9
0.0
9
0.2
8
-0.0
5
1.0
0
0.0
0
-0.2
5
-0.0
1
-0.2
2
-0.1
0
0.2
5
0.0
8
-0.2
3
0.1
5
-0.2
9
-0.2
0
-0.1
2
-0.0
4
-0.2
5
-0.1
8
16
Analysis of 2003-2006 data
30
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
305
7I:
Pe
rce
nt.
hou
seh
old
s
with
le
ss
than
half
nat.a
ve
r.inco
me
EC
306
0I:
Pro
po
rtion
of
househ
old
s
relia
nt
upon
so
cia
l
secu
rity
EC
303
9I:
Me
dia
n dis
po
sable
a
nn
ual
hou
seh
old
in
co
me
(for c
ity o
r NU
TS
3 re
gio
n)
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l hou
s-
ing
DE
300
8I:
Pro
p.
househ
old
s
that
are
lo
ne-p
ensio
ner
hou
seh
old
s
EN
500
1I:
Gre
en s
pace (in
m2)
to w
hic
h th
e public
has
access p
er c
apita
DE
300
5I:
Pro
p.
of
ho
use
hold
s
tha
t are
lo
ne
-pare
nt
hou
seh
old
s
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
TE
20
31
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 5
or 6
ISC
ED
TE
20
25
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 1
or 2
ISC
ED
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
200
6I:
Pro
portio
n of
Re
sid
ents
w
ho are
n
ot
EU
N
a-
tion
als
and c
itize
ns o
f a c
ountry
with
a m
ediu
m o
r low
HD
I
DE
200
5I:
Pro
portio
n of
Re
sid
ents
w
ho are
n
ot
EU
N
a-
tion
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
TE
20
28
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 3
or 4
ISC
ED
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
SA
300
1I:
Tota
l N
um
ber
of
record
ed
crim
es
pe
r 10
00
pop
ula
tion
EN
510
1I:
Po
pula
tion
density
: to
tal
resid
en
t pop
. per
squ
are
km
DE
102
8V
+ D
E10
55V
: Tota
l Resid
ent P
opula
tion 6
5 a
nd
over
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
DE
100
3I: P
roportio
n o
f fem
ale
s to
male
s in
tota
l popula
-
tion
0.1
1
0.2
5
0.3
1
0.0
1
0.2
5
0.0
5
-0.0
8
1.0
0
-0.0
5
-0.2
5
-0.0
1
-0.0
3
0.2
5
-0.1
6
-0.0
9
0.4
4
0.0
7
0.2
3
-0.1
2
0.1
0
-0.1
6
0.5
6
0.1
4
0.0
2
17
-0.3
7
0.3
8
-0.2
4
-0.0
6
0.0
2
-0.0
4
1.0
0
-0.0
8
0.2
8
0.1
7
-0.2
6
0.0
7
-0.1
7
-0.1
3
0.0
5
0.0
1
-0.0
8
-0.0
9
-0.1
6
-0.0
6
-0.1
0
-0.0
1
-0.1
0
0.0
0
18
-0.1
3
-0.3
9
-0.0
2
0.3
1
-0.2
1
1.0
0
-0.0
4
0.0
5
0.0
9
-0.1
8
-0.3
6
-0.4
8
0.0
7
0.0
5
0.3
7
0.0
9
0.2
4
0.1
6
-0.1
3
0.3
0
0.5
1
-0.3
3
0.3
3
0.3
3
19
0.1
7
0.5
3
0.5
2
-0.2
6
1.0
0
-0.2
1
0.0
2
0.2
5
-0.1
9
0.0
9
0.1
3
-0.0
4
-0.1
7
-0.2
1
-0.4
0
-0.1
3
0.0
3
0.1
9
0.2
1
-0.0
1
-0.1
2
0.4
8
-0.1
8
-0.3
2
20
0.0
6
-0.0
6
-0.5
8
1.0
0
-0.2
6
0.3
1
-0.0
6
0.0
1
-0.2
1
0.1
7
-0.2
9
-0.1
0
0.1
1
-0.0
9
0.3
2
-0.0
8
0.0
3
-0.1
3
-0.1
7
-0.0
9
0.3
5
-0.2
1
0.6
9
0.4
5
21
0.4
6
-0.1
2
1.0
0
-0.5
8
0.5
2
-0.0
2
-0.2
4
0.3
1
-0.2
5
0.1
8
0.0
1
-0.2
9
0.3
1
0.2
7
-0.3
9
-0.1
9
0.1
1
0.4
2
0.0
2
0.0
7
0.0
5
0.3
5
-0.4
2
-0.5
4
22
0.3
0
1.0
0
-0.1
2
-0.0
6
0.5
3
-0.3
9
0.3
8
0.2
5
-0.0
4
0.0
2
0.4
9
0.3
7
-0.1
8
-0.3
1
-0.3
0
0.2
0
-0.1
9
-0.0
6
-0.1
9
-0.2
3
-0.0
6
0.3
7
0.0
8
-0.1
5
23
1.0
0
0.3
0
0. 4
6
0.0
6
0.1
7
-0.1
3
-0.3
7
0.1
1
-0.1
8
0.1
8
0.0
4
0.3
8
-0.0
2
0.0
7
-0.2
2
-0.1
9
-0.0
7
-0.1
1
-0.0
1
-0.0
7
-0.0
7
0.1
2
-0.1
0
-0.1
5
24
The bar-diagrams in Figure 5 (indicator numbers 1-8), Figure 6 (indicator numbers 9-16) and Figure
7 (indicator numbers 17-24) are histograms normalised to estimate probability distribution function
for the 24 transformed indicators. The solid line shows a best fit generalised extreme value distribu-
tion. The best fit parameters are listed in Table 16.
Analysis of 2003-2006 data
31
80 90 100 110 120 1300
0.02
0.04
0.06
0.08
0.1
Transformed value
1.
DE
1003I:
Pro
port
ion o
f fe
male
s
to m
ale
s in t
ota
l popula
tion
0 1 2 3 4 5 60
0.2
0.4
0.6
0.8
1
Transformed value
2.
SA
2016I:
Mort
alit
y r
ate
for
<65 p
er
year
0.02 0.04 0.06 0.08 0.1 0.12 0.140
10
20
30
40
50
60
Transformed value
3.
DE
1040I:
Pro
port
ion o
f to
tal
popula
tion a
ged 0
-4
0 5 10 150
0.05
0.1
0.15
0.2
0.25
0.3
0.35
Transformed value
4.
SA
2029I:
Cru
de d
eath
rate
per
1000 r
esid
ents
0 2 4 6 8 10
x 105
0
0.5
1
1.5
2
2.5x 10
-5
Transformed value
5.
DE
1028V
+ D
E1055V
: T
ota
l
Resid
ent
Popula
tion 6
5 a
nd
over
0 0.5 1 1.5 2 2.5
x 104
0
1
2
3
4x 10
-4
Transformed value
6.
EN
5101I:
Popula
tion d
ensity:
tota
l re
sid
ent
pop.
per
square
km
0 50 100 150 200 2500
0.002
0.004
0.006
0.008
0.01
0.012
Transformed value
7.
SA
3001I:
Tota
l N
um
ber
of
record
ed c
rim
es p
er
1000 p
opula
tion
0 1 2 3 4
x 106
0
2
4
6
8x 10
-6
Transformed value
8.
DE
3003I:
Tota
l num
ber
of
household
s
Figure 5: Probability density function estimates for indicator nos. 1-8 (bars). Best fit General-
ised Extreme Value distribution (solid line).
Analysis of 2003-2006 data
32
0 0.1 0.2 0.3 0.4 0.50
2
4
6
8
10
Transformed value
9.
EC
1020I:
Unem
plo
ym
ent
rate
0 0.5 1 1.5 2 2.50
0.5
1
1.5
2
2.5
Transformed value
10.
TE
2028I:
Pro
p.
of
work
ing
age p
opula
tion q
ualif
ied a
t
level 3 o
r 4 I
SC
ED
0 0.01 0.02 0.03 0.040
20
40
60
80
100
120
140
Transformed value
11.
DE
2005I:
Pro
port
ion o
f
Resid
ents
who a
re n
ot
EU
Nationals
and c
itiz
ens o
f a c
ountr
y w
ith
hig
h H
DI
0 0.1 0.2 0.3 0.4 0.50
5
10
15
20
Transformed value
12.
DE
2006I:
Pro
port
ion o
f
Resid
ents
who a
re n
ot
EU
Nationals
and c
itiz
ens o
f a c
ountr
y w
ith
a m
ediu
m o
r lo
w H
DI
1.5 2 2.5 3 3.50
0.5
1
1.5
2
2.5
Transformed value
13.
DE
3004I:
Avera
ge s
ize o
f
household
s
0 0.5 1 1.50
1
2
3
4
Transformed value
14.
TE
2025I:
Pro
p.
of
work
ing
age p
opula
tion q
ualif
ied a
t
level 1 o
r 2 I
SC
ED
0 0.5 1 1.50
0.5
1
1.5
2
2.5
3
3.5
Transformed value
15.
TE
2031I:
Pro
p.
of
work
ing
age p
opula
tion q
ualif
ied a
t
level 5 o
r 6 I
SC
ED
0 2 4 6 80
0.2
0.4
0.6
0.8
1
1.2
1.4
Transformed value
16.
EN
5012I:
Pro
port
ion o
f
the a
rea in g
reen s
pace
Figure 6: Probability density function estimates for indicator nos. 9-16 (bars). Best fit Gener-
alised Extreme Value distribution (solid line).
Analysis of 2003-2006 data
33
0 0.1 0.2 0.3 0.40
5
10
15
20
Transformed value
17.
DE
3005I:
Pro
p.
of
household
s
that
are
lone-p
are
nt
household
s
0 1000 2000 3000 40000
0.002
0.004
0.006
0.008
0.01
0.012
Transformed value
18.
EN
5001I:
Gre
en s
pace (
in
m2)
to w
hic
h t
he p
ublic
has
access p
er
capita
0 0.2 0.4 0.6 0.80
2
4
6
8
10
12
14
Transformed value
19.
DE
3008I:
Pro
p.
household
s
that
are
lone-p
ensio
ner
household
s
0 0.5 1 1.50
0.5
1
1.5
2
2.5
3
3.5
Transformed value
20.
SA
1012I:
Pro
port
ion o
f
household
s liv
ing in s
ocia
l
housin
g
0 0.1 0.2 0.3 0.40
5
10
15
20
25
30
35
Transformed value
21.
SA
1018I:
Pro
port
ion o
f
dw
elli
ngs lackin
g b
asic
am
enitie
s
0 1 2 3 4 5
x 104
0
0.2
0.4
0.6
0.8
1x 10
-4
Transformed value
22.
EC
3039I:
Media
n d
isposable
annual household
incom
e (
for
city o
r N
UT
S 3
regio
n)
0 0.5 1 1.50
2
4
6
8
10
Transformed value
23.
EC
3060I:
Pro
port
ion o
f
household
s r
elia
nt
upon s
ocia
l
security
0 1 2 3 4 5 60
1
2
3
4
5
Transformed value
24.
EC
3057I:
Perc
ent.
household
s
with less t
han h
alf n
at.
aver.
incom
e
Figure 7: Probability density function estimates for indicator nos. 17-24 (bars). Best fit Gen-
eralised Extreme Value distribution (solid line).
Analysis of 2003-2006 data
34
Table 16: Parameters for best fit Generalised Extreme Value probability density function for
the 24 transformed indicators
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
305
7I: P
erc
ent. h
ouseh
old
s w
ith le
ss th
an h
alf
nat.a
ve
r.inco
me
EC
306
0I: P
ropo
rtion o
f househ
old
s re
liant u
pon s
ocia
l security
EC
303
9I: M
edia
n d
ispo
sa
ble
ann
ual h
ouse
hold
incom
e (fo
r city
or N
UT
S 3
regio
n)
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner h
ouse
hold
s
EN
500
1I: G
ree
n s
pace
(in m
2) to
whic
h th
e p
ublic
has a
ccess p
er
capita
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
hold
s
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
TE
20
31
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 5
or 6
ISC
ED
TE
20
25
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 1
or 2
ISC
ED
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
200
6I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
Natio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
a m
ediu
m o
r low
HD
I
DE
200
5I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
Natio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
hig
h H
DI
TE
20
28
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 3
or 4
ISC
ED
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0 p
op
ula
tion
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er s
qua
re k
m
DE
102
8V
+ D
E10
55V
: To
tal R
esid
en
t Popula
tion 6
5 a
nd
over
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
Ind
ica
tor n
am
e
0.2
85
0.4
70
-0.1
37
0.7
49
0.2
43
-0.0
25
0.6
25
0.0
17
0.6
39
-0.0
36
0.0
54
0.0
12
0.2
33
0.5
84
-0.0
69
0.0
93
0.6
96
-0.0
95
0.3
61
0.7
15
-0.2
32
0.1
02
0.0
06
-0.2
18
Sh
ap
e p
a-
ram
e-
ter, ξ
0.0
80
0.0
91
6983
0.0
16
0.1
23
0.0
42
21.3
94
0.0
29
0.3
97
0.1
26
0.1
50
0.2
55
0.0
25
0.0
03
0.3
03
0.0
48
61827
39.6
973
18157
1.7
48
0.0
08
0.5
85
5.5
06
Sca
le
pa
-
ram
e-
ter, σ
0.1
58
0.1
29
13083
0.0
13
0.1
35
0.1
24
19.1
95
0.0
55
0.3
54
0.2
75
0.2
91
2.0
82
0.0
35
0.0
03
0.6
58
0.0
92
72510
62.1
1194
21321
8.5
79
0.0
46
1.9
52
106.0
Lo
ca
-
t ion
pa
r.,
µ.
Be
st fit
0.1
24
0.3
21
-0.2
07
0.4
82
0.0
59
-0.0
95
0.4
64
-0.0
61
0.4
18
-0.1
23
-0.0
53
-0.1
20
0.1
07
0.3
88
-0.1
57
-0.0
09
0.5
45
-0.1
96
0.2
48
0.5
89
-0.2
81
0.0
20
-0.0
65
-0.2
52
Sh
ap
e p
a-
ram
e-
ter, ξ
0.0
65
0.0
74
6131
0.0
13
0.1
05
0.0
37
17.9
72
0.0
26
0.3
29
0.1
13
0.1
34
0.2
26
0.0
22
0.0
03
0.2
73
0.0
43
52649
35.8
862
15819
1.6
15
0.0
07
0.5
37
5.1
28
Sca
le
pa
-
ram
e-
ter, σ
0.1
38
0.1
10
11706
0.0
09
0.1
11
0.1
16
15.5
83
0.0
51
0.2
81
0.2
55
0.2
68
2.0
41
0.0
31
0.0
03
0.6
12
0.0
85
63371
56.4
1059
19058
8.3
77
0.0
45
1.8
81
105.4
Lo
ca
-
t ion
pa
r.,
µ.
5%
co
nfid
ence
inte
rva
l
va
lue
0.4
46
0.6
19
-0.0
67
1.0
15
0.4
27
0.0
45
0.7
86
0.0
95
0.8
60
0.0
50
0.1
60
0.1
44
0.3
59
0.7
80
0.0
20
0.1
94
0.8
48
0.0
07
0.4
73
0.8
42
-0.1
84
0.1
83
0.0
78
-0.1
84
Sh
ap
e p
a-
ram
e-
ter, ξ
0.0
98
0.1
11
7953
0.0
20
0.1
45
0.0
48
25.4
69
0.0
32
0.4
80
0.1
41
0.1
68
0.2
88
0.0
29
0.0
04
0.3
37
0.0
54
72605
43.9
1098
20839
1.8
91
0.0
09
0.6
38
5.9
11
Sca
le
pa
-
ram
e-
ter, σ
0.1
77
0.1
48
14460
0.0
16
0.1
59
0.1
31
22.8
08
0.0
60
0.4
28
0.2
95
0.3
15
2.1
23
0.0
39
0.0
04
0.7
03
0.0
99
81650
67.7
1329
23584
8.7
81
0.0
47
2.0
23
106.6
Lo
ca
-
t ion
pa
r.,
µ.
95
% c
on
fide
nce
inte
rval
va
lue
The eigenvectors with corresponding eigenvalues resulting from the principal component analysis
are shown in Table 17. To show the importance of each indicator in the various eigenvectors, ei-
genvector components with absolute value larger than 0.3 (high importance) are marked in red and
Analysis of 2003-2006 data
35
eigenvector components with absolute value between 0.1 and 0.3 (medium importance) are marked
in green. Indicators with low importance (absolute value less than 0.1) are written in black.
The eigenvectors are sorted by decreasing eigenvalue, showing the eigenvector representing the
largest variation (16.2%) in the data set in the first column and the eigenvector representing the
least variation (0.01%) in column number 24.
The dimensions of the data set can now be reduced by omitting the principal components represent-
ing the least variation in the data set.
The third row in Table 17 shows cumulative explanation of variability as more eigenvectors are in-
cluded in the new data set. For example, including 10 out of the 24 eigenvectors, i.e. all eigenvec-
tors with eigenvalues larger than 0.99, gives a data set containing 81.3 % of the variation in the
original data set.
Table 17: Eigenvectors and eigenvalues resulting from the principal component analysis
sorted by decreasing eigenvalue. Components with absolute value larger than 0.3 marked in
red. Components with absolute value between 0.1 and 0.3 marked in green.
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
3057I
EC
3060I
EC
3039I
SA
1018I
SA
1012I
DE
3008I
EN
5001I
DE
3005I
EN
5012I
TE
2031I
TE
2025I
DE
3004I
DE
2006I
DE
2005I
TE
2028I
EC
1020I
DE
3003I
SA
3001I
EN
5101I
DE
1028V
SA
2029I
DE
1040I
SA
2016I
DE
1003I
Ind
icato
r co
de
Eig
en
valu
e
Cu
mu
lativ
e e
xp
lan
atio
n
of v
aria
bility
(%)
Exp
lan
atio
n o
f varia
bil-
ity (%
)
Eig
en
vecto
r no
0.1
3
0.2
1
0.2
7
-0.3
2
0.2
9
-0.2
7
0.0
3
0.1
1
0.0
1
0.0
8
0.1
6
0.1
9
-0.0
5
-0.0
4
-0.3
1
-0.0
5
-0.0
2
0.0
2
0.0
6
-0.0
5
-0.3
1
0.3
4
-0.3
4
-0.3
1
4.0
7
16.2
16.2
1
-0.0
6
0.2
1
-0.3
0
0.1
1
-0.0
1
-0.1
8
0.1
7
-0.0
4
0.1
5
0.0
0
0.0
3
0.2
5
-0.3
3
-0.3
4
0.1
3
0.0
8
-0.3
7
-0.2
5
-0.2
5
-0.4
0
-0.1
1
0.0
1
0.0
6
0.0
9
3.3
9
29.7
13.4
2
-0.0
4
0.0
0
-0.2
4
0.0
6
-0.1
3
-0.1
8
-0.0
4
-0.1
0
-0.2
7
-0.1
2
0.4
4
0.3
8
0.1
6
0.0
9
-0.1
6
0.0
1
0.1
4
-0.4
2
0.3
5
0.1
7
-0.0
8
-0.1
2
0.0
4
0.1
5
2.5
0
39.6
9.9
5
3
0.0
9
-0.3
6
0.0
3
-0.2
3
-0.3
0
-0.0
5
-0.0
1
-0.5
2
0.2
2
0.1
2
-0.0
3
0.0
1
-0.1
2
0.2
0
-0.0
1
-0.2
7
-0.1
4
-0.0
9
0.0
1
-0.1
4
-0.1
0
-0.2
9
-0.3
1
-0.1
4
2.1
9
48.4
8.7
3
4
0.2
9
0.0
5
0.2
6
0.2
2
0.1
1
-0.0
9
-0.2
1
-0.0
7
-0.3
1
-0.1
6
0.0
7
-0.0
4
0.1
3
0.1
8
-0.1
5
-0.2
5
-0.4
0
-0.0
1
0.0
3
-0.4
0
0.2
8
-0.0
2
0.2
4
-0.0
7
1.8
7
55.8
7.4
5
5
0.2
2
0.0
6
0.0
2
0.3
3
0.1
5
0.0
1
0.1
4
-0.2
3
-0.2
5
0.5
0
-0.1
5
-0.0
3
-0.2
8
-0.1
2
-0.1
9
-0.3
3
0.2
8
-0.1
4
-0.1
5
0.2
0
-0.0
4
0.0
2
0.1
0
0.0
8
1.7
7
62.9
7.0
5
6
0.0
2
-0.2
2
-0.1
0
0.2
4
-0.2
5
-0.3
5
-0.2
0
0.2
0
-0.0
7
0.3
0
-0.2
4
0.0
1
0.3
2
0.1
5
0.1
7
-0.0
2
-0.0
5
0.0
6
0.0
2
-0.1
4
-0.4
2
0.2
9
0.0
1
0.1
2
1.3
3
68.2
5.3
2
7
-0.1
7
0.1
9
0.0
4
-0.0
8
0.1
7
-0.1
3
0.2
1
-0.2
0
-0.1
2
0.4
3
0.0
8
-0.3
0
0.0
6
0.0
2
-0.0
1
0.3
3
-0.1
7
0.1
9
0.4
2
-0.1
8
0.0
2
-0.2
1
-0.0
6
0.2
5
1.1
7
72.8
4.6
5
8
-0.4
7
0.2
3
-0.2
3
0.0
6
0.1
3
-0.2
5
0.3
0
-0.2
0
0.0
6
-0.1
6
-0.1
7
0.0
2
0.1
5
0.2
1
0.1
0
-0.4
5
0.0
7
0.1
4
0.1
0
0.0
2
0.1
0
0.1
3
0.1
0
-0.2
3
1.1
5
77.4
4.5
7
9
0.5
2
0.2
2
0.0
3
0. 0
5
-0.2
2
0.1
1
0.6
4
0.1
3
0.2
1
0.0
0
-0.0
1
0.0
3
0.2
9
0.2
1
0.0
8
0.0
2
-0.0
2
-0.0
9
0.0
4
0.0
2
0.0
2
0.0
1
-0.0
1
0.0
1
0.9
9
81.3
3.9
3
10
-0.0
1
-0.1
3
-0.0
5
-0.2
1
0.1
4
0.2
9
0.0
3
0.1
0
0.0
0
-0.0
5
-0.3
2
0.0
3
0.1
6
-0.3
1
-0.0
5
-0.3
7
-0.2
1
-0.2
2
0.3
4
-0.0
1
0.0
8
0.2
1
-0.2
1
0.4
0
0.7
6
84.4
3.0
3
11
0.3
5
0.1
9
0.0
0
-0.1
0
0.2
2
-0.2
1
-0.3
9
-0.2
0
0.4
0
0.0
8
-0.0
5
0.1
1
0.0
1
-0.0
8
0.4
2
0.0
0
0.1
5
-0.0
6
0.2
7
0.1
0
0.1
4
0.0
3
0.2
2
0.0
0
0.7
2
87.2
2.8
7
12
-0.0
9
0.1
7
-0.2
6
0.0
6
0.2
1
0.3
4
-0.2
3
0.0
9
0.3
1
0.1
9
0.3
4
-0.2
9
0.0
2
0.4
3
-0.0
8
-0.1
5
-0.0
8
-0.2
3
-0.1
3
-0.0
7
-0.1
3
0.1
2
-0.0
5
0.1
0
EIG
EN
VE
CT
OR
CO
MP
ON
EN
TS
0.5
4
89.4
2.1
6
13
Analysis of 2003-2006 data
36
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
3057I
EC
3060I
EC
3039I
SA
1018I
SA
1012I
DE
3008I
EN
5001I
DE
3005I
EN
5012I
TE
2031I
TE
2025I
DE
3004I
DE
2006I
DE
2005I
TE
2028I
EC
1020I
DE
3003I
SA
3001I
EN
5101I
DE
1028V
SA
2029I
DE
1040I
SA
2016I
DE
1003I
Ind
icato
r co
de
Eig
en
valu
e
Cu
mu
lativ
e e
xp
lan
atio
n
of v
aria
bility
(%)
Exp
lan
atio
n o
f varia
bil-
ity (%
)
Eig
en
vecto
r no
0.0
9
0.1
6
-0.2
4
0.2
3
-0.1
8
-0.0
5
-0.1
7
0.0
6
0.3
5
-0.0
3
-0.2
0
0.1
7
-0.1
1
0.0
2
-0.6
1
0.0
4
0.0
1
0.3
7
0.1
9
0.0
0
0.1
1
-0.0
8
-0.0
6
0.1
2
0.4
9
91.3
1.9
5
14
0.0
5
-0.1
2
-0.2
3
0.1
0
0.0
3
0.0
8
0.0
0
0.0
7
0.0
5
0.0
0
-0.1
4
-0.3
5
0.1
2
-0.2
8
-0.1
9
0.1
1
-0.0
1
-0.3
0
0.2
9
-0.0
7
-0.1
5
-0.1
1
0.1
7
-0.6
1
0.4
2
93.0
1.6
9
15
0.1
2
-0.3
4
-0.0
1
-0.1
1
0.3
3
-0.2
7
0.2
1
0.2
5
0.1
2
-0.2
6
-0.0
3
-0.1
5
-0.3
1
0.2
0
-0.0
7
-0.1
0
0.0
2
0.0
3
0.1
0
0.0
2
-0.3
0
-0.2
3
0.3
2
0.2
5
0.3
2
94.3
1.2
8
16
0.1
7
0.3
8
-0.2
5
-0.2
9
-0.1
8
0.0
9
-0.1
5
0.0
0
-0.4
2
-0.1
6
-0.3
7
-0.1
2
-0.3
4
0.3
1
0.1
0
0.0
9
0.0
1
-0.0
6
0.1
0
0.0
0
-0.1
5
0.0
3
-0.0
6
-0.0
2
0.3
0
95.5
1.1
9
17
-0.1
6
0.2
0
0.4
6
0.4
4
-0.2
0
-0.0
2
-0.0
3
-0.0
5
0.1
5
-0.2
4
0.0
1
-0.2
0
-0.2
4
-0.0
3
0.1
1
-0.0
3
-0.2
1
-0.1
4
0.2
5
0.2
9
-0.2
4
0.0
5
-0.1
5
0.0
5
0.2
7
96.6
1.0
9
18
0.0
2
-0.2
1
-0.1
0
-0.0
8
-0.2
8
-0.3
0
0.0
9
0.1
9
0.0
2
0.1
1
0.2
1
-0.2
8
-0.3
4
0.0
3
0. 0
0
-0.0
1
0.0
1
-0.1
1
0.1
3
0.0
4
0.5
1
0.4
3
-0.0
4
-0.0
3
0.2
4
97.6
0.9
7
19
0.0
7
-0.0
4
-0.2
1
-0.0
8
0.1
2
-0.1
4
-0.0
3
-0.0
1
-0.0
6
0.1
1
-0.0
5
0.0
3
0.1
1
0.0
4
-0.0
4
0.0
6
-0.6
3
0.0
4
-0.1
8
0.6
5
0.0
6
-0.0
7
0.0
1
-0.0
9
0.2
0
98.4
0.8
1
20
-0.2
2
-0.1
6
0.2
2
0.0
9
0.1
9
0.1
0
0.0
6
0.0
8
0.0
9
0.1
7
-0.3
7
0.3
8
-0.1
2
0.4
1
-0.0
6
0.3
3
-0.0
4
-0.3
6
0.0
8
-0.0
4
0.2
1
0.0
5
0.0
4
-0.1
1
0.1
7
99.0
0.6
7
21
-0.0
1
0.1
7
0.0
4
0. 0
2
0.0
9
-0.4
3
-0.1
0
0.1
5
0.0
4
-0.1
1
-0.2
1
-0.2
6
0.2
2
-0.0
2
-0.0
8
0.0
1
0.1
8
-0.3
5
-0.2
9
-0.0
2
0.2
1
-0.3
3
-0.3
8
0.1
4
0.1
4
99.6
0.5
6
22
0.2
3
-0.2
5
-0.2
8
0.3
8
0.3
7
0.0
3
0.0
8
-0.2
8
-0.0
9
-0.2
8
0.0
2
-0.0
4
-0.0
3
0.0
6
0.1
3
0.2
5
0.0
5
0.1
2
0.0
4
-0.0
4
0.0
2
0.2
2
-0.4
4
-0.0
1
0.0
9
99.9
0.3
5
23
-0.0
2
0.0
0
0.0
9
-0.1
8
-0.0
7
-0.0
7
0.0
4
-0.5
0
0.1
1
-0.2
0
-0.1
1
-0.2
0
0.1
8
-0.0
1
-0.3
3
0.2
7
0.0
3
-0.1
6
-0.2
1
0.0
4
-0.0
9
0.4
0
0.3
1
0.2
2
0.0
0
100
0.0
1
24
Table 18, which shows correlation between the principal components (PC1 to PC24) and each of
the 24 original indicators, can be used to find which indicators dominate each of the principal com-
ponents. It can be seen that on average, decreasing correlation between the indicators and the
principal components is found for increasing principal component number.
Analysis of 2003-2006 data
37
Table 18: Correlation between principal components (PC1 to PC24) and each of the original
indicators. Green: Strong positive correlation. Yellow: Low correlation. Red: Strong negative
correlation.
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
305
7I: P
erc
ent. h
ouseh
old
s w
ith le
ss th
an h
alf
nat.a
ve
r.inco
me
EC
306
0I: P
ropo
rtion o
f househ
old
s re
liant u
pon s
ocia
l secu
rity
EC
303
9I: M
edia
n d
ispo
sa
ble
ann
ual h
ouse
hold
in-
com
e (fo
r city
or N
UT
S 3
regio
n)
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
eni-
ties
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l hou
sin
g
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner
hou
seh
old
s
EN
500
1I: G
ree
n s
pace
(in m
2) to
whic
h th
e p
ublic
has
access p
er c
apita
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt
hou
seh
old
s
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
TE
20
31
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t le
vel 5
or 6
ISC
ED
TE
20
25
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t le
vel 1
or 2
ISC
ED
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
200
6I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
N
atio
nals
an
d c
itizens o
f a c
ou
ntry
with
a m
ediu
m o
r lo
w H
DI
DE
200
5I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
N
atio
nals
an
d c
itizens o
f a c
ou
ntry
with
hig
h H
DI
TE
20
28
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t le
vel 3
or 4
ISC
ED
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0
pop
ula
tion
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er
squ
are
km
DE
102
8V
+ D
E10
55V
: To
tal R
esid
en
t Popula
tion 6
5
and
over
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pu
-la
tion
IND
ICA
OT
RS
-0.2
6
-0.4
1
-0.5
4
0.6
4
-0.5
9
0.5
5
-0.0
6
-0.2
1
-0.0
2
-0.1
6
-0.3
3
-0.3
8
0.0
9
0.0
8
0.6
2
0.1
1
0.0
4
-0.0
4
-0.1
1
0.1
0
0.6
2
-0.6
8
0.6
9
0.6
2
PC
1
0.1
1
-0.3
9
0.5
5
-0.2
0
0.0
2
0.3
3
-0.3
2
0.0
6
-0.2
7
-0.0
1
-0.0
5
-0.4
6
0.6
1
0.6
3
-0.2
4
-0.1
5
0.6
8
0.4
6
0.4
6
0.7
3
0.2
1
-0.0
3
-0.1
2
-0.1
7
PC
2
-0.0
7
0.0
1
-0.3
7
0.1
0
-0.2
1
-0.2
8
-0.0
6
-0.1
6
-0.4
3
-0.1
9
0.6
9
0.6
0
0.2
5
0.1
4
-0.2
5
0.0
2
0.2
2
-0.6
6
0.5
6
0.2
6
-0.1
2
-0.1
9
0.0
6
0.2
4
PC
3
-0.1
4
0.5
3
-0.0
4
0.3
3
0.4
4
0.0
8
0.0
1
0.7
6
-0.3
2
-0.1
7
0.0
5
-0.0
2
0.1
8
-0.3
0
0.0
1
0.3
9
0.2
1
0.1
3
-0.0
2
0.2
1
0.1
4
0.4
3
0.4
6
0.2
1
PC
4
-0.3
9
-0.0
7
-0.3
6
-0.3
0
-0.1
5
0.1
2
0.2
8
0.0
9
0.4
2
0.2
2
-0.0
9
0.0
5
-0.1
7
-0.2
4
0.2
1
0.3
4
0.5
5
0.0
1
-0.0
5
0.5
5
-0.3
9
0.0
2
-0.3
3
0.1
0
PC
5
0.2
9
0.0
8
0.0
3
0.4
4
0.2
1
0.0
1
0.1
8
-0.3
0
-0.3
3
0.6
6
-0.1
9
-0.0
5
-0.3
7
-0.1
6
-0.2
5
-0.4
3
0.3
7
-0.1
9
-0.2
0
0.2
7
-0.0
5
0.0
3
0.1
4
0.1
0
PC
6
-0.0
2
0.2
6
0.1
1
-0.2
7
0.2
9
0.4
0
0.2
3
-0.2
3
0.0
8
-0.3
5
0.2
8
-0.0
2
-0.3
7
-0.1
7
-0.2
0
0.0
2
0.0
6
-0.0
7
-0.0
2
0.1
6
0.4
8
-0.3
4
-0.0
1
-0.1
4
PC
7
-0.1
8
0.2
0
0.0
4
-0.0
9
0.1
8
-0.1
4
0.2
3
-0.2
2
-0.1
3
0.4
7
0.0
8
-0.3
3
0.0
7
0.0
2
-0.0
1
0.3
6
-0.1
9
0.2
0
0.4
6
-0.1
9
0.0
3
-0.2
2
-0.0
6
0.2
7
PC
8
0.5
0
-0.2
5
0.2
4
-0.0
6
-0.1
4
0.2
6
-0.3
2
0.2
1
-0.0
6
0.1
7
0.1
9
-0.0
2
-0.1
6
-0.2
3
-0.1
1
0.4
8
-0.0
7
-0.1
5
-0.1
1
-0.0
2
-0.1
0
-0.1
4
-0.1
1
0.2
5
PC
9
0.5
2
0.2
2
0.0
3
0.0
5
-0.2
2
0.1
1
0.6
3
0.1
3
0.2
1
0.0
0
-0.0
1
0.0
3
0.2
8
0.2
1
0.0
8
0.0
2
-0.0
2
-0.0
9
0.0
4
0.0
2
0.0
2
0.0
1
-0.0
1
0.0
1
PC
10
0.0
0
-0.1
1
-0.0
5
-0.1
8
0.1
2
0.2
5
0.0
2
0.0
8
0.0
0
-0.0
4
-0.2
8
0.0
3
0.1
4
-0.2
7
-0.0
4
-0.3
2
-0.1
8
-0.1
9
0.2
9
-0.0
1
0.0
7
0.1
8
-0.1
8
0.3
5
PC
11
0.2
9
0.1
6
0.0
0
-0.0
9
0.1
9
-0.1
7
-0.3
3
-0.1
7
0.3
4
0.0
7
-0.0
4
0.0
9
0.0
1
-0.0
7
0.3
6
0.0
0
0.1
2
-0.0
5
0.2
3
0.0
9
0.1
2
0.0
3
0.1
9
0.0
0
PC
12
-0.0
7
0.1
3
-0.1
9
0.0
5
0.1
5
0.2
5
-0.1
7
0.0
7
0.2
3
0.1
4
0.2
5
-0.2
1
0.0
1
0.3
2
-0.0
6
-0.1
1
-0.0
6
-0.1
7
-0.1
0
-0.0
5
-0.1
0
0.0
9
-0.0
4
0.0
8
PC
13
-0.0
6
-0.1
1
0.1
7
-0.1
6
0.1
2
0.0
3
0.1
2
-0.0
4
-0.2
5
0.0
2
0.1
4
-0.1
2
0.0
8
-0.0
1
0.4
3
-0.0
3
-0.0
1
-0.2
6
-0.1
3
0.0
0
-0.0
8
0.0
6
0.0
4
-0.0
8
PC
14
-0.0
3
0.0
8
0.1
5
-0.0
6
-0.0
2
-0.0
5
0.0
0
-0.0
5
-0.0
3
0.0
0
0.0
9
0.2
3
-0.0
8
0.1
9
0.1
2
-0.0
7
0.0
1
0.2
0
-0.1
9
0.0
5
0.1
0
0.0
7
-0.1
1
0.4
0
PC
15
-0.0
7
0.1
9
0.0
0
0.0
6
-0.1
9
0.1
5
-0.1
2
-0.1
4
-0.0
7
0.1
5
0.0
1
0.0
8
0.1
7
-0.1
1
0.0
4
0.0
6
-0.0
1
-0.0
1
-0.0
6
-0.0
1
0.1
7
0.1
3
-0.1
8
-0.1
4
PC
16
Analysis of 2003-2006 data
38
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
EC
305
7I: P
erc
ent. h
ouseh
old
s w
ith le
ss th
an h
alf
nat.a
ve
r.inco
me
EC
306
0I: P
ropo
rtion o
f househ
old
s re
liant u
pon s
ocia
l secu
rity
EC
303
9I: M
edia
n d
ispo
sa
ble
ann
ual h
ouse
hold
in-
com
e (fo
r city
or N
UT
S 3
regio
n)
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
eni-
ties
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l hou
sin
g
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner
hou
seh
old
s
EN
500
1I: G
ree
n s
pace
(in m
2) to
whic
h th
e p
ublic
has
access p
er c
apita
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt
hou
seh
old
s
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
TE
20
31
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t le
vel 5
or 6
ISC
ED
TE
20
25
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t le
vel 1
or 2
ISC
ED
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
200
6I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
N
atio
nals
an
d c
itizens o
f a c
ou
ntry
with
a m
ediu
m o
r lo
w H
DI
DE
200
5I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
N
atio
nals
an
d c
itizens o
f a c
ou
ntry
with
hig
h H
DI
TE
20
28
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t le
vel 3
or 4
ISC
ED
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0
pop
ula
tion
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er
squ
are
km
DE
102
8V
+ D
E10
55V
: To
tal R
esid
en
t Popula
tion 6
5
and
over
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pu
-la
tion
IND
ICA
OT
RS
-0.0
9
-0.2
1
0.1
4
0.1
6
0.1
0
-0.0
5
0.0
8
0.0
0
0.2
3
0.0
9
0.2
0
0.0
7
0.1
8
-0.1
7
-0.0
5
-0.0
5
-0.0
1
0.0
3
-0.0
5
0.0
0
0.0
8
-0.0
2
0.0
3
0.0
1
PC
17
0.0
8
-0.1
0
-0.2
4
-0.2
3
0.1
0
0.0
1
0.0
2
0.0
3
-0.0
8
0.1
3
0.0
0
0.1
1
0.1
3
0.0
2
-0.0
6
0.0
2
0.1
1
0.0
7
-0.1
3
-0.1
5
0.1
3
-0.0
3
0.0
8
-0.0
2
PC
18
0.0
1
-0.1
0
-0.0
5
-0.0
4
-0.1
4
-0.1
5
0.0
4
0.0
9
0.0
1
0.0
5
0.1
0
-0.1
4
-0.1
7
0.0
2
0.0
0
-0.0
1
0.0
0
-0.0
5
0.0
6
0.0
2
0.2
5
0.2
1
-0.0
2
-0.0
2
PC
19
-0.0
3
0.0
2
0.0
9
0.0
3
-0.0
5
0.0
6
0.0
1
0.0
0
0.0
3
-0.0
5
0.0
2
-0.0
1
-0.0
5
-0.0
2
0.0
2
-0.0
3
0.2
8
-0.0
2
0.0
8
-0.2
9
-0.0
3
0.0
3
-0.0
1
0.0
4
PC
20
-0.0
9
-0.0
6
0.0
9
0.0
4
0.0
8
0.0
4
0.0
2
0.0
3
0.0
4
0.0
7
-0.1
5
0.1
5
-0.0
5
0.1
7
-0.0
2
0.1
4
-0.0
1
-0.1
5
0.0
3
-0.0
2
0.0
9
0.0
2
0.0
2
-0.0
5
PC
21
0.0
0
-0.0
6
-0.0
1
-0.0
1
-0.0
3
0.1
6
0.0
4
-0.0
6
-0.0
2
0.0
4
0.0
8
0.1
0
-0.0
8
0.0
1
0.0
3
0.0
0
-0.0
7
0.1
3
0.1
1
0.0
1
-0.0
8
0.1
2
0.1
4
-0.0
5
PC
22
-0.0
7
0.0
7
0.0
8
-0.1
1
-0.1
1
-0.0
1
-0.0
2
0.0
8
0.0
3
0.0
8
-0.0
1
0.0
1
0.0
1
-0.0
2
-0.0
4
-0.0
7
-0.0
2
-0.0
4
-0.0
1
0.0
1
-0.0
1
-0.0
6
0.1
3
0.0
0
PC
23
0.0
0
0.0
0
0.0
0
-0.0
1
0.0
0
0.0
0
0.0
0
-0.0
2
0.0
0
-0.0
1
0.0
0
-0.0
1
0.0
1
0.0
0
-0.0
1
0.0
1
0.0
0
-0.0
1
-0.0
1
0.0
0
0.0
0
0.0
2
0.0
1
0.0
1
PC
24
Principal component number one is strongly correlated with the following indicators:
Positive:
• Correlation = +0.69: SA2016I: Mortality rate for <65 per year
• Correlation = +0.64:SA1018I: Proportion of dwellings lacking basic amenities
• Correlation = +0.62: DE1003I: Proportion of females to males in total population
• Correlation = +0.62: SA2029I: Crude death rate per 1000 residents
• Correlation = +0.62: TE2028I: Prop. of working age population qualified at level 3 or 4
ISCED
Negative:
• Correlation = -0.68: DE1040I: Proportion of total population aged 0-4
• Correlation = -0.59: SA1012I: Proportion of households living in social housing
• Correlation = -0.54: EC3039I: Median disposable annual household income (for city or NUTS
3 region)
Analysis of 2003-2006 data
39
The strongly correlated indicators would be characteristic for an on average poor population. Princi-
pal component no 1 could therefore subjectively be termed Proportion of poor people.
The results of a similar analysis for the first eight principal components, representing close to 75%
of the variation in the data, can be found in Table 19. Along with the subjective descriptor is listed
the indicator most strongly correlated (either positive or negative) with the principal component and
the corresponding correlation value.
Table 19: Subjective labelling of the first eight principal components representing close to
75% of the variation in the data
No Subjective descriptor for principal com-
ponent Strongest correlated indicator
Strongest
correlation
value
1 Proportion of poor people SA2016I: Mortality rate for <65 per
year +0.69
2 Proportion of old people DE1028V + DE1055V: Total Resident
Population 65 and over +0.73
3 Proportion of uneducated people TE2025I: Prop. of working age popu-
lation qualified at level 1 or 2 ISCED +0.69
4 Proportion of households that are lone-
parent households
DE3005I: Prop. of households that are
lone-parent households +0.76
5 Size of sub-city district DE3003I: Total number of households +0.55
6 Proportion of highly educated people TE2031I: Prop. of working age popu-
lation qualified at level 5 or 6 ISCED +0.66
7 Crude death rate per 1000 residents SA2029I: Crude death rate per 1000
residents +0.48
8 Population density EN5101I: Population density: total
resident pop. per square km +0.46
4.2 SUB-CITY LEVEL ANALYSIS
The spatial resolution of interest for determining the need and capacity for setting up emergency
shelters and health facilities in a post earthquake situation is normally sub-city. Although not all data
of interest (see Table 11 on page 21) was available in the Urban Audit for 2003-2006, it was de-
cided to analyse the available sub-city level data in order to be able to make comparisons of correla-
tion and principal components to the city level data. A two step procedure was followed for the sub-
city level analysis:
1. The first step was to make a correlation analysis at sub-city level for the same 24 indicators that were included in the principal component analysis at city-level.
2. The second step was to exclude indicators that either were highly correlated to other indica-tors, or indicators with too little data and then perform a principal component analysis.
Details for each of the two steps can be found below.
4.2.1 Results from correlation analysis
Table 20 shows the 24 indicators included in the correlation analysis. A total of 14 out of the 24 indi-
cators have been transformed using the equation in Section 3.1.2.
Analysis of 2003-2006 data
40
Table 20: The 24 indicators included in the correlation analysis. A total of 14 out of the 24
indicators have been transformed using the equation in Section 3.1.2.
No Indicator description Trans-formed
1 DE1003I: Proportion of females to males in total population
2 DE1028V + DE1055V: Total Resident Population 65 and over
3 DE1040I: Proportion of total population aged 0-4 Y
4 DE2005I: Proportion of Residents who are not EU Nationals and citi-zens of a country with high HDI
Y
5 DE2006I: Proportion of Residents who are not EU Nationals and citi-zens of a country with a medium or low HDI
Y
6 DE3003I: Total number of households
7 DE3004I: Average size of households
8 DE3005I: Prop. of households that are lone-parent households Y
9 DE3008I: Prop. households that are lone-pensioner households Y
10 EC1020I: Unemployment rate Y
11 EC3039I: Median disposable annual household income (for city or NUTS 3 region)
12 EC3057I: Percent. households with less than half nat.aver.income Y
13 EC3060I: Proportion of households reliant upon social security Y
14 EN5001I: Green space (in m2) to which the public has access per capita
15 EN5012I: Proportion of the area in green space Y
16 EN5101I: Population density: total resident pop. per square km
17 SA1012I: Proportion of households living in social housing Y
18 SA1018I: Proportion of dwellings lacking basic amenities Y
19 SA2016I: Mortality rate for <65 per year
20 SA2029I: Crude death rate per 1000 residents
21 SA3001I: Total Number of recorded crimes per 1000 population
22 TE2025I: Prop. of working age population qualified at level 1 or 2 ISCED
Y
23 TE2028I: Prop. of working age population qualified at level 3 or 4 ISCED
Y
24 TE2031I: Prop. of working age population qualified at level 5 or 6 ISCED
Y
Table 21 shows the number of sub-city districts (out of 2466) having data for two paired indicators.
Green colour is used for indicator pairs with more than 1200 data points. Red colour is used for indi-
cator pair with less than 200 data points. The diagonal in Table 21 gives more details on the number
of districts (out of the 2466) that have collected data on the various indicators. The indicator with
most data is no. 1 with 2188 out of the 2466 districts having collected data. Apart from indicator no.
2 DE1028V + DE1055V: Total Resident Population 65 and over which has no data at sub-city dis-
trict level, indicator no. 12, EC3057I: Percent. households with less than half nat. aver. income is the
indicator with the least data (216 out of the 2466 districts).
The off-diagonal elements in Table 12 show how much data can be found on each pair of indicators.
The most data can be found for indicator pair 1 and 3 with 2128 districts having corresponding data.
Several data pairs have no data at all. This is the case for the six combinations of indicators 12 and
13 with indicators 22, 23 and 24.
Analysis of 2003-2006 data
41
Analysis of 2003-2006 data
42
Table 21: No of sub-city districts (out of 2466) having data for two paired indicators. Green
colour: Indicator pair with more than 1200 data points. Red colour: Indicator pair with less
than 200 data points.
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
5 o
r 6 IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
3 o
r 4 IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
1 o
r 2 IS
CE
D
SA
300
1I:
Tota
l N
um
ber
of
record
ed
crim
es
per
100
0
pop
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I:
Pop
ula
tion
den
sity
: to
tal
resid
en
t p
op.
pe
r
squ
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en space (in
m
2)
to w
hic
h th
e pu
blic
h
as
access p
er c
apita
EC
306
0I:
Pro
portio
n
of
ho
use
hold
s
relia
nt
up
on
so
cia
l
secu
rity
EC
305
7I:
Perc
en
t. ho
use
hold
s
with
le
ss
than
h
alf
nat.a
ve
r.inco
me
EC
303
9I:
Media
n
dis
posa
ble
an
nual
hou
seh
old
in
com
e
(for c
ity o
r NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I:
Pro
p.
hou
seh
old
s
tha
t a
re
lon
e-p
ensio
ne
r
hou
seh
old
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
-
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
a m
ediu
m o
r low
HD
I
DE
200
5I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
102
8V
+ D
E105
5V
: To
tal R
esid
ent P
opula
tion 6
5 a
nd
over
DE
100
3I: P
roportio
n o
f fem
ale
s to
ma
les in
tota
l po
pula
-
tion
543
542
536
584
104
2
104
1
100
9
117
9
149
1
664
342
589
215
340
127
4
138
0
154
6
154
0
159
4
192
1
192
1
212
8
0
218
8
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
517
516
510
584
104
2
100
4
987
115
8
148
5
638
336
584
210
326
124
8
135
9
152
5
153
2
157
3
191
5
191
5
212
8
3
442
441
438
488
914
856
972
115
0
137
1
569
269
559
200
282
119
2
136
1
149
4
149
4
151
8
198
1
198
1
4
442
441
438
488
914
856
972
115
0
137
1
569
269
559
200
282
119
2
136
1
149
4
149
4
151
8
198
1
5
361
361
358
399
746
689
996
125
4
127
8
408
221
589
215
404
120
1
145
9
174
3
173
7
179
2
6
356
356
353
387
730
684
982
123
6
126
1
397
221
584
215
388
116
1
142
8
171
2
173
7
7
332
331
331
368
711
665
996
125
4
126
7
397
211
589
215
393
117
7
145
8
174
3
8
300
300
300
310
574
528
996
110
8
103
9
351
199
491
215
275
104
8
145
9
9
368
368
368
268
635
559
658
862
898
257
65
381
8
124
133
4
10
8
8
8
292
278
299
213
261
432
304
245
275
216
459
11
0
0
0
196
166
187
201
201
204
210
199
215
216
12
0
0
0
368
528
549
201
555
576
360
199
589
13
135
135
99
274
288
311
232
195
344
344
377
14
Analysis of 2003-2006 data
43
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
5 o
r 6 IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
3 o
r 4 IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
1 o
r 2 IS
CE
D
SA
300
1I:
Tota
l N
um
ber
of
record
ed
crim
es
per
100
0
pop
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I:
Pop
ula
tion
den
sity
: to
tal
resid
en
t p
op.
pe
r
squ
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en space (in
m
2)
to w
hic
h th
e pu
blic
h
as
access p
er c
apita
EC
306
0I:
Pro
portio
n
of
ho
use
hold
s
relia
nt
up
on
so
cia
l
secu
rity
EC
305
7I:
Perc
en
t. ho
use
hold
s
with
le
ss
than
h
alf
nat.a
ve
r.inco
me
EC
303
9I:
Media
n
dis
posa
ble
an
nual
hou
seh
old
in
com
e
(for c
ity o
r NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I:
Pro
p.
hou
seh
old
s
tha
t a
re
lon
e-p
ensio
ne
r
hou
seh
old
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
-
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
a m
ediu
m o
r low
HD
I
DE
200
5I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
102
8V
+ D
E105
5V
: To
tal R
esid
ent P
opula
tion 6
5 a
nd
over
DE
100
3I: P
roportio
n o
f fem
ale
s to
ma
les in
tota
l po
pula
-
tion
129
128
93
360
559
591
241
353
670
697
15
116
115
79
521
831
829
762
105
1
163
6
16
81
80
80
368
607
569
758
125
4
17
305
305
258
207
163
188
106
4
18
267
267
264
549
977
104
1
19
247
247
244
529
110
2
20
108
109
106
607
21
559
559
559
22
611
612
23
614
24
Table 22 shows the correlation matrix between the 24 transformed indicators. Red colour is used to
indicate highly correlated indicators (correlation >0.5 or <-0.5). Green colour is used to indicate rela-
tively independent indicators (-0.1 < correlation <0.1). It can be seen that only a few indicator pairs
are highly correlated (abs > 0.5), a larger number of indicator pairs are somewhat correlated (0.1 <
abs < 0.5) whereas the majority of indicator pairs are virtually independent (abs<0.1).
Analysis of 2003-2006 data
44
Table 22: Correlation matrix between the 24 transformed indicators. Red colour: Abs >0.5.
Green colour: Abs<0.1.
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 5
or 6
ISC
ED
TE
20
28
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 3
or 4
ISC
ED
TE
20
25
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 1
or 2
ISC
ED
SA
300
1I:
Tota
l N
um
be
r of
record
ed
crim
es
per
10
00
pop
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
ropo
rtion o
f househ
old
s liv
ing in
socia
l ho
us-
ing
EN
510
1I:
Pop
ula
tion
de
nsity
: to
tal
resid
ent
po
p.
per
squ
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en sp
ace (in
m
2) to
wh
ich th
e p
ublic
has
access p
er c
apita
EC
306
0I:
Pro
portio
n
of
househ
old
s
relia
nt
up
on
socia
l secu
rity
EC
305
7I:
Pe
rce
nt.
ho
use
hold
s
with
le
ss
than
half
nat.a
ve
r.inco
me
EC
303
9I:
Me
dia
n dis
posa
ble
a
nn
ual
hou
seh
old
in
co
me
(fo
r city
or N
UT
S 3
regio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I:
Pro
p.
ho
use
hold
s
that
are
lo
ne
-pensio
ner
hou
seh
old
s
DE
300
5I:
Pro
p.
of
household
s
that
are
lo
ne-p
are
nt
hou
seh
old
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I:
Pro
po
rtion of
Resid
ents
w
ho a
re no
t E
U N
a-
tion
als
and
citiz
ens o
f a c
ountry
with
a m
ediu
m o
r low
H
DI
DE
200
5I:
Pro
po
rtion of
Resid
ents
w
ho a
re no
t E
U N
a-
tion
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
102
8V
+ D
E1
05
5V
: Tota
l Resid
ent P
opula
tion 6
5 a
nd
over
DE
100
3I: P
rop
ortio
n o
f fem
ale
s to
male
s in
tota
l pop
ula
-tio
n
Indic
ato
r nam
e
0.2
5
0.1
1
-0.1
8
-0.2
3
0.1
7
0.0
2
-0.1
6
-0.1
3
0.2
7
0.0
4
0.1
2
-0.0
8
-0.3
5
-0.2
2
-0.1
0
0.4
6
-0.2
4
0.0
2
-0.0
6
-0.1
9
-0.0
6
-0.2
8
NA
1.0
0
1
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
2
-0.0
8
-0.2
5
0. 1
3
-0.2
4
-0.3
4
-0.3
7
-0.0
1
0.3
4
-0.2
7
-0.0
2
-0.1
6
0.4
2
-0.1
0
0.5
2
0.0
2
-0.4
7
0.4
2
0.1
0
0.1
5
-0.0
2
-0.1
7
1.0
0
3
0.1
1
-0.1
5
-0.0
5
0.1
4
0.0
3
0.1
2
-0.0
3
-0.1
8
0.1
0
-0.1
2
0.0
2
-0.4
0
0.3
1
-0.0
8
-0.1
4
0.2
0
-0.0
5
-0.2
7
0.1
4
0.3
7
1.0
0
4
-0.1
9
-0.0
1
0.1
5
0.1
3
-0.0
1
0.2
8
0.0
6
0.0
5
0.1
6
-0.1
3
-0.1
3
-0.1
0
0.6
2
-0.3
5
0.0
8
0.2
1
0.1
1
-0.2
0
0.1
4
1.0
0
5
-0.0
1
-0.4
3
-0.0
7
0.1
7
-0.0
4
0.0
3
-0.2
1
0.0
0
-0.1
2
0.0
1
-0.1
3
-0.4
3
0.3
6
-0.4
9
-0.1
2
0.3
4
0.2
1
-0.2
0
1.0
0
6
-0.3
8
-0.0
2
0.5
2
-0.2
9
-0.1
3
-0.0
7
0.2
3
-0.0
9
0.2
3
0. 1
4
0.1
7
0.2
3
-0.4
8
0.1
6
0.0
2
-0.3
1
-0.2
5
1.0
0
7
-0.1
7
-0.4
1
0.1
9
-0.0
4
-0.1
1
0.2
0
-0.1
3
0.3
5
-0.4
6
-0.0
6
-0.1
2
-0.0
3
0.3
4
-0.2
1
0.0
5
0.0
7
1.0
0
8
0.0
9
0.2
3
-0.2
2
0.1
1
0.4
0
0.6
1
-0.0
5
-0.2
2
0.1
1
0.0
5
0.0
5
-0.4
2
-0.1
4
-0.6
5
0.1
0
1.0
0
9
-0.3
2
0.2
2
0.2
6
0.1
0
0.1
7
0.4
2
0.0
8
0.0
9
0.0
3
-0.0
4
0.0
2
0.6
3
-0.1
1
-0.3
4
1.0
0
10
0.7
1
0.3
4
-0.7
6
-0.1
0
-0.1
8
-0.7
0
-0.4
9
0.2
0
0.1
0
-0.1
2
-0.4
7
0.4
5
-0.7
8
1.0
0
11
NA
NA
NA
0.1
0
-0.0
8
-0.0
4
0.1
9
0.3
9
0.5
7
-0.2
5
-0.2
8
0.1
1
1.0
0
12
NA
NA
NA
-0.1
6
-0.2
8
-0.4
5
0.1
2
0.6
5
0.1
5
-0.0
9
-0.0
4
1.0
0
13
-0.2
1
0.1
4
0.0
7
-0.0
4
0.0
6
0.3
3
0.0
7
-0.0
6
-0.4
6
0. 3
8
1.0
0
14
Analysis of 2003-2006 data
45
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 5
or 6
ISC
ED
TE
20
28
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 3
or 4
ISC
ED
TE
20
25
I: P
rop.
of
work
ing
age
po
pu
latio
n
qu
alifie
d
at
level 1
or 2
ISC
ED
SA
300
1I:
Tota
l N
um
be
r of
record
ed
crim
es
per
10
00
pop
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
ropo
rtion o
f househ
old
s liv
ing in
socia
l ho
us-
ing
EN
510
1I:
Pop
ula
tion
de
nsity
: to
tal
resid
ent
po
p.
per
squ
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en sp
ace (in
m
2) to
wh
ich th
e p
ublic
has
access p
er c
apita
EC
306
0I:
Pro
portio
n
of
househ
old
s
relia
nt
up
on
socia
l secu
rity
EC
305
7I:
Pe
rce
nt.
ho
use
hold
s
with
le
ss
than
half
nat.a
ve
r.inco
me
EC
303
9I:
Me
dia
n dis
posa
ble
a
nn
ual
hou
seh
old
in
co
me
(fo
r city
or N
UT
S 3
regio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I:
Pro
p.
ho
use
hold
s
that
are
lo
ne
-pensio
ner
hou
seh
old
s
DE
300
5I:
Pro
p.
of
household
s
that
are
lo
ne-p
are
nt
hou
seh
old
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I:
Pro
po
rtion of
Resid
ents
w
ho a
re no
t E
U N
a-
tion
als
and
citiz
ens o
f a c
ountry
with
a m
ediu
m o
r low
H
DI
DE
200
5I:
Pro
po
rtion of
Resid
ents
w
ho a
re no
t E
U N
a-
tion
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
102
8V
+ D
E1
05
5V
: Tota
l Resid
ent P
opula
tion 6
5 a
nd
over
DE
100
3I: P
rop
ortio
n o
f fem
ale
s to
male
s in
tota
l pop
ula
-tio
n
Indic
ato
r nam
e
-0.2
8
0.4
6
0.1
7
-0.1
2
0.0
3
0.0
7
-0.0
4
-0.0
8
-0.2
1
1.0
0
15
0.3
9
-0.3
3
-0.3
1
0.0
3
-0.1
3
-0.1
5
-0.1
6
-0.0
9
1.0
0
16
-0.2
3
-0.6
0
0.2
4
-0.1
8
-0.3
0
-0.4
1
-0.0
9
1.0
0
17
0.2
0
-0.0
6
0.0
3
-0.0
5
0.0
3
0.3
0
1.0
0
18
-0.2
5
0.2
1
0.2
4
0.0
9
0.5
4
1.0
0
19
0.0
9
0.1
0
-0.0
7
0.1
0
1.0
0
20
0.4
2
-0.4
3
-0.0
7
1.0
0
21
-0.6
2
-0.2
8
1.0
0
22
-0.2
8
1.0
0
23
1.0
0
24
No zero data indicator combinations can be included in the principal component analysis. For this
reason it was decided to exclude indicators number 12 and 13 (EC3057I: Percent. households with
less than half nat.aver.income and EC3060I: Proportion of households reliant upon social security).
Also indicator number two (DE1028V + DE1055V: Total Resident Population 65 and over) has no
data, and was excluded. This leaves 21 indicators for the principal component analysis.
4.2.2 Results from PCA analysis
Table 23 lists the 21 indicators included in the principal component analysis. A total of 12 out of the
21 indicators have been transformed using the equation in Section 3.1.2.
Analysis of 2003-2006 data
46
Table 23: The 21 indicators included in the principal component analysis. A total of 12 out of
the 21 indicators have been transformed using the equation in Section 3.1.2.
No Indicator description Trans-formed
1 DE1003I: Proportion of females to males in total population
2 DE1040I: Proportion of total population aged 0-4 Y
3 DE2005I: Proportion of Residents who are not EU Nationals and citizens of a country with high HDI
Y
4 DE2006I: Proportion of Residents who are not EU Nationals and citizens of a country with a medium or low HDI
Y
5 DE3003I: Total number of households
6 DE3004I: Average size of households
7 DE3005I: Prop. of households that are lone-parent households Y
8 DE3008I: Prop. households that are lone-pensioner house-holds
Y
9 EC1020I: Unemployment rate Y
10 EC3039I: Median disposable annual household income (for city or NUTS 3 region)
11 EN5001I: Green space (in m2) to which the public has access per capita
12 EN5012I: Proportion of the area in green space Y
13 EN5101I: Population density: total resident pop. per square km
14 SA1012I: Proportion of households living in social housing Y
15 SA1018I: Proportion of dwellings lacking basic amenities Y
16 SA2016I: Mortality rate for <65 per year
17 SA2029I: Crude death rate per 1000 residents
18 SA3001I: Total Number of recorded crimes per 1000 population
19 TE2025I: Prop. of working age population qualified at level 1 or 2 ISCED
Y
20 TE2028I: Prop. of working age population qualified at level 3 or 4 ISCED
Y
21 TE2031I: Prop. of working age population qualified at level 5 or 6 ISCED
Y
Table 24 shows the correlation matrix between the 21 transformed indicators. Red colour is used to
indicate relatively highly correlated indicators (correlation >0.4 or <-0.4). Green colour is used to
indicate independent indicators (-0.1 < correlation < 0.1).
Analysis of 2003-2006 data
47
Table 24: Correlation matrix between the 21 transformed indicators. Red colour: Abs >0.4.
Green colour: Abs<0.1.
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 5
or 6
IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 3
or 4
IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 1
or 2
IS
CE
D
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0 p
op
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er s
qua
re k
m
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I: G
ree
n s
pace
(in m
2) to
whic
h th
e p
ublic
has a
ccess p
er
capita
EC
303
9I: M
edia
n d
ispo
sa
ble
ann
ual h
ouse
hold
incom
e (fo
r city
or
NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner h
ouse
hold
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
Natio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
a m
ediu
m o
r low
HD
I
DE
200
5I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
Natio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
Indic
ato
r
0.2
5
0.1
1
-0.1
8
-0.2
3
0.1
7
0.0
2
-0.1
6
-0.1
3
0.2
7
0.0
4
0.1
2
-0.2
2
-0.1
0
0.4
6
-0.2
4
0.0
2
-0.0
6
-0.1
9
-0.0
6
-0.2
8
1.0
0
1
-0.0
8
-0.2
5
0.1
3
-0.2
4
-0.3
4
-0.3
7
-0.0
1
0.3
4
-0.2
7
-0.0
2
-0.1
6
0.5
2
0.0
2
-0.4
7
0.4
2
0.1
0
0.1
5
-0.0
2
-0.1
7
1.0
0
2
0.1
1
-0.1
5
-0.0
5
0.1
4
0.0
3
0.1
2
-0.0
3
-0.1
8
0.1
0
-0.1
2
0.0
2
-0.0
8
-0.1
4
0.2
0
-0.0
5
-0.2
7
0.1
4
0.3
7
1.0
0
3
-0.1
9
-0.0
1
0.1
5
0. 1
3
-0.0
1
0.2
8
0.0
6
0.0
5
0.1
6
-0.1
3
-0.1
3
-0.3
5
0.0
8
0.2
1
0. 1
1
-0.2
0
0.1
4
1.0
0
4
-0.0
1
-0.4
3
-0.0
7
0.1
7
-0.0
4
0.0
3
-0.2
1
0.0
0
-0.1
2
0.0
1
-0.1
3
-0.4
9
-0.1
2
0.3
4
0.2
1
-0.2
0
1.0
0
5
-0.3
8
-0.0
2
0.5
2
-0.2
9
-0.1
3
-0.0
7
0.2
3
-0.0
9
0.2
3
0.1
4
0.1
7
0.1
6
0.0
2
-0.3
1
-0.2
5
1.0
0
6
-0.1
7
-0.4
1
0.1
9
-0.0
4
-0.1
1
0.2
0
-0.1
3
0.3
5
-0.4
6
-0.0
6
-0.1
2
-0.2
1
0.0
5
0.0
7
1.0
0
7
0.0
9
0.2
3
-0.2
2
0.1
1
0.4
0
0.6
1
-0.0
5
-0.2
2
0.1
1
0.0
5
0.0
5
-0.6
5
0.1
0
1.0
0
8
-0.3
2
0.2
2
0.2
6
0.1
0
0.1
7
0.4
2
0.0
8
0.0
9
0.0
3
-0.0
4
0.0
2
-0.3
4
1.0
0
9
0.7
1
0.3
4
-0.7
6
-0.1
0
-0.1
8
-0.7
0
-0.4
9
0.2
0
0.1
0
-0.1
2
-0.4
7
1.0
0
10
-0.2
1
0.1
4
0.0
7
-0.0
4
0.0
6
0.3
3
0.0
7
-0.0
6
-0.4
6
0.3
8
1.0
0
11
-0.2
8
0.4
6
0.1
7
-0.1
2
0.0
3
0.0
7
-0.0
4
-0.0
8
-0.2
1
1.0
0
12
0.3
9
-0.3
3
-0.3
1
0.0
3
-0.1
3
-0.1
5
-0.1
6
-0.0
9
1.0
0
13
Analysis of 2003-2006 data
48
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 5
or 6
IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 3
or 4
IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing
age p
opula
tion q
ualifie
d a
t level 1
or 2
IS
CE
D
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0 p
op
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er s
qua
re k
m
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I: G
ree
n s
pace
(in m
2) to
whic
h th
e p
ublic
has a
ccess p
er
capita
EC
303
9I: M
edia
n d
ispo
sa
ble
ann
ual h
ouse
hold
incom
e (fo
r city
or
NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner h
ouse
hold
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
Natio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
a m
ediu
m o
r low
HD
I
DE
200
5I: P
ropo
rtion o
f Resid
ents
who
are
no
t EU
Natio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
Indic
ato
r
-0.2
3
-0.6
0
0.2
4
-0.1
8
-0.3
0
-0.4
1
-0.0
9
1.0
0
14
0.2
0
-0.0
6
0.0
3
-0.0
5
0.0
3
0.3
0
1.0
0
15
-0.2
5
0.2
1
0.2
4
0.0
9
0.5
4
1.0
0
16
0.0
9
0.1
0
-0.0
7
0.1
0
1.0
0
17
0.4
2
-0.4
3
-0.0
7
1.0
0
18
-0.6
2
-0.2
8
1.0
0
19
-0.2
8
1.0
0
20
1.0
0
21
Analysis of 2003-2006 data
49
0 50 100 1500
0.01
0.02
0.03
0.04
0.05
0.06
Transformed value
1.
DE
1003I:
Pro
port
ion o
f fe
male
s
to m
ale
s in t
ota
l popula
tion
0 0.05 0.1 0.15 0.2 0.250
10
20
30
40
Transformed value
2.
DE
1040I:
Pro
port
ion o
f to
tal
popula
tion a
ged 0
-4
0 0.1 0.2 0.3 0.40
20
40
60
80
Transformed value
3.
DE
2005I:
Pro
port
ion o
f R
esid
ents
who a
re n
ot
EU
Nationals
and
citiz
ens o
f a c
ountr
y w
ith
hig
h H
DI
0 0.5 1 1.50
2
4
6
8
10
12
Transformed value
4.
DE
2006I:
Pro
port
ion o
f R
esid
ents
who a
re n
ot
EU
Nationals
and
citiz
ens o
f a c
ountr
y w
ith
a
mediu
m o
r lo
w H
DI
0 5 10 15
x 104
0
0.2
0.4
0.6
0.8
1x 10
-4
Transformed value
5.
DE
3003I:
Tota
l num
ber
of
household
s
0 2 4 6 8 100
0.2
0.4
0.6
0.8
1
Transformed value
6.
DE
3004I:
Avera
ge s
ize o
f
household
s
0 0.1 0.2 0.3 0.40
5
10
15
20
25
30
Transformed value
7.
DE
3005I:
Pro
p.
of
household
s
that
are
lone-p
are
nt
household
s
0 0.1 0.2 0.3 0.40
2
4
6
8
10
Transformed value
8.
DE
3008I:
Pro
p.
household
s
that
are
lone-p
ensio
ner
household
s
Figure 8: Probability density function estimates for indicator nos. 1-8 (bars). Best fit General-
ised Extreme Value distribution (solid line).
Analysis of 2003-2006 data
50
0 0.5 1 1.5 20
2
4
6
8
Transformed value
9.
EC
1020I:
Unem
plo
ym
ent
rate
0 1 2 3 4
x 104
0
0.2
0.4
0.6
0.8
1
1.2x 10
-4
Transformed value
10.
EC
3039I:
Media
n d
isposable
annual household
incom
e (
for
city o
r N
UT
S 3
regio
n)
0 200 400 6000
0.005
0.01
0.015
0.02
Transformed value
11.
EN
5001I:
Gre
en s
pace (
in
m2)
to w
hic
h t
he p
ublic
has
access p
er
capita
0 10 20 30 40 50 600
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Transformed value
12.
EN
5012I:
Pro
port
ion o
f
the a
rea in g
reen s
pace
0 2 4 6 8
x 104
0
1
x 10-4
Transformed value
13.
EN
5101I:
Popula
tion d
ensity:
tota
l re
sid
ent
pop.
per
square
km
0 1 2 3 4 5 60
1
2
3
4
Transformed value
14.
SA
1012I:
Pro
port
ion o
f
household
s liv
ing in s
ocia
l
housin
g
0 0.1 0.2 0.3 0.40
20
40
60
80
Transformed value
15.
SA
1018I:
Pro
port
ion o
f
dw
elli
ngs lackin
g b
asic
am
enitie
s
0 5 10 15 200
0.1
0.2
0.3
0.4
0.5
Transformed value
16.
SA
2016I:
Mort
alit
y r
ate
for
<65 p
er
year
Figure 9: Probability density function estimates for indicator nos. 9-16 (bars). Best fit Gener-
alised Extreme Value distribution (solid line).
Analysis of 2003-2006 data
51
0 10 20 30 40 500
0.05
0.1
0.15
0.2
Transformed value
17.
SA
2029I:
Cru
de d
eath
rate
per
1000 r
esid
ents
0 200 400 600 800 1000 12000
2
4
6
8x 10
-3
Transformed value
18.
SA
3001I:
Tota
l N
um
ber
of
record
ed c
rim
es p
er
1000 p
opula
tion
0 0.5 1 1.50
0.5
1
1.5
2
2.5
3
Transformed value
19.
TE
2025I:
Pro
p.
of
work
ing
age p
opula
tion q
ualif
ied a
t
level 1 o
r 2 I
SC
ED
0 1 2 3 40
0.5
1
1.5
2
Transformed value
20.
TE
2028I:
Pro
p.
of
work
ing
age p
opula
tion q
ualif
ied a
t
level 3 o
r 4 I
SC
ED
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.60
0.5
1
1.5
2
2.5
3
Transformed value
21.
TE
2031I:
Pro
p.
of
work
ing
age p
opula
tion q
ualif
ied a
t
level 5 o
r 6 I
SC
ED
Figure 10: Probability density function estimates for indicator nos. 17-21 (bars). Best fit Gen-
eralised Extreme Value distribution (solid line).
Analysis of 2003-2006 data
52
Table 25: Parameters for best fit Generalised Extreme Value probability density function for
the 21 transformed indicators
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: Pro
p. o
f wo
rkin
g age
pop
ula
tion q
ualifie
d at le
vel 5
or 6
IS
CE
D
TE
20
28
I: Pro
p. o
f wo
rkin
g age
pop
ula
tion q
ualifie
d at le
vel 3
or 4
IS
CE
D
TE
20
25
I: Pro
p. o
f wo
rkin
g age
pop
ula
tion q
ualifie
d at le
vel 1
or 2
IS
CE
D
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0 p
op
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er s
qua
re k
m
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I: G
ree
n s
pace (in
m2) to
whic
h th
e p
ublic
has a
cce
ss p
er
capita
EC
303
9I:
Me
dia
n dis
posa
ble
a
nn
ual
hou
seh
old
in
co
me (fo
r city
o
r N
UT
S 3
regio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner h
ouse
hold
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I:
Pro
po
rtion o
f R
esid
ents
w
ho a
re not
EU
N
atio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
a m
ediu
m o
r low
HD
I
DE
200
5I:
Pro
po
rtion o
f R
esid
ents
w
ho a
re not
EU
N
atio
nals
an
d
citiz
en
s o
f a c
ou
ntry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
Ind
icato
r nam
e
0.1
8
-0.0
2
0.2
0
0.2
5
-0.0
6
-0.0
4
4.9
8
0.5
3
0.8
8
1.0
2
0.8
1
-0.3
8
0.0
7
-0.1
1
0.5
5
-0.0
4
0.2
3
0.4
5
0.4
6
0.0
0
-0.2
5
Sh
ap
e
para
me-
ter, ξξ ξξ
0.1
5
0.2
9
0.1
3
50.1
3.3
9
0.9
2
0.0
0
0.1
1
3127
0.2
9
28.8
7
9822
0.0
5
0.0
5
0.0
2
0.4
5
4433
0.0
3
0.0
1
0.0
1
11.3
Scale
p
ara
me-
ter, σσ σσ
0.2
9
0.7
7
0.2
3
52.0
7.4
7
1.5
0
0.0
0
0.1
1
2397
0.2
0
23.3
9
13050
0.1
0
0.0
9
0.0
3
2.1
2
7367
0.0
4
0.0
1
0.0
4
104
Lo
catio
n
par., µµ µµ
.
Best fit
0.1
1
-0.0
6
0.1
2
0.1
8
-0.0
7
-0.0
5
0.4
8
0.8
1
0.8
8
0.6
5
-0.4
2
0.0
4
-0.1
4
0.5
0
-0.0
6
0.2
0
0.3
9
0.4
1
-0.0
2
-0.2
5
Sh
ap
e
para
me-
ter, ξξ ξξ
0.1
4
0.2
8
0.1
2
46.6
3.2
5
0.8
8
0.1
1
2911
0.2
5
25.0
5
9158
0.0
4
0.0
5
0.0
2
0.4
4
4257
0.0
3
0.0
0
0.0
1
11.0
Scale
p
ara
me-
ter, σσ σσ
0.2
8
0.7
5
0.2
1
47.5
7.2
6
1.4
4
0.1
1
2208
0.1
7
19.7
8
12090
0.0
9
0.0
9
0.0
3
2.1
0
7140
0.0
4
0.0
1
0.0
4
103
Lo
catio
n
par., µµ µµ
.
5%
co
nfid
en
ce in
terv
al
valu
e
0.2
5
0.0
2
0.2
8
0.3
2
-0.0
4
-0.0
2
0.5
9
0.9
6
1.1
6
0.9
6
-0.3
4
0.1
0
-0.0
8
0.6
1
-0.0
3
0.2
6
0.5
0
0.5
1
0.0
3
-0.2
4
Sh
ap
e
para
me-
ter, ξξ ξξ
0.1
6
0.3
1
0.1
4
54.0
3.5
4
0.9
7
0.1
2
3359
0.3
2
33.3
10535
0.0
5
0.0
5
0.0
2
0.4
7
4616
0.0
4
0.0
1
0.0
1
11.6
Scale
p
ara
me-
ter, σσ σσ
0.3
0
0.8
0
0.2
4
56.6
7.6
9
1.5
6
0.1
2
2586
0.2
3
27.0
14009
0.1
0
0.1
0
0.0
3
2.1
4
7593
0.0
4
0.0
1
0.0
4
104
Lo
catio
n
par., µµ µµ
.
95%
co
nfid
en
ce in
terv
al
valu
e
The bar-diagrams in Figure 8 (indicator numbers 1-8), Figure 9 (indicator numbers 9-16) and Figure
10 (indicator numbers 17-21) are histograms normalised to estimate probability distribution function
for the 21 transformed indicators. The solid line shows a best fit generalised extreme value distribu-
tion. The best fit parameters are listed in
Analysis of 2003-2006 data
53
Table 25. It is believed that the double peak distribution observed for indicator number 10. EC3039I:
Median disposable annual household income (for city or NUTS 3 region) (Figure 9) is a result of
error in the data, in particular for a number of German cities where income levels seem unrealisti-
cally low.
The eigenvectors with corresponding eigenvalues resulting from the principal component analysis
are shown in Table 26. To show the importance of each indicator in the various eigenvectors, ei-
genvector components with absolute value larger than 0.3 (high importance) are marked in red and
eigenvector components with absolute value between 0.1 and 0.3 (medium importance) are marked
in green. Indicators with low importance (absolute value less than 0.1) are written in black.
The eigenvectors are sorted by decreasing eigenvalue, showing the eigenvector representing the
largest variation (14.4%) in the data set in the first column and the eigenvector representing the
least variation (0.3%) in column number 21.
The dimensions of the data set can now be reduced by omitting the principal components represent-
ing the least variation in the data set.
The third row in Table 26 shows cumulative explanation of variability as more eigenvectors are in-
cluded in the new data set. For example, including 10 out of the 21 eigenvectors, i.e. all eigenvec-
tors with eigenvalues larger than 1.00, gives a data set containing 75 % of the variation in the origi-
nal data set.
Table 26: Eigenvectors and eigenvalues resulting from the principal component analysis
sorted by decreasing eigenvalue. Components with absolute value larger than 0.3 marked in
red. Components with absolute value between 0.1 and 0.3 marked in green.
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1 T
E20
31
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level 5
or 6
ISC
ED
TE
20
28
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level 3
or 4
ISC
ED
TE
20
25
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level 1
or 2
ISC
ED
SA
300
1I: T
ota
l Nu
mb
er o
f reco
rded c
rimes p
er 1
000 p
opula
-tio
n
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I: P
opula
tion d
ensity
: tota
l resid
en
t po
p. p
er s
qu
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en
space
(in
m2)
to
whic
h
the
public
h
as
access p
er c
apita
EC
303
9I: M
edia
n d
isp
osa
ble
ann
ual h
ou
seh
old
inco
me (fo
r city
or N
UT
S 3
regio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I: P
rop
. ho
use
hold
s th
at a
re lo
ne
-pensio
ner h
ouse-
hold
s
DE
300
5I:
Pro
p.
of
ho
usehold
s th
at
are
lo
ne
-pare
nt
ho
use
-hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
ropo
rtion o
f Resid
en
ts w
ho
are
not E
U N
atio
nals
and
citiz
ens o
f a c
oun
try w
ith a
me
diu
m o
r low
HD
I
DE
200
5I: P
ropo
rtion o
f Resid
en
ts w
ho
are
not E
U N
atio
nals
and
citiz
ens o
f a c
oun
try w
ith h
igh H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
Indic
ato
r code
Eig
en v
alu
e
Cum
ula
tive e
xpla
natio
n o
f varia
bility
(%)
Expla
natio
n o
f varia
bility
(%)
Eig
en v
ecto
r no
0.0
0
-0.1
3
0.0
6
-0.1
0
-0.3
2
-0.4
4
-0.0
5
0.2
9
-0.0
4
-0.0
8
-0.1
8
0.4
4
-0.0
8
-0.4
0
0.0
3
0.0
9
0.0
1
-0.1
1
-0.1
0
0.3
7
-0.1
6
4.0
2
18.3
18.3
1
-0.4
1
0.4
7
0.2
3
-0.3
3
-0.0
1
0.1
2
-0.0
2
0.0
4
-0.1
2
0.3
2
0.2
4
0.0
3
0.1
3
-0.1
2
-0.0
4
0.2
5
-0.3
2
-0.0
7
-0.2
1
0.0
5
-0.0
3
2.6
8
30.5
12.2
2
-0.3
2
-0.2
5
0.3
4
0.0
8
-0.0
3
0.2
1
0.0
3
0.1
5
-0.3
0
-0.0
5
0.0
5
-0.1
8
0.1
4
0.0
3
0.4
4
-0.1
2
0.2
7
0.2
7
0.0
4
0.1
8
-0.3
2
2.4
5
41.6
11.1
3
0.1
7
0.2
0
-0.3
9
-0.2
0
0.0
1
-0.0
6
0.0
4
0.1
2
-0.4
8
0.1
6
0.1
2
-0.0
5
-0.1
9
0.0
9
0.3
5
-0.4
0
0.0
9
-0.2
6
-0.1
1
0.1
0
0.1
5
1.7
7
49.7
8.0
4
-0.1
6
0.3
0
-0.1
2
-0.1
6
-0.1
3
-0.0
5
-0.3
8
0.2
0
0.1
6
-0.0
3
-0.1
9
0.1
0
-0.0
4
0.1
4
0.0
0
-0.3
8
-0.1
6
0.4
7
0.3
9
-0.0
3
-0.0
2
1.4
8
56.4
6.7
5
Analysis of 2003-2006 data
54
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1 T
E20
31
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level 5
or 6
ISC
ED
TE
20
28
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level 3
or 4
ISC
ED
TE
20
25
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level 1
or 2
ISC
ED
SA
300
1I: T
ota
l Nu
mb
er o
f reco
rded c
rimes p
er 1
000 p
opula
-tio
n
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I: P
opula
tion d
ensity
: tota
l resid
en
t po
p. p
er s
qu
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en
space
(in
m2)
to
whic
h
the
public
h
as
access p
er c
apita
EC
303
9I: M
edia
n d
isp
osa
ble
ann
ual h
ou
seh
old
inco
me (fo
r city
or N
UT
S 3
regio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I: P
rop
. ho
use
hold
s th
at a
re lo
ne
-pensio
ner h
ouse-
hold
s
DE
300
5I:
Pro
p.
of
ho
usehold
s th
at
are
lo
ne
-pare
nt
ho
use
-hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
ropo
rtion o
f Resid
en
ts w
ho
are
not E
U N
atio
nals
and
citiz
ens o
f a c
oun
try w
ith a
me
diu
m o
r low
HD
I
DE
200
5I: P
ropo
rtion o
f Resid
en
ts w
ho
are
not E
U N
atio
nals
and
citiz
ens o
f a c
oun
try w
ith h
igh H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
Indic
ato
r code
Eig
en v
alu
e
Cum
ula
tive e
xpla
natio
n o
f varia
bility
(%)
Expla
natio
n o
f varia
bility
(%)
Eig
en v
ecto
r no
-0.2
0
-0.2
3
0.1
0
-0.3
4
-0.1
0
-0.0
4
-0.3
3
0.1
7
0.2
9
0.1
0
0.0
3
-0.1
6
-0.2
3
0.2
1
0.0
5
0.1
7
0.4
1
-0.0
7
-0.1
7
-0.0
1
0.4
3
1.4
1
62.8
6.4
6
-0.1
1
-0.0
4
-0.0
4
0.1
8
-0.1
5
-0.0
8
-0.0
8
-0.3
0
-0.0
6
0.4
7
0.2
8
0.0
2
-0.4
9
-0.2
0
-0.1
5
0.0
3
0.2
4
0.0
3
0.3
3
-0.0
6
-0.2
3
1.2
4
68.5
5.7
7
0.1
6
0.0
0
0.0
3
-0.3
3
-0.2
8
0.0
2
0.6
2
0.2
1
0.0
3
-0.1
7
0.1
7
-0.2
2
-0.2
5
-0.0
6
-0.0
2
0.1
6
-0.0
8
0.2
7
0.2
4
-0.0
4
0.1
0
1.1
6
73.8
5.3
8
-0.0
5
0.0
9
-0.0
4
-0.3
4
0.3
3
0.1
4
0.2
2
-0.3
5
-0.0
2
0.0
7
-0.5
5
0.1
5
-0.2
0
0.1
4
0.0
0
0.1
6
0.1
7
0.1
0
0.0
5
0.3
4
-0.1
0
0.9
7
78.2
4.4
9
0.0
1
-0.0
8
0.3
8
0.0
3
0.2
8
-0.0
8
-0.2
2
-0.0
4
-0.2
2
-0.2
7
-0.0
6
0.0
2
-0.4
9
-0.0
1
0.2
5
0.1
1
-0.4
1
-0.1
4
0.1
8
-0.1
6
0.1
4
0.8
4
82.0
3.8
10
-0.0
4
-0.1
0
-0.0
5
-0.2
3
0.2
8
0.0
9
-0.0
9
-0.0
5
-0.0
8
-0.2
5
0.4
4
0.2
4
0.2
7
-0.0
8
-0.1
6
0.0
0
0.1
4
-0.1
6
0.5
0
0.3
0
0.1
9
0.7
2
85.2
3.3
11
-0.2
2
0.0
5
-0.3
5
-0.2
4
-0.0
9
0.1
6
-0.2
0
-0.1
6
-0.0
5
-0.5
7
0.0
7
-0.1
4
-0.1
7
-0.2
1
-0.0
9
0.0
5
0.1
1
-0.0
7
-0.2
1
-0.2
1
-0.3
7
0.6
6
88.2
3.0
12
0.1
5
0.0
9
-0.1
5
0.0
7
-0.4
4
0.1
4
-0.2
1
-0.3
8
0.0
8
-0.0
2
-0.0
8
-0.1
6
0.1
9
-0.1
3
0.4
7
0.3
2
-0.1
2
-0.0
4
0.2
0
0.2
0
0.1
8
0.6
0
91.0
2.7
13
-0.1
0
0.0
3
0.1
2
-0.1
7
-0.1
9
0.0
1
0.1
1
0.1
2
0.0
0
0.0
8
-0.3
2
-0.0
1
0.1
9
0.1
1
0.0
3
-0.0
2
0.1
1
-0.5
8
0.4
1
-0.4
0
-0.2
1
0.4
7
93.1
2.1
14
-0.2
6
0.0
2
0.2
3
0.0
3
-0.3
1
-0.0
8
0.1
3
-0.3
8
-0.3
5
-0.1
6
-0.1
8
0.0
6
0.0
8
-0.1
2
-0.2
6
-0.3
0
0.1
0
0.0
8
-0.0
8
-0.1
1
0.4
7
0.3
9
94.9
1.8
15
0.0
0
0.6
2
0.2
3
0.2
7
0.1
0
-0.3
3
0.0
5
0.0
4
0.1
4
-0.2
6
0.0
3
-0.3
0
-0.0
6
0.0
1
0.0
0
0.0
2
0.3
5
-0.1
3
0.0
6
0.2
2
-0.0
2
0.3
4
96.5
1.5
16
0.0
1
0.0
2
-0.1
2
-0.1
4
0.3
7
-0.3
5
0.0
0
-0.0
1
-0.0
5
0.1
3
-0.0
9
-0.0
8
0.2
5
-0.4
7
0.2
5
0.1
5
0.2
0
0.2
8
0.0
5
-0.4
1
0.1
2
0.2
9
97.8
1.3
17
0.1
5
-0.0
5
0.2
9
-0.2
6
0.0
6
0. 0
8
0.0
7
-0.2
6
0.4
9
0.0
4
0.1
1
-0.0
4
-0.0
3
-0.3
2
0.2
0
-0.5
4
-0.0
7
-0.1
3
-0.1
1
0.0
4
-0.0
9
0.2
6
98.9
1.2
18
-0.3
5
-0.2
4
-0.2
4
0.0
7
0.1
3
-0.1
1
0.0
4
0.1
1
0.0
4
0.1
0
-0.2
0
-0.5
7
0.0
0
-0.2
2
-0.1
8
-0.1
1
-0.3
0
-0.1
6
0.1
3
0.3
1
0.0
8
0.1
4
99.6
0.6
19
0.5
4
0.0
0
0.2
8
-0.2
3
-0.0
4
0.1
1
-0.3
6
0.0
7
-0.3
1
0.1
0
-0.1
3
-0.3
4
0.0
8
-0.1
5
-0.3
7
0.0
2
0.0
6
0.0
6
0.0
0
0.0
8
-0.1
2
0.0
9
100.0
0.4
20
-0.0
1
0.1
7
-0.0
3
0.2
6
0.0
7
0.6
2
0.0
2
0.3
5
0.0
6
0.0
2
-0.1
8
0.1
5
-0.1
8
-0.4
5
-0.0
1
-0.0
1
0.1
7
-0.0
6
0.0
3
-0.0
3
0.2
5
0.0
1
100.0
0.0
21
Table 27, which shows correlation between the principal components (PC1 to PC21) and each of
the 21 original indicators, can be used to find which indicators dominate each of the principal com-
Analysis of 2003-2006 data
55
ponents. It can be seen that on average, decreasing correlation between the indicators and the
principal components is found for increasing principal component number.
Table 27: Correlation between principal components (PC1 to PC21) and each of the original
indicators. Green: Strong positive correlation. Yellow: Low correlation. Red: Strong negative
correlation.
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
5 o
r 6 IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
3 o
r 4 IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
1 o
r 2 IS
CE
D
SA
300
1I:
Tota
l N
um
ber
of
record
ed
crim
es
per
100
0
pop
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I:
Pop
ula
tion
den
sity
: to
tal
resid
en
t p
op.
pe
r
squ
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en space (in
m
2)
to w
hic
h th
e pu
blic
h
as
access p
er c
apita
EC
303
9I:
Media
n
dis
posa
ble
an
nual
hou
seh
old
in
com
e
(for c
ity o
r NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I:
Pro
p.
hou
seh
old
s
tha
t a
re
lon
e-p
ensio
ne
r
hou
seh
old
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
-
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
a m
ediu
m o
r low
HD
I
DE
200
5I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
100
3I: P
roportio
n o
f fem
ale
s to
ma
les in
tota
l po
pula
-
tion
IND
ICA
OT
RS
-0.0
1
-0.2
7
0.1
1
-0.2
0
-0.6
3
-0.8
8
-0.1
0
0.5
9
-0.0
8
-0.1
6
-0.3
6
0.8
8
-0.1
6
-0.8
0
0.0
7
0.1
7
0.0
2
-0.2
2
-0.2
0
0.7
3
-0.3
1
PC
1
-0.6
8
0.7
7
0.3
8
-0.5
4
-0.0
2
0.2
0
-0.0
4
0.0
6
-0.2
0
0.5
3
0.4
0
0.0
5
0.2
0
-0.1
9
-0.0
6
0.4
0
-0.5
3
-0.1
1
-0.3
4
0.0
8
-0.0
5
PC
2
-0.5
0
-0.4
0
0.5
4
0.1
2
-0.0
5
0.3
3
0.0
5
0.2
3
-0.4
8
-0.0
8
0.0
8
-0.2
8
0.2
3
0.0
5
0.6
8
-0.1
8
0.4
2
0.4
2
0.0
7
0.2
9
-0.5
0
PC
3
0.2
3
0.2
7
-0.5
2
-0.2
7
0.0
1
-0.0
8
0.0
5
0.1
6
-0.6
4
0.2
1
0.1
5
-0.0
7
-0.2
6
0.1
2
0.4
7
-0.5
3
0.1
2
-0.3
4
-0.1
5
0.1
3
0.2
0
PC
4
-0.1
9
0.3
7
-0.1
4
-0.2
0
-0.1
6
-0.0
6
-0.4
6
0.2
4
0.1
9
-0.0
3
-0.2
3
0.1
2
-0.0
5
0.1
6
0.0
0
-0.4
6
-0.2
0
0.5
8
0.4
8
-0.0
4
-0.0
2
PC
5
-0.2
4
-0.2
7
0.1
2
-0.4
1
-0.1
2
-0.0
4
-0.3
9
0.2
0
0.3
5
0.1
2
0.0
4
-0.1
8
-0.2
7
0.2
5
0.0
6
0.2
0
0.4
9
-0.0
9
-0.2
0
-0.0
1
0.5
1
PC
6
-0.1
2
-0.0
5
-0.0
4
0.2
0
-0.1
6
-0.0
9
-0.0
8
-0.3
4
-0.0
7
0.5
2
0.3
1
0.0
2
-0.5
4
-0.2
2
-0.1
7
0.0
3
0.2
6
0.0
3
0.3
7
-0.0
6
-0.2
6
PC
7
0.1
8
0.0
0
0.0
3
-0.3
6
-0.3
0
0.0
3
0.6
7
0.2
3
0.0
4
-0.1
8
0.1
9
-0.2
4
-0.2
7
-0.0
6
-0.0
2
0.1
8
-0.0
9
0.2
9
0.2
5
-0.0
5
0.1
1
PC
8
-0.0
5
0.0
9
-0.0
4
-0.3
3
0.3
2
0.1
4
0.2
1
-0.3
5
-0.0
2
0.0
7
-0.5
4
0.1
4
-0.2
0
0.1
4
0.0
0
0.1
6
0.1
7
0.1
0
0.0
5
0.3
3
-0.1
0
PC
9
0.0
0
-0.0
8
0.3
5
0.0
3
0.2
6
-0.0
8
-0.2
0
-0.0
4
-0.2
0
-0.2
5
-0.0
5
0.0
2
-0.4
5
-0.0
1
0.2
3
0.1
0
-0.3
7
-0.1
2
0.1
6
-0.1
5
0.1
3
PC
10
-0.0
3
-0.0
9
-0.0
4
-0.1
9
0.2
3
0.0
7
-0.0
7
-0.0
4
-0.0
7
-0.2
1
0.3
7
0.2
0
0.2
3
-0.0
7
-0.1
3
0.0
0
0.1
2
-0.1
3
0.4
2
0.2
5
0.1
6
PC
11
Analysis of 2003-2006 data
56
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
TE
20
31
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
5 o
r 6 IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
3 o
r 4 IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing a
ge p
op
ula
tion q
ualifie
d a
t level
1 o
r 2 IS
CE
D
SA
300
1I:
Tota
l N
um
ber
of
record
ed
crim
es
per
100
0
pop
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I:
Pop
ula
tion
den
sity
: to
tal
resid
en
t p
op.
pe
r
squ
are
km
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I:
Gre
en space (in
m
2)
to w
hic
h th
e pu
blic
h
as
access p
er c
apita
EC
303
9I:
Media
n
dis
posa
ble
an
nual
hou
seh
old
in
com
e
(for c
ity o
r NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I:
Pro
p.
hou
seh
old
s
tha
t a
re
lon
e-p
ensio
ne
r
hou
seh
old
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
-
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
a m
ediu
m o
r low
HD
I
DE
200
5I: P
rop
ortio
n o
f Resid
en
ts w
ho a
re n
ot E
U N
atio
n-
als
and c
itize
ns o
f a c
ountry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
100
3I: P
roportio
n o
f fem
ale
s to
ma
les in
tota
l po
pula
-
tion
IND
ICA
OT
RS
-0.1
8
0.0
4
-0.2
8
-0.1
9
-0.0
7
0.1
3
-0.1
6
-0.1
3
-0.0
4
-0.4
6
0.0
5
-0.1
2
-0.1
3
-0.1
7
-0.0
8
0.0
4
0.0
9
-0.0
5
-0.1
7
-0.1
7
-0.3
0
PC
12
0.1
2
0.0
7
-0.1
2
0.0
5
-0.3
4
0.1
1
-0.1
6
-0.2
9
0.0
6
-0.0
1
-0.0
6
-0.1
2
0.1
4
-0.1
0
0.3
7
0.2
5
-0.0
9
-0.0
3
0.1
5
0.1
6
0.1
4
PC
13
-0.0
7
0.0
2
0.0
8
-0.1
2
-0.1
3
0.0
1
0.0
8
0.0
9
0.0
0
0.0
5
-0.2
2
-0.0
1
0.1
3
0.0
7
0.0
2
-0.0
1
0.0
8
-0.4
0
0.2
8
-0.2
8
-0.1
4
PC
14
-0.1
6
0.0
1
0.1
4
0.0
2
-0.1
9
-0.0
5
0.0
8
-0.2
4
-0.2
2
-0.1
0
-0.1
1
0.0
4
0.0
5
-0.0
8
-0.1
6
-0.1
9
0.0
7
0.0
5
-0.0
5
-0.0
7
0.2
9
PC
15
0.0
0
-0.3
6
-0.1
3
-0.1
6
-0.0
6
0.1
9
-0.0
3
-0.0
2
-0.0
8
0.1
5
-0.0
2
0.1
8
0.0
4
-0.0
1
0.0
0
-0.0
1
-0.2
0
0.0
7
-0.0
4
-0.1
3
0.0
1
PC
16
0.0
0
0.0
1
-0.0
6
-0.0
8
0.2
0
-0.1
9
0.0
0
0.0
0
-0.0
3
0.0
7
-0.0
5
-0.0
4
0.1
3
-0.2
6
0.1
3
0.0
8
0.1
1
0.1
5
0.0
3
-0.2
2
0.0
7
PC
17
0.0
7
-0.0
2
0.1
5
-0.1
3
0.0
3
0.0
4
0.0
4
-0.1
3
0.2
5
0.0
2
0.0
5
-0.0
2
-0.0
2
-0.1
6
0.1
0
-0.2
8
-0.0
4
-0.0
7
-0.0
6
0.0
2
-0.0
5
PC
18
0.1
3
0.0
9
0.0
9
-0.0
3
-0.0
5
0.0
4
-0.0
2
-0.0
4
-0.0
1
-0.0
4
0.0
7
0.2
1
0.0
0
0.0
8
0.0
7
0.0
4
0.1
1
0.0
6
-0.0
5
-0.1
2
-0.0
3
PC
19
0.1
6
0.0
0
0.0
8
-0.0
7
-0.0
1
0.0
3
-0.1
1
0.0
2
-0.0
9
0.0
3
-0.0
4
-0.1
0
0.0
3
-0.0
4
-0.1
1
0.0
1
0.0
2
0.0
2
0.0
0
0.0
2
-0.0
4
PC
20
0.0
0
-0.0
2
0.0
0
-0.0
2
-0.0
1
-0.0
6
0.0
0
-0.0
3
-0.0
1
0.0
0
0.0
2
-0.0
1
0.0
2
0.0
4
0.0
0
0.0
0
-0.0
1
0.0
1
0.0
0
0.0
0
-0.0
2
PC
21
Principal component number one is strongly correlated with the following indicators:
Positive:
• Correlation = +0.73: DE1040I: Proportion of total population aged 0-4
• Correlation = +0.88: EC3039I: Median disposable annual household income (for city or
NUTS 3 region)
• Correlation = +0.59: SA1012I: Proportion of households living in social housing
Negative:
Analysis of 2003-2006 data
57
• Correlation = -0.80: DE3008I: Prop. households that are lone-pensioner households
• Correlation = -0.88: SA2016I: Mortality rate for <65 per year
• Correlation = -0.63: SA2029I: Crude death rate per 1000 residents
It should be noted that less emphasis should be put on the median income indicator (EC3039I) as
the data is believed to contain errors. The correlation with the remaining five indicators would be
characteristic for a young population. Principal component no 1 could therefore subjectively be
termed Proportion of young people.
The results of a similar analysis for the first eight principal components, representing close to 75%
of the variation in the data, can be found in Table 28. Along with the subjective descriptor is listed
the indicator most strongly correlated (either positive or negative) with the principal component and
the corresponding correlation value.
Table 28: Subjective labelling of the first eight principal components representing close to
75% of the variation in the data
No Subjective descriptor for principal com-
ponent Strongest correlated indicator
Strongest
correlation
value
1 Proportion of young people SA2016I: Mortality rate for <65 per
year -0.88
2 Green, suburban, medium educated family
area
TE2028I: Prop. of working age popu-
lation qualified at level 3 or 4 ISCED +0.77
3 Low education, single parent households DE3005I: Prop. of households that are
lone-parent households +0.68
4 Suburban, well educated, single people EN5101I: Population density: total
resident pop. per square km -0.64
5 Medium educated, small household, immi-
grant areas
DE2006I: Proportion of Residents who
are not EU Nationals and citizens of a
country with a medium or low HDI
+0.58
6 Large, male dominates districts with social
problems
DE1003I: Proportion of females to
males in total population +0.51
7 Suburban districts with high employment EC1020I: Unemployment rate -0.54
8 Slum areas SA1018I: Proportion of dwellings lack-
ing basic amenities +0.67
4.3 COMPARISON OF CITY AND SUB-CITY DATA.
Table 29 shows a comparison of correlation of data at city and sub-city level (Correlation at sub-city
level from Table 24 minus correlation at city level from Table 15). Components where data were not
available at sub-city level are marked with NA. Components with absolute value larger than 0.5 are
marked in red. Components with absolute value between 0.2 and 0.5 are marked in yellow. Compo-
nents with absolute value between 0.1 and 0.2 are unmarked (white background). Components with
absolute value less than 0.1 are marked in green.
It can be seen from the table that although a majority of the data correlates similarly at city and sub-
city level (green and white), still there are a significant number of variables that correlate somewhat
(yellow) or highly (red) different at sub-city from city level.
Analysis of 2003-2006 data
58
Table 29: Comparison of correlation of data at city and sub-city level (Correlation at sub-city
level from Table 24 minus correlation at city level from Table 15). Components where data
were not available at sub-city level are marked with NA. Components with absolute value
larger than 0.5 are marked in red. Components with absolute value between 0.2 and 0.5 are
marked in yellow. Components with absolute value between 0.1 and 0.2 are unmarked(white
background). Components with absolute value less than 0.1 are marked in green.
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1 T
E20
31
I: Pro
p. o
f work
ing
age
po
pula
tion
qu
alifie
d a
t level 5
or 6
IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing
age
po
pula
tion
qu
alifie
d a
t level 3
or 4
IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing
age
po
pula
tion
qu
alifie
d a
t level 1
or 2
IS
CE
D
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0 p
op
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er s
qua
re k
m
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I: G
ree
n s
pace (in
m2) to
whic
h th
e p
ublic
has a
ccess p
er
capita
EC
306
0I: P
ropo
rtion o
f househ
old
s re
liant u
pon s
ocia
l security
EC
305
7I: P
erc
ent. h
ouseh
old
s w
ith le
ss th
an h
alf n
at.a
ver.in
co
me
EC
303
9I: M
edia
n d
isp
osable
an
nu
al h
ousehold
incom
e (fo
r city
or
NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner h
ouse
hold
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I:
Pro
portio
n o
f R
esid
en
ts w
ho are
not
EU
N
atio
nals
a
nd
citiz
en
s o
f a c
ou
ntry
with
a m
ediu
m o
r low
HD
I
DE
200
5I:
Pro
portio
n o
f R
esid
en
ts w
ho are
not
EU
N
atio
nals
a
nd
citiz
en
s o
f a c
ou
ntry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
102
8V
+ D
E10
55V
: To
tal R
esid
en
t Popula
tion 6
5 a
nd
over
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
IND
ICA
TO
R
0.1
5
-0.2
5
-0.1
1
0.0
0
-0.1
3
-0.4
3
-0.6
1
0.1
9
0.1
8
0.2
2
0.1
2
0.0
7
-0.2
1
0.3
2
-0.3
5
0.1
3
-0.2
6
0.0
6
-0.0
1
-0.2
3
0.1
2
0.1
2
NA
0.0
0
1
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
2
-0.2
1
-0.0
2
0.1
5
-0.3
8
0.1
1
-0.0
7
0.2
0
-0.1
4
-0.1
5
0.0
2
-0.1
4
0.0
5
-0.2
1
0.1
7
0.1
4
-0.1
4
-0.1
5
-0.1
1
0.1
4
-0.0
9
0.0
0
0.0
0
NA
0.1
2
3
0.2
0
-0.0
2
-0.1
4
-0.0
8
-0.0
9
0.2
0
0.0
6
0.0
3
-0.1
3
-0.0
2
0.1
5
-0.0
9
0.2
4
-0.3
5
0.1
5
0.1
5
0.1
1
0.0
0
-0.1
9
-0.1
4
0.0
0
0.0
0
NA
0.1
2
4
0.0
3
-0.0
7
0.0
8
-0.0
5
-0.1
0
0.1
9
-0.0
5
0.2
2
-0.3
2
0.0
8
0.0
5
0.0
7
0.6
4
-0.6
6
0.0
9
0.1
4
-0.1
4
-0.0
3
-0.1
6
0.0
0
-0.1
4
-0.0
9
NA
-0.2
3
5
-0.1
6
-0.3
0
-0.0
7
-0.0
4
-0.0
1
0.0
6
-0.2
4
-0.0
3
-0.4
3
0.2
5
-0.0
5
-0.2
4
0.4
3
-0.6
0
-0.1
0
0.0
9
0.1
5
-0.0
7
0.0
0
-0.1
6
-0.1
9
0.1
4
NA
-0.0
1
6
-0.1
3
0.2
7
0.0
1
0.3
3
0.2
6
0.2
4
0.3
3
-0.0
5
0.1
7
0.1
4
0.1
0
-0.1
4
-0.8
5
0.4
5
0.0
5
0.1
6
-0.2
2
0.0
0
-0.0
7
-0.0
3
0.0
0
-0.1
1
NA
0.0
6
7
0.0
8
-0.3
1
0.2
0
-0.2
7
0.0
6
0.0
6
-0.1
4
0.1
0
-0.3
4
0.0
0
-0.0
3
-0.2
8
0.2
3
-0.5
2
-0.3
9
0.0
2
0.0
0
-0.2
2
0.1
5
-0.1
4
0.1
1
-0.1
5
NA
-0.2
6
8
0.2
7
-0.1
4
0. 1
3
-0.0
5
-0.1
1
0.2
7
-0.3
6
-0.0
1
0.2
4
-0.0
4
0.0
9
-0.0
4
-0.0
2
-0.6
3
0.0
1
0.0
0
0.0
2
0.1
6
0.0
9
0.1
4
0.1
5
-0.1
4
NA
0.1
3
9
-0.2
6
-0.0
5
0.0
4
0.0
0
0.2
3
0.3
2
0.1
6
0.2
2
-0.0
1
-0.1
2
0.0
1
0.4
3
0.0
8
-0.1
5
0.0
0
0.0
1
-0.3
9
0.0
5
-0.1
0
0.0
9
0.1
5
0.1
4
NA
-0.3
5
10
0.5
3
0.7
3
-0.7
7
-0.5
2
-0.2
4
-0.2
7
0.0
9
-0.3
3
0.0
9
0.1
3
-0.2
3
0.5
6
-1.2
4
0.0
0
-0.1
5
-0.6
3
-0.5
2
0.4
5
-0.6
0
-0.6
6
-0.3
5
0.1
7
NA
0.3
2
11
Analysis of 2003-2006 data
59
24
23
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1 T
E20
31
I: Pro
p. o
f work
ing
age
po
pula
tion
qu
alifie
d a
t level 5
or 6
IS
CE
D
TE
20
28
I: Pro
p. o
f work
ing
age
po
pula
tion
qu
alifie
d a
t level 3
or 4
IS
CE
D
TE
20
25
I: Pro
p. o
f work
ing
age
po
pula
tion
qu
alifie
d a
t level 1
or 2
IS
CE
D
SA
300
1I: T
ota
l Num
be
r of re
co
rde
d c
rimes p
er 1
00
0 p
op
ula
tion
SA
202
9I: C
rude d
eath
rate
pe
r 100
0 re
sid
en
ts
SA
201
6I: M
orta
lity ra
te fo
r <6
5 p
er y
ear
SA
101
8I: P
rop
ortio
n o
f dw
ellin
gs la
ckin
g b
asic
am
enitie
s
SA
101
2I: P
rop
ortio
n o
f ho
use
hold
s liv
ing in
socia
l housin
g
EN
510
1I: P
op
ula
tion d
ensity
: tota
l resid
ent p
op. p
er s
qua
re k
m
EN
501
2I: P
ropo
rtion o
f the a
rea in
gre
en s
pace
EN
500
1I: G
ree
n s
pace (in
m2) to
whic
h th
e p
ublic
has a
ccess p
er
capita
EC
306
0I: P
ropo
rtion o
f househ
old
s re
liant u
pon s
ocia
l security
EC
305
7I: P
erc
ent. h
ouseh
old
s w
ith le
ss th
an h
alf n
at.a
ver.in
co
me
EC
303
9I: M
edia
n d
isp
osable
an
nu
al h
ousehold
incom
e (fo
r city
or
NU
TS
3 re
gio
n)
EC
102
0I: U
ne
mplo
ym
ent ra
te
DE
300
8I: P
rop. h
ou
seh
old
s th
at a
re lo
ne-p
ensio
ner h
ouse
hold
s
DE
300
5I: P
rop. o
f househ
old
s th
at a
re lo
ne
-pare
nt h
ouse
hold
s
DE
300
4I: A
vera
ge s
ize o
f househ
old
s
DE
300
3I: T
ota
l nu
mb
er o
f ho
use
hold
s
DE
200
6I:
Pro
portio
n o
f R
esid
en
ts w
ho are
not
EU
N
atio
nals
a
nd
citiz
en
s o
f a c
ou
ntry
with
a m
ediu
m o
r low
HD
I
DE
200
5I:
Pro
portio
n o
f R
esid
en
ts w
ho are
not
EU
N
atio
nals
a
nd
citiz
en
s o
f a c
ou
ntry
with
hig
h H
DI
DE
104
0I: P
ropo
rtion o
f tota
l pop
ula
tion
age
d 0
-4
DE
102
8V
+ D
E10
55V
: To
tal R
esid
en
t Popula
tion 6
5 a
nd
over
DE
100
3I: P
ropo
rtion o
f fem
ale
s to
male
s in
tota
l po
pula
tion
IND
ICA
TO
R
NA
NA
NA
0.2
1
-0.0
1
0.0
6
0.1
3
0.2
2
0.5
8
-0.0
7
0.0
9
-0.1
9
0.0
0
-1.2
4
0.0
8
-0.0
2
0.2
3
-0.8
5
0.4
3
0.6
4
0.2
4
-0.2
1
NA
-0.2
1
12
NA
NA
NA
-0.1
0
-0.2
2
-0.5
2
0.1
8
0.1
2
0.3
4
-0.0
5
-0.4
2
0.0
0
-0.1
9
0.5
6
0.4
3
-0.0
4
-0.2
8
-0.1
4
-0.2
4
0.0
7
-0.0
9
0.0
5
NA
0.0
7
13
-0.3
8
0.0
9
0.3
3
0.0
6
0.1
6
0.4
3
0.1
3
-0.0
8
-0.3
0
0.1
0
0.0
0
-0.4
2
0.0
9
-0.2
3
0.0
1
0.0
9
-0.0
3
0.1
0
-0.0
5
0.0
5
0.1
5
-0.1
4
NA
0.1
2
14
-0.2
7
0.2
1
0.4
3
-0.2
7
0.1
5
0.3
3
0.1
7
0.1
1
0.0
8
0.0
0
0.1
0
-0.0
5
-0.0
7
0.1
3
-0.1
2
-0.0
4
0.0
0
0.1
4
0.2
5
0.0
8
-0.0
2
0.0
2
NA
0.2
2
15
0.4
2
-0.1
3
-0.5
3
0.0
8
-0.2
0
-0.0
6
0.0
0
-0.3
0
0.0
0
0.0
8
-0.3
0
0.3
4
0.5
8
0.0
9
-0.0
1
0.2
4
-0.3
4
0.1
7
-0.4
3
-0.3
2
-0.1
3
-0.1
5
NA
0.1
8
16
-0.3
2
-0.2
1
0.1
1
-0.3
7
-0.1
7
-0.2
4
0.1
7
0.0
0
-0.3
0
0.1
1
-0.0
8
0.1
2
0.2
2
-0.3
3
0.2
2
-0.0
1
0.1
0
-0.0
5
-0.0
3
0.2
2
0.0
3
-0.1
4
NA
0.1
9
17
0.0
2
-0.3
8
0.3
2
0.0
9
-0.3
2
-0.3
9
0.0
0
0.1
7
0.0
0
0.1
7
0.1
3
0.1
8
0.1
3
0.0
9
0.1
6
-0.3
6
-0.1
4
0.3
3
-0.2
4
-0.0
5
0.0
6
0.2
0
NA
-0.6
1
18
0.0
0
-0.2
9
0.4
0
0.1
6
0.0
0
0.0
0
-0.3
9
-0.2
4
-0.0
6
0.3
3
0.4
3
-0.5
2
0.0
6
-0.2
7
0.3
2
0.2
7
0.0
6
0.2
4
0.0
6
0.1
9
0.2
0
-0.0
7
NA
-0.4
3
19
0.4
5
-0.1
5
0.0
8
-0.0
4
0.0
0
0.0
0
-0.3
2
-0.1
7
-0.2
0
0.1
5
0.1
6
-0.2
2
-0.0
1
-0.2
4
0.2
3
-0.1
1
0.0
6
0.2
6
-0.0
1
-0.1
0
-0.0
9
0.1
1
NA
-0.1
3
20
0.3
8
-0.4
6
0.3
1
0.0
0
-0.0
4
0.1
6
0.0
9
-0.3
7
0.0
8
-0.2
7
0.0
6
-0.1
0
0.2
1
-0.5
2
0.0
0
-0.0
5
-0.2
7
0.3
3
-0.0
4
-0.0
5
-0.0
8
-0.3
8
NA
0.0
0
21
-0.3
3
0.0
9
0.0
0
0.3
1
0.0
8
0.4
0
0.3
2
0.1
1
-0.5
3
0.4
3
0.3
3
NA
NA
-0.7
7
0.0
4
0.1
3
0.2
0
0.0
1
-0.0
7
0.0
8
-0.1
4
0.1
5
NA
-0.1
1
22
-0.1
9
0.0
0
0.0
9
-0.4
6
-0.1
5
-0.2
9
-0.3
8
-0.2
1
-0.1
3
0.2
1
0.0
9
NA
NA
0.7
3
-0.0
5
-0.1
4
-0.3
1
0.2
7
-0.3
0
-0.0
7
-0.0
2
-0.0
2
NA
-0.2
5
23
0.0
0
-0.1
9
-0.3
3
0.3
8
0.4
5
0.0
0
0.0
2
-0.3
2
0.4
2
-0.2
7
-0.3
8
NA
NA
0.5
3
-0.2
6
0.2
7
0.0
8
-0.1
3
-0.1
6
0.0
3
0.2
0
-0.2
1
NA
0.1
5
24
Summary and recommendations
60
5 Summary and recommendations
The objective of the work reported herein has been to review the only publicly available pan-
European socio-economic indicator database, the European Urban Audit. The review has attempted
to identify indicators suitable to describe levels of socio-economic vulnerability related to the need
for emergency shelter as well as pressure on and functioning of the health system in a post-
earthquake situation.
In the downloaded Urban Audit data set, a total of 958 indicators are included grouped as follows:
• Demography (DE)
• Economic aspects (EC)
• Environment (EN)
• Social aspects (SE)
• Training and education (TE)
Out of the 958 indicators, data are collected for only 44 indicators. For these 44 indicators, data has
been collected for two periods, 1999-2002 and 2003-2006. During the first period, data has been
collected for 7856 districts in 321 cities in 30 European countries. During the second period, data
has been collected for 2972 districts in 173 cities in 24 European Countries.
Identification of indicators has been made through the following procedure:
1. Removal of indicators with no or very little data. A subjective selection procedure considering the relevance of the indicators to emergency shelter and health has been carried out. The re-sulting matrix with 44 indicators, organised with indicator as column heading and district as row heading, had completeness degrees of 32 % for 1999-2002 and 35% for 2003-2006.
2. Improving completeness : To improve completeness of the data set, the following procedure of excluding cities and indicators have been performed:
a. Sort indicators according to completeness, i.e. starting with the indicator having data for the most districts and ending with the indicator having the least data.
b. Subjectively exclude indicators with the least data. Indicators believed to carry infor-mation with significant importance for socio economic vulnerability should be kept when possible.
c. Sort cities according to completeness, i.e. starting with the city having data for the most indicators and ending with the city with data for the least indicators.
d. Subjectively exclude cities with the least indicators. Cities believed to have high sig-nificance, for example being among the last cities of a country, or representing a cer-tain size or type of city should be kept.
e. If necessary, repeat the procedure. For 1999-2002, this resulted in a data set with 20 indicators and 2820 districts, representing
161 cities in 20 European countries. For 2003-2006, analysis was carried out both at city and
sub-city district level. At city level a total of 29 indicators was included, whereas 24 indicators
were included at sub-city district level.
3. Correlation analysis and removal of highly correlated indicators. For 1999-2002 this resulted
in the removal of three indicators, leaving 17 indicators for further analysis. For 2003-2006,
five indicators were removed at city level, leaving 24 indicators for further analysis. At sub-
city district level, three indicators were removed leaving 21 indicators for further analysis.
4. Principal component analysis with the objective to reduce the dimensionality of the data set
while retaining as much as possible of the variation present in the data set. This was
achieved by transforming the data to a new set of variables, the principal components (PCs),
Summary and recommendations
61
which are uncorrelated, and which are ordered so that the first few retain most of the varia-
tion present in all of the original variables. For 1999-2002, this resulted in ten principal com-
ponents that represent 85% of the variation in the 17 input indicators. For 2003-2006, the
city level data are represented by the first eight principal components representing close to
75% of the variation in the 24 original indicators. At sub-city district level, the same is found
with the first eight principal components representing close to 75% of the variation in the
data. It should be noted that the eight principal components at sub-city district level are not
the same as the ones identified at city level.
It is expected that Urban Audit Data for the period 2007-2010 will be made available shortly. These
data will have to be analysed following the same procedure to be able to identify data gaps and pat-
terns of variation in the data. Until then, the most recent available data are from 2003-2006. For this
period, the following principal components are found to represent the Urban Audit Data well in terms
of its relevance to socio-economic vulnerability considering emergency shelter and health care
needs :
City level:
No Subjective descriptor for principal com-
ponent Strongest correlated indicator
Strongest
correlation
value
1 Proportion of poor people SA2016I: Mortality rate for <65 per
year +0.69
2 Proportion of old people DE1028V + DE1055V: Total Resident
Population 65 and over +0.73
3 Proportion of uneducated people TE2025I: Prop. of working age popu-
lation qualified at level 1 or 2 ISCED +0.69
4 Proportion of households that are lone-
parent households
DE3005I: Prop. of households that are
lone-parent households +0.76
5 Size of sub-city district DE3003I: Total number of households +0.55
6 Proportion of highly educated people TE2031I: Prop. of working age popu-
lation qualified at level 5 or 6 ISCED +0.66
7 Crude death rate per 1000 residents SA2029I: Crude death rate per 1000
residents +0.48
8 Population density EN5101I: Population density: total
resident pop. per square km +0.46
Sub-city district level :
No Subjective descriptor for principal com-
ponent Strongest correlated indicator
Strongest
correlation
value
1 Proportion of young people SA2016I: Mortality rate for <65 per
year -0.88
2 Green, suburban, medium educated family
area
TE2028I: Prop. of working age popu-
lation qualified at level 3 or 4 ISCED +0.77
3 Low education, single parent households DE3005I: Prop. of households that are
lone-parent households +0.68
4 Suburban, well educated, single people EN5101I: Population density: total
resident pop. per square km -0.64
Summary and recommendations
62
5 Medium educated, small household, immi-
grant areas
DE2006I: Proportion of Residents who
are not EU Nationals and citizens of a
country with a medium or low HDI
+0.58
6 Large, male dominates districts with social
problems
DE1003I: Proportion of females to
males in total population +0.51
7 Suburban districts with high employment EC1020I: Unemployment rate -0.54
8 Slum areas SA1018I: Proportion of dwellings lack-
ing basic amenities +0.67
It should be noted that the subjective descriptors as given in the tables above should be used with
care, and that a full understanding of each of the eight principal components included both at city
and sub-city district level can only be obtained by considering how the individual indicators contrib-
ute to each principal component as detailed in Table 17 (city level) and Table 26 (sub-city district
level).
References
63
6 References
Jolliffe, I.T. (2002)Principal Component Analysis, Second Edition, Springer Series in Statistics, ISBN
0-387-95442-2, 2002