Date post: | 20-Mar-2017 |
Category: |
Documents |
Upload: | austin-white |
View: | 7 times |
Download: | 1 times |
Austin White
Niko Kritsonis
Ivan Aguilar
Stephen RogersMultiple Regression – 35 points
(5 pages plus output)
ModelingHigher grades on the final project are given to those teams who take the time to analyze the data and relate it to the topic studied. Just evaluating the numbers without referencing the bigger context of why you are doing this analysis will not get you an A grade. Make sure you show how much you love your data in this final team project.
Introduction
In the world today, the health of a nation’s economy is gaged by GDP per Capita. GDP per Capita has become the standard measure for determining economic prosperity of a country. A country with a high GDP per Capita typically contains a strong industrial sector, a strong centralized government, and a large presence in international trade, while a country with a low GDP per Capita typically contains a weak industrial sector, a weak centralized government, and/or a weak international trade presence.
What we wanted to analyse is whether social/economic factors such as Poverty Rate, Fertility Rate, Inflation Rate, and unemployment rate had a strong correlation with the GDP per Capita of a country. For Example, does the 5% U.S Unemployment rate have anything to do with the U.S’s $54,000 GDP per Capita? Or, Does the GDP per Capita of Argentina have any correlation with its 23.7% inflation rate in 2014? These are questions that are central to modern economic and social science.
Data from individual reports (Revised from the individual sections. Select the best of the individual reports.)
Limitations: The data does not take into account economic conditions of the countries outside of GDP per Capita. The data does not take into account different laws and regulations affecting each country. The data does not take into account the social stability or instability of each country.
Adequacy: The data in our analysis was gathered from Published Sources. We gathered a large portion of our Data from the World Bank because of their reliability and their broad range of data on countries all across the world.
Independent Variables:Poverty Rate is defined as minimum line of income that is deemed adequate for a particular country.
Fertility Rate is defined as the ratio of living births in an area to the population in an area. Typically, show as 1000 population per area.Inflation is defined as the general increase in prices and the depreciation and appreciation of the value of money. Unemployment Rate is defined as the percentage of people who are in the labor force who are looking for a job.
Dependent Variable:GDP per Capita is defined as the average income per person in a country. Since this is a major economic representation of a country its should have a strong relationship with the rest of the economic measurements.
Hypothesis: We hypothesize that the independent variables will all be negatively correlated with our dependent variable since they all seem to reflect economic factors that would seem to diminish with higher levels of GDP per Capita. this means that economic policies that decrease our x-values (poverty rate, fertility rate, unemployment rate, inflation) will increase our y-value (GDP per Capita).
Austin White Ivan Aguilar
Stephen Rogers
Niko Kritsonis Y Variable
Variable Poverty Rate Fertility Rates
Inflation Rate Unemployment rate
GDP per Capita
Mean 17.284 1.937 .974 7.32 30712.648Median 15.15 1.8 1 6.2 24406.5Mode 6.2 1.8 .4, -0.6, 1 6.2 N/AStandard Deviation 12.259 .752 6.1788 5.461 25241.66Min 1.5 1.18 -23.3 .03 1086.825 percentile 8.875 12.5 -0.4 4.1 9707.4575 percentile 21.175 37.5 2.7 8.5 49944.825Max 70 5.7 23.7 26.3 97429.7Satisfy empirical rule as normal?
No (2 out of 3 are yes)
no no yes yes
Y-Value (GDP per Capita):Mean: the average GDP per Capita among our 50 countries is $30712.648.Median: the GDP per Capita that is at the 50% point of our sorted data is $24406.5.Mode: there is no two countries with the same GDPStandard Deviation: the amount of variation between the GDP per Capita of our countries as a whole is $24406.5.Min: the lowest GDP per Capita of our countries is $1086.8. (Bangladesh)25 percentile: the GDP per capita at the lower 25% of our data is $9707.45.75 percentile: the GDP per capita at the upper 25% of our data is $49944.825.Max: the highest GDP per capita of our countries is $97429.7. (Norway)Empirical rule?: It does satisfy the empirical rule meaning 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.
Austin White:Mean: the average poverty rate among our 50 countries is 17.284%.Median: the poverty rate that is between 50% of our 50 countries is 15.15%. Mode: the poverty rate that appeared the most often was 6.2% (Australia and Austria).Standard Deviation: the amount of variation in the poverty rate of our countries as a whole is 12.259%.Min: the lowest poverty rate of the countries is 1.5% (Taiwan).25 percentile: the poverty rate at the lowest 25% is 8.875%.75 percentile: the poverty rate at the upper 25% is 21.175%. Max: the highest poverty rate of the countries is 70% (Nigeria).Empirical Rule?: It does not satisfy the empirical rule which means one of the 3 criterias are not met: 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.
Niko Kritsonis:Mean: the average unemployment rate is 7.32%.Median: the unemployment rate that is at the 50% point in the data is 6.2%.Mode: the most common unemployment rate is 6.2% (U.K., Indonesia, and Czech Republic)Standard Deviation: the amount of variation in the unemployment rate of our countries as a whole is 5.461%.Min: the lowest unemployment rate in the data is .03%. (Qatar)25 percentile: the unemployment rate at the lowest 25% is 4.1%.
75 percentile: the unemployment rate at the upper 25% is 8.5%.Max: the highest unemployment rate is 26.3%. (Greece) Empirical Rule?: It does satisfy the empirical rule meaning 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.
Stephen Rogers:Mean: the average inflation rate is .974%.Median: the inflation rate that is at the 50% point in our sorted data is 1%.Mode: there are 3 common inflation rates, they are .4%, -0.6%, and 1%. (Netherlands, Poland, Finland) (Canada, Greece, Australia) (US, India, Denmark)Standard Deviation: the amount of variance in inflation rates of our countries is 6.18%Min: the lowest inflation rate is -23.3%. (Qatar)25 percentile: the inflation rate at the lowest 25% is -0.4%.75 percentile: the inflation rate at the upper 25% is 2.7%.Max: the highest inflation rate is 23.7%. (Argentina)Empirical Rule?: It does not satisfy the empirical rule which means one of the 3 criterias are not met: 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.
Ivan Aguilar:Mean: the average fertility rate of our countries is 1.937%Median: the fertility rate that is at the 50% of our data is 1.8%Mode: the most common fertility rate is 1.8% (U.K., Brazil, Belgium, Norway, Chile, Finland)Standard Deviation: the amount of variation in the fertility rate of our countries is .752%Min: the lowest fertility rate of our countries is 1.18% (Taiwan)25 percentile: the fertility rate of the lowest 25% is 12.5%75 percentile: the fertility rate of the upper 25% is 37.5%Max: the highest fertility rate of our countries is 5.7% (NIgeria) Empirical Rule?: It does not satisfy the empirical rule which means one of the 3 criterias are not met: 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.
Fill in the following correlation table from your individual projects and draw conclusions from your analysis of the correlations:
y (GDP per Capita
Poverty Rate (Austin)
Fertility Rate (name)
Unemployment Rate [Name]
Inflation [Name]
y (GDP per Capita) -0.381 -0.306 -0.159 -0.376Poverty Rate (Austin) -0.111 0.156 -0.083fertility rate[name] -0.068 0.023unemployment rate[Name]
0.156
inflation [Name]
As a team, we have decided that the x variables that should be used for final analysis are the Poverty Rate, The Fertility Rate, and the Inflation. These variables were chosen because they have the strongest correlation with our y-value (GDP per Capita) and therefore have the strongest chance of having a relationship with our Y-value (GDP per Capita).
Team Member
X variable Correlation with y
Parameter validity (Intercept)
Parameter validity (Slope)
R square
Austin White Poverty Rate -0.381 Yes Yes 0.157
Ivan Aguilar Fertility Rate -0.306 Yes No 0.0937
Stephen Rogers
Inflation Rate -0.376 Yes Yes 0.1417
Niko Kritsonis
unemployment rate
-0.159 Yes No 0.02539
Modeling
Comparison Table of Model Significance
Combination 1 Combination 2 Combination 3 Combination 4Poverty Rate -424.006 -646.213 -629.955
Fertility Rate -6023.43 -4335.42 -9991.93
Inflation -1338.75 -1260.32 -1509.66
Model Significance .003212 .001812 .018602 .002139
Adjusted R-Square .208155 .203059 .12004 .197416
Parameter significance(Poverty Rate)
.207096 .020286 .068845 N/A
Parameter significance (Fertility Rate)
.259671 N/A .435991 .0245
Parameter significanceInflation
.016207 .022386 N/A .005865
Parameter significanceIntercept
6.9E-07 6.16E-10 2.55E-06 5.9E-07
Assumption Checks XXXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
Mean of 0 PR: noFR: noI: no
PR: yesI: no
PR: noFR:no
FR: noI: no
Variance constancy PR: noFR: noI: yes
PR: noI: no
PR: noFR: no
FR: noI: no
Independence PR: Independant
PR: Independant
PR: Independant
FR: Independant
FR: IndependantI: Dependant
I: Dependant FR: Dependant I: Dependant
Normality no no no no
We chose combination #2 as our best model (poverty rate and inflation). We took into account three different factors when deciding our best model. we first took a look at the model significance and noticed that all combinations were valid (F significant is less than .05). We then took a look at parameter significance and noticed that combination #2 and #4 both had valid parameters (parameter significance is less than .05). Lastly, we looked at Adjusted R-Squared and noticed that Combination #2 had the largest adjusted R-Squared (largest percent of variation explained by the graph) while meeting our first two significance checks. After these three checks, we came to the conclusion that Combination #2 (poverty rate and inflation) was our best model.
Best Model Improvement:
Final modeling:
Best Model(combination
#2)
Best Model with first outliers removal
Best Model with second outliers removal
Intercept 43109.34 42723.18 40315.71
Slope 1 -646.213 -641.691 -576.627
Slope 2 -1260.32 -1261.65 -1186.08
Slope 3 N/A N/A N/A
Model Significance .001812 0.002033 0.002697
Adjusted R-Square .203059 0.203015 0.19703
Parameter significance(Poverty Rate)
.020286 0.021979 0.028298
Parameter significance (Fertility Rate)
N/A N/A N/A
Parameter significance(Inflation)
.022386 0.023093 0.022837
Assumption Checks XXXXXXXX XXXXXXXXXX XXXXXXXXXX
Mean of 0 PR: yesI: no
PR: yesI: no
PR: yesI: no
Variance constancy PR: noI: yes
PR: noI: yes
PR: noI: yes
Independence PR: IndependentI: Dependent
PR: IndependentI: Dependent
PR: IndependentI: Dependent
Normality no no no
Compared to the original model, the new model has a higher model significance meaning that it is closer to accepting the null. The parameter significance for both x’s are different but not enough to really affect how we see or read the data and because it's still a small number we can accept the value. The slope and the intercepts are getting closer to 0 which means the data is becoming more and more reliable. For the final best model there is one normal outlier of 2.3 on the 19th data point but besides that there are no extreme outliers.
y=-1762.707x+40315.71. The equation for the multiple regression here explains a best fit line for our data points since no line actually fits though every single point. We are given an intercept and slopes from the regression summary and that line is the best explanation of how the data is acting.
Prediction Interval:
X variable t value (alpha/2,df)
Point Prediction
Prediction Interval Confidence Interval Actual y value
Poverty Rate
2.682 15.1 (U.S.A) -31360-96479.37 23244.68-41604.67 54398.5
Inflation 2.898 2.1 (Germany)
-60405.421-81107.021
9728.176-28973.423
10350.8
Fertility Rate
2.898 1.9 (Australia)
-16670.173-125467.173
-64362.2922-44434.7078
54398.5
The prediction interval is an estimate interval of where an observation should be, given a probability. In our case alpha = .01. For the poverty Rate, given that the U.S’s Poverty Rate was 15.1% in 2014, the prediction interval predicted that the U.S’s GDP per capita was between $23244.68-$41604.67. The actual U.S GDP per Capita in 2014 was $54398.5, which is inside the prediction interval. This meant that the prediction interval accurately predicted where the U.S’s GDP per Capita would land (in an interval) given the only the U.S’s Poverty Rate. This same trend occurred for Inflation with Germany and Fertility Rate with Australia.
The prediction interval is fairly similar to the confidence interval in both what it’s trying to capture and in formula. However, the prediction interval is a much larger interval than the confidence interval and therefore doesn’t give us as much accuracy as possibly desired. The Prediction Interval is a useful tool in statistics because of it’s strong predictive characteristics but does have the drawback that its interval is so large that it can be more advantageous to use a confidence interval given certain circumstances.
Discussion for Final Model (I will look to this section to see if you really understood what you did.)
Summary:
Although the correlations weren’t strong, there is some evidence to show that economic factors such as inflation and poverty rate are interconnected with a country’s GDP per Capita. Our final model (which contained Poverty Rate and Inflation as the x variables) proved that with all valid parameters from the x values and intercepts we received a adjusted R-Squared of 0.197. While our adjusted R-Squared is fairly small, almost 20% of our regression plots can be explained by our multiple regression model. This means that there is sufficient evidence to prove that the size of a country’s GDP per Capita has an effect on the poverty rate and inflation of that country.
In our final model, our data contained no extreme outliers (z score greater than or equal to 3 or less than or equal to -3). However, our data did contain one outlier which was Turkey with a z score of 2.32.
Our model appears to be valid based on Parameter significance and Model Significance. The Parameter significance for all three values (two x values and one intercept) had a significance level less than .05 meaning the parameters are valid. The Model Significance level was also below .05 meaning the model is valid. However, it must be noted that our assumption check for our two x variables gave mixed results as to whether a linear regression model was valid for our final values. in our mean of 0 test and independence test we found that the poverty rate passed both tests while inflation did not pass either. For the variance constancy test, inflation passed while the poverty rate did not. Neither values had a Normal distribution meaning a linear regression model may not be acceptable for these values.
Our model would not be a reliable predictor of GDP per Capita of a country. Although we believe that there is sufficient evidence to prove that there is a correlation between the poverty rate and inflation of a country and its GDP per Capita, that correlation according to our data is small. based on our adjusted R-Squared, our regression can only explain about 19.7% of the whole model distribution. We do not believe our model to be a reliable predictor because of its low ability to explain the variation of data points on our model.
In statistical terms, this model is recommended because of its parameter validity and its model validity. Having validity to these pieces of data is highly important to the success of a model. If our model did not have valid parameters or a valid F significance than the model could not be used to determine the GDP per Capita of a country and our model would then be useless. That being said, in the real world our model allows us to see how poverty rates and inflation correlate with the size of a country’s GDP per Capita, which is something that economists and social scientists study frequently in today’s world.
The drawbacks of our model is that the Adjusted R-Squared is vary low (0.197) indicating that our model is not a very good indicator of the correlation between GDP per Capita and the poverty rate and inflation of a country.
Conclusion (I will look to this section to see if you love your data.)There are a myriad of ways to make improvements to our study in the future. In this study we used GDP per Capita of a country as our y-value to find out whether there was a correlation between this and our x-values poverty rate, fertility rate, unemployment rate, and inflation. In the future I would not use the GDP per Capita of a country but rather the increase or decrease of a country’s GDP per Capita from a specific year to another. This would probably give us a better correlation with our x-values because it would reflect the current prosperity of a country rather than comparing to a country's prosperity that may have been accumulated in the past and then became stagnant economically. I would also change x-values such as poverty rate and Fertility Rate to something more economically focused such as interest rates and international trade. Using poverty rate and fertility rate as an x-value was not bad, however, these measures are more socially driven statistics rather than economically driven like GDP per Capita. If we were to use interest rates and international trade of a country instead, we would probably see better regression models being made from the data that can be attained.
From our final model, we can see that the two slopes of our x-values (poverty rate and inflation) were negative. meaning as x increased, y decreased. This matches our hypothesis to a point. Any policies that decrease poverty such as work programs, training programs, educational programs, etc that are used to decrease poverty would increase a nation's GDP per Capita. Also, policies that are aimed at decrease inflation would increase GDP per Capita according to our model. Our model must be taken with a grain of salt though because of the low adjusted R-Squared that has already been noted which questions the reliability of our model.
Going back to our original hypothesis, we found that none of our variables had a strong correlation with GDP per Capita. When we correlated each x-value with our y-value (GDP per Capita) we found the poverty rate, fertility rate, and unemployment rate to have moderate negative correlations with our y (GDP per Capita. However, our R-Squared was very low (.157 for poverty rate, .0937 for fertility rate, and .1417 for inflation). When we combined these x-values in our multiple regression, our Adjusted R-Squared wasn’t reading much better than our individual x R-Squared values.
Works Cited
Unemployment, total (% of total labor force) (modeled ILO estimate). (n.d.). Retrieved
November 20, 2016, from http://data.worldbank.org/indicator/SL.UEM.TOTL.ZS
Inflation, GDP deflator (annual %). (n.d.). Retrieved November 20, 2016, from
http://data.worldbank.org/indicator/NY.GDP.DEFL.KD.ZG
Fertility Rate, Total (births per woman). (n.d.). Retrieved November 20, 2016, from
http://data.worldbank.org/indicator/SP.DYN.TFRT.IN?
Appendix
(Note: Standard Residuals shaded yellow represent normal outliers)
Combination 1
SUMMARY OUTPUT
Regression StatisticsMultiple R 0.506592R Square 0.256635Adjusted R Square 0.208155Standard Error 22461.46Observations 50
ANOVA
df SS MS FSignificanc
e F
Regression 38.01E+0
92.67E+0
95.29360
2 0.003212
Residual 462.32E+1
05.05E+0
8
Total 493.12E+1
0
Coefficients
Standard Error t Stat P-value Lower 95%
Upper 95%
Lower 95.0%
Upper 95.0%
Intercept 51013.718874.91
2 5.74808 6.9E-07 33149.4368877.9
833149.4
368877.9
8
X Variable 1 -424.006331.353
5-
1.279620.20709
6 -1090.99 242.974-
1090.99 242.974
X Variable 2 -6023.435277.90
9-
1.141250.25967
1 -16647.34600.44
7-
16647.34600.44
7
X Variable 3 -1338.75536.356
6-
2.496010.01620
7 -2418.38-
259.121-
2418.38-
259.121
RESIDUAL OUTPUTPROBABILITY OUTPUT
Observation Predicted Y Residuals
Standard Residuals
Percentile Y
131827.9414
122570.5585
9 1.03710662 1 1086.8
239459.1525
4
-31871.8525
4 -1.46449673 3 1315.3
333119.3007
63033.39923
9 0.139383277 5 1576.8
433197.4286
814569.5713
2 0.669464994 7 2052.3
532901.0037
913377.4962
1 0.614689699 9 2872.5
634010.6916
98536.10830
8 0.392230188 11 3203.2
722583.3360
7
-21006.5360
7 -0.965240516 13 3499.6
828832.1177
96347.58220
7 0.291668436 15 5442.9
920387.7877
8
-8658.98778
4 -0.397876442 17 5969.9
1038193.8076
211991.6923
8 0.551012661 19 6472.1
1134056.2369
3
-6066.83693
4 -0.278768322 21 7587.3
1237743.5974
324252.2025
7 1.114377375 23 7918.1
1325801.4195
3
-11899.3195
3 -0.546768172 2510303.
9
14 33433.4647
-3714.96470
2 -0.170700892 2710350.
8
1512282.1595
8
-1931.35958
1 -0.08874507 2911307.
1
1627298.9971
6
-23799.3971
6 -1.093571178 3111728.
8
1736379.9150
115758.7849
9 0.724108806 3312324.
9
1821158.1611
1
-10854.2611
1 -0.498748225 3513154.
819 40496.4876 45114.3123 2.072981572 37 13902.
2 8 1
2052807.3539
6
-28400.8539
6 -1.305005967 3914337.
2
21
-7288.77470
819613.6747
1 0.901239187 4114566.
1
2241663.5448
8
-19044.5448
8 -0.875087938 4319502.
4
23
-16882.6613
420085.8613
4 0.922935942 4521627.
4
2434057.5145
124842.4854
9 1.141500599 4722124.
4
2538153.2787
3
-23816.0787
3 -1.094337688 49 22619
2632521.7591
314777.2408
7 0.679007312 5124406.
5
2739773.8055
757655.8944
3 2.649261406 5327989.
4
2838869.3376
9
-33426.4376
9 -1.535929191 5529718.
5
2935980.0516
4
-30010.1516
4 -1.378952444 5735179.
7
3037810.0607
813338.3392
2 0.612890454 5936152.
7
3145558.6715
9
-1595.97159
1 -0.073334149 6137206.
2
3218198.8235
3
-11726.7235
3 -0.538837466 6340215.
7
3338493.4617
6
-27186.3617
6 -1.249200619 6542546.
8
3430253.9379
99961.76201
5 0.457738309 6743962.
7
3522644.3739
4
-19771.8739
4 -0.908508368 6946278.
5
3619822.3099
917383.8900
1 0.798781622 71 47299
3730017.0837
125990.2162
9 1.194238293 73 47767
3833753.4381
527577.4618
5 1.267171485 7549864.
6
3922223.4325
4
-14305.3325
4 -0.657323345 7750185.
5
4015188.3850
7
-13873.0850
7 -0.637461776 7951148.
4
4128012.4055
8
-13446.3055
8 -0.617851458 8152138.
7
4236498.3821
313366.2178
7 0.614171465 8354321.
3
4329539.4262
624781.8737
4 1.138715518 8554398.
5
4416507.3311
7
-15420.5311
7 -0.708566201 8756007.
3
4534443.3227
5
-32391.0227
5 -1.488352359 89 58900
4633609.8501
5
-11485.4501
5 -0.527751067 9161330.
9
47 33708.0826-
20553.2826 -0.94441373 9361995.
8
4835506.3728
5
-13878.9728
5 -0.637732317 9585610.
8
49 64138.8584 32593.6416 1.4976626 9796732.
5
5036886.1718
1
-17383.7718
1 -0.798776191 9997429.
7
0 10 20 30 40 50 60 70 80-40000-20000
020000400006000080000
Poverty Residual Plot
X Variable 1
Resid
uals
0 1 2 3 4 5 6-40000-20000
020000400006000080000
Fertility Residual Plot
X Variable 2
Resid
uals
-30 -20 -10 0 10 20 30-40000-20000
020000400006000080000
Inflation Residual Plot
X Variable 3
Resid
uals
Combination 2
Note: Residual output for Combination #2 is the same as the best model and can be found under Best Model data.
SUMMARY OUTPUT
Regression StatisticsMultiple R 0.485374R Square 0.235588Adjusted R Square 0.203059Standard Error 22533.62Observations 50
ANOVA
df SS MS FSignifican
ce F
Regression 2 7.36E+093.68E+0
97.24256
4 0.001812
Residual 47 2.39E+105.08E+0
8Total 49 3.12E+10
Coefficients
Standard Error t Stat P-value
Lower 95%
Upper 95%
Lower 95.0%
Upper 95.0%
Intercept 43109.34 5567.0687.74363
46.16E-
10 31909.8454308.8
431909.8
454308.8
4
X Variable 1 -646.213 268.9759-
2.402490.02028
6 -1187.32-
105.103-
1187.32-
105.103
X Variable 2 -1260.32 533.6441-
2.361720.02238
6 -2333.87-
186.764-
2333.87-
186.764
0 10 20 30 40 50 60 70 80-40000-20000
020000400006000080000
Poverty Rate Residual Plot
Poverty Rate
Resid
uals
-30 -20 -10 0 10 20 30-40000-20000
020000400006000080000
Inflation Residual Plot
Inflation
Resid
uals
Combination 3
SUMMARY OUTPUT
Regression StatisticsMultiple R 0.394914R Square 0.155957Adjusted R Square 0.12004Standard Error 23678.24Observations 50
ANOVA
df SS MS FSignifican
ce F
Regression 2 4.87E+092.43E+0
94.34218
2 0.018602
Residual 47 2.64E+105.61E+0
8Total 49 3.12E+10
Coefficients
Standard Error t Stat P-value
Lower 95%
Upper 95%
Lower 95.0%
Upper 95.0%
Intercept 49999.36 9345.867 5.349892.55E-
06 31197.968800.8
231197.
968800.8
2
X Variable 1 -629.955 338.3005-
1.862120.06884
5 -1310.5350.6177
6
-1310.5
350.6177
6
X Variable 2 -4335.42 5517.958-
0.785690.43599
1 -15436.16765.28
2
-15436.
16765.28
2
RESIDUAL OUTPUTPROBABILITY OUTPUT
Observation Predicted Y Residuals
Standard Residuals
Percentile Y
132249.7474
222148.7525
8 0.955100367 1 1086.8
239219.9683
5
-31632.6683
5 -1.364066578 3 1315.33 33850.4964 2302.20351 0.099275813 5 1576.8
9 2
434165.4740
313601.5259
7 0.586526144 7 2052.3
531990.3385
614288.1614
4 0.616135296 9 2872.5
636351.8822
96194.91771
3 0.267137761 11 3203.2
720821.6990
7
-19244.8990
7 -0.829880152 13 3499.6
825094.1208
410085.5791
6 0.43491119 15 5442.9
928714.5721
3
-16985.7721
3 -0.732461891 17 5969.9
1037141.1165
813044.3834
2 0.562500996 19 6472.1
1134717.5799
4
-6728.17994
4 -0.29013314 21 7587.3
1237856.3476
624139.4523
4 1.040943489 23 7918.1
1335699.6467
2
-21797.5467
2 -0.939955638 2510303.
9
14 31071.2673
-1352.76729
7 -0.058334145 2710350.
8
157577.78868
52773.01131
5 0.119578027 2911307.
1
1633177.6778
6
-29678.0778
6 -1.279780563 3111728.
8
1736896.5613
715242.1386
3 0.65727278 3312324.
9
1830248.7448
2
-19944.8448
2 -0.860063271 3513154.
8
1938708.5774
546902.2225
5 2.022521575 3713902.
1
2031371.6562
7
-6965.15626
6 -0.300352053 3914337.
221 21129.2497 - -0.37966191 41 14566.
88804.34978
4 1
2243938.6369
8
-21319.6369
8 -0.919347174 4319502.
4
23
-18809.3722
622012.5722
6 0.949227987 4521627.
4
2437352.3835
921547.6164
1 0.929178122 4722124.
4
2537685.7956
6
-23348.5956
6 -1.006840101 49 22619
2632620.2936
414678.7063
6 0.632976407 5124406.
5
2737029.9792
260399.7207
8 2.604561826 5327989.
4
2830848.9925
8
-25406.0925
8 -1.095563656 5529718.
5
2935180.8289
9
-29210.9289
9 -1.25963613 5735179.
7
3040024.0562
911124.3437
1 0.479704882 5936152.
7
3129911.4867
914051.2132
1 0.605917595 6137206.
2
3219876.7664
5
-13404.6664
5 -0.578037151 6340215.
7
3339368.2398
5
-28061.1398
5 -1.210054827 6542546.
8
3432449.7416
57765.95835
2 0.334884308 6743962.
7
3520299.3004
8
-17426.8004
8 -0.751479952 6946278.
536 23330.5117 13875.6883 0.598348595 71 47299
3727984.4873
928022.8126
1 1.208402076 73 47767
3834187.7545
227143.1454
8 1.170468996 7549864.
639 21162.5379 - -0.571127765 77 50185.
713244.4379
7 5
4020343.8614
5
-19028.5614
5 -0.820551223 7951148.
4
4132683.2891
5
-18117.1891
5 -0.78125095 8152138.
7
4237533.9432
812330.6567
2 0.531723613 8354321.
3
4337863.7744
816457.5255
2 0.709682796 8554398.
5
4420617.8588
9
-19531.0588
9 -0.84221996 8756007.
3
4534210.0350
1
-32157.7350
1 -1.386708547 89 58900
4633457.6697
8
-11333.2697
8 -0.48871421 9161330.
9
4734954.9734
1
-21800.1734
1 -0.940068906 9361995.
8
4831764.2178
9
-10136.8178
9 -0.437120711 9585610.
8
4932383.1652
764349.3347
3 2.77487741 9796732.
5
5037322.6762
7
-17820.2762
7 -0.768447448 9997429.
7
0 10 20 30 40 50 60 70 80-40000-20000
020000400006000080000
Poverty Rate Residual Plot
Poverty Rate
Resid
uals
0 1 2 3 4 5 6-40000-20000
020000400006000080000
Fertility Rate Residual Plot
Fertility
Resid
uals
Combination 4
SUMMARY OUTPUT
Regression StatisticsMultiple R 0.479765R Square 0.230174Adjusted R Square 0.197416Standard Error 22613.26Observations 50
ANOVA
df SS MS FSignifican
ce F
Regression 2 7.19E+093.59E+0
97.02639
7 0.002139
Residual 47 2.4E+105.11E+0
8Total 49 3.12E+10
Coefficients
Standard Error t Stat P-value
Lower 95%
Upper 95%
Lower 95.0%
Upper 95.0%
Intercept 51539.42 8925.3125.77452
3 5.9E-07 33584.0169494.8
333584.0
169494.8
3
X Variable 1 -9991.93 4299.48-
2.32399 0.0245 -18641.4-
1342.49-
18641.4-
1342.49
X Variable 2 -1509.66 522.9722-
2.886690.00586
5 -2561.74-
457.572-
2561.74-
457.572
RESIDUAL OUTPUTPROBABILITY OUTPUT
Observation Predicted Y Residuals
Standard Residuals
Percentile Y
1 31045.0967 23353.4033 1.054474455 1 1086.8
2 36307.1606-
28719.8606 -1.296785696 3 1315.3
334531.4047
11621.29528
7 0.073206224 5 1576.8
434380.4390
413386.5609
6 0.604442376 7 2052.3
533101.0493
413177.4506
6 0.595000434 9 2872.5
629743.9724
112802.8275
9 0.578085107 11 3203.2
726049.1319
8
-24472.3319
8 -1.104997357 13 3499.6
836342.9927
7
-1163.29276
6 -0.05252607 15 5442.9
921476.6926
7
-9747.89266
7 -0.440145861 17 5969.9
1036458.1262
713727.3737
3 0.619831069 19 6472.1
1136227.8592
6-
8238.45926 -0.371990529 21 7587.3
1233460.5474
428535.2525
6 1.288450104 23 7918.1
1322928.7826
2
-9026.68262
5 -0.407581119 2510303.
9
1437644.1170
5
-7925.61705
3 -0.357864788 2710350.
8
15 25783.0328-
15432.2328 -0.696810442 2911307.
1
1623416.4549
8
-19916.8549
8 -0.899304249 3111728.
8
1733949.2766
218189.4233
8 0.82130566 3312324.
918 19233.9421
9-
8930.04218-0.40321752 35 13154.
8
9
1938514.0789
247096.7210
8 2.12655469 3713902.
1
2049528.1123
4
-25121.6123
4 -1.134314265 3914337.
2
21
-7220.88242
319545.7824
2 0.882549237 4114566.
1
2237937.3545
6
-15318.3545
6 -0.691668506 4319502.
4
23
-9792.58294
512995.7829
5 0.586797606 4521627.
4
2429686.4056
629213.5943
4 1.319079219 4722124.
4
25 37946.0484-
23608.8484 -1.066008548 49 22619
2632195.2553
215103.7446
8 0.681978243 5124406.
5
2737026.1567
960403.5432
1 2.727396624 5327989.
4
28 41346.5945-
35903.6945 -1.621156806 5529718.
5
2936098.6281
8
-30128.7281
8 -1.360400188 5735179.
7
30 35135.2674 16013.1326 0.723039767 5936152.
7
3148952.4448
1
-4989.74481
1 -0.225301571 6137206.
2
3221822.0931
9
-15349.9931
9 -0.693097083 6340215.
7
33 33158.6161-
21851.5161 -0.986659856 6542546.
8
3433661.4428
5 6554.25715 0.295943878 6743962.
7
3522620.3907
2
-19747.8907
2 -0.891675018 6946278.
536 16488.3649 20717.8350 0.935470841 71 47299
6 4
3736134.4603
419872.8396
6 0.897316828 73 47767
3833043.4825
928287.4174
1 1.277259622 7549864.
6
3928629.6459
6
-20711.5459
6 -0.93518687 7750185.
5
4010284.6748
6
-8969.37486
5 -0.404993506 7951148.
4
41 27062.4225-
12496.3225 -0.564245506 8152138.
7
4232950.0836
716914.5163
3 0.763739878 8354321.
3
43 23554.3799 30766.9201 1.38921642 8554398.
5
4420650.1999
8
-19563.3999
8 -0.883344722 8756007.
3
4531857.4918
1
-29805.1918
1 -1.345791575 89 58900
4636680.7562
7
-14556.3562
7 -0.657262055 9161330.
9
47 25919.9009-
12765.1009 -0.576381637 9361995.
8
4839455.7051
1
-17828.3051
1 -0.805000114 9585610.
8
4966730.5618
430001.9381
6 1.354675248 9796732.
5
5035494.7654
9
-15992.3654
9 -0.722102071 9997429.
7
0 1 2 3 4 5 6
-60000-40000-20000
020000400006000080000
Fertility Rate Residual Plot
Fertility
Resid
uals
-30 -20 -10 0 10 20 30
-60000-40000-20000
020000400006000080000
Inflation Residual Plot
Inflation
Resid
uals
Normality Plot for Combinations 1-4
0 20 40 60 80 100 1200
20000
40000
60000
80000
100000
120000
Normal Probability Plot
Sample Percentile
Y
Best Model w/ Residual Output
SUMMARY OUTPUT
Regression Statistics
Multiple R0.48537358
7
R Square0.23558751
8Adjusted R Square
0.203059328
Standard Error22533.6195
9Observations 50
ANOVA
df SS MS FSignificanc
e F
Regression 2 7355027188 36775135947.24256447
80.00181218
5
Residual 472386490855
0 507764011.7
Total 493121993573
8
Coefficients
Standard Error t Stat P-value Lower 95% Upper 95%
Lower 95.0%
Intercept43109.3417
35567.06840
3 7.7436342816.16287E-
1031909.8446
854308.8387
831909.8446
X Variable 1
-646.212953
4268.975907
3 -2.4024938140.02028626
6
-1187.32268
3
-105.103223
51187.32268
X Variable 2
-1260.31729
6533.644143
7 -2.3617185930.02238568
5-
2333.87084
-186.763752
5 2333.87084
RESIDUAL OUTPUT PROBABILITY OUTPUT
Observation Predicted Y ResidualsStandard Residuals Percentile Y
132091.2088
422307.2911
6 1.010799431 1 1086.8
239797.6013
7
-32210.3013
7 -1.459529714 3 1315.3
330249.2998
95903.40011
4 0.267497897 5 1576.8
430446.3746
317320.6253
7 0.78484107 7 2052.35 32262.5967 14015.9033 0.635095807 9 2872.5
636491.8786
56054.92135
4 0.274363705 11 3203.2
722591.8784
3
-21015.0784
3 -0.952246024 13 3499.6
822779.3205
912400.3794
1 0.561892359 15 5442.9
919197.8461
6
-7469.04615
9 -0.338441255 17 5969.9
1037791.1303
512394.3696
5 0.561620041 19 6472.1
1129997.2364
3
-2007.83642
7 -0.090980115 21 7587.312 39859.0118 22136.7882 1.003073513 23 7918.113 26296.5560
6-
12394.4560-0.561623957 25 10303.9
6
1428718.0580
41000.44196
1 0.04533254 27 10350.8
156226.23232
54124.56767
5 0.186894528 29 11307.1
1630255.3175
3
-26755.7175
3 -1.21236881 31 11728.8
1736724.6769
415414.0230
6 0.698448127 33 12324.918 22735.9631 -12432.0631 -0.563328026 35 13154.8
1939836.5357
745774.2642
3 2.074146963 37 13902.1
2058130.8058
1
-33724.3058
1 -1.52813306 39 14337.2
21
-6146.56679
318471.4667
9 0.836988587 41 14566.1
2240627.6415
5
-18008.6415
5 -0.816016812 43 19502.4
23
-5780.48516
48983.68516
4 0.407073355 45 21627.424 36191.2482 22708.7518 1.028990621 47 22124.4
2535755.3575
1
-21418.1575
1 -0.97051055 49 22619
2632152.6192
715146.3807
3 0.68632058 51 24406.527 40709.1253 56720.5747 2.570151804 53 27989.4
2836696.5873
4
-31253.6873
4 -1.416183128 55 29718.5
2934201.2355
6
-28231.3355
6 -1.279232772 57 35179.7
3037086.3137
514062.0862
5 0.637188472 59 36152.7
3143363.4255
7599.274434
8 0.027154631 61 37206.2
3218093.6705
7
-11621.5705
7 -0.526602572 63 40215.7
3341157.8594
3
-29850.7594
3 -1.352612938 65 42546.8
3425528.3303
914687.3696
1 0.665521633 67 43962.7
3526866.9205
8
-23994.4205
8 -1.087247505 69 46278.5
3626136.0130
111070.1869
9 0.501617997 71 47299
3724291.2972
731716.0027
3 1.4371318 73 47767
3833189.7708
628141.1291
4 1.275145292 75 49864.6
3918701.3531
9
-10783.2531
9 -0.488616305 77 50185.5
4024287.6823
4
-22972.3823
4 -1.040936383 79 51148.4
4127932.1617
6
-13366.0617
6 -0.605649853 81 52138.7
4237823.2389
612041.3610
4 0.545624334 83 54321.3
4332875.4888
221445.8111
8 0.971763607 85 54398.5
4415317.7616
5
-14230.9616
5 -0.644840641 87 56007.3
4536059.1988
2
-34006.8988
2 -1.54093806 89 58900
4629082.9057
1
-6958.50570
9 -0.315307383 91 61330.9
4740818.6986
5
-27663.8986
5 -1.253520779 93 61995.8
4830941.2730
4
-9313.87304
3 -0.422034997 95 85610.849 63298.5108 33433.9892 1.514978085 97 96732.5
5035894.2326
8
-16391.8326
8 -0.742755139 99 97429.7
Best Model With First Outlier Removed
SUMMARY OUTPUT
Regression Statistics
Multiple R0.48602746
3
R Square0.23622269
5Adjusted R Square
0.203014986
Standard Error22665.1761
6Observations 49
ANOVA
df SS MS FSignificanc
e F
Regression 2 7308544561 36542722807.11348968
10.00203345
8
Residual 462363066967
5 513710210.3
Total 483093921423
5
Coefficients
Standard Error t Stat P-value Lower 95% Upper 95%
Lower 95.0%
Intercept 42723.17785628.69686
5 7.5902431461.19595E-
0931393.1986
654053.1569
531393.1986
X Variable 1
-641.690872
2 270.629122 -2.371107985 0.02197892
-1186.43904
1
-96.9427035
81186.43904
X Variable 2
-1261.64773
8536.763299
9 -2.3504731760.02309328
5
-2342.09622
2-
181.1992542342.09622
RESIDUAL OUTPUT PROBABILITY OUTPUT
Observation Predicted Y ResidualsStandard Residuals Percentile Y
1 31771.9979 22626.5021 1.0197648171.02040816
3 1086.8
239439.6873
5
-31852.3873
5 -1.435570722 3.06122449 1315.33 29932.8283 6219.87162 0.280326417 5.10204081 1576.8
7 8 6
430127.5090
317639.4909
7 0.7950027897.14285714
3 2052.3
531949.2913
514329.2086
5 0.6458100679.18367346
9 2872.5
636139.8426
36406.95737
2 0.288758275 11.2244898 3203.2
722339.1420
7
-20762.3420
7 -0.93574808313.2653061
2 3499.6
822527.3025
312652.3974
7 0.57023704915.3061224
5 5442.9
918897.8112
3
-7169.01123
4 -0.32310365117.3469387
8 5969.9
1037448.2722
512737.2277
5 0.574060306 19.3877551 6472.1
1129680.4988
2
-1691.09882
5 -0.07621695521.4285714
3 7587.3
1239501.6830
422494.1169
6 1.01379828723.4693877
6 7918.1
1325949.8906
3
-12047.7906
3 -0.5429877325.5102040
8 10303.9
1428426.5117
61291.98824
3 0.05822924627.5510204
1 10350.8
156072.79492
94278.00507
1 0.19280748929.5918367
3 11307.1
16 29916.4741 -26416.8741 -1.19059493431.6326530
6 11728.8
1736379.1317
715759.5682
3 0.71027563833.6734693
9 12324.9
1822416.2440
3
-12112.3440
3 -0.54589711835.7142857
1 13154.8
1939486.4692
346124.3307
7 2.07879987437.7551020
4 13902.1
2057814.1029
1
-33407.6029
1 -1.50566348839.7959183
7 14337.2
21
-6428.59975
318753.4997
5 0.84521059241.8367346
9 14566.1
2240246.6642
1
-17627.6642
1 -0.79446976343.8775510
2 19502.423 - 9057.16169 0.408201621 45.9183673 21627.4
5853.961691 1 5
24 35834.211 23065.789 1.03956325247.9591836
7 22124.4
2535416.5954
6
-21079.3954
6 -0.950037516 50 22619
2640363.1024
557066.5975
5 2.57196221352.0408163
3 24406.5
2736400.9733
1
-30958.0733
1 -1.39526444854.0816326
5 27989.4
2833874.3639
7
-27904.4639
7 -1.25763984556.1224489
8 29718.5
2936726.0580
214422.3419
8 0.65000753858.1632653
1 35179.7
3043079.0127
2883.687276
9 0.03982733160.2040816
3 36152.7
31 17843.9921 -11371.8921 -0.5125253362.2448979
6 37206.2
3240789.4115
8
-29482.3115
8 -1.32875262664.2857142
9 40215.7
3325225.6105
314990.0894
7 0.67559562666.3265306
1 42546.8
3426601.5231
1
-23729.0231
1 -1.06945487268.3673469
4 43962.7
3525841.2205
911364.9794
1 0.51221377970.4081632
7 46278.5
3624020.5787
531986.7212
5 1.44162508272.4489795
9 47767
3732862.8723
828468.0276
2 1.28303936974.4897959
2 49864.6
3818459.6021
6
-10541.5021
6 -0.47510008376.5306122
4 50185.5
3923997.7042
7
-22682.4042
7 -1.02228429978.5714285
7 51148.4
4027608.5603
6
-13042.4603
6 -0.587816983 80.6122449 52138.7
4137470.0062
512394.5937
5 0.55861796882.6530612
2 54321.3
42 32507.145 21814.155 0.98315275184.6938775
5 54398.5
4315066.1936
7
-13979.3936
7 -0.63004408686.7346938
8 56007.344 35724.4005 -33672.1005 -1.517584258 88.7755102 58900
45 28775.6114
-6651.21140
2 -0.29976667890.8163265
3 61330.9
4640457.6991
5
-27302.8991
5 -1.23052762892.8571428
6 61995.8
47 30646.349
-9018.94900
2 -0.40647939594.8979591
8 85610.8
4863007.5597
133724.9402
9 1.5199657296.9387755
1 96732.5
4935551.4538
4
-16049.0538
4 -0.72332260598.9795918
4 97429.7
0 10 20 30 40 50 60 70 80
-40000
-20000
0
20000
40000
60000
80000
X Variable 1 Residual Plot
X Variable 1
Resid
uals
-30 -20 -10 0 10 20 30
-40000
-20000
0
20000
40000
60000
80000
X Variable 2 Residual Plot
X Variable 2
Resid
uals
0 20 40 60 80 100 1200
20000
40000
60000
80000
100000
120000
Normal Probability Plot
Sample Percentile
Y
Best Model with Second Outlier Removed
SUMMARY OUTPUT
Regression Statistics
Multiple R0.48083128
9
R Square0.23119872
8Adjusted R Square
0.197029783
Standard Error21216.9822
6Observations 48
ANOVA
df SS MS FSignificanc
e F
Regression 2 6091876477 30459382396.76634077
30.00269660
3
Residual 452025721513
7 450160336.4
Total 472634909161
4
Coefficients
Standard Error t Stat P-value Lower 95% Upper 95%
Lower 95.0%
Intercept40315.7117
25341.93880
2 7.5470186411.58554E-
0929556.4946
851074.9287
629556.4946
X Variable 1
-576.626861
4254.449725
5 -2.2661720710.02829838
7
-1089.11491
6
-64.1388069
51089.11491
X Variable 2-
1186.07979503.224460
4 -2.3569597330.02283661
1
-2199.62588
1
-172.533698
62199.62588
RESIDUAL OUTPUT PROBABILITY OUTPUT
Observation Predicted Y ResidualsStandard Residuals Percentile Y
130422.5663
223975.9336
8 1.1548739561.04166666
7 1086.8
237391.3277
6
-29804.0277
6 -1.435601879 3.125 1315.3
328717.5223
67435.17764
2 0.3581380035.20833333
3 1576.8
428887.2278
118879.7721
9 0.9094017987.29166666
7 2052.3
530618.5326
315659.9673
7 0.75431008 9.375 2872.5
634337.0637
78209.73623
3 0.3954469811.4583333
3 3203.2
721946.1514
6
-20369.3514
6 -0.98115192513.5416666
7 3499.6
822125.7047
313053.9952
7 0.628785488 15.625 5442.9
918487.2585
7
-6758.45856
8 -0.32554176617.7083333
3 5969.9
10 35607.0671 14578.4329 0.70221467519.7916666
7 6472.1
11 28480.3064
-490.906399
7 -0.023646004 21.875 7587.3
1237452.2730
524543.5269
5 1.18221381623.9583333
3 7918.1
1324840.0018
6
-10937.9018
6 -0.52685739626.0416666
7 10303.9
1427437.2370
7 2281.26293 0.109883985 28.125 10350.8
15 7250.59008 3100.20992 0.14933106430.2083333
3 11307.116 28587.6423
2-
25088.0423-1.20844206 32.2916666
711728.8
2
1734593.9753
617544.7246
4 0.845095161 34.375 12324.9
1821675.1193
4
-11371.2193
4 -0.54772945436.4583333
3 13154.8
19 37475.2513 48135.5487 2.31859548338.5416666
7 13902.1
2054777.0274
3
-30370.5274
3 -1.462889063 40.625 14337.2
21
-5093.18513
717418.0851
4 0.83899518442.7083333
3 14566.1
2238027.4756
8
-15408.4756
8 -0.74219621644.7916666
7 19502.4
23
-3487.79996
86690.99996
8 0.322292416 46.875 21627.4
2434025.7720
924874.2279
1 1.19814303648.9583333
3 22124.4
2533729.0350
7
-19391.8350
7 -0.93406686851.0416666
7 22619
2634870.1484
6
-29427.2484
6 -1.417453155 53.125 24406.5
2732348.4132
1
-26378.5132
1 -1.27060152555.2083333
3 27989.4
2834842.8975
216305.5024
8 0.78540424857.2916666
7 29718.5
2941169.5017
82793.19822
4 0.13454291 59.375 35179.7
3017760.1877
6
-11288.0877
6 -0.54372516861.4583333
3 36152.7
3138598.9615
6
-27291.8615
6 -1.31459573363.5416666
7 37206.2
3224388.1140
615827.5859
4 0.76238394 65.625 40215.7
3325865.3557
4
-22992.8557
4 -1.10752100967.7083333
3 42546.8
34 25004.1322 12202.0678 0.58774980469.7916666
7 43962.7
3523425.6856
632581.6143
4 1.569392806 71.875 46278.5
3631402.8319
929928.0680
1 1.44157665673.9583333
3 47767
37 18376.2059 -10458.1059 -0.50374656176.0416666
7 49864.638
23305.65345
-21990.3534
5
-1.059232429 78.125 50185.5
3926508.5030
2
-11942.4030
2
-0.57524225780.2083333
3
51148.4
4035574.2410
314290.3589
7 0.68833871582.2916666
7 52138.741 30858.0411 23463.2589 1.130179412 84.375 54321.342
15154.09483
-14067.2948
3
-0.67759415186.4583333
3
54398.5
4334037.0441
4
-31984.7441
4
-1.54064273388.5416666
7
56007.3
4427682.8766
1
-5558.47661
4
-0.267740976 90.625 58900
4538327.0611
6
-25172.2611
6
-1.21249871792.7083333
3
61330.9
4629494.8223
7
-7867.42236
5
-0.37895838894.7916666
7
61995.8
4759763.2693
936969.2306
1 1.780735723 96.875 85610.848
33834.51262
-14332.1126
2
-0.69034990898.9583333
3
96732.5
0 10 20 30 40 50 60 70 80
-40000-30000
-20000-10000
010000
200003000040000
5000060000
X Variable 1 Residual Plot
X Variable 1
Resid
uals
-30 -20 -10 0 10 20 30
-40000-30000-20000-10000
0100002000030000400005000060000
X Variable 2 Residual Plot
X Variable 2
Resid
uals
0 20 40 60 80 100 1200
20000
40000
60000
80000
100000
120000
Normal Probability Plot
Sample Percentile
Y