BA240 Final Project

Austin White

Niko Kritsonis

Ivan Aguilar

Stephen RogersMultiple Regression – 35 points

(5 pages plus output)

ModelingHigher grades on the final project are given to those teams who take the time to analyze the data and relate it to the topic studied. Just evaluating the numbers without referencing the bigger context of why you are doing this analysis will not get you an A grade. Make sure you show how much you love your data in this final team project.

Introduction

In the world today, the health of a nation’s economy is gaged by GDP per Capita. GDP per Capita has become the standard measure for determining economic prosperity of a country. A country with a high GDP per Capita typically contains a strong industrial sector, a strong centralized government, and a large presence in international trade, while a country with a low GDP per Capita typically contains a weak industrial sector, a weak centralized government, and/or a weak international trade presence.

What we wanted to analyse is whether social/economic factors such as Poverty Rate, Fertility Rate, Inflation Rate, and unemployment rate had a strong correlation with the GDP per Capita of a country. For Example, does the 5% U.S Unemployment rate have anything to do with the U.S’s $54,000 GDP per Capita? Or, Does the GDP per Capita of Argentina have any correlation with its 23.7% inflation rate in 2014? These are questions that are central to modern economic and social science.

Data from individual reports (Revised from the individual sections. Select the best of the individual reports.)

Limitations: The data does not take into account economic conditions of the countries outside of GDP per Capita. The data does not take into account different laws and regulations affecting each country. The data does not take into account the social stability or instability of each country.

Adequacy: The data in our analysis was gathered from Published Sources. We gathered a large portion of our Data from the World Bank because of their reliability and their broad range of data on countries all across the world.

Independent Variables:Poverty Rate is defined as minimum line of income that is deemed adequate for a particular country.

Fertility Rate is defined as the ratio of living births in an area to the population in an area. Typically, show as 1000 population per area.Inflation is defined as the general increase in prices and the depreciation and appreciation of the value of money. Unemployment Rate is defined as the percentage of people who are in the labor force who are looking for a job.

Dependent Variable:GDP per Capita is defined as the average income per person in a country. Since this is a major economic representation of a country its should have a strong relationship with the rest of the economic measurements.

Hypothesis: We hypothesize that the independent variables will all be negatively correlated with our dependent variable since they all seem to reflect economic factors that would seem to diminish with higher levels of GDP per Capita. this means that economic policies that decrease our x-values (poverty rate, fertility rate, unemployment rate, inflation) will increase our y-value (GDP per Capita).

Austin White Ivan Aguilar

Stephen Rogers

Niko Kritsonis Y Variable

Variable Poverty Rate Fertility Rates

Inflation Rate Unemployment rate

GDP per Capita

Mean 17.284 1.937 .974 7.32 30712.648Median 15.15 1.8 1 6.2 24406.5Mode 6.2 1.8 .4, -0.6, 1 6.2 N/AStandard Deviation 12.259 .752 6.1788 5.461 25241.66Min 1.5 1.18 -23.3 .03 1086.825 percentile 8.875 12.5 -0.4 4.1 9707.4575 percentile 21.175 37.5 2.7 8.5 49944.825Max 70 5.7 23.7 26.3 97429.7Satisfy empirical rule as normal?

No (2 out of 3 are yes)

no no yes yes

Y-Value (GDP per Capita):Mean: the average GDP per Capita among our 50 countries is $30712.648.Median: the GDP per Capita that is at the 50% point of our sorted data is $24406.5.Mode: there is no two countries with the same GDPStandard Deviation: the amount of variation between the GDP per Capita of our countries as a whole is $24406.5.Min: the lowest GDP per Capita of our countries is $1086.8. (Bangladesh)25 percentile: the GDP per capita at the lower 25% of our data is $9707.45.75 percentile: the GDP per capita at the upper 25% of our data is $49944.825.Max: the highest GDP per capita of our countries is $97429.7. (Norway)Empirical rule?: It does satisfy the empirical rule meaning 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.

Austin White:Mean: the average poverty rate among our 50 countries is 17.284%.Median: the poverty rate that is between 50% of our 50 countries is 15.15%. Mode: the poverty rate that appeared the most often was 6.2% (Australia and Austria).Standard Deviation: the amount of variation in the poverty rate of our countries as a whole is 12.259%.Min: the lowest poverty rate of the countries is 1.5% (Taiwan).25 percentile: the poverty rate at the lowest 25% is 8.875%.75 percentile: the poverty rate at the upper 25% is 21.175%. Max: the highest poverty rate of the countries is 70% (Nigeria).Empirical Rule?: It does not satisfy the empirical rule which means one of the 3 criterias are not met: 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.

Niko Kritsonis:Mean: the average unemployment rate is 7.32%.Median: the unemployment rate that is at the 50% point in the data is 6.2%.Mode: the most common unemployment rate is 6.2% (U.K., Indonesia, and Czech Republic)Standard Deviation: the amount of variation in the unemployment rate of our countries as a whole is 5.461%.Min: the lowest unemployment rate in the data is .03%. (Qatar)25 percentile: the unemployment rate at the lowest 25% is 4.1%.

75 percentile: the unemployment rate at the upper 25% is 8.5%.Max: the highest unemployment rate is 26.3%. (Greece) Empirical Rule?: It does satisfy the empirical rule meaning 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.

Stephen Rogers:Mean: the average inflation rate is .974%.Median: the inflation rate that is at the 50% point in our sorted data is 1%.Mode: there are 3 common inflation rates, they are .4%, -0.6%, and 1%. (Netherlands, Poland, Finland) (Canada, Greece, Australia) (US, India, Denmark)Standard Deviation: the amount of variance in inflation rates of our countries is 6.18%Min: the lowest inflation rate is -23.3%. (Qatar)25 percentile: the inflation rate at the lowest 25% is -0.4%.75 percentile: the inflation rate at the upper 25% is 2.7%.Max: the highest inflation rate is 23.7%. (Argentina)Empirical Rule?: It does not satisfy the empirical rule which means one of the 3 criterias are not met: 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.

Ivan Aguilar:Mean: the average fertility rate of our countries is 1.937%Median: the fertility rate that is at the 50% of our data is 1.8%Mode: the most common fertility rate is 1.8% (U.K., Brazil, Belgium, Norway, Chile, Finland)Standard Deviation: the amount of variation in the fertility rate of our countries is .752%Min: the lowest fertility rate of our countries is 1.18% (Taiwan)25 percentile: the fertility rate of the lowest 25% is 12.5%75 percentile: the fertility rate of the upper 25% is 37.5%Max: the highest fertility rate of our countries is 5.7% (NIgeria) Empirical Rule?: It does not satisfy the empirical rule which means one of the 3 criterias are not met: 68% of the countries fall within the first standard deviation, 95% of the countries fall within the second standard deviation, 99% of the countries fall within the third standard deviation.

Fill in the following correlation table from your individual projects and draw conclusions from your analysis of the correlations:

y (GDP per Capita

Poverty Rate (Austin)

Fertility Rate (name)

Unemployment Rate [Name]

Inflation [Name]

y (GDP per Capita) -0.381 -0.306 -0.159 -0.376Poverty Rate (Austin) -0.111 0.156 -0.083fertility rate[name] -0.068 0.023unemployment rate[Name]

0.156

inflation [Name]

As a team, we have decided that the x variables that should be used for final analysis are the Poverty Rate, The Fertility Rate, and the Inflation. These variables were chosen because they have the strongest correlation with our y-value (GDP per Capita) and therefore have the strongest chance of having a relationship with our Y-value (GDP per Capita).

Team Member

X variable Correlation with y

Parameter validity (Intercept)

Parameter validity (Slope)

R square

Austin White Poverty Rate -0.381 Yes Yes 0.157

Ivan Aguilar Fertility Rate -0.306 Yes No 0.0937

Stephen Rogers

Inflation Rate -0.376 Yes Yes 0.1417

Niko Kritsonis

unemployment rate

-0.159 Yes No 0.02539

Modeling

Comparison Table of Model Significance

Combination 1 Combination 2 Combination 3 Combination 4Poverty Rate -424.006 -646.213 -629.955

Fertility Rate -6023.43 -4335.42 -9991.93

Inflation -1338.75 -1260.32 -1509.66

Model Significance .003212 .001812 .018602 .002139

Adjusted R-Square .208155 .203059 .12004 .197416

Parameter significance(Poverty Rate)

.207096 .020286 .068845 N/A

Parameter significance (Fertility Rate)

.259671 N/A .435991 .0245

Parameter significanceInflation

.016207 .022386 N/A .005865

Parameter significanceIntercept

6.9E-07 6.16E-10 2.55E-06 5.9E-07

Assumption Checks XXXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX

Mean of 0 PR: noFR: noI: no

PR: yesI: no

PR: noFR:no

FR: noI: no

Variance constancy PR: noFR: noI: yes

PR: noI: no

PR: noFR: no

FR: noI: no

Independence PR: Independant

PR: Independant

PR: Independant

FR: Independant

FR: IndependantI: Dependant

I: Dependant FR: Dependant I: Dependant

Normality no no no no

We chose combination #2 as our best model (poverty rate and inflation). We took into account three different factors when deciding our best model. we first took a look at the model significance and noticed that all combinations were valid (F significant is less than .05). We then took a look at parameter significance and noticed that combination #2 and #4 both had valid parameters (parameter significance is less than .05). Lastly, we looked at Adjusted R-Squared and noticed that Combination #2 had the largest adjusted R-Squared (largest percent of variation explained by the graph) while meeting our first two significance checks. After these three checks, we came to the conclusion that Combination #2 (poverty rate and inflation) was our best model.

Best Model Improvement:

Final modeling:

Best Model(combination

#2)

Best Model with first outliers removal

Best Model with second outliers removal

Intercept 43109.34 42723.18 40315.71

Slope 1 -646.213 -641.691 -576.627

Slope 2 -1260.32 -1261.65 -1186.08

Slope 3 N/A N/A N/A

Model Significance .001812 0.002033 0.002697

Adjusted R-Square .203059 0.203015 0.19703

Parameter significance(Poverty Rate)

.020286 0.021979 0.028298

Parameter significance (Fertility Rate)

N/A N/A N/A

Parameter significance(Inflation)

.022386 0.023093 0.022837

Assumption Checks XXXXXXXX XXXXXXXXXX XXXXXXXXXX

Mean of 0 PR: yesI: no

PR: yesI: no

PR: yesI: no

Variance constancy PR: noI: yes

PR: noI: yes

PR: noI: yes

Independence PR: IndependentI: Dependent

PR: IndependentI: Dependent

PR: IndependentI: Dependent

Normality no no no

Compared to the original model, the new model has a higher model significance meaning that it is closer to accepting the null. The parameter significance for both x’s are different but not enough to really affect how we see or read the data and because it's still a small number we can accept the value. The slope and the intercepts are getting closer to 0 which means the data is becoming more and more reliable. For the final best model there is one normal outlier of 2.3 on the 19th data point but besides that there are no extreme outliers.

y=-1762.707x+40315.71. The equation for the multiple regression here explains a best fit line for our data points since no line actually fits though every single point. We are given an intercept and slopes from the regression summary and that line is the best explanation of how the data is acting.

Prediction Interval:

X variable t value (alpha/2,df)

Point Prediction

Prediction Interval Confidence Interval Actual y value

Poverty Rate

2.682 15.1 (U.S.A) -31360-96479.37 23244.68-41604.67 54398.5

Inflation 2.898 2.1 (Germany)

-60405.421-81107.021

9728.176-28973.423

10350.8

Fertility Rate

2.898 1.9 (Australia)

-16670.173-125467.173

-64362.2922-44434.7078

54398.5

The prediction interval is an estimate interval of where an observation should be, given a probability. In our case alpha = .01. For the poverty Rate, given that the U.S’s Poverty Rate was 15.1% in 2014, the prediction interval predicted that the U.S’s GDP per capita was between $23244.68-$41604.67. The actual U.S GDP per Capita in 2014 was $54398.5, which is inside the prediction interval. This meant that the prediction interval accurately predicted where the U.S’s GDP per Capita would land (in an interval) given the only the U.S’s Poverty Rate. This same trend occurred for Inflation with Germany and Fertility Rate with Australia.

The prediction interval is fairly similar to the confidence interval in both what it’s trying to capture and in formula. However, the prediction interval is a much larger interval than the confidence interval and therefore doesn’t give us as much accuracy as possibly desired. The Prediction Interval is a useful tool in statistics because of it’s strong predictive characteristics but does have the drawback that its interval is so large that it can be more advantageous to use a confidence interval given certain circumstances.

Discussion for Final Model (I will look to this section to see if you really understood what you did.)

Summary:

Although the correlations weren’t strong, there is some evidence to show that economic factors such as inflation and poverty rate are interconnected with a country’s GDP per Capita. Our final model (which contained Poverty Rate and Inflation as the x variables) proved that with all valid parameters from the x values and intercepts we received a adjusted R-Squared of 0.197. While our adjusted R-Squared is fairly small, almost 20% of our regression plots can be explained by our multiple regression model. This means that there is sufficient evidence to prove that the size of a country’s GDP per Capita has an effect on the poverty rate and inflation of that country.

In our final model, our data contained no extreme outliers (z score greater than or equal to 3 or less than or equal to -3). However, our data did contain one outlier which was Turkey with a z score of 2.32.

Our model appears to be valid based on Parameter significance and Model Significance. The Parameter significance for all three values (two x values and one intercept) had a significance level less than .05 meaning the parameters are valid. The Model Significance level was also below .05 meaning the model is valid. However, it must be noted that our assumption check for our two x variables gave mixed results as to whether a linear regression model was valid for our final values. in our mean of 0 test and independence test we found that the poverty rate passed both tests while inflation did not pass either. For the variance constancy test, inflation passed while the poverty rate did not. Neither values had a Normal distribution meaning a linear regression model may not be acceptable for these values.

Our model would not be a reliable predictor of GDP per Capita of a country. Although we believe that there is sufficient evidence to prove that there is a correlation between the poverty rate and inflation of a country and its GDP per Capita, that correlation according to our data is small. based on our adjusted R-Squared, our regression can only explain about 19.7% of the whole model distribution. We do not believe our model to be a reliable predictor because of its low ability to explain the variation of data points on our model.

In statistical terms, this model is recommended because of its parameter validity and its model validity. Having validity to these pieces of data is highly important to the success of a model. If our model did not have valid parameters or a valid F significance than the model could not be used to determine the GDP per Capita of a country and our model would then be useless. That being said, in the real world our model allows us to see how poverty rates and inflation correlate with the size of a country’s GDP per Capita, which is something that economists and social scientists study frequently in today’s world.

The drawbacks of our model is that the Adjusted R-Squared is vary low (0.197) indicating that our model is not a very good indicator of the correlation between GDP per Capita and the poverty rate and inflation of a country.

Conclusion (I will look to this section to see if you love your data.)There are a myriad of ways to make improvements to our study in the future. In this study we used GDP per Capita of a country as our y-value to find out whether there was a correlation between this and our x-values poverty rate, fertility rate, unemployment rate, and inflation. In the future I would not use the GDP per Capita of a country but rather the increase or decrease of a country’s GDP per Capita from a specific year to another. This would probably give us a better correlation with our x-values because it would reflect the current prosperity of a country rather than comparing to a country's prosperity that may have been accumulated in the past and then became stagnant economically. I would also change x-values such as poverty rate and Fertility Rate to something more economically focused such as interest rates and international trade. Using poverty rate and fertility rate as an x-value was not bad, however, these measures are more socially driven statistics rather than economically driven like GDP per Capita. If we were to use interest rates and international trade of a country instead, we would probably see better regression models being made from the data that can be attained.

From our final model, we can see that the two slopes of our x-values (poverty rate and inflation) were negative. meaning as x increased, y decreased. This matches our hypothesis to a point. Any policies that decrease poverty such as work programs, training programs, educational programs, etc that are used to decrease poverty would increase a nation's GDP per Capita. Also, policies that are aimed at decrease inflation would increase GDP per Capita according to our model. Our model must be taken with a grain of salt though because of the low adjusted R-Squared that has already been noted which questions the reliability of our model.

Going back to our original hypothesis, we found that none of our variables had a strong correlation with GDP per Capita. When we correlated each x-value with our y-value (GDP per Capita) we found the poverty rate, fertility rate, and unemployment rate to have moderate negative correlations with our y (GDP per Capita. However, our R-Squared was very low (.157 for poverty rate, .0937 for fertility rate, and .1417 for inflation). When we combined these x-values in our multiple regression, our Adjusted R-Squared wasn’t reading much better than our individual x R-Squared values.

Works Cited

Unemployment, total (% of total labor force) (modeled ILO estimate). (n.d.). Retrieved

November 20, 2016, from http://data.worldbank.org/indicator/SL.UEM.TOTL.ZS

Inflation, GDP deflator (annual %). (n.d.). Retrieved November 20, 2016, from

http://data.worldbank.org/indicator/NY.GDP.DEFL.KD.ZG

Fertility Rate, Total (births per woman). (n.d.). Retrieved November 20, 2016, from

http://data.worldbank.org/indicator/SP.DYN.TFRT.IN?

Appendix

(Note: Standard Residuals shaded yellow represent normal outliers)

Combination 1

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.506592R Square 0.256635Adjusted R Square 0.208155Standard Error 22461.46Observations 50

ANOVA

df SS MS FSignificanc

e F

Regression 38.01E+0

92.67E+0

95.29360

2 0.003212

Residual 462.32E+1

05.05E+0

8

Total 493.12E+1

0

Coefficients

Standard Error t Stat P-value Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept 51013.718874.91

2 5.74808 6.9E-07 33149.4368877.9

833149.4

368877.9

8

X Variable 1 -424.006331.353

5-

1.279620.20709

6 -1090.99 242.974-

1090.99 242.974

X Variable 2 -6023.435277.90

9-

1.141250.25967

1 -16647.34600.44

7-

16647.34600.44

7

X Variable 3 -1338.75536.356

6-

2.496010.01620

7 -2418.38-

259.121-

2418.38-

259.121

RESIDUAL OUTPUTPROBABILITY OUTPUT

Observation Predicted Y Residuals

Standard Residuals

Percentile Y

131827.9414

122570.5585

9 1.03710662 1 1086.8

239459.1525

4

-31871.8525

4 -1.46449673 3 1315.3

333119.3007

63033.39923

9 0.139383277 5 1576.8

433197.4286

814569.5713

2 0.669464994 7 2052.3

532901.0037

913377.4962

1 0.614689699 9 2872.5

634010.6916

98536.10830

8 0.392230188 11 3203.2

722583.3360

7

-21006.5360

7 -0.965240516 13 3499.6

828832.1177

96347.58220

7 0.291668436 15 5442.9

920387.7877

8

-8658.98778

4 -0.397876442 17 5969.9

1038193.8076

211991.6923

8 0.551012661 19 6472.1

1134056.2369

3

-6066.83693

4 -0.278768322 21 7587.3

1237743.5974

324252.2025

7 1.114377375 23 7918.1

1325801.4195

3

-11899.3195

3 -0.546768172 2510303.

9

14 33433.4647

-3714.96470

2 -0.170700892 2710350.

8

1512282.1595

8

-1931.35958

1 -0.08874507 2911307.

1

1627298.9971

6

-23799.3971

6 -1.093571178 3111728.

8

1736379.9150

115758.7849

9 0.724108806 3312324.

9

1821158.1611

1

-10854.2611

1 -0.498748225 3513154.

819 40496.4876 45114.3123 2.072981572 37 13902.

2 8 1

2052807.3539

6

-28400.8539

6 -1.305005967 3914337.

2

21

-7288.77470

819613.6747

1 0.901239187 4114566.

1

2241663.5448

8

-19044.5448

8 -0.875087938 4319502.

4

23

-16882.6613

420085.8613

4 0.922935942 4521627.

4

2434057.5145

124842.4854

9 1.141500599 4722124.

4

2538153.2787

3

-23816.0787

3 -1.094337688 49 22619

2632521.7591

314777.2408

7 0.679007312 5124406.

5

2739773.8055

757655.8944

3 2.649261406 5327989.

4

2838869.3376

9

-33426.4376

9 -1.535929191 5529718.

5

2935980.0516

4

-30010.1516

4 -1.378952444 5735179.

7

3037810.0607

813338.3392

2 0.612890454 5936152.

7

3145558.6715

9

-1595.97159

1 -0.073334149 6137206.

2

3218198.8235

3

-11726.7235

3 -0.538837466 6340215.

7

3338493.4617

6

-27186.3617

6 -1.249200619 6542546.

8

3430253.9379

99961.76201

5 0.457738309 6743962.

7

3522644.3739

4

-19771.8739

4 -0.908508368 6946278.

5

3619822.3099

917383.8900

1 0.798781622 71 47299

3730017.0837

125990.2162

9 1.194238293 73 47767

3833753.4381

527577.4618

5 1.267171485 7549864.

6

3922223.4325

4

-14305.3325

4 -0.657323345 7750185.

5

4015188.3850

7

-13873.0850

7 -0.637461776 7951148.

4

4128012.4055

8

-13446.3055

8 -0.617851458 8152138.

7

4236498.3821

313366.2178

7 0.614171465 8354321.

3

4329539.4262

624781.8737

4 1.138715518 8554398.

5

4416507.3311

7

-15420.5311

7 -0.708566201 8756007.

3

4534443.3227

5

-32391.0227

5 -1.488352359 89 58900

4633609.8501

5

-11485.4501

5 -0.527751067 9161330.

9

47 33708.0826-

20553.2826 -0.94441373 9361995.

8

4835506.3728

5

-13878.9728

5 -0.637732317 9585610.

8

49 64138.8584 32593.6416 1.4976626 9796732.

5

5036886.1718

1

-17383.7718

1 -0.798776191 9997429.

7

0 10 20 30 40 50 60 70 80-40000-20000

020000400006000080000

Poverty Residual Plot

X Variable 1

Resid

uals

0 1 2 3 4 5 6-40000-20000

020000400006000080000

Fertility Residual Plot

X Variable 2

Resid

uals

-30 -20 -10 0 10 20 30-40000-20000

020000400006000080000

Inflation Residual Plot

X Variable 3

Resid

uals

Combination 2

Note: Residual output for Combination #2 is the same as the best model and can be found under Best Model data.

SUMMARY OUTPUT


ANOVA

df SS MS FSignifican

ce F

Regression 2 7.36E+093.68E+0

97.24256

4 0.001812

Residual 47 2.39E+105.08E+0

8Total 49 3.12E+10

Coefficients

Standard Error t Stat P-value

Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept 43109.34 5567.0687.74363

46.16E-

10 31909.8454308.8

431909.8

454308.8

4

X Variable 1 -646.213 268.9759-

2.402490.02028

6 -1187.32-

105.103-

1187.32-

105.103

X Variable 2 -1260.32 533.6441-

2.361720.02238

6 -2333.87-

186.764-

2333.87-

186.764

0 10 20 30 40 50 60 70 80-40000-20000

020000400006000080000

Poverty Rate Residual Plot

Poverty Rate

Resid

uals

-30 -20 -10 0 10 20 30-40000-20000

020000400006000080000


Inflation

Resid

uals

Combination 3

SUMMARY OUTPUT


ANOVA


ce F


94.34218

2 0.018602

Residual 47 2.64E+105.61E+0

8Total 49 3.12E+10

Coefficients


Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept 49999.36 9345.867 5.349892.55E-

06 31197.968800.8

231197.

968800.8

2

X Variable 1 -629.955 338.3005-

1.862120.06884

5 -1310.5350.6177

6

-1310.5

350.6177

6

X Variable 2 -4335.42 5517.958-

0.785690.43599

1 -15436.16765.28

2

-15436.

16765.28

2



Standard Residuals

Percentile Y

132249.7474

222148.7525

8 0.955100367 1 1086.8

239219.9683

5

-31632.6683

5 -1.364066578 3 1315.33 33850.4964 2302.20351 0.099275813 5 1576.8

9 2

434165.4740

313601.5259

7 0.586526144 7 2052.3

531990.3385

614288.1614

4 0.616135296 9 2872.5

636351.8822

96194.91771

3 0.267137761 11 3203.2

720821.6990

7

-19244.8990

7 -0.829880152 13 3499.6

825094.1208

410085.5791

6 0.43491119 15 5442.9

928714.5721

3

-16985.7721

3 -0.732461891 17 5969.9

1037141.1165

813044.3834

2 0.562500996 19 6472.1

1134717.5799

4

-6728.17994

4 -0.29013314 21 7587.3

1237856.3476

624139.4523

4 1.040943489 23 7918.1

1335699.6467

2

-21797.5467

2 -0.939955638 2510303.

9

14 31071.2673

-1352.76729

7 -0.058334145 2710350.

8

157577.78868

52773.01131

5 0.119578027 2911307.

1

1633177.6778

6

-29678.0778

6 -1.279780563 3111728.

8

1736896.5613

715242.1386

3 0.65727278 3312324.

9

1830248.7448

2

-19944.8448

2 -0.860063271 3513154.

8

1938708.5774

546902.2225

5 2.022521575 3713902.

1

2031371.6562

7

-6965.15626

6 -0.300352053 3914337.

221 21129.2497 - -0.37966191 41 14566.

88804.34978

4 1

2243938.6369

8

-21319.6369

8 -0.919347174 4319502.

4

23

-18809.3722

622012.5722

6 0.949227987 4521627.

4

2437352.3835

921547.6164

1 0.929178122 4722124.

4

2537685.7956

6

-23348.5956

6 -1.006840101 49 22619

2632620.2936

414678.7063

6 0.632976407 5124406.

5

2737029.9792

260399.7207

8 2.604561826 5327989.

4

2830848.9925

8

-25406.0925

8 -1.095563656 5529718.

5

2935180.8289

9

-29210.9289

9 -1.25963613 5735179.

7

3040024.0562

911124.3437

1 0.479704882 5936152.

7

3129911.4867

914051.2132

1 0.605917595 6137206.

2

3219876.7664

5

-13404.6664

5 -0.578037151 6340215.

7

3339368.2398

5

-28061.1398

5 -1.210054827 6542546.

8

3432449.7416

57765.95835

2 0.334884308 6743962.

7

3520299.3004

8

-17426.8004

8 -0.751479952 6946278.

536 23330.5117 13875.6883 0.598348595 71 47299

3727984.4873

928022.8126

1 1.208402076 73 47767

3834187.7545

227143.1454

8 1.170468996 7549864.

639 21162.5379 - -0.571127765 77 50185.

713244.4379

7 5

4020343.8614

5

-19028.5614

5 -0.820551223 7951148.

4

4132683.2891

5

-18117.1891

5 -0.78125095 8152138.

7

4237533.9432

812330.6567

2 0.531723613 8354321.

3

4337863.7744

816457.5255

2 0.709682796 8554398.

5

4420617.8588

9

-19531.0588

9 -0.84221996 8756007.

3

4534210.0350

1

-32157.7350

1 -1.386708547 89 58900

4633457.6697

8

-11333.2697

8 -0.48871421 9161330.

9

4734954.9734

1

-21800.1734

1 -0.940068906 9361995.

8

4831764.2178

9

-10136.8178

9 -0.437120711 9585610.

8

4932383.1652

764349.3347

3 2.77487741 9796732.

5

5037322.6762

7

-17820.2762

7 -0.768447448 9997429.

7

0 10 20 30 40 50 60 70 80-40000-20000

020000400006000080000

Poverty Rate Residual Plot

Poverty Rate

Resid

uals

0 1 2 3 4 5 6-40000-20000

020000400006000080000

Fertility Rate Residual Plot

Fertility

Resid

uals

Combination 4

SUMMARY OUTPUT


ANOVA


ce F


97.02639

7 0.002139

Residual 47 2.4E+105.11E+0

8Total 49 3.12E+10

Coefficients


Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept 51539.42 8925.3125.77452

3 5.9E-07 33584.0169494.8

333584.0

169494.8

3

X Variable 1 -9991.93 4299.48-

2.32399 0.0245 -18641.4-

1342.49-

18641.4-

1342.49

X Variable 2 -1509.66 522.9722-

2.886690.00586

5 -2561.74-

457.572-

2561.74-

457.572



Standard Residuals

Percentile Y

1 31045.0967 23353.4033 1.054474455 1 1086.8

2 36307.1606-

28719.8606 -1.296785696 3 1315.3

334531.4047

11621.29528

7 0.073206224 5 1576.8

434380.4390

413386.5609

6 0.604442376 7 2052.3

533101.0493

413177.4506

6 0.595000434 9 2872.5

629743.9724

112802.8275

9 0.578085107 11 3203.2

726049.1319

8

-24472.3319

8 -1.104997357 13 3499.6

836342.9927

7

-1163.29276

6 -0.05252607 15 5442.9

921476.6926

7

-9747.89266

7 -0.440145861 17 5969.9

1036458.1262

713727.3737

3 0.619831069 19 6472.1

1136227.8592

6-

8238.45926 -0.371990529 21 7587.3

1233460.5474

428535.2525

6 1.288450104 23 7918.1

1322928.7826

2

-9026.68262

5 -0.407581119 2510303.

9

1437644.1170

5

-7925.61705

3 -0.357864788 2710350.

8

15 25783.0328-

15432.2328 -0.696810442 2911307.

1

1623416.4549

8

-19916.8549

8 -0.899304249 3111728.

8

1733949.2766

218189.4233

8 0.82130566 3312324.

918 19233.9421

9-

8930.04218-0.40321752 35 13154.

8

9

1938514.0789

247096.7210

8 2.12655469 3713902.

1

2049528.1123

4

-25121.6123

4 -1.134314265 3914337.

2

21

-7220.88242

319545.7824

2 0.882549237 4114566.

1

2237937.3545

6

-15318.3545

6 -0.691668506 4319502.

4

23

-9792.58294

512995.7829

5 0.586797606 4521627.

4

2429686.4056

629213.5943

4 1.319079219 4722124.

4

25 37946.0484-

23608.8484 -1.066008548 49 22619

2632195.2553

215103.7446

8 0.681978243 5124406.

5

2737026.1567

960403.5432

1 2.727396624 5327989.

4

28 41346.5945-

35903.6945 -1.621156806 5529718.

5

2936098.6281

8

-30128.7281

8 -1.360400188 5735179.

7

30 35135.2674 16013.1326 0.723039767 5936152.

7

3148952.4448

1

-4989.74481

1 -0.225301571 6137206.

2

3221822.0931

9

-15349.9931

9 -0.693097083 6340215.

7

33 33158.6161-

21851.5161 -0.986659856 6542546.

8

3433661.4428

5 6554.25715 0.295943878 6743962.

7

3522620.3907

2

-19747.8907

2 -0.891675018 6946278.

536 16488.3649 20717.8350 0.935470841 71 47299

6 4

3736134.4603

419872.8396

6 0.897316828 73 47767

3833043.4825

928287.4174

1 1.277259622 7549864.

6

3928629.6459

6

-20711.5459

6 -0.93518687 7750185.

5

4010284.6748

6

-8969.37486

5 -0.404993506 7951148.

4

41 27062.4225-

12496.3225 -0.564245506 8152138.

7

4232950.0836

716914.5163

3 0.763739878 8354321.

3

43 23554.3799 30766.9201 1.38921642 8554398.

5

4420650.1999

8

-19563.3999

8 -0.883344722 8756007.

3

4531857.4918

1

-29805.1918

1 -1.345791575 89 58900

4636680.7562

7

-14556.3562

7 -0.657262055 9161330.

9

47 25919.9009-

12765.1009 -0.576381637 9361995.

8

4839455.7051

1

-17828.3051

1 -0.805000114 9585610.

8

4966730.5618

430001.9381

6 1.354675248 9796732.

5

5035494.7654

9

-15992.3654

9 -0.722102071 9997429.

7

0 1 2 3 4 5 6

-60000-40000-20000

020000400006000080000

Fertility Rate Residual Plot

Fertility

Resid

uals

-30 -20 -10 0 10 20 30

-60000-40000-20000

020000400006000080000


Inflation

Resid

uals

Normality Plot for Combinations 1-4

0 20 40 60 80 100 1200

20000

40000

60000

80000

100000

120000

Normal Probability Plot

Sample Percentile

Y

Best Model w/ Residual Output

SUMMARY OUTPUT

Regression Statistics

Multiple R0.48537358

7

R Square0.23558751

8Adjusted R Square

0.203059328

Standard Error22533.6195

9Observations 50

ANOVA


e F

Regression 2 7355027188 36775135947.24256447

80.00181218

5

Residual 472386490855

0 507764011.7

Total 493121993573

8

Coefficients

Standard Error t Stat P-value Lower 95% Upper 95%

Lower 95.0%

Intercept43109.3417

35567.06840

3 7.7436342816.16287E-

1031909.8446

854308.8387

831909.8446

X Variable 1

-646.212953

4268.975907

3 -2.4024938140.02028626

6

-1187.32268

3

-105.103223

51187.32268

X Variable 2

-1260.31729

6533.644143

7 -2.3617185930.02238568

5-

2333.87084

-186.763752

5 2333.87084

RESIDUAL OUTPUT PROBABILITY OUTPUT

Observation Predicted Y ResidualsStandard Residuals Percentile Y

132091.2088

422307.2911

6 1.010799431 1 1086.8

239797.6013

7

-32210.3013

7 -1.459529714 3 1315.3

330249.2998

95903.40011

4 0.267497897 5 1576.8

430446.3746

317320.6253

7 0.78484107 7 2052.35 32262.5967 14015.9033 0.635095807 9 2872.5

636491.8786

56054.92135

4 0.274363705 11 3203.2

722591.8784

3

-21015.0784

3 -0.952246024 13 3499.6

822779.3205

912400.3794

1 0.561892359 15 5442.9

919197.8461

6

-7469.04615

9 -0.338441255 17 5969.9

1037791.1303

512394.3696

5 0.561620041 19 6472.1

1129997.2364

3

-2007.83642

7 -0.090980115 21 7587.312 39859.0118 22136.7882 1.003073513 23 7918.113 26296.5560

6-

12394.4560-0.561623957 25 10303.9

6

1428718.0580

41000.44196

1 0.04533254 27 10350.8

156226.23232

54124.56767

5 0.186894528 29 11307.1

1630255.3175

3

-26755.7175

3 -1.21236881 31 11728.8

1736724.6769

415414.0230

6 0.698448127 33 12324.918 22735.9631 -12432.0631 -0.563328026 35 13154.8

1939836.5357

745774.2642

3 2.074146963 37 13902.1

2058130.8058

1

-33724.3058

1 -1.52813306 39 14337.2

21

-6146.56679

318471.4667

9 0.836988587 41 14566.1

2240627.6415

5

-18008.6415

5 -0.816016812 43 19502.4

23

-5780.48516

48983.68516

4 0.407073355 45 21627.424 36191.2482 22708.7518 1.028990621 47 22124.4

2535755.3575

1

-21418.1575

1 -0.97051055 49 22619

2632152.6192

715146.3807

3 0.68632058 51 24406.527 40709.1253 56720.5747 2.570151804 53 27989.4

2836696.5873

4

-31253.6873

4 -1.416183128 55 29718.5

2934201.2355

6

-28231.3355

6 -1.279232772 57 35179.7

3037086.3137

514062.0862

5 0.637188472 59 36152.7

3143363.4255

7599.274434

8 0.027154631 61 37206.2

3218093.6705

7

-11621.5705

7 -0.526602572 63 40215.7

3341157.8594

3

-29850.7594

3 -1.352612938 65 42546.8

3425528.3303

914687.3696

1 0.665521633 67 43962.7

3526866.9205

8

-23994.4205

8 -1.087247505 69 46278.5

3626136.0130

111070.1869

9 0.501617997 71 47299

3724291.2972

731716.0027

3 1.4371318 73 47767

3833189.7708

628141.1291

4 1.275145292 75 49864.6

3918701.3531

9

-10783.2531

9 -0.488616305 77 50185.5

4024287.6823

4

-22972.3823

4 -1.040936383 79 51148.4

4127932.1617

6

-13366.0617

6 -0.605649853 81 52138.7

4237823.2389

612041.3610

4 0.545624334 83 54321.3

4332875.4888

221445.8111

8 0.971763607 85 54398.5

4415317.7616

5

-14230.9616

5 -0.644840641 87 56007.3

4536059.1988

2

-34006.8988

2 -1.54093806 89 58900

4629082.9057

1

-6958.50570

9 -0.315307383 91 61330.9

4740818.6986

5

-27663.8986

5 -1.253520779 93 61995.8

4830941.2730

4

-9313.87304

3 -0.422034997 95 85610.849 63298.5108 33433.9892 1.514978085 97 96732.5

5035894.2326

8

-16391.8326

8 -0.742755139 99 97429.7

Best Model With First Outlier Removed

SUMMARY OUTPUT



3

R Square0.23622269

5Adjusted R Square

0.203014986


6Observations 49

ANOVA


e F

Regression 2 7308544561 36542722807.11348968

10.00203345

8

Residual 462363066967

5 513710210.3

Total 483093921423

5

Coefficients


Lower 95.0%

Intercept 42723.17785628.69686

5 7.5902431461.19595E-

0931393.1986

654053.1569

531393.1986

X Variable 1

-641.690872

2 270.629122 -2.371107985 0.02197892

-1186.43904

1

-96.9427035

81186.43904

X Variable 2

-1261.64773

8536.763299

9 -2.3504731760.02309328

5

-2342.09622

2-

181.1992542342.09622



1 31771.9979 22626.5021 1.0197648171.02040816

3 1086.8

239439.6873

5

-31852.3873

5 -1.435570722 3.06122449 1315.33 29932.8283 6219.87162 0.280326417 5.10204081 1576.8

7 8 6

430127.5090

317639.4909

7 0.7950027897.14285714

3 2052.3

531949.2913

514329.2086

5 0.6458100679.18367346

9 2872.5

636139.8426

36406.95737

2 0.288758275 11.2244898 3203.2

722339.1420

7

-20762.3420

7 -0.93574808313.2653061

2 3499.6

822527.3025

312652.3974

7 0.57023704915.3061224

5 5442.9

918897.8112

3

-7169.01123

4 -0.32310365117.3469387

8 5969.9

1037448.2722

512737.2277

5 0.574060306 19.3877551 6472.1

1129680.4988

2

-1691.09882

5 -0.07621695521.4285714

3 7587.3

1239501.6830

422494.1169

6 1.01379828723.4693877

6 7918.1

1325949.8906

3

-12047.7906

3 -0.5429877325.5102040

8 10303.9

1428426.5117

61291.98824

3 0.05822924627.5510204

1 10350.8

156072.79492

94278.00507

1 0.19280748929.5918367

3 11307.1

16 29916.4741 -26416.8741 -1.19059493431.6326530

6 11728.8

1736379.1317

715759.5682

3 0.71027563833.6734693

9 12324.9

1822416.2440

3

-12112.3440

3 -0.54589711835.7142857

1 13154.8

1939486.4692

346124.3307

7 2.07879987437.7551020

4 13902.1

2057814.1029

1

-33407.6029

1 -1.50566348839.7959183

7 14337.2

21

-6428.59975

318753.4997

5 0.84521059241.8367346

9 14566.1

2240246.6642

1

-17627.6642

1 -0.79446976343.8775510

2 19502.423 - 9057.16169 0.408201621 45.9183673 21627.4

5853.961691 1 5

24 35834.211 23065.789 1.03956325247.9591836

7 22124.4

2535416.5954

6

-21079.3954

6 -0.950037516 50 22619

2640363.1024

557066.5975

5 2.57196221352.0408163

3 24406.5

2736400.9733

1

-30958.0733

1 -1.39526444854.0816326

5 27989.4

2833874.3639

7

-27904.4639

7 -1.25763984556.1224489

8 29718.5

2936726.0580

214422.3419

8 0.65000753858.1632653

1 35179.7

3043079.0127

2883.687276

9 0.03982733160.2040816

3 36152.7

31 17843.9921 -11371.8921 -0.5125253362.2448979

6 37206.2

3240789.4115

8

-29482.3115

8 -1.32875262664.2857142

9 40215.7

3325225.6105

314990.0894

7 0.67559562666.3265306

1 42546.8

3426601.5231

1

-23729.0231

1 -1.06945487268.3673469

4 43962.7

3525841.2205

911364.9794

1 0.51221377970.4081632

7 46278.5

3624020.5787

531986.7212

5 1.44162508272.4489795

9 47767

3732862.8723

828468.0276

2 1.28303936974.4897959

2 49864.6

3818459.6021

6

-10541.5021

6 -0.47510008376.5306122

4 50185.5

3923997.7042

7

-22682.4042

7 -1.02228429978.5714285

7 51148.4

4027608.5603

6

-13042.4603

6 -0.587816983 80.6122449 52138.7

4137470.0062

512394.5937

5 0.55861796882.6530612

2 54321.3

42 32507.145 21814.155 0.98315275184.6938775

5 54398.5

4315066.1936

7

-13979.3936

7 -0.63004408686.7346938

8 56007.344 35724.4005 -33672.1005 -1.517584258 88.7755102 58900

45 28775.6114

-6651.21140

2 -0.29976667890.8163265

3 61330.9

4640457.6991

5

-27302.8991

5 -1.23052762892.8571428

6 61995.8

47 30646.349

-9018.94900

2 -0.40647939594.8979591

8 85610.8

4863007.5597

133724.9402

9 1.5199657296.9387755

1 96732.5

4935551.4538

4

-16049.0538

4 -0.72332260598.9795918

4 97429.7

0 10 20 30 40 50 60 70 80

-40000

-20000

0

20000

40000

60000

80000

X Variable 1 Residual Plot

X Variable 1

Resid

uals

-30 -20 -10 0 10 20 30

-40000

-20000

0

20000

40000

60000

80000


X Variable 2

Resid

uals

0 20 40 60 80 100 1200

20000

40000

60000

80000

100000

120000


Sample Percentile

Y

Best Model with Second Outlier Removed

SUMMARY OUTPUT



9

R Square0.23119872

8Adjusted R Square

0.197029783


6Observations 48

ANOVA


e F

Regression 2 6091876477 30459382396.76634077

30.00269660

3

Residual 452025721513

7 450160336.4

Total 472634909161

4

Coefficients


Lower 95.0%

Intercept40315.7117

25341.93880

2 7.5470186411.58554E-

0929556.4946

851074.9287

629556.4946

X Variable 1

-576.626861

4254.449725

5 -2.2661720710.02829838

7

-1089.11491

6

-64.1388069

51089.11491

X Variable 2-

1186.07979503.224460

4 -2.3569597330.02283661

1

-2199.62588

1

-172.533698

62199.62588



130422.5663

223975.9336

8 1.1548739561.04166666

7 1086.8

237391.3277

6

-29804.0277

6 -1.435601879 3.125 1315.3

328717.5223

67435.17764

2 0.3581380035.20833333

3 1576.8

428887.2278

118879.7721

9 0.9094017987.29166666

7 2052.3

530618.5326

315659.9673

7 0.75431008 9.375 2872.5

634337.0637

78209.73623

3 0.3954469811.4583333

3 3203.2

721946.1514

6

-20369.3514

6 -0.98115192513.5416666

7 3499.6

822125.7047

313053.9952

7 0.628785488 15.625 5442.9

918487.2585

7

-6758.45856

8 -0.32554176617.7083333

3 5969.9

10 35607.0671 14578.4329 0.70221467519.7916666

7 6472.1

11 28480.3064

-490.906399

7 -0.023646004 21.875 7587.3

1237452.2730

524543.5269

5 1.18221381623.9583333

3 7918.1

1324840.0018

6

-10937.9018

6 -0.52685739626.0416666

7 10303.9

1427437.2370

7 2281.26293 0.109883985 28.125 10350.8

15 7250.59008 3100.20992 0.14933106430.2083333

3 11307.116 28587.6423

2-

25088.0423-1.20844206 32.2916666

711728.8

2

1734593.9753

617544.7246

4 0.845095161 34.375 12324.9

1821675.1193

4

-11371.2193

4 -0.54772945436.4583333

3 13154.8

19 37475.2513 48135.5487 2.31859548338.5416666

7 13902.1

2054777.0274

3

-30370.5274

3 -1.462889063 40.625 14337.2

21

-5093.18513

717418.0851

4 0.83899518442.7083333

3 14566.1

2238027.4756

8

-15408.4756

8 -0.74219621644.7916666

7 19502.4

23

-3487.79996

86690.99996

8 0.322292416 46.875 21627.4

2434025.7720

924874.2279

1 1.19814303648.9583333

3 22124.4

2533729.0350

7

-19391.8350

7 -0.93406686851.0416666

7 22619

2634870.1484

6

-29427.2484

6 -1.417453155 53.125 24406.5

2732348.4132

1

-26378.5132

1 -1.27060152555.2083333

3 27989.4

2834842.8975

216305.5024

8 0.78540424857.2916666

7 29718.5

2941169.5017

82793.19822

4 0.13454291 59.375 35179.7

3017760.1877

6

-11288.0877

6 -0.54372516861.4583333

3 36152.7

3138598.9615

6

-27291.8615

6 -1.31459573363.5416666

7 37206.2

3224388.1140

615827.5859

4 0.76238394 65.625 40215.7

3325865.3557

4

-22992.8557

4 -1.10752100967.7083333

3 42546.8

34 25004.1322 12202.0678 0.58774980469.7916666

7 43962.7

3523425.6856

632581.6143

4 1.569392806 71.875 46278.5

3631402.8319

929928.0680

1 1.44157665673.9583333

3 47767

37 18376.2059 -10458.1059 -0.50374656176.0416666

7 49864.638

23305.65345

-21990.3534

5

-1.059232429 78.125 50185.5

3926508.5030

2

-11942.4030

2

-0.57524225780.2083333

3

51148.4

4035574.2410

314290.3589

7 0.68833871582.2916666

7 52138.741 30858.0411 23463.2589 1.130179412 84.375 54321.342

15154.09483

-14067.2948

3

-0.67759415186.4583333

3

54398.5

4334037.0441

4

-31984.7441

4

-1.54064273388.5416666

7

56007.3

4427682.8766

1

-5558.47661

4

-0.267740976 90.625 58900

4538327.0611

6

-25172.2611

6

-1.21249871792.7083333

3

61330.9

4629494.8223

7

-7867.42236

5

-0.37895838894.7916666

7

61995.8

4759763.2693

936969.2306

1 1.780735723 96.875 85610.848

33834.51262

-14332.1126

2

-0.69034990898.9583333

3

96732.5

0 10 20 30 40 50 60 70 80

-40000-30000

-20000-10000

010000

200003000040000

5000060000


X Variable 1

Resid

uals

-30 -20 -10 0 10 20 30

-40000-30000-20000-10000

0100002000030000400005000060000


X Variable 2

Resid

uals

0 20 40 60 80 100 1200

20000

40000

60000

80000

100000

120000


Sample Percentile

Y

Date post:	20-Mar-2017
Category:	Documents
Upload:	austin-white
View:	7 times
Download:	1 times

BA240 Final Project

Documents