+ All Categories
Home > Documents > How to Do a Regression in Excel 2007

How to Do a Regression in Excel 2007

Date post: 04-Apr-2018
Category:
Upload: minh-thang
View: 221 times
Download: 0 times
Share this document with a friend

of 10

Transcript
  • 7/29/2019 How to Do a Regression in Excel 2007

    1/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    1 | P a g e

    How to do a regression in Excel 2007+

    The following procedure can be followed to do a regression in Excel using office

    2007 or 2010.

    Step 1 : Input your data.

    Step 2 : Click on Data - Click on the Data Analysis Tab

    Just enter

    your data.

  • 7/29/2019 How to Do a Regression in Excel 2007

    2/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    2 | P a g e

    i. Click on the Office button and Select Excel Options

    ii. Click on the Excel options and Select Add-ins

  • 7/29/2019 How to Do a Regression in Excel 2007

    3/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    3 | P a g e

    iii. Select Analysis Tool Pak or The first Add-in which appear on your version.

    You may have Analysis ToolPak or Analysis ToolPak-VBA from the Add-In Window

    iv. Click on Go and Select activate Analysis ToolPak

  • 7/29/2019 How to Do a Regression in Excel 2007

    4/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    4 | P a g e

    Step 3 : Start your regression by clicking on the Data Analysis Tab and SelectRegression from the Menu which appears.

    Step 4 : Input your Dependent Variable in (Y) and your Independent Variable in X. For this exampleour dependent Variable is Sales and our Independent Variable is Advertising.

  • 7/29/2019 How to Do a Regression in Excel 2007

    5/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    5 | P a g e

    Click on Labels, Confidence Level and Line Fit Plots etc :

    Step 5 : Click on Ok Interpret your results and Add Trend Line by Clicking on the

    Scatter plots.

  • 7/29/2019 How to Do a Regression in Excel 2007

    6/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    6 | P a g e

    SUMMARY OUTPUT

    Regression Statistics

    Multiple R 0.640163

    R Square 0.409808

    Adjusted R Square 0.26226

    Standard Error 0.666976

    Observations 6

    ANOVA

    df SS MS F Significance F

    Regression 1 1.235571 1.235571 2.777456647 0.170928

    Residual 4 1.779429 0.444857

    Total 5 3.015

    CoefficientsStandard

    Error t Stat P-value Lower 95%Upper95%

    Intercept 6.890714 1.265289 5.445961 0.005521213 3.377709 10.40372

    Advert Exp ('000) 0.006643 0.003986 1.66657 0.170928062 -0.00442 0.01771

    The Regression has three components:

    1. Regression Statistics Table2. ANOVA Table3. Regression Coefficient Table

    Regression Statistics Table:

    SUMMARYOUTPUT

    Regression Statistics

    Multiple R 0.64016256

    R Square 0.4098081

    Adjusted R Square 0.26226013

    Standard Error 0.66697612

    Observations 6

    Explanation :

    The above gives the goodness-of-fit measures :R2 = 40.98%

  • 7/29/2019 How to Do a Regression in Excel 2007

    7/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    7 | P a g e

    The correlation between the dependent variable (Sales) and Independent Variable (AdvertisingExpenditure is 0.64-Multiple R. When this is squared we get R2. There 0< R2 < 1. The more closer weare to 1 the more the variation is explained.

    Therefore R Square = 0.409 or 40.98% means that 40.98% of the variations in sales is explained by theAdvertising. When there is more than more than one independent variable it is better to use theAdjusted R square.

    The standard error of the regression is 0.66697

    Analysis of the Variance : ANOVA

    ANOVA

    df SS MS F Significance F

    Regression 1 1.235571 1.235571 2.777456647 0.170928

    Residual 4 1.779429 0.444857

    Total 5 3.015

    This splits the sum of squares into its components:SS, MS will be discussed later

    8

    This splits the sum of squares into its components:SS, MS will be discussed later

    Statistical Significance of the regression as whole.Here we use the F statistic. From the above the F statistic is 2.777 with a probabilityof 0.1709. It is normally the same as the p-value when only one independent variableis used. This means the entire regression is not statistically significant at the 95%confidence level (5% level of significance)

    3. Regression Coefficient Table

    CoefficientsStandard

    Error t Stat P-valueLower95%

    Upper95%

    Intercept 6.890714286 1.265288948 5.445961016 0.0055212 3.377709 10.40372Advert Exp('000) 0.006642857 0.003985945 1.666570325 0.1709281

    -0.0044239 0.0177096

    a. Regression Equation :Sales = 6.890 + 0.0066Advert [Using the Coefficient Column]

    This should be explained under the following headings :

    Size and Sign :

  • 7/29/2019 How to Do a Regression in Excel 2007

    8/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    8 | P a g e

    Sign :From the very beginning we expect Sales to be related to advertising. It is seenfrom the equation that sales is positively related to advertising (If we needmore sales we need to do advert). This can also be supported with theMultiple R which is the same as the correlation.

    Size :If there is no advert (i.e Advert = 0) then Sales will be 6.890 (m). Sales willrise by 0.0066 (000,000) or 6600 for a unit increase in advertising (ie 1000).

    b. Statistical Significance of Coefficients :This can be done using the p-values or the critical t values. If we performregression at the 95% level or at the 5% percent level of significance then thefollowing applies :

    If p < 0.05 : Statistically significant ; p > 0.05 : Statistically not significant.

    From the regression above the p>0.05 which means that the relationship foundbetween advertising and sales is not statistically significant.

    Example 2 Multiple Regression

    A car dealer believes the number of cars sold each month is related to the number of

    years of sales experience and the age of the sales person. The data for a randomsample of 10 sales persons are as follows:

    Y X1 X2

    17 2 23

    23 6 33

    20 8 30

    18 11 35

    19 4 24

    22 7 49

    21 7 36

    28 14 40

    26 12 46

    12 3 51

    Y = number of cars sold per month, X1 = Number of years of sales experience and X2=Ageof Sales person (Years).

    Required :Using a multiple regression analysis and use your output to perform the following :

  • 7/29/2019 How to Do a Regression in Excel 2007

    9/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    9 | P a g e

    i. What is the estimated regression Equation?

    ii. Interpret the coefficient s on X1 and X2.

    iii. State whether or not there is a significant relationship between sales and bothexplanatory variables, taken together, at the 5% level of significance.

    iv. State whether or not the explanatory variables X1 and X2 are statisticallysignificant at the 5% level . Be sure to state the null and alternative hypothesisand on what your conclusions are based (i.e. the test procedure used).

    Example 3

    MonthsSales(000) Price ()

    Advert Exp() Mean Daily Hours

    January 75 6.8 2 2.4

    February 90 6.5 5 4

    March 148 6 6 5.2

    April 183 3.5 7 6.8

    May 242 3 22 8

    June 263 2.9 25 8.4July 278 2.6 28 10.4

    August 318 2.1 30 11.5

    September 256 3.1 22 9.6

    October 200 3.6 18 6.1

    November 140 4.2 10 3.4

    December 80 5.2 2 2

    Perform a regression analysis and Discuss under the following headings:

    a. Explanatory power of the regressionb. The regression Equationc. The Statistical Significance of the Independent Variable

  • 7/29/2019 How to Do a Regression in Excel 2007

    10/10

    David Acquaye BSc. MBA, PhD-St. (2010/11)

    10 | P a g e

    When the trend line is added, this is what we get:

    Reference:

    Fleming M.C and Nellis, J.G (1997) Statistics for Business, Prentice Hall, UK

    Curwin, J and Slater, R (2004), Quantitative Methods for Business Decisions, 5th Edition,

    Thomson, UK

    y = 0.0066x + 6.8907

    0

    2

    4

    6

    8

    10

    12

    0 100 200 300 400 500

    Sa

    les('m)

    Advert ('000)

    Regression Model for Sales on Advertising

    Sales

    Predicted Sales

    Linear (Predicted Sales)


Recommended