CIVL 7012/8012
Simple Linear Regression
Lecture 2
Correlation
โข Correlation is the degree to which two continuous variables are
linearly associated.
โข This is most often represented by a scatterplot and the Pearson
correlation coefficient, denote by (๐).
โข The scatterplot provides a visual as to how the two continuous
variable are correlated.
โข The coefficient is a measure of the linear association between the
two variables.
Correlation
โข If there is no correlation between the two variables, the points will
form a horizontal or vertical line or complete randomness (no obvious
patterns).
โข Note that it does not matter which variable is on x-axis and which is
on the y-axis.
โข The pattern the two variables form determines the strength and
direction of their correlation.
Correlation
โข The stronger the correlation, the more
linearly distinct the pattern will be.
โข The coefficient is between -1 and 1.
+1 indicates a perfect positive correlation
-1 indicates a perfect negative correlation
0 indicates no correlation
โข No strict rules for interpretation, however,
as a guideline, it is suggested:
0 < |๐| < 0.3: weak correlation
0.3 < |๐| < 0.7: moderate correlation
|๐| > 0.7: strong correlation
Correlation
Snapshot from Multivariate Lecture 6
๐๐๐ is the correlation notation for the entire population.
Pearson correlation coefficient (๐) is for our sample representing
the population.
๐ = ๐ฅ๐ โ ๐ฅ ๐ฆ๐ โ ๐ฆ
๐ฅ๐ โ ๐ฅ 2 ๐ฆ๐ โ ๐ฆ 2
Correlation calculation
Meal
Bill ($)
Tip ($)
Bill deviations
Tip deviations
Deviations products
Bill deviations squared
Tip deviations squared
๐ฅ ๐ฆ ๐ฅ๐ โ ๐ฅ ๐ฆ๐ โ ๐ฆ (๐ฅ๐ โ ๐ฅ )(๐ฆ๐ โ ๐ฆ ) ๐ฅ๐ โ ๐ฅ 2 ๐ฆ๐ โ ๐ฆ 2
1 35 6 -37.5 -4 150 1406.25 16
2 110 18 37.5 8 300 1406.25 64
3 66 11 -6.5 1 -6.5 42.25 1
4 75 7 2.5 -3 -7.5 6.25 9
5 100 14 27.5 4 110 756.25 16
6 49 4 -23.5 -6 141 552.25 36
687 4169.5 142
๐ = ๐ฅ๐ โ ๐ฅ ๐ฆ๐ โ ๐ฆ
๐ฅ๐ โ ๐ฅ 2 ๐ฆ๐ โ ๐ฆ 2=
687
(4169.5)(142) = 0.892
Correlation significance test (t-test)
โข Is it statistically significant?
โข Conduct a t-test
โข ๐ป0: ๐ = 0 ๐ฃ๐ . ๐ป1: ๐ โ 0 ๐๐ก ๐ผ = 0.05
โข ๐ก = ๐๐โ2
1โ๐2, df=n-2
โข ๐ก = 0.8926โ2
1โ0.8922= 3.947
๐ = 0.892
Correlation significance test (t-test)
โข ๐ป0: ๐ = 0 ๐ฃ๐ . ๐ป1: ๐ โ 0 ๐๐ก ๐ผ = 0.05
โข ๐ก = ๐๐โ2
1โ๐2, df=n-2
โข ๐ก = 0.8926โ2
1โ0.8922= 3.947
โข ๐ก๐๐๐๐ > ๐ก๐๐๐๐ก. โโโ ๐๐๐๐๐๐ก ๐๐ข๐๐
SLR Lecture 1 Recap
Recap - Quick Review
โข SLR is a comparison of 2 models:
โข One is where the independent variable does not exist
โข And the other uses the best-fit regression line
โข If there is only one variable, the best prediction for other
values is the mean of the dependent variable.
โข The distance between the best-fit line and the observed
value is called residual (or error).
โข The residuals are squared and added together to
generate sum of squares residuals/error (SSE).
โข SLR is designed to find the best fitting line through the
data that minimizes the SSE.
Recap - Example
0
2
4
6
8
10
12
14
16
18
20
0 1 2 3 4 5 6 7
Tip
($
)
Meal #
Tips for service ($)
๐ฆ =10
Best-fit line
Meal # Tip ($)
1 6
2 18
3 11
4 7
5 14
6 4
0
2
4
6
8
10
12
14
16
18
20
0 1 2 3 4 5 6 7
Tips for service ($)
16 1
16
64
9 36
Recap - Residuals (Errors)
+8
+1
โ3
+4
โ6 Squared Residuals (Errors)
# Residual Residual2
1 โ4 16
2 +8 64
3 +1 1
4 โ3 9
5 +4 16
6 โ6 36
Sum of squared errors (SSE)
= 142
๐น๐๐๐๐ ๐๐๐๐๐ = ๐๐๐
โ4
Recap โ Population vs. Sample Eq.
โข If we knew our โpopulationโ parameters, ๐ฝ0, ๐ฝ1, then we could use the SLR eq. as is.
โข In reality, we almost never have the population parameters. Therefore we have to estimate them using sample data. With sample data, SLR eq. changes a bit.
โข Where ๐ฆ โy-hatโ is the point estimator of ๐ธ ๐ฆ .
โข Or, ๐ฆ is the mean value of ๐ฆ for a given ๐ฅ.
๐ธ ๐ฆ = ๐ฝ0 + ๐ฝ1๐ฅ
๐ฆ = ๐0 + ๐1๐ฅ
Recap โ OLS criterion
๐ฆ๐ = observed value of dependent variable (tip amount).
๐ฆ ๐ =estimated (predicted) value of the dependent variable
(predicted tip amount based on regression model).
min ๐ฆ๐ โ ๐ฆ ๐2
0
5
10
15
20
0 50 100 150
observed
predicted
Recap - SLR parameter equations
๐ฆ ๐ = ๐0 + ๐1๐ฅ
๐1 = ๐ฅ๐ โ ๐ฅ ๐ฆ๐ โ ๐ฆ
๐ฅ๐ โ ๐ฅ 2
slope
๐ฅ = mean of the independent variable ($
bill)
๐ฆ = mean of the dependent variable ($ tip)
๐ฅ๐ = value of the independent variable
๐ฆ๐ = value of the dependent variable
๐0 = ๐ฆ โ ๐1๐ฅ
intercept
Recap - OLS Calculations
Meal Bill ($) Tip ($) Bill deviations
(๐๐ฅ) Tip deviations Deviations products
Bill deviations squared ๐๐ฅ
2
๐ฅ ๐ฆ ๐ฅ๐ โ ๐ฅ ๐ฆ๐ โ ๐ฆ (๐ฅ๐ โ ๐ฅ )(๐ฆ๐ โ ๐ฆ ) ๐ฅ๐ โ ๐ฅ 2
1 35 6 -37.5 -4 150 1406.25
2 110 18 37.5 8 300 1406.25
3 66 11 -6.5 1 -6.5 42.25
4 75 7 2.5 -3 -7.5 6.25
5 100 14 27.5 4 110 756.25
6 49 4 -23.5 -6 141 552.25
๐ฅ = 72.5 ๐ฆ = 10 687 4169.5
Recap - OLS Calculations
Deviations products Bill deviations squared
(๐๐ โ ๐ )(๐๐ โ ๐ ) ๐๐ โ ๐ ๐
150 1406.25
300 1406.25
-6.5 42.25
-7.5 6.25
110 756.25
141 552.25
๐๐๐ ๐๐๐๐. ๐
๐๐ = ๐๐ โ ๐ ๐๐ โ ๐
๐๐ โ ๐ ๐
๐๐ =๐๐๐
๐๐๐๐. ๐
๐๐ = ๐. ๐๐๐๐
Recap - OLS Calculations
๐๐ = ๐๐ โ ๐. ๐๐๐๐(๐๐. ๐)
๐๐ = ๐. ๐๐๐๐
๐๐ = ๐ + ๐๐๐
Bill ($) Tip ($)
๐ ๐
35 6
110 18
66 11
75 7
100 14
49 4
๐ฅ = 72.5 ๐ฆ = 10
๐๐ = ๐๐ โ ๐๐. ๐๐๐๐
๐๐ = โ๐. ๐๐๐๐
Recap โ New Best-Fit Line & Parameters
๐ฆ ๐ = ๐0 + ๐1๐ฅ
๐ฆ ๐ = โ1.9457 +0.1648๐ฅ
๐0 = โ1.9457
intercept
๐1 = 0.1648
slope
๐ฆ ๐ = 0.1648๐ฅ โ 1.9457
OR
Recap - Final SLR line
0
2
4
6
8
10
12
14
16
18
20
0 20 40 60 80 100 120
Tip
($
)
Bill ($)
Bill vs. Tip Amount ($)
๐ ฬ_๐ =โ๐.๐๐๐๐ +๐.๐๐๐๐๐
๐๐=โ๐.๐๐๐๐
๐๐๐๐๐ ๐๐ = ๐. ๐๐๐๐
Recap - SLR Model Interpretation
๐ฆ ๐ = โ1.9457 +0.1648๐ฅ
For every $1 the bill amount (๐ฅ) increases, we would expect the tip
amount to also increase by $0.1648 or
about 16 cents (positive coefficient).
If the bill amount (๐ฅ) is zero, then the
expected/predicted tip amount is $-
1.9457 or negative $1.95!
Does this make any sense? NO In real
world problems, the intercept may or
may not make sense.
SLR โ Lecture 2
0
2
4
6
8
10
12
14
16
18
20
0 50 100 150
Bills vs Tips ($)
0
5
10
15
20
0 1 2 3 4 5 6 7
Tips ($)
Model fit and Coefficient of Determination
๐บ๐บ๐ฌ = ๐๐๐
๐บ๐บ๐ฌ = ๐บ๐บ๐ป
With only the DV, the only sum
of squares is due to error.
Therefore, it is also the total,
and MAX sum of squares for
this data sample. ๐บ๐บ๐ป = ๐๐๐
With both the IV and DV, SST
remains the same. But the SSE
is reduced significantly. The
difference between the SSE
and SST is due to regression
(SSR).
๐บ๐บ๐ป = ๐๐๐
๐บ๐บ๐ฌ = ?
๐บ๐บ๐ป โ ๐บ๐บ๐ฌ = ๐บ๐บ๐น
Estimate regression values
Meal Bill ($) Tip ($) ๐ ๐ = โ๐. ๐๐๐๐ +๐. ๐๐๐๐๐ ๐ ๐ (predicted tip $)
๐ฅ๐ ๐ฆ๐
1 35 6 ๐ฆ ๐ = โ1.9457 +0.1648(35) 3.8212
2 110 18 ๐ฆ ๐ = โ1.9457 +0.1648(110) 16.1788
3 66 11 ๐ฆ ๐ = โ1.9457 +0.1648(66) 8.9290
4 75 7 ๐ฆ ๐ = โ1.9457 +0.1648(75) 10.4119
5 100 14 ๐ฆ ๐ = โ1.9457 +0.1648(100) 14.5311
6 49 4 ๐ฆ ๐ = โ1.9457 +0.1648(49) 6.1280
๐ฅ = 72.5 ๐ฆ = 10
min ๐ฆ๐ โ ๐ฆ ๐2
Regression errors (residuals)
Meal Bill ($) Tip ($) ๐ ๐ (predicted tip $) Error (๐ โ ๐ ๐)
๐ฅ ๐ฆ (observed-predicted)
1 35 6 3.8212 6 โ 3.8212 = 2.1788
2 110 18 16.1788 18 โ 16.1788 = 1.8212
3 66 11 8.9290 11 โ 8.9290 = 2.0710
4 75 7 10.4119 7 โ 10.4119 = -3.4119
5 100 14 14.5311 14 โ 14.5311 = -0.5311
6 49 4 6.1280 4 โ 6.1280 = -2.1280
๐ฅ = 72.5 ๐ฆ = 10
Meal Bill ($) Tip ($) ๐ ๐ (predicted tip $) Error (๐ โ ๐ ๐) (๐ โ ๐ ๐)๐
๐ฅ ๐ฆ
1 35 6 3.8212 2.1788 4.7472
2 110 18 16.1788 1.8212 3.3168
3 66 11 8.9290 2.0710 4.2890
4 75 7 10.4119 -3.4119 11.6412
5 100 14 14.5311 -0.5311 0.2821
6 49 4 6.1280 -2.1280 4.5282
Regression errors (residuals) - SSE
๐ฅ = 72.5 ๐ฆ = 10 ๐๐๐ธ = 28.8044
SSE comparison
Sum of squared error (SSE) Comparison
D.V. (tip $) ONLY
+ + + + + = SSE = 28.8044
16 1 16 64 9 36 + + + + + = SSE = 142
D.V. & I.V (tip $ as a function of bill $)
Comparison of two lines
โข When we conducted the regression, the SSE decreased
from 142 to 28.8044.
โข 28.8044 was explained by (allocated to) ERROR.
โข What happen to the difference (113.1956)?
โข 113.1956 is the sum of squares due to REGRESSION
(SSR).
โข ๐๐๐ = ๐๐๐ + ๐๐๐ธ
โข In this case:
142 = 113.1956 + 28.8044
0
2
4
6
8
10
12
14
16
18
20
0 50 100 150
Bills vs Tips ($)
0
5
10
15
20
0 1 2 3 4 5 6 7
Tips ($)
Comparison of two lines
๐บ๐บ๐ฌ = ๐๐๐
๐บ๐บ๐ฌ = ๐บ๐บ๐ป
๐บ๐บ๐ป = ๐๐๐
๐บ๐บ๐ป = ๐๐๐
๐บ๐บ๐ฌ = ๐๐. ๐๐๐๐
๐บ๐บ๐ป โ ๐บ๐บ๐ฌ = ๐บ๐บ๐น = ๐๐๐. ๐๐๐๐
Coefficient of Determination (๐2)
โข How well does the estimated regression equation fit our
data?
โข This is where regression starts to look a lot like ANOVA,
where the SST is partitioned into SSE & SSR.
โข The larger the SSR the smaller the SSE.
โข The Coefficient of Determination quantifies this ratio as a
percentage (%).
SSE
SST
SSR
๐ถ๐๐๐๐๐๐๐๐๐๐ก ๐๐ ๐ท๐๐ก๐๐๐๐๐๐๐ก๐๐๐ = ๐2 =๐๐๐
๐๐๐
Coefficient of Determination (๐2)
โข How well does the estimated regression equation fit our
data?
โข This is where regression starts to look a lot like ANOVA,
where the SST is partitioned into SSE & SSR.
โข The larger the SSR the smaller the SSE.
โข The Coefficient of Determination quantifies this ratio as a
percentage (%).
SSE
SST
SSR
ANOVA
df SS MS F Significance F
Regression 1 113.1956 113.1956 15.7192 0.016611541
Residual 4 28.80441 7.201103
Total 5 142
๐2 Interpretation
โข ๐ถ๐๐๐๐๐๐๐๐๐๐ก ๐๐ ๐ท๐๐ก๐๐๐๐๐๐๐ก๐๐๐ = ๐2 =๐๐๐
๐๐๐
โข ๐ถ๐๐๐๐๐๐๐๐๐๐ก ๐๐ ๐ท๐๐ก๐๐๐๐๐๐๐ก๐๐๐ = ๐2 =113.1956
142
โข ๐ถ๐๐๐๐๐๐๐๐๐๐ก ๐๐ ๐ท๐๐ก๐๐๐๐๐๐๐ก๐๐๐ = ๐2 = 0.7972 ๐๐ 79.72%
โข We can conclude that 79.72% of the total sum of squares
can be explained using the estimates from the regression
equation to predict the tip amount. And that the remainder
(20.28%) is error.
โข This is a โGood fitโ!
0
2
4
6
8
10
12
14
16
18
20
30 40 50 60 70 80 90 100 110
Tip
($
)
Bill ($)
3 squared differences
๐ ๐ = โ๐. ๐๐๐๐ +๐. ๐๐๐๐๐
Bills vs. Tips ($)
๐ = ๐๐
SSE= (๐ฆ๐ โ ๐ฆ ๐)2
SST= (๐ฆ๐ โ ๐ฆ )2
SSR= (๐ฆ ๐ โ ๐ฆ )2
Model fit
๐ฆ ๐ = โ1.9457 +0.1648๐ฅ
Questions:
โข Once a regression line is calculated, how much better is it than only
using the mean of the dependent variable line alone? (coefficient of
determination (๐2)
โข How confident are we in the significance of the relationship between x
and y? (t-test of slope)
Regression with Excel
โข Produce SLR model in Excel.
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.892834
R Square 0.797152
Adjusted R Square 0.74644
Standard Error 2.683487
Observations 6
ANOVA
df SS MS F Significance F
Regression 1 113.1956 113.1956 15.7192 0.016611541
Residual 4 28.80441 7.201103
Total 5 142
Coefficien
ts Standard
Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept -1.94568 3.205964 -0.60689 0.576683 -10.84685887 6.955504991 -10.84685887 6.955504991
X Variable 1 0.164768 0.041558 3.964745 0.016612 0.049383684 0.280152232 0.049383684 0.280152232
Testing slope -1
โข Is the relationship between ๐ฆ and ๐ฅ significant?
โข Test the slope ๐ฝ1. (two-tailed t-test)
โข Remember ๐1is for our sample and ๐ฝ1 is for the population
โข We will use our sample slope ๐1 to test if the true slope of
the population ๐ฝ1 is significantly different than 0.
๐ฆ ๐ = โ1.9457 +0.1648๐ฅ
Testing slope -2
Steps to conduct a t-test on slope ๐ฝ1:
โข Step 1: Specify hypothesis:
โข ๐ป0: ๐ฝ1 = 0 ๐ฃ๐ . ๐ป1: ๐ฝ1 โ 0 ๐๐ก ๐ผ = 0.05
โข Step 2: Determine the test statistic:
๐ก =๐1โ๐ฝ1
๐๐ธ๐1
โข where ๐ฝ1 is true coefficient for all population
โข where ๐๐ธ๐1 =๐๐๐ธ๐โ2
(๐ฅโ๐ฅ )2
= standard error of the slope ๐1
Testing slope -3
โข Step 2 calculation:
โข ๐๐ธ๐1 =๐๐๐ธ๐โ2
(๐ฅโ๐ฅ )2
=28.8044(6โ2)
4169.5
= 0.0416
โข ๐ก =๐1โ๐ฝ1
๐๐ธ๐1=
0.1648โ0
0.0416= 3.9615
โข Step 3: Quantify the evidence of the test
โข Method 1: Critical value method
โข Compare calculated t to critical t
โข ยฑ๐ก1โ๐ผ
2,๐โ2 = ยฑ๐ก0.975,4
๐ฆ ๐ = โ1.9457 +0.1648๐ฅ
Testing slope -4
โข Step 3: Quantify the evidence of the test
โข Method 1: Critical value method
โข Compare calculated ๐ก to critical ๐ก (remember ๐ผ = 0.05)
โข ยฑ๐ก1โ๐ผ
2,๐โ2 = ยฑ๐ก0.975,4 = 2.776
Testing slope -5
โข Step 3: Method 1: Critical value method
โข Compare calculated ๐ก to critical ๐ก (remember ๐ผ = 0.05)
โข ๐ก๐๐๐๐๐ข๐๐๐ก๐๐ = 3.9615 > ๐ก๐๐๐๐ก๐๐๐๐ = 2.776
โข T calc is in the critical region so Reject null hypothesis ๐ป0: ๐ฝ1 = 0
meaning that our ๐ฝ1 โ 0 and we do have a statistically significant
relationship between ๐ฅ and ๐ฆ. .
0.95
0.025 0.025
Testing slope -6
โข Step 3: Method 2: p-value method
โข Compare calculated/estimated ๐ value to desired significance
level. (remember ๐ผ = 0.05)
โข ๐๐๐๐๐๐ข๐๐๐ก๐๐/๐๐ ๐ก๐๐๐๐ก๐๐ = 2๐ ๐ก > ๐๐๐๐๐ข๐ก๐๐ ๐ก = 2๐(๐ก > 3.9615) โ
0.03
โข ๐ ๐ฃ๐๐๐ข๐ ๐๐ 0.03 < ๐ผ = 0.05, therefore reject null hypothesis
๐ป0: ๐ฝ1 = 0 meaning that our ๐ฝ1 โ 0 and we do have a statistically
significant relationship between ๐ฅ and ๐ฆ. .
SLR Example with R
โข Start R session
โข Import dataset โairqualityโ included in R base
โข Explore and plot data
โข Run a simple linear regression model with
โOzoneโ as a DV (๐ฆ)
โTempโ as an IV (๐ฅ)
โข Follow in R session and model results are as follows:
SLR Example with R
โข Dataset = airquality ----> 153 obs. of 6 variables
โข Start R session and follow instructions in code
โข Use simple linear regression to predict ozone levels โOzoneโ based on the
temperature โTempโ.
ID Ozone Solar.R Wind Temp Month Day
1 41 190 7.4 67 5 1
2 36 118 8 72 5 2
3 12 149 12.6 74 5 3
4 18 313 11.5 62 5 4
5 NA NA 14.3 56 5 5
6 28 NA 14.9 66 5 6
7 23 299 8.6 65 5 7
8 19 99 13.8 59 5 8
9 8 19 20.1 61 5 9
10 NA 194 8.6 69 5 10
Step 1: scatter plot
Ozone Temp
41 67
36 72
12 74
18 62
NA 56
28 66
23 65
19 59
8 61
NA 69
STEP 3: CORRELATION (Ozone vs Temp)
โข What is the correlation coefficient (r) for Ozone vs. Temp? (see R session)
In this case, ๐ = .698
โข Is the relationship strong?
MODERATE! --------> RUN MODEL see R session
Model results (model m1)
โข ๐ฆ = ๐ฝ0 + ๐ฝ1๐ฅ
โข ๐ฝ0 = โ146.996 (Intercept) ๐ฝ1 = +2.429 (Slope)
โข Regression line for this model ---> ๐ฆ = โ146.996 +2.429(๐ฅ)
Results interpretation (model m1) -1
Residuals:
โข Residuals are the differences between the actual observed response values
(distance to Ozone levels in our case) and the response values that the
model predicted.
โข The โResidualsโ section of the model output breaks it down into 5 summary
points to assess how well the model fit the data.
โข A good fit model will show symmetry from the min to max around the mean
value (0).
โข We do not have a very good symmetry here.
โข So, the model is predicting certain points that fall far away from the actual
observed points.
Results interpretation (model m1) -2
Model Coefficients:
โข ๐ฝ0 = โ146.996 (๐ฆ โ ๐ผ๐๐ก๐๐๐๐๐๐ก)
No interpretational meaning; but it is the Ozone level value when Temp = 0
โข ๐ฝ1 = +2.429 (๐๐๐๐๐)
For every 1 degree โ the temperature increases (๐ฅ), it is expected that the
Ozone level to also increase by 2.429 units.
โข ๐ ๐ก๐. ๐๐๐๐๐ = 0.2331
We can say that Ozone level/units can vary by 0.2331.
โข t-value for โTempโ = ๐๐๐๐๐๐๐๐๐๐๐ก
๐ ๐ก๐. ๐๐๐๐๐ =
2.429
0.233 = 10.418
t-value is significant Pr (> |๐ก|) = 2๐โ16 ; which is significant at any level of
significance (you could say at 99.99% level of confidence or 0.001).
Results interpretation (model m1) -3
โข Residual Standard Error = 23.71 on 114 degrees of freedom
โข The Residual Standard Error is the average amount that the response
โOzoneโ will deviate from the true regression line.
โข In our example, the actual Ozone level can deviate from the true regression
line by approximately 23.71 units, on average.
โข Degrees of freedom are the actual number of data points (observations)
minus 2 (taking into account the parameters for the โinterceptโ and the
โOzoneโ variables).
So, we started the model with 153 data point in the โairqualityโ dataset
We removed 37 data points that were N/Aโs
We are left with 116 data points
116 data points will lead to (116-2 parameters) = 114 DF
Results interpretation (model m1) -4
โข ๐ -squared = 0.4877 (๐ 2 = coefficient of determination)
๐ 2 varies from 0 ๐ก๐ 1; in this case, 48.77% of (๐ฆ) is explained by (๐ฅ)
โข Adjusted ๐ 2 = 0.4832
Adjusted ๐ 2 accounts for how many independent variables entered the
model. Typically lower than ๐ 2 based on how much contribution
additional independent variables (๐ฅโ๐ )added to explaining (๐ฆ)
A sharp drop in the adjusted ๐ 2 versus ๐ 2 indicates a bad model.
๐ญ-Test (F-value is used for measuring the overall model significance).
โข At the desired level of significance (say 95%), the statistical significance of
the ๐น-test will show how good of a model this is.
โข In this model, the ๐น-statistic = 108.5 on 1 variable with 114
โข The ๐น-statistic level of significance is Pr (> ๐น) = 2.2๐โ16; that is the ๐น-statistic
is significant at any reasonable level of significance (or you could say @
99.99%).
SLR โ R code