+ All Categories
Home > Documents > Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental...

Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental...

Date post: 12-Jun-2018
Category:
Upload: doanthu
View: 220 times
Download: 0 times
Share this document with a friend
37
Fundamental Probability and Statistics "There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are also unknown unknowns. There are things we don't know we don't know," Donald Rumsfeld
Transcript
Page 1: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Fundamental Probability and Statistics

"There are known knowns. These are things we know that we know. There areknown unknowns. That is to say, there are things that we know we don't know.But there are also unknown unknowns. There are things we don't know we don'tknow," Donald Rumsfeld

Page 2: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Probability Theory

Probability Space:

Reference: G.R. Grimmett and D.R. Stirzaker, Probability and RandomProcesses, Oxford Science Publications, 1997

Example: Toss possibly biased coin once

Take

Note: Fair coin if p = 1/2

Page 3: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Probability TheoryExample: Two coins tossed possibly multiple times and outcome is ordered pair

Let

Then

Definition: Events A and B are independent if

Page 4: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Random Variables and DistributionsDefinition:

Definition:

Definition:

Example:

Page 5: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Distributions and DensitiesDefinition:

Definition:

Definition:

PDF Properties:

Page 6: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Density PropertiesExample:

Example:

Page 7: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Density PropertiesAdditional Properties:

Page 8: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Multivariate Distributions

Note: Important for longitudinal data

Joint CDF:

Joint Density (if it exists):

Example:

Page 9: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Multivariate Distributions

Example:

Note:

Note:

Page 10: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Multivariate Distributions

Definition:

Definition: Marginal density function of X

Definition: X and Y are independent if and only if

or

Note:

Page 11: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Estimators and EstimatesDefinition: An estimator is a function or procedure for deriving an estimate fromobserved data. An estimator is a random variable whereas an estimate is a realnumber.

Example:

Page 12: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Other EstimatorsCommonly Employed Estimators:

• Maximum likelihood

• Bayes estimators

• Particle filter (Sequential Monte Carlo (SMC))

• Markov chain Monte Carlo (MCMC)

• Kalman filter

• Wiener filter

Page 13: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Linear RegressionConsider

Example:

Page 14: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Linear RegressionStatistical Model:

Assumptions:

Goals:

Page 15: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Least Squares ProblemMinimize

Note: General result for quadratic forms

Thus

where

Least Squares Estimate:

Least Squares Estimator:

Note:

Page 16: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Parameter Estimator Properties

Estimator Mean:

Estimator Covariance:

Page 17: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Variance Estimator PropertiesGoal:

Residual:

Variance Estimator:

Note:

Page 18: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Variance Estimator PropertiesNote:

Page 19: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Variance Estimator PropertiesNote:

Unbiased Estimator:

Unbiased Estimate:

Page 20: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Parameter Estimator PropertiesProperties of B:

Central Limit Theorem:

Page 21: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

ExampleExample: Consider the height-weight data from the 1975 World Almanac and Bookof Facts

164159154150146142139135132129126123120117115Weight(lbs)

727170696867666564636261605958Height(in)

Consider the model

Page 22: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

ExampleLeast Square Estimate:

Here

Note:

Note:

Page 23: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

ExampleVariance Estimate:

Parameter Covariance Estimate:

Note: This yields variances and standard deviations for parameter estimates

Goal: Can we additionally compute confidence intervals? Yes, but we needa little more statistics.

Page 24: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Example

Hypothesis: One way to check the hypothesis of iid is to plot the residuals

Page 25: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Random Variables Related to the Normal

Chi-Square Random Variables:

T Random Variables:

Page 26: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Variance Estimator Properties

Assumption:

Page 27: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Variance Estimator Properties

Page 28: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Variance Estimator Properties

Confidence Interval:

Page 29: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

ExamplePrevious Example:

Note:

Page 30: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Summary of Linear TheoryStatistical Model:

Assumptions:

Covariance Estimator and Estimate:

Least Squares Estimator and Estimate:

Variance Estimator and Estimate:

Page 31: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Summary of Linear TheoryStatistical Properties:

Page 32: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Hypothesis TestingStatistical Testing:

• An objective of statistics is to make inferences about unknown populationparameters and models based on information in sample data.

• Inferences may be estimates of parameters or tests of hypotheses regardingtheir values.

Hypothesis Testing:• Largely originated with Ronald Fisher, Jerzy Neyman, Karl Pearson and EgonPearson

• Fisher: Agricultural statistician: emphasized rigorous experiments and designs

• Neyman: Emphasized mathematical rigor

• Early Paper: R. Fisher, ``Mathematics of a Lady Tasting Tea,’’ 1956

-- Question: Could lady determine means of tea preparation based on taste?

-- Null Hypothesis: Lady had no such ability

-- Fisher asserted that no alternative hypothesis was required

Page 33: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Hypothesis TestingElements of Test:

Strategy:

Page 34: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Hypothesis TestingElements of Test:

Definitions:• Test Statistic: Function of sample measurement upon which decision is made.

• Rejection Region: Value of test statistic for which null hypothesis is rejected.

Definitions:

Page 35: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Hypothesis TestingExample: Adam is running for Student body president and thinks he will gainmore than 50% of the votes and hence win. His committee is very pragmaticand wants to test the hypothesis that he will receive less than 50% of thevote. Here we take

Page 36: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Hypothesis TestingExample: Is this test equally protect us from erroneously concluding thatAdam is the winner when, in fact, he will lose? Suppose that he will reallywin 30% of the vote so that p = 0.3. What is the probability of a Type IIerror?

Note: The test using this rejection region protects Adam from Type I errors but notType II errors.

Page 37: Fundamental Probability and Statistics - ncsu.edursmith/MA797V_S12/Statistics.pdf · Fundamental Probability and Statistics "There are known knowns. These are things we know that

Hypothesis Testing

One Solution: Use a larger critical or rejection region.

Conclusion: This provides a better balance between Type I and Type II errors.

Question: How can we reduce both errors?


Recommended