+ All Categories
Home > Documents > Sample Size & Power Estimation

Sample Size & Power Estimation

Date post: 25-Feb-2016
Category:
Upload: anika
View: 71 times
Download: 3 times
Share this document with a friend
Description:
Sample Size & Power Estimation. Computing for Research April 21, 2014. General Comments. Can consume much of a collaborative biostatistician’s time Really only relevant in the context of hypothesis testing and in estimation of precision - PowerPoint PPT Presentation
Popular Tags:
26
Sample Size & Power Estimation Computing for Research April 21, 2014
Transcript
Page 1: Sample Size & Power Estimation

Sample Size & Power Estimation

Computing for ResearchApril 21, 2014

Page 2: Sample Size & Power Estimation

General Comments

• Can consume much of a collaborative biostatistician’s time

• Really only relevant in the context of hypothesis testing and in estimation of precision

• If there are multiple Aims within a proposal, make sure that each is properly powered.

• It can be helpful to perform computations in two or more different software programs.

Page 3: Sample Size & Power Estimation

More General Comments

• Can be somewhat of an art form• Before proposing a sample size, get a sense

from the other investigators what sample sizes are even feasible (know resource limitations).

• Make sure you understand the hypotheses that are to be tested.

• Make sure you understand the study design.

Page 4: Sample Size & Power Estimation

More General Comments• A well-written sample size estimation section in a grant can

convince the reviewers that you know what you’re doing.

• A poorly-written sample size estimation section in a grant can convince the reviewers that you don’t know what you’re doing.

• Sometimes PIs will calculate a sample size on their own. Double check these, and make sure their rationale is sound. Don’t be afraid to ask how they arrived at their estimate.

Page 5: Sample Size & Power Estimation

Understanding the Term “Effect Size”

• In a very general sense, this is the magnitude of the summary statistic you plan to use for your hypothesis test– Difference in means– Difference in proportions– Odds ratio, Risk ratio– Correlation

Page 6: Sample Size & Power Estimation

Understanding the Term “Effect Size”

• Often this refers to Cohen’s D:– Small: 0.2– Medium: 0.5– Large: >0.8

• An effect size of 1 is equivalent of a 1 standard deviation unit difference between groups.

• Can be helpful when trying to justify a sample size when little pilot data exist.

• Ex. “With 20 subjects per group, we’ll be able to detect an effect size of 0.9 (i.e. a large effect) with 80% power, assuming 2-sided hypothesis testing and an alpha level of 0.05.”

21

Page 7: Sample Size & Power Estimation

Software

• Free (online, downloadable) – careful!• Moderately priced• Expensive

Page 8: Sample Size & Power Estimation

Sample Size Survey Results*

• 14 Faculty – PhD• 7 Faculty – RA• 9 Students

• * paul nietert’s results from 2011

Page 9: Sample Size & Power Estimation

Sample size software used

CRAB/online calculators

EAST

Gpower

Minitab

None

nQuery

Own calculations

Own simulations

PASS

Power and Precision

Power(Piantadosi)

PROC POWER

PS

R package/library

StatCalc

0 2 4 6 8 10 12 14 16 18

Student RA PhD Total

Page 10: Sample Size & Power Estimation

Examples

• GPower• R• Stata• Power and Precision• Power (Simon two-stage design)• Simulation (simple independent sample T-test

example)

Page 11: Sample Size & Power Estimation

Two sample t-test

• A scientist wants to compare tumor size at 12 weeks in two groups of mice. Based on preliminary data, the expects the average tumor size to be 400mm2 in the control and states that a 50% decrease in mean tumor size would be relevant.

• Based on the preliminary data, the SD of tumor size is estimated to be 150 mm2

• What is the effect size?• Assume 80% power, alpha of 0.05. • What is the required sample size?

Page 12: Sample Size & Power Estimation

. sampsi 400 200, sd(150)

Estimated sample size for two-sample comparison of means

Test Ho: m1 = m2, where m1 is the mean in population 1 and m2 is the mean in population 2Assumptions:

alpha = 0.0500 (two-sided) power = 0.9000 m1 = 400 m2 = 200 sd1 = 150 sd2 = 150 n2/n1 = 1.00

Estimated required sample sizes:

n1 = 12 n2 = 12

Page 13: Sample Size & Power Estimation

n Number of observations (per group)delta True difference in meanssd Standard deviationsig.level Significance level (Type I error probability)power Power of test (1 minus Type II error

probability)type Type of t testalternative One- or two-sided teststrict Use strict interpretation in two-sided case

R: Power calculations for one and two sample t tests

Description Compute power of test, or determine parameters to obtain target power.

power.t.test(n = NULL, delta = NULL, sd = 1, sig.level = 0.05, power =

NULL, type = c("two.sample", "one.sample", "paired"), alternative = c("two.sided", "one.sided"), strict =

FALSE)

Page 14: Sample Size & Power Estimation

One sample test of proportion

• A clinical trial is being planned in a cancer patient population.

• The standard of care response rate is 0.20. • The new treatment would be considered worth

further study if the response rate were 0.40.• What is the needed sample size for a one arm

trial to detect this response rate with 90% power using a one-sided alpha of 0.10?

Page 15: Sample Size & Power Estimation

. sampsi 0.20 0.40, onesample

Estimated sample size for one-sample comparison of proportion to hypothesized value

Test Ho: p = 0.2000, where p is the proportion in the population

Assumptions:

alpha = 0.0500 (two-sided) power = 0.9000 alternative p = 0.4000

Estimated required sample size:

n = 50

Page 16: Sample Size & Power Estimation

Simon two-stage design

• A study design appropriate for a binary endpoint that is quick to evaluate in a single arm study.

• Ethical it is appropriate to consider early stopping for futility.

• It is quite rare to NOT include interim analyses to consider stopping for ethical reasons.

• Simon, Controlled Clinical Trials, 1989.

Page 17: Sample Size & Power Estimation

Two-Stage Designs• What if by the 15th patient you’ve seen no responses?• Is it worth proceeding?• Maybe you should have considered a design with an early stopping rule• Two-stage designs:

Stage 1: enroll N1 patients

X1 or more respond

Stage 2: Enroll an additional N2 patients Stop trial

Fewer than X1 respond

Page 18: Sample Size & Power Estimation

Example• An investigator wishes to investigate a 2-way

interaction between 2 risk factors (RF) for a disease. – The prevalence of RF1 and RF2 is 20% and 30%,

respectively. 5% have both RF1 and RF2.– The baseline rate of developing disease within a year is

known to be 10% (no RFs). – The RR of developing disease within 1 year associated

with each of the RFs is 1.5, but the hypothesis is that if both RF1 and RF2 are present, the RR is 5.0.

– How many subjects are needed to detect this interaction effect?

Page 19: Sample Size & Power Estimation

Example (Cont.): CalculationsSolve for Prevalence Estimates

RF2

RF1 + - Total

+ 5% 20%

-

Total 30% 100%

Page 20: Sample Size & Power Estimation

Example (Cont.): CalculationsSolve for Prevalence Estimates

RF2

RF1 + - Total

+ 5% 15% 20%

- 25% 55% 80%

Total 30% 70% 100%

Page 21: Sample Size & Power Estimation

Example (Cont.): CalculationsSolve for RR Estimates

RF2

RF1 + - Total

+ 5%(Risk = 50%)

15% 20%

- 25% 55%(Risk = 10%)

80%

Total 30% 70% 100%

Page 22: Sample Size & Power Estimation

Example (Cont.): CalculationsSolve for RR Estimates

RF2

RF1 + - Total

+ 5%(Risk = 50%)

15% 20%(Risk = 15%)

- 25% 55%(Risk = 10%)

80%

Total 30%(Risk = 15%)

70% 100%

Page 23: Sample Size & Power Estimation

Example (Cont.): CalculationsSolve for RR Estimates

RF2

RF1 + - Total

+ 5%(Risk = 50%)

15%(Risk = 3.33%)

20%(Risk = 15%)

- 25%(Risk = 8%)

55%(Risk = 10%)

80%

Total 30%(Risk = 15%)

70% 100%

Page 24: Sample Size & Power Estimation

Example (Cont.): CalculationsSolve for RR Estimates

RF2

RF1 + - Total

+ 50(Risk = 50%)

(n=25 of 50 Diseased)

150(Risk = 3.33%)

(n=5 of 150 Diseased)

200(Risk = 15%)

(n=30 of 200 Diseased)- 250

(Risk = 8%)(n=20 of 250 Diseased)

550(Risk = 10%)

(n=55 of 550 Diseased)

800

Total 300(Risk = 15%)

(n=45 of 300 Diseased)

700 1000

Page 25: Sample Size & Power Estimation

R Simulation Code

Page 26: Sample Size & Power Estimation

Recommended