Power is a critical concept when planning or evaluating a radiology study:
- power = (1 - β)
Conventionally, power is set at 0.80-0.85.
When reviewing a sample data set, the mean of the value in question of the experimental population is likely different from the overall population. In other words, after you do something to your experimental sample, you expect the variable you're watching to change.
For instance, imagine you want to evaluate the size of the pancreatic duct after administering secretin. You think secretin will increase the size of the pancreatic duct. Pathologic data is your gold standard, and using this technique, it's been shown that an average adult patient has a mean size duct (μ) with a standard deviation (σ). Now you give a sample population a bolus of secretin and start counting...
The mean size of the pancreatic duct from the post-secretin experimental population (X) is greater than expected from the path data... but is this a real effect or is this just due to chance?
If we set the p-value to 0.05, then we know we have only a 5% chance of making a type I error (α)... the error of saying that the increase in size of the pancreatic duct is a real effect, when it really isn't.
If our study does not show that the increase is significant to the 0.05 level, then we cannot believe that secretin made a difference, but we run the risk of a type II error (β)... the error of saying that there is no difference when there really is.
The question then becomes, how do we know we have enough people in our experimental group to show a difference if there were one? This is how you power a study effectively. If the sample size is too small (underpowered), then the risk of a type II error increases.
You can imagine two tests for the pancreatic duct: one with 15 post-secretin patients and one with 115 post-secretin patients. Both may fail to meet the p-value, but intuitively we know that the second test did a better job of trying to show a difference if there were one. This is what we're trying to capture with the concept of power.
Post hoc power analysis
The use of post hoc power analysis (i.e. calculating power after the study has concluded) is controversial as it is thought to be unreliable 2.
Related Radiopaedia articles
- clinical trials
- descriptive studies
- Bayes' theorem
- sensitivity and specificity
- positive predictive value (PPV)
- negative predictive value (NPV)
- likelihood ratio (LR)
- normal distribution
- type I error
- type II error
- confidence interval
- ROC curve
- retrospective studies
- prospective studies
- analyzes of variance
- non-parametric statistics
- cognitive bias in image perception