Statistical Test Selection Flowchart
Use this table to find the right statistical test. Start with your research goal in the left column, then match your data type and study design to arrive at the recommended test.
Comparing Groups
| Research Goal | DV Data Type | Design | Recommended Test | Non-Parametric Alternative |
|---|---|---|---|---|
| Compare 2 group means | Continuous | Between-subjects (independent groups) | Independent-samples t-test | Mann-Whitney U |
| Compare 2 related means | Continuous | Within-subjects (repeated measures / matched pairs) | Paired-samples t-test | Wilcoxon signed-rank |
| Compare 3+ group means | Continuous | Between-subjects, 1 IV | One-way ANOVA | Kruskal-Wallis H |
| Compare 3+ related means | Continuous | Within-subjects, 1 IV | Repeated-measures ANOVA | Friedman test |
| Compare means across 2+ IVs | Continuous | Between-subjects, 2+ IVs | Factorial ANOVA | Aligned rank transform |
Associations & Relationships
| Research Goal | Variable Types | Design | Recommended Test | Non-Parametric Alternative |
|---|---|---|---|---|
| Strength of linear relationship | 2 continuous | Bivariate | Pearson r | Spearman ρ / Kendall τ |
| Association between categories | 2 categorical | Cross-tabulation | Chi-square test of independence | Fisher's exact test (small n) |
| Goodness of fit to expected proportions | 1 categorical | Observed vs. expected | Chi-square goodness-of-fit | Binomial test (2 categories) |
Prediction
| Research Goal | DV Type | Predictors | Recommended Test | Notes |
|---|---|---|---|---|
| Predict a continuous outcome | Continuous | 1 continuous | Simple linear regression | Check linearity, normality of residuals |
| Predict a continuous outcome | Continuous | 2+ continuous/categorical | Multiple linear regression | Check multicollinearity (VIF) |
| Predict group membership | Binary categorical | 1+ continuous/categorical | Logistic regression | Reports odds ratios; no normality assumption for DV |
Reliability & Agreement
| Research Goal | Data Type | Design | Recommended Measure | Notes |
|---|---|---|---|---|
| Internal consistency of a scale | Continuous / Likert items | Multi-item measure | Cronbach's α | α ≥ .70 generally acceptable; use McDonald's ω for tau-nonequivalent items |
| Agreement between raters | Continuous | 2+ raters scoring same targets | Intraclass Correlation (ICC) | Choose ICC form based on rater model (one-way, two-way random, two-way mixed) |
| Agreement between raters | Categorical | 2 raters, nominal categories | Cohen's κ | Corrects for chance agreement; use Fleiss' κ for 3+ raters |
Quick Decision Rules
- 2 groups → t-test. 3 or more groups → ANOVA.
- Same participants measured twice → paired / repeated-measures. Different participants → independent / between-subjects.
- DV is continuous → parametric tests. DV is ordinal or non-normal → non-parametric alternatives.
- DV is categorical → chi-square or logistic regression.
- Want to predict an outcome → regression. Want to describe a relationship → correlation.
- Assumptions violated? Try a transformation first, then a non-parametric test, then bootstrapping.