Assumption Checking Checklist

Before running any parametric test, verify its assumptions. This checklist tells you what to check, how to check it, and what to do when an assumption is violated.

Independent-Samples t-Test

Assumption	How to Test	If Violated
Independence of observations	Study design — each participant contributes one data point; no clustering or repeated measures.	Use a paired test, multilevel model, or GEE to account for dependency.
Normality (of DV in each group)	Shapiro-Wilk test (p > .05 = normal). Also inspect Q-Q plots and histograms. Skewness and kurtosis within +/- 2.	If n > 30 per group, the t-test is robust via CLT. Otherwise use Mann-Whitney U or apply a log/square-root transformation.
Homogeneity of variance	Levene's test (p > .05 = equal variances). Compare group SDs — ratio > 2:1 is a concern.	Use Welch's t-test (does not assume equal variances). Most software offers this as the default option.
No significant outliers	Boxplots. Values beyond 1.5 * IQR flagged. Check if outlier is a data-entry error.	Winsorize, trim, or use a robust test (e.g., bootstrapped t-test). Report results with and without the outlier.

Paired-Samples t-Test

Assumption	How to Test	If Violated
Related pairs	Study design — same participants measured at two time points, or matched pairs.	If observations are independent, use an independent-samples t-test instead.
Normality of difference scores	Compute difference scores (post − pre). Run Shapiro-Wilk on the differences. Inspect Q-Q plot of differences.	Use Wilcoxon signed-rank test or bootstrapping. Robust with n > 30.
No extreme outliers in differences	Boxplot of difference scores.	Investigate outlier. Winsorize or use a robust method. Report sensitivity analysis.

One-Way ANOVA

Assumption	How to Test	If Violated
Independence of observations	Study design. No participant appears in more than one group.	Use repeated-measures ANOVA or mixed-effects model.
Normality (DV within each group)	Shapiro-Wilk per group. Q-Q plots. ANOVA is robust when n ≥ 15 per group and groups are roughly equal in size.	Use Kruskal-Wallis H test. Or transform the DV (log, square root, Box-Cox).
Homogeneity of variance	Levene's test (p > .05). Compare largest and smallest group SDs.	Use Welch's ANOVA (`oneway.test()` in R) with Games-Howell post hoc. Do not use Tukey HSD.
No significant outliers	Boxplots per group. Studentized residuals > \|3\| are extreme.	Remove only with justification. Use robust ANOVA (trimmed means) or report with and without.

Pearson Correlation

Assumption	How to Test	If Violated
Both variables continuous	Verify measurement scale. Interval or ratio data required.	For ordinal data, use Spearman's ρ or Kendall's τ. For one dichotomous variable, use point-biserial r.
Linearity	Scatterplot of X vs. Y. Look for a straight-line pattern (not a curve).	Transform one or both variables. Or use Spearman ρ (which only requires a monotonic relationship).
Bivariate normality	Shapiro-Wilk on each variable individually. Joint normality is hard to test — if both are univariate-normal and the scatterplot is elliptical, you're fine.	Use Spearman ρ or bootstrapped Pearson r.
No significant outliers	Scatterplot — look for points far from the cloud. A single outlier can dramatically inflate or deflate r.	Winsorize or remove with justification. Report r with and without the outlier.
Independence of observations	Study design. Each pair of scores comes from a different participant.	Use multilevel modeling or repeated-measures correlation if data are nested.

Linear Regression (Simple & Multiple)

Assumption	How to Test	If Violated
Linearity	Scatterplot of predicted vs. residuals — should show no pattern (random cloud around zero).	Add polynomial terms, transform predictors, or use non-linear regression.
Independence of residuals	Durbin-Watson test (value near 2.0 = independent). Values < 1.5 or > 2.5 are concerning.	Use generalized least squares, add a lagged DV, or use a time-series model.
Normality of residuals	Q-Q plot of standardized residuals. Shapiro-Wilk on residuals. Histogram of residuals.	Transform the DV (log, Box-Cox). Use bootstrapped regression for inference. Robust with large n.
Homoscedasticity (constant variance of residuals)	Plot residuals vs. predicted values — spread should be constant (no funnel shape). Breusch-Pagan test (p > .05 = homoscedastic).	Use robust (HC3) standard errors. Or transform the DV. Weighted least squares for severe cases.
No multicollinearity (multiple regression only)	VIF (Variance Inflation Factor) for each predictor. VIF > 5 is problematic; VIF > 10 is severe. Tolerance < 0.2 is a concern.	Remove or combine correlated predictors. Use ridge regression or PCA to reduce dimensionality.
No influential outliers	Cook's distance (> 1 is influential, some use > 4/n). Leverage values, studentized residuals > \|3\|.	Investigate the case. Report results with and without influential points. Use robust regression.

At-a-Glance Decision Table

Assumption	Test It With	Rule of Thumb
Normality	Shapiro-Wilk + Q-Q plot	p > .05 = normal. Visual check is often more useful than the test.
Equal variances	Levene's test	p > .05 = equal. Largest SD / smallest SD < 2.
Linearity	Scatterplot / residual plot	No curved pattern in the residual plot.
Homoscedasticity	Residuals vs. fitted plot, Breusch-Pagan	Even spread of residuals. No fan/funnel shape.
No multicollinearity	VIF / Tolerance	VIF < 5. Tolerance > 0.2.
Independence	Study design / Durbin-Watson	DW near 2.0. No clustering in the data.