Paired Samples t-Test

Purpose

Compares two means from the same participants (or matched pairs) to determine if the difference is statistically significant.

When to Use

When the same participants are measured at two time points (pre/post), under two conditions (crossover design), or when participants are matched on key characteristics.

Data Type

Continuous (interval or ratio) dependent variable measured twice on the same participants

Key Assumptions

The difference scores are approximately normally distributed. Observations within each pair are dependent, but pairs are independent of each other.

Tools

Effect Size Calculator on Subthesis →

What Is the Paired Samples t-Test?

The paired samples t-test (also called the dependent samples t-test or repeated-measures t-test) compares two means from the same group of participants. Instead of comparing two separate groups, you compare each participant's scores under two different conditions or at two different time points.

The key insight is that this test works with difference scores. For each participant, you calculate the difference between their two measurements, and then you test whether the average of those differences is significantly different from zero.

D_i = X_{i,2} - X_{i,1}

If the treatment has no effect, the average difference score $\bar{D}$ should be close to zero.

When to Use It

Use a paired samples t-test when:

The same participants are measured at two time points (pre-test/post-test design)
The same participants are tested under two conditions (crossover or within-subjects design)
Matched pairs are used — each participant in one condition is matched with a similar participant in the other condition

Examples:

Blood pressure before and after taking a medication
Student test scores at the beginning and end of a semester
Reaction time under caffeine vs. placebo in the same participants
Anxiety ratings before and after a therapy session

Why not just use an independent t-test? Because the paired t-test accounts for individual differences. Each person serves as their own control, which dramatically reduces variability and increases statistical power. This is one of the biggest advantages of within-subjects designs.

Assumptions

Continuous dependent variable. The outcome must be measured on an interval or ratio scale.
Related samples. The two measurements come from the same participants (or matched pairs). Each observation in one condition has a corresponding observation in the other.
Normality of difference scores. The differences between paired observations ( $D_i$ ) should be approximately normally distributed. This is less strict than it sounds — with $n > 30$ , the test is robust to non-normality. For smaller samples, check with a Shapiro-Wilk test or Q-Q plot of the differences.
No significant outliers in the difference scores. Extreme outliers in the differences can distort results. Inspect a boxplot of the difference scores.

Note: The paired t-test does not require homogeneity of variances (unlike the independent t-test), because it operates on a single set of difference scores.

Formula

t = \frac{\bar{D}}{s_D / \sqrt{n}}

Where:

$\bar{D}$ = mean of the difference scores
$s_D$ = standard deviation of the difference scores
$n$ = number of pairs

The denominator $s_D / \sqrt{n}$ is the standard error of the mean difference.

Degrees of freedom:

df = n - 1

The mean difference and its standard deviation are calculated as:

\bar{D} = \frac{1}{n}\sum_{i=1}^{n} D_i

s_D = \sqrt{\frac{\sum_{i=1}^{n}(D_i - \bar{D})^2}{n - 1}}

Worked Example

Scenario: A health researcher measures resting heart rate (BPM) in 12 participants before and after an 8-week mindfulness meditation program.

Participant	Pre ( $X_1$ )	Post ( $X_2$ )	Difference ( $D$ )
1	78	72	-6
2	85	80	-5
3	72	70	-2
4	90	82	-8
5	68	68	0
6	82	76	-6
7	76	74	-2
8	88	79	-9
9	74	72	-2
10	80	75	-5
11	70	69	-1
12	84	78	-6

Step 1: State the hypotheses.

$H_0$ : $\mu_D = 0$ (no change in heart rate after the program)
$H_1$ : $\mu_D \neq 0$ (heart rate changed after the program)

Step 2: Calculate the mean and standard deviation of differences.

Sum of differences: $(-6) + (-5) + (-2) + (-8) + 0 + (-6) + (-2) + (-9) + (-2) + (-5) + (-1) + (-6) = -52$

\bar{D} = \frac{-52}{12} = -4.33

Sum of squared deviations from the mean:

\sum(D_i - \bar{D})^2 = (−1.67)^2 + (−0.67)^2 + (2.33)^2 + (−3.67)^2 + (4.33)^2 + (−1.67)^2 + (2.33)^2 + (−4.67)^2 + (2.33)^2 + (−0.67)^2 + (3.33)^2 + (−1.67)^2

= 2.79 + 0.45 + 5.43 + 13.47 + 18.75 + 2.79 + 5.43 + 21.81 + 5.43 + 0.45 + 11.09 + 2.79 = 90.67

s_D = \sqrt{\frac{90.67}{11}} = \sqrt{8.24} = 2.87

Step 3: Calculate the t-statistic.

t = \frac{-4.33}{2.87 / \sqrt{12}} = \frac{-4.33}{2.87 / 3.46} = \frac{-4.33}{0.829} = -5.22

Step 4: Determine degrees of freedom and find the p-value.

df = 12 - 1 = 11

For $|t| = 5.22$ with $df = 11$ (two-tailed): $p < .001$ .

Step 5: Calculate the effect size (Cohen's $d_z$ ).

For a paired design, the effect size is:

d_z = \frac{\bar{D}}{s_D} = \frac{4.33}{2.87} = 1.51

Interpretation

Since $p < .001$ , we reject the null hypothesis. There is a statistically significant decrease in resting heart rate after the 8-week mindfulness program.

The mean reduction was 4.33 BPM ( $SD = 2.87$ ), and the effect size is very large ( $d_z = 1.51$ ). This means the average reduction was over 1.5 standard deviations of the within-person variability — a strong and consistent effect across participants.

The 95% confidence interval for the mean difference:

\bar{D} \pm t_{0.025, 11} \times SE = -4.33 \pm 2.201 \times 0.829 = [-6.16, -2.51]

We are 95% confident that the true mean reduction in heart rate is between 2.51 and 6.16 BPM.

Common Mistakes

Using an independent t-test when data are paired. This is the most common error. If the same people are measured twice, you must use a paired t-test. An independent t-test ignores the within-person correlation and loses statistical power.
Checking normality of the raw scores instead of the differences. The assumption is that the difference scores are normally distributed, not the individual measurements at each time point.
Ignoring the direction of subtraction. Be consistent. If you calculate Post - Pre, a negative difference means a decrease. Flipping the direction mid-analysis leads to sign errors.
Assuming causation in pre/post designs without a control group. A significant pre-post difference could be due to the intervention, practice effects, regression to the mean, or maturation. A control group strengthens causal claims.
Not reporting the effect size. A significant paired t-test without $d_z$ leaves readers unable to judge practical significance. Always include it.
Ignoring outliers in the difference scores. One participant with an extreme change can dominate the results. Check for outliers with a boxplot of $D_i$ values.

How to Run It

```r # Paired samples t-test in R t.test(mydata$pre, mydata$post, paired = TRUE)

Effect size (Cohen's d for paired data)

library(effsize) cohen.d(mydata$post, mydata$pre, paired = TRUE)

```python
from scipy import stats
import pingouin as pg

# Using scipy
t_stat, p_value = stats.ttest_rel(pre_scores, post_scores)

# Using pingouin (includes effect size)
result = pg.ttest(pre_scores, post_scores, paired=True)
print(result)
```


Go to Analyze > Compare Means > Paired-Samples T Test
Select your two measurement variables (e.g., Pre and Post) and move them into the Paired Variables box as a pair
Click OK

SPSS outputs the means of each measurement, the correlation between them, the mean difference, the t-statistic, degrees of freedom, and the p-value (two-tailed).


Use the T.TEST function:
=T.TEST(array1, array2, tails, type)

array1: range of pre-test scores
array2: range of post-test scores
tails: 2 (two-tailed)
type: 1 (paired)

Example: =T.TEST(A2:A13, B2:B13, 2, 1)
For the full output, use Data > Data Analysis > t-Test: Paired Two Sample for Means.



## How to Report in APA Format

> A paired samples t-test was conducted to evaluate the effect of an 8-week mindfulness meditation program on resting heart rate. Results indicated a statistically significant decrease from pre-program ($M$ = 78.92, $SD$ = 6.85) to post-program ($M$ = 74.58, $SD$ = 4.64), $t$(11) = -5.22, $p$ < .001, $d_z$ = 1.51, 95% CI [-6.16, -2.51]. On average, participants' resting heart rate decreased by 4.33 BPM after the mindfulness program.

Ready to calculate?

Now that you understand the concept, use the free Effect Size Calculator on Subthesis to run your own analysis.

Calculate Effect Size for Your t-Test on Subthesis

Paired Samples t-Test

Paired Samples t-Test

What Is the Paired Samples t-Test?

When to Use It

Assumptions

Formula

Worked Example

Interpretation

Common Mistakes

How to Run It

Effect size (Cohen's d for paired data)

Related Concepts

Independent Samples t-Test

Effect Size

Statistical Power & Power Analysis

Wilcoxon Signed-Rank Test

Repeated Measures ANOVA

One-Way ANOVA