Cronbach's Alpha

Purpose

Measures the internal consistency of a scale — the degree to which a set of items measures a single underlying construct.

When to Use

When you have a multi-item scale or questionnaire and want to demonstrate that the items consistently measure the same construct.

Data Type

Continuous or ordinal item scores (e.g., Likert-scale responses) from a single administration

Key Assumptions

Items measure a single unidimensional construct, items are scored in the same direction (reverse-code where needed), tau-equivalence (equal true-score variances across items).

Tools

Reliability Calculator on Subthesis →

What Is Cronbach's Alpha?

Cronbach's alpha ( $\alpha$ ) is the most widely used measure of internal consistency reliability in the social and behavioral sciences. It estimates how well a set of items on a questionnaire or scale "hang together" — that is, how consistently they measure the same underlying construct.

When you create a scale with multiple items (e.g., a 10-item anxiety questionnaire), you expect that someone who scores high on one item will tend to score high on the others. Cronbach's alpha quantifies this expectation. A high alpha means the items are intercorrelated and likely measuring the same thing; a low alpha means the items may be measuring different constructs or contain too much measurement error.

Alpha is expressed as a number between 0 and 1. Technically, alpha can be negative if items are negatively correlated (usually indicating a scoring error), but in practice you should see values between 0 and 1.

Interpretation Benchmarks

Cronbach's $\alpha$	Interpretation
$\geq .90$	Excellent
$.80$ -- $.89$	Good
$.70$ -- $.79$	Acceptable
$.60$ -- $.69$	Questionable
$.50$ -- $.59$	Poor
$< .50$	Unacceptable

The threshold of $\alpha \geq .70$ is the most commonly cited minimum for research purposes (Nunnally, 1978). For high-stakes decisions (e.g., clinical diagnosis), $\alpha \geq .90$ is recommended.

When to Use It

Use Cronbach's alpha when:

You have a multi-item scale (e.g., a 15-item self-esteem questionnaire) and want to show the items are internally consistent.
You are developing or validating a new instrument and need to report reliability.
You want to justify combining multiple items into a single composite score (e.g., summing or averaging items).

Do not use Cronbach's alpha when:

You have a single-item measure — alpha requires at least two items.
Your items measure different constructs by design (e.g., a comprehensive exam covering distinct topics). Use subscale alphas instead.
You want to assess test-retest reliability (stability over time) or inter-rater reliability (agreement between raters). Those require different methods.

Assumptions

Unidimensionality. The items should measure a single underlying construct. If your scale has multiple subscales (e.g., cognitive anxiety and somatic anxiety), compute alpha separately for each subscale. Running a factor analysis first can confirm dimensionality.
Tau-equivalence. Strictly, alpha assumes that each item contributes equally to the total score (equal factor loadings). When items have very different loadings, alpha can underestimate true reliability. In such cases, consider McDonald's omega ( $\omega$ ) as an alternative.
Items scored in the same direction. If some items are positively worded and others negatively worded, you must reverse-code the negatively worded items before computing alpha. Failing to do so will artificially deflate alpha.
Continuous or ordinal item data. Alpha is designed for items with continuous or quasi-continuous scores (e.g., Likert scales with 5+ points). For truly dichotomous items (yes/no), the Kuder-Richardson 20 (KR-20) formula is the appropriate equivalent.

Formula

Cronbach's alpha is defined as:

\alpha = \frac{k}{k - 1}\left(1 - \frac{\sum_{i=1}^{k} s_i^2}{s_t^2}\right)

Where:

$k$ = the number of items on the scale
$s_i^2$ = the variance of item $i$
$s_t^2$ = the variance of the total score (sum of all items)
$\sum_{i=1}^{k} s_i^2$ = the sum of all individual item variances

Intuition Behind the Formula

The ratio $\frac{\sum s_i^2}{s_t^2}$ compares the variability within individual items to the total variability. If the items are perfectly correlated, the item variances will be small relative to the total variance, and alpha will approach 1. If items are uncorrelated (pure noise), the sum of item variances will equal the total variance, and alpha will approach 0.

The term $\frac{k}{k-1}$ is a correction factor that adjusts for the number of items. More items generally produce a higher alpha, even if inter-item correlations stay the same.

Standardized Alpha

When items are on different scales, the standardized alpha uses the average inter-item correlation ( $\bar{r}$ ):

\alpha_{standardized} = \frac{k \bar{r}}{1 + (k - 1)\bar{r}}

This form makes clear that alpha depends on two things: the number of items ( $k$ ) and the average correlation among items ( $\bar{r}$ ).

Worked Example

Scenario: A researcher develops a 4-item scale measuring academic motivation. Five students respond on a 1--7 scale.

Student	Item 1	Item 2	Item 3	Item 4	Total
A	6	5	6	7	24
B	4	4	3	4	15
C	5	5	5	6	21
D	3	2	3	3	11
E	7	6	7	7	27

Step 1: Compute item variances.

For each item, calculate $s^2 = \frac{\sum(X - \bar{X})^2}{n - 1}$ (using $n - 1$ for sample variance).

Item 1: $\bar{X}_1 = 5.0$ , $s_1^2 = \frac{(1+1+0+4+4)}{4} = 2.50$
Item 2: $\bar{X}_2 = 4.4$ , $s_2^2 = \frac{(0.36+0.16+0.36+5.76+2.56)}{4} = 2.30$
Item 3: $\bar{X}_3 = 4.8$ , $s_3^2 = \frac{(1.44+3.24+0.04+3.24+4.84)}{4} = 3.20$
Item 4: $\bar{X}_4 = 5.4$ , $s_4^2 = \frac{(2.56+1.96+0.36+5.76+2.56)}{4} = 3.30$

\sum s_i^2 = 2.50 + 2.30 + 3.20 + 3.30 = 11.30

Step 2: Compute total score variance.

Total scores: 24, 15, 21, 11, 27. Mean total $= 19.6$ .

s_t^2 = \frac{(4.4^2 + 4.6^2 + 1.4^2 + 8.6^2 + 7.4^2)}{4} = \frac{(19.36 + 21.16 + 1.96 + 73.96 + 54.76)}{4} = \frac{171.20}{4} = 42.80

Step 3: Apply the formula.

\alpha = \frac{4}{4-1}\left(1 - \frac{11.30}{42.80}\right) = \frac{4}{3}\left(1 - 0.264\right) = 1.333 \times 0.736 = 0.981

Wait — this is suspiciously high for a 4-item scale with $n = 5$ . Let us double-check. Actually, for a small sample with highly correlated items, $\alpha > .90$ is plausible. The items covary strongly: students who score high on one item score high on all items.

Result: $\alpha = .98$ , indicating excellent internal consistency.

Interpretation

$\alpha = .98$ suggests the four items are very highly intercorrelated and appear to measure the same construct consistently.
In practice, an alpha this high might indicate item redundancy — the items may be so similar that some could be removed without losing information.
Remember that alpha is inflated by the number of items. A 50-item scale can achieve $\alpha > .90$ even with modest inter-item correlations.

What to Do if Alpha Is Low

If $\alpha < .70$ , consider the following steps:

Check for reverse-coded items that were not recoded. This is the most common cause of low alpha.
Examine the "alpha if item deleted" column. Most statistics software (SPSS, R, jamovi) provides this. If removing a particular item increases alpha substantially, consider dropping it.
Check item-total correlations. Items with corrected item-total correlations below $.30$ may not belong to the scale.
Run a factor analysis. Low alpha sometimes means your items are actually measuring two or more factors. Split them into subscales and compute alpha for each.
Rewrite or replace poor items. Items that do not correlate well with the total may be ambiguous or off-topic.
Add more items. The Spearman-Brown prophecy formula can estimate how many items you need to reach a target alpha.

Common Mistakes

Reporting alpha without checking dimensionality. A high alpha does not prove unidimensionality. Items measuring two correlated factors can still produce a high alpha. Always run a factor analysis alongside reliability analysis.
Forgetting to reverse-code items. If item 3 is "I rarely feel motivated" while the other items are positively worded, you must reverse-score item 3 before computing alpha. Failing to do so will drastically lower alpha.
Assuming higher is always better. Very high alpha ( $> .95$ ) often signals item redundancy rather than excellent measurement. Aim for $.80$ -- $.90$ in most research contexts.
Treating alpha as a fixed property of the scale. Alpha is a property of the scores in your sample, not the instrument itself. Always compute and report alpha for your own data, even if the original scale developers reported $\alpha = .90$ .
Using alpha for multidimensional scales. If your scale has subscales (e.g., verbal and quantitative sections of a test), report alpha for each subscale separately. A total-scale alpha is misleading if the construct is not unidimensional.
Confusing reliability with validity. High reliability does not mean you are measuring what you intend to measure. A scale can be internally consistent yet measure the wrong construct entirely.

How to Report in APA Format

Internal consistency for the 4-item Academic Motivation Scale was excellent (Cronbach's $\alpha = .98$ ).

For a more detailed report:

The 10-item perceived stress scale demonstrated good internal consistency in the current sample (Cronbach's $\alpha = .84$ ). Item-total correlations ranged from .41 to .68, and no item deletion would have substantially improved alpha.

If alpha is below the threshold:

Internal consistency for the 6-item scale was questionable ( $\alpha = .63$ ). Examination of item-total correlations revealed that item 4 (corrected $r = .11$ ) contributed poorly. Removing this item improved alpha to $.74$ .

Key elements:

Name of the scale and number of items
The alpha value
Note on item-total correlations or items deleted if relevant
Use the word "current sample" to emphasize that reliability is sample-dependent

Ready to calculate?

Now that you understand the concept, use the free Reliability Calculator on Subthesis to run your own analysis.

Calculate Reliability on Subthesis

Cronbach's Alpha

Cronbach's Alpha

What Is Cronbach's Alpha?

Interpretation Benchmarks

When to Use It

Assumptions

Formula

Intuition Behind the Formula

Standardized Alpha

Worked Example

Interpretation

What to Do if Alpha Is Low

Common Mistakes

How to Report in APA Format

Related Concepts

Inter-Rater Reliability

Descriptive Statistics

Effect Size