Glossary: M08 — Hypothesis Testing

Module: M08 Formulas: Formula Sheet Concept page: Hypothesis Testing Concept

Hypothesis Testing

A formal statistical procedure for evaluating a claim (hypothesis) about a population parameter using sample data. Involves specifying null and alternative hypotheses, computing a test statistic, and making a decision.

LOS: 8.a | See: 6-Step Process | Related: Confidence Interval

Null Hypothesis

The hypothesis to be tested. Denoted $H_{0}$ . Typically states that a parameter equals a specific value, or that there is no effect. The null hypothesis is assumed true unless evidence strongly contradicts it.

LOS: 8.a | Key: We never “prove” the null; we either reject it or fail to reject it.

Alternative Hypothesis

The hypothesis that is accepted if the null hypothesis is rejected. Denoted $H_{a}$ or $H_{1}$ . Represents what the analyst is trying to find evidence for.

LOS: 8.a | Related: Two-Tailed Test, One-Tailed Test

Two-Tailed Test

A test where the alternative hypothesis specifies that the parameter is different from (≠) the null value, in either direction. The rejection region is split between both tails.

$H_{0} : μ = μ_{0} H_{a} : μ \neq = μ_{0}$

LOS: 8.b | Use: When the analyst has no prior directional belief.

One-Tailed Test

A test where the alternative hypothesis specifies a direction — either greater than or less than the null value. The rejection region is entirely in one tail.

LOS: 8.b | See: Right-Tail Test, Left-Tail Test

Right-Tail Test

A one-tailed test where the alternative hypothesis specifies that the parameter is greater than the null value.

$H_{0} : μ \leq μ_{0} H_{a} : μ > μ_{0}$

LOS: 8.b | Rejection: Reject $H_{0}$ when test statistic exceeds the upper critical value.

Left-Tail Test

A one-tailed test where the alternative hypothesis specifies that the parameter is less than the null value.

$H_{0} : μ \geq μ_{0} H_{a} : μ < μ_{0}$

LOS: 8.b | Rejection: Reject $H_{0}$ when test statistic is less than the lower critical value.

Test Statistic

A standardized value computed from sample data used to evaluate the null hypothesis. Measures how far the sample estimate is from the hypothesized value in standard error units.

$Test statistic = \frac{Sample statistic - Hypothesized value}{Standard error of statistic}$

LOS: 8.c | Examples: z-Test, t-Test, Chi-Square Test, F-Test

Critical Value

The boundary value of the test statistic that separates the rejection region from the non-rejection region. Determined by the chosen Level of Significance and the distribution of the test statistic.

LOS: 8.c | Related: Rejection Region, Reliability Factor

Rejection Region

The set of test statistic values for which the null hypothesis is rejected. Also called the critical region.

LOS: 8.c | Rule: Reject $H_{0}$ if the test statistic falls in the rejection region (i.e., $∣ t ∣ > t_{critical}$ for a two-tailed test).

Level of Significance

The probability of rejecting a true null hypothesis (Type I Error). Denoted $α$ . Chosen before conducting the test.

LOS: 8.d | Common levels: 1%, 5%, 10%. | Related: Power of a Test, Estimation context

Type I Error

Rejecting a true null hypothesis. The probability of a Type I error equals $α$ , the level of significance. Also called a “false positive.”

$P (Type I Error) = α$

LOS: 8.d | Trade-off: Reducing $α$ reduces Type I errors but increases Type II Error probability.

Type II Error

Failing to reject a false null hypothesis. Also called a “false negative.” The probability of a Type II error is denoted $β$ .

$P (Type II Error) = β$

LOS: 8.d | Related: Power of a Test = $1 - β$

Power of a Test

The probability of correctly rejecting a false null hypothesis.

$Power = 1 - β = 1 - P (Type II Error)$

LOS: 8.d | Key: Higher power is better. Power increases with sample size, larger effect size, and higher $α$ .

p-Value

The smallest level of significance at which the null hypothesis can be rejected, given the observed test statistic. Equivalently, the probability of obtaining a test statistic at least as extreme as the observed value, assuming $H_{0}$ is true.

LOS: 8.e | Decision rule: Reject $H_{0}$ if $p$ -value $< α$ . | Key: A smaller $p$ -value provides stronger evidence against $H_{0}$ .

Statistical Significance

A result is statistically significant if the probability of observing it by chance (when $H_{0}$ is true) is less than $α$ . Formally: the null hypothesis is rejected.

LOS: 8.f | Key warning: Statistical significance does not imply economic significance.

Economically Significant

A result is economically significant if the magnitude of the effect is large enough to be practically meaningful in a financial context — particularly after accounting for transaction costs and risk.

LOS: 8.f | Key: A statistically significant result (e.g., 0.01% excess return) may be economically trivial.

z-Test

A hypothesis test using the standard normal distribution. Used when the population variance is known, or when the sample size is large (typically $n \geq 30$ ) so the CLT applies.

$z = \frac{X ˉ - μ _{0}}{σ / n}$

LOS: 8.g | Related: Standard Normal Distribution

t-Test

A hypothesis test using the t-distribution. Used when the population variance is unknown and the sample size is small. Applied to tests of means, regression coefficients, and correlations.

$t = \frac{X ˉ - μ _{0}}{s / n} df = n - 1$

LOS: 8.g | Related: Student’s t-Distribution, Regression t-Test

Chi-Square Test

A hypothesis test using the chi-square distribution. Used to test hypotheses about a population variance or to test independence of categorical variables.

For variance: $χ^{2} = \frac{( n - 1 ) s ^{2}}{σ _{0}^{2}} df = n - 1$

LOS: 8.h | Related: Chi-Square Distribution, Chi-Square Test of Independence

F-Test

A hypothesis test using the F-distribution. Used to test equality of two population variances, or the overall significance of a regression model.

$F = \frac{s _{1}^{2}}{s _{2}^{2}} d f_{1} = n_{1} - 1, d f_{2} = n_{2} - 1$

LOS: 8.h | Related: F-Distribution, Regression F-Statistic

Pooled Estimator

A combined estimate of a parameter (typically variance) from two or more samples, used when the parameter is assumed equal across groups. Applied in the pooled two-sample t-test.

$s_{p}^{2} = \frac{( n _{1} - 1 ) s _{1}^{2} + ( n _{2} - 1 ) s _{2}^{2}}{n _{1} + n _{2} - 2}$

LOS: 8.g | Condition: Use only when population variances are assumed equal.

Paired Comparison Test

A t-test applied to the differences between paired observations. Used when two sets of observations are related (e.g., before/after measurements for the same subjects).

$t = \frac{d ˉ - μ _{d_{0}}}{s _{d} / n} df = n - 1$

where $\overset{ˉ}{d}$ = mean of paired differences.

LOS: 8.g | Advantage: Controls for individual differences, increasing statistical power.

Parametric Test

A statistical test that makes assumptions about the distribution of the population (typically normality) and tests hypotheses about population parameters.

LOS: 8.i | Examples: z-Test, t-Test, F-Test, Chi-Square Test. | Contrast: Nonparametric Test

Nonparametric Test

A statistical test that does not rely on assumptions about the population distribution or that tests hypotheses not about population parameters. Used when data are ranked, not normally distributed, or sample sizes are very small.

LOS: 8.i | Examples: Spearman rank correlation, sign test, Wilcoxon signed-rank test. | See: Spearman Rank Correlation

Wiki Hub

Explorer

Glossary: M08 — Hypothesis Testing

Glossary: M08 — Hypothesis Testing

Hypothesis Testing

Null Hypothesis

Alternative Hypothesis

Two-Tailed Test

One-Tailed Test

Right-Tail Test

Left-Tail Test

Test Statistic

Critical Value

Rejection Region

Level of Significance

Type I Error

Type II Error

Power of a Test

p-Value

Statistical Significance

Economically Significant

z-Test

t-Test

Chi-Square Test

F-Test

Pooled Estimator

Paired Comparison Test

Parametric Test

Nonparametric Test

Graph View

Table of Contents