Glossary: M08 — Hypothesis Testing

Module: M08 Formulas: Formula Sheet Concept page: Hypothesis Testing Concept


Hypothesis Testing

A formal statistical procedure for evaluating a claim (hypothesis) about a population parameter using sample data. Involves specifying null and alternative hypotheses, computing a test statistic, and making a decision.

LOS: 8.a | See: 6-Step Process | Related: Confidence Interval


Null Hypothesis

The hypothesis to be tested. Denoted . Typically states that a parameter equals a specific value, or that there is no effect. The null hypothesis is assumed true unless evidence strongly contradicts it.

LOS: 8.a | Key: We never “prove” the null; we either reject it or fail to reject it.


Alternative Hypothesis

The hypothesis that is accepted if the null hypothesis is rejected. Denoted or . Represents what the analyst is trying to find evidence for.

LOS: 8.a | Related: Two-Tailed Test, One-Tailed Test


Two-Tailed Test

A test where the alternative hypothesis specifies that the parameter is different from (≠) the null value, in either direction. The rejection region is split between both tails.

LOS: 8.b | Use: When the analyst has no prior directional belief.


One-Tailed Test

A test where the alternative hypothesis specifies a direction — either greater than or less than the null value. The rejection region is entirely in one tail.

LOS: 8.b | See: Right-Tail Test, Left-Tail Test


Right-Tail Test

A one-tailed test where the alternative hypothesis specifies that the parameter is greater than the null value.

LOS: 8.b | Rejection: Reject when test statistic exceeds the upper critical value.


Left-Tail Test

A one-tailed test where the alternative hypothesis specifies that the parameter is less than the null value.

LOS: 8.b | Rejection: Reject when test statistic is less than the lower critical value.


Test Statistic

A standardized value computed from sample data used to evaluate the null hypothesis. Measures how far the sample estimate is from the hypothesized value in standard error units.

LOS: 8.c | Examples: z-Test, t-Test, Chi-Square Test, F-Test


Critical Value

The boundary value of the test statistic that separates the rejection region from the non-rejection region. Determined by the chosen Level of Significance and the distribution of the test statistic.

LOS: 8.c | Related: Rejection Region, Reliability Factor


Rejection Region

The set of test statistic values for which the null hypothesis is rejected. Also called the critical region.

LOS: 8.c | Rule: Reject if the test statistic falls in the rejection region (i.e., for a two-tailed test).


Level of Significance

The probability of rejecting a true null hypothesis (Type I Error). Denoted . Chosen before conducting the test.

LOS: 8.d | Common levels: 1%, 5%, 10%. | Related: Power of a Test, Estimation context


Type I Error

Rejecting a true null hypothesis. The probability of a Type I error equals , the level of significance. Also called a “false positive.”

LOS: 8.d | Trade-off: Reducing reduces Type I errors but increases Type II Error probability.


Type II Error

Failing to reject a false null hypothesis. Also called a “false negative.” The probability of a Type II error is denoted .

LOS: 8.d | Related: Power of a Test =


Power of a Test

The probability of correctly rejecting a false null hypothesis.

LOS: 8.d | Key: Higher power is better. Power increases with sample size, larger effect size, and higher .


p-Value

The smallest level of significance at which the null hypothesis can be rejected, given the observed test statistic. Equivalently, the probability of obtaining a test statistic at least as extreme as the observed value, assuming is true.

LOS: 8.e | Decision rule: Reject if -value . | Key: A smaller -value provides stronger evidence against .


Statistical Significance

A result is statistically significant if the probability of observing it by chance (when is true) is less than . Formally: the null hypothesis is rejected.

LOS: 8.f | Key warning: Statistical significance does not imply economic significance.


Economically Significant

A result is economically significant if the magnitude of the effect is large enough to be practically meaningful in a financial context — particularly after accounting for transaction costs and risk.

LOS: 8.f | Key: A statistically significant result (e.g., 0.01% excess return) may be economically trivial.


z-Test

A hypothesis test using the standard normal distribution. Used when the population variance is known, or when the sample size is large (typically ) so the CLT applies.

LOS: 8.g | Related: Standard Normal Distribution


t-Test

A hypothesis test using the t-distribution. Used when the population variance is unknown and the sample size is small. Applied to tests of means, regression coefficients, and correlations.

LOS: 8.g | Related: Student’s t-Distribution, Regression t-Test


Chi-Square Test

A hypothesis test using the chi-square distribution. Used to test hypotheses about a population variance or to test independence of categorical variables.

For variance:

LOS: 8.h | Related: Chi-Square Distribution, Chi-Square Test of Independence


F-Test

A hypothesis test using the F-distribution. Used to test equality of two population variances, or the overall significance of a regression model.

LOS: 8.h | Related: F-Distribution, Regression F-Statistic


Pooled Estimator

A combined estimate of a parameter (typically variance) from two or more samples, used when the parameter is assumed equal across groups. Applied in the pooled two-sample t-test.

LOS: 8.g | Condition: Use only when population variances are assumed equal.


Paired Comparison Test

A t-test applied to the differences between paired observations. Used when two sets of observations are related (e.g., before/after measurements for the same subjects).

where = mean of paired differences.

LOS: 8.g | Advantage: Controls for individual differences, increasing statistical power.


Parametric Test

A statistical test that makes assumptions about the distribution of the population (typically normality) and tests hypotheses about population parameters.

LOS: 8.i | Examples: z-Test, t-Test, F-Test, Chi-Square Test. | Contrast: Nonparametric Test


Nonparametric Test

A statistical test that does not rely on assumptions about the population distribution or that tests hypotheses not about population parameters. Used when data are ranked, not normally distributed, or sample sizes are very small.

LOS: 8.i | Examples: Spearman rank correlation, sign test, Wilcoxon signed-rank test. | See: Spearman Rank Correlation