Permutation test

Permutation test example animation

Permutation tests, also known as randomization tests, re-randomization tests, or exact tests, are a family of statistical significance tests used in inference statistics. They provide a non-parametric alternative to traditional parametric tests, such as the t-test or ANOVA, which rely on assumptions about the underlying probability distribution of the data. Permutation tests are particularly useful when those assumptions are questionable or when dealing with small sample sizes.

Overview[edit | edit source]

Permutation tests operate on the principle that under the null hypothesis, all permutations of the observed data are equally likely. By comparing the observed test statistic to the distribution of test statistics computed from a large number of random permutations of the data, one can assess the p-value, which indicates the probability of observing a test statistic as extreme as, or more extreme than, the observed one under the null hypothesis.

Procedure[edit | edit source]

The basic steps in conducting a permutation test are as follows:

Formulate the null and alternative hypothesis.
Calculate the test statistic for the observed data.
Permute the data or labels of the data randomly and recalculate the test statistic for each permutation.
Compare the original test statistic to the distribution of permuted test statistics to calculate the p-value.

Applications[edit | edit source]

Permutation tests can be applied in a wide range of scenarios, including but not limited to testing the difference in means or medians between two groups, analyzing the association between two variables, and assessing the goodness-of-fit of a model. They are widely used in fields such as psychology, ecology, and genomics, where the assumptions of parametric tests are often violated.

Advantages[edit | edit source]

Non-parametric: They do not require assumptions about the form of the distribution of the data.
Flexibility: They can be applied to a wide variety of test statistics and experimental designs.
Exact: Under the null hypothesis, the p-value is exact, not an approximation, assuming all possible permutations are considered.

Limitations[edit | edit source]

Computational Intensity: For large datasets or complex test statistics, the number of possible permutations can be computationally prohibitive.
Dependence on the Test Statistic: The choice of test statistic can influence the power of the test.
Sample Size Restrictions: In cases with very small sample sizes, the number of unique permutations may be too limited to provide a meaningful p-value.

Comparison with Parametric Tests[edit | edit source]

While permutation tests offer several advantages over parametric tests, especially in terms of flexibility and fewer assumptions, they are not universally superior. Parametric tests can be more powerful when their assumptions are met, and they are generally less computationally intensive.

Software Implementation[edit | edit source]

Several statistical software packages and programming languages, including R and Python, offer functions or libraries for conducting permutation tests, making them accessible for researchers and data analysts.

This statistics-related article is a stub. You can help WikiMD by expanding it.

Navigation: Wellness - Encyclopedia - Health topics - Disease Index‏‎ - Drugs - World Directory - Gray's Anatomy - Keto diet - Recipes

Search WikiMD

Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro) available.
Advertise on WikiMD

WikiMD is not a substitute for professional medical advice. See full disclaimer.

Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.