arrow_drop_up arrow_drop_down

ANOVA (Analyses of variance)

ANOVA is a statistical technique to determine whether the averages of three or more groups differ from each other.

Quite a lot has to be explained to understand ANOVA in detail. Below a short outline can be found in which the essence of the ANOVA-test is explained.

Examples of research questions for an ANOVA

The ANOVA is applied with research questions like:

  • Differ blackbirds, starlings and seagulls in size?
  • Do people in Italy, the United Kingdom, Germany and Spain earn different amounts of money for the same type of work?
  • Is there a difference between, Africa, America, Asia, Australia and Europe in the amount of waste that they produce?

The hypothesis for testing the ANOVA

ANOVA is short for analysis of variances. Now this looks weird at first sight, because the test is comparing means and not comparing variances of variables. Well, not the means itself are analyzed, but the variance between the means.

In essence it is the ratio between the variance between the means (the groups) and the variance within the groups. In formula this test is used:

The ANOVA formula

Now suppose there is no difference between the means, the score above the line will be zero. Consequently the F-value is zero. When there are only small differences, the F-value is close to zero.

If the differences between the means increase, the F-value gets larger and on a certain point it will exceed a critical value (on the F-distribution). Then the means differ statistically significant from each other.

In essence this statistical hypothesis is tested:
        H0:    μ1    =    μ2    =    μ3    =    ……
        Ha:    μ1    =    μ2    =    μ3    =    ……

Again, when the means are almost equal, the variance between the means is low, that is near to zero. If so, the groups are said to be equal. When the means differ substantially, the variance will be large. If so, the groups are said to be different (on the feature examined).

Requirements for conducting an ANOVA

Characteristic for an ANOVA is the comparison of the mean of three or more groups. A mean can only be calculated from continuous data, that is, from data measured at an interval or ratio scale. The number of groups is a variable measured at a nominal scale.

The number of groups (the independent variable) should be at least three. If there are only two groups a t-test should be performed. The dependent variable should be a continuous variable. If it is measured on ordinal level, it is better to perform a Kruskal-Wallis test, and if the dependent variable is measured on nominal level it is better to perform a chi square test. Read more about this topic in our paper which test should be performed?

Related topics to ANOVA:

  • Mean
  • Standard deviation
  • Hypothesis
  • t-test
  • Kruskal-Wallis test
  • Chi square test

Deepen your knowledge and read our manual about ...

  • The statistical test procedure
  • How to find the correct statistical test
Place comment