Kruskal-Wallis Test

The Kruskal–Wallis Non Parametric Hypothesis Test (1952) is a nonparametric analog of the one-way analysis of variance. It is generally used when the measurement variable does not meet the normality assumptions of one-way ANOVA. It is also a popular nonparametric test to compare outcomes among three or more independent (unmatched) groups.  

Consider the Mann–Whitney test for just two groups instead of the Kruskal–Wallis test. Like the Mann-Whitney test, this test may also evaluate the differences between the groups by estimating the differences in ranks among the groups.

Generally in the ANOVA test, the assumption is that the dependent variable is drawn from a normally distributed population and also assumes that common variance across groups.  But, in Kruskal-Wallis Test, there is no necessity for these assumptions. Therefore, this test is the best option for both continuous as well as ordinal types of data.

Assumptions of the Kruskal-Wallis Test

Uses of Kruskal-Wallis Non Parametric Hypothesis Test Test

The Kruskal-Wallis test can be used for any industry to understand the dependent variable when it has three or more independent groups. For example, this test helps to understand the student’s performance in exams. While the scores are measured on a scale from 0-100, the scores may vary based on the exam anxiety levels (low, medium, high, and severe -in this case, four different groups) of the students.

Procedure to conduct Kruskal-Wallis Test

  • First pool all the data across the groups.
  • Rank the data from 1 for the smallest value of the dependent variable and the next smallest variable rank 2 and so on… (if any value ties, in that case, it is advised to use mid-point), N being the highest variable.
  • Compute the test statistic
  • Determine critical value from the Chi-Square distribution table
  • Finally, formulate a decision and conclusion

Most of the teams lose track when they exercise the ranks for the original variables. Hence this can make Kruskal–Wallis test a bit less powerful than a one-way ANOVA test.

Calculation of the Kruskal-Wallis Non Parametric Hypothesis Test

The Kruskal–Wallis Non Parametric Hypothesis Test compares medians among k groups (k > 2). The null and alternative hypotheses for the Kruskal-Wallis test are as follows:

  • Null Hypothesis H0: Population medians are equal
  • Alternative Hypothesis H1: Population medians are not all equal

As explained above, the procedure for the Kruskal-Wallis test pools the observations from the k groups into one combined sample, and then ranks from lowest to highest value (1 to N), where N is the total number of values in all the groups.

The test statistic for the Kruskal Wallis test (mostly denoted as H) is defined as follows: 

Kruskal-Wallis Non Parametric Hypothesis Test

Where Ti = rank sum for the ith sample i = 1, 2,…,k

In the Kruskal-Wallis test, the H value will not have any impact on any two groups in which the data values have the same ranks. Either increasing the largest value or decreasing the smallest value will have zero effect on H.  Hence, the extreme outliers (higher and lower sides) will not impact this test.

Example of Kruskal-Wallis Non Parametric Hypothesis Test

In a manufacturing unit, four teams of operators were randomly selected and sent to four different facilities for machining techniques training. After the training, the supervisor conducted the exam and recorded the test scores. At 95% confidence level does the scores are same in all four facilities?

  • Null Hypothesis H0: The distribution of operator scores are same
  • Alternative Hypothesis H1: The scores may vary in four facilities

Rank the score in all the facilities

N=16

While for a right-tailed chi-square test with a 95% confidence level, and df =3, the critical χ2 value is 7.81

Critical values of Chi-Square Distribution

The calculated χ2 value is greater than the critical value of χ2for a 0.05 significance level. χ2calculated 2critical hence, you reject the null hypotheses

So, there is enough evidence to conclude that difference in test scores exists for four teaching methods at different facilities.

Six Sigma Black Belt Certification Kruskal-Wallis Test Questions:

Question 1: In an organization, management conducted a study comparing Purchase, Marketing, Quality, and Production groups on a measure of leadership skills. Which of the following test would an organization choose?

(A) Mood’Median test
(B) Kruskal-Wallis test
(C) Mann-Whitney U test
(D) Friedman Rank Test

Unlock Additional Members-only Content!

To unlock additional content, please upgrade now to a full membership.
Upgrade to a Full Membership
If you are a member, you can log in here.

Thank You for being a Member!

Here’s some of the bonus content that is only available to you as a paying member.

Answer B: It is independent data and there are more than two conditions, hence Kruskal-Wallis test is the best option.

Question 2: Which of the following nonparametric test use the rank sum?

(A) Runs test
(B) Mood’Median test
(C) Sign test
(D) Kruskal-Wallis test

Answer D: Kruskal-Wallis test pools the observations from the k groups into one combined sample, and then ranks from lowest to the highest value.

Authors

Comments (12)

The Kruskal-Wallis test is not about the equality of medians. It’s about the stochastic dominance. If the distributions within groups are IID, then indeed (and ONLY then) such interpretation holds. Otherwise, the difference may be caused either difference in locations or scales. Kindly please correct it, as people then use the KW or MW(W) tests to detect shift in locations and get totally surprised, how that’s possible to have numerically equal medians and H0 rejected. Both tests fail in general (appropriate sources available over the internet).

what is the difference between the kruksal walistest and themood median test ,
Both of them treat more than 2 non parametric variable

Hi Youssef Boudoudouh,

When the data are non normal or the data points are very few to check if the data are normal or not and have more than two populations then we have to use Moods Median or Kruskal-Wallis test , the key difference is Moods median handles the outliers but Kruskal-Wallis test is more powerful than Moods Median.

Thanks

In the example that is worked out on this page, why is it considered a right-tailed test and not a two-tailed test?

I’m having trouble understanding why the X2 critical is 7.815 and not 9.348.

Critical values for the Kruskal-Wallis test follow χ2 . The χ2 test is one-sided tests because we never have negative values of χ2. For χ2, the sum of the difference of observed and expected squared is divided by the expected ( a proportion), thus chi-square is always a positive number or it may be close to zero on the right side when there is no difference. Thus, this test is always a right-sided one-sided test.

Thanks

Hi Laki,

its reverse in case of only mann whitney..always remember that and author mentioned same in the video.

There are two versions of the Mann-Whitney U test, one for small samples (i.e., when n < 20 for each group) and one for large samples. It is important to remember the null hypothesis for this test, and to differentiate it from the nulls for the t-test and the median test. Please find below links for better clarity https://psych.unl.edu/psycrs/handcomp/hcmann.PDF

https://sphweb.bumc.bu.edu/otlt/mph-modules/bs/bs704_nonparametric/bs704_nonparametric4.html

Thanks

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.