The lettuce dataset
In a previous tutorial, we analysed the dataset on lettuce plants using ANOVA. However, it was not clear if all the assumptions of ANOVA were met. Indeed, with only 7 datapoints per group, it is very hard to assess the assumptions of normality and equal variances.
Therefore, we will re-analyse the dataset by using the non-parametric alternative to ANOVA, the Kruskal-Wallis test
. We will first give a concise overview of what we saw in the ANOVA analysis, which can be found in the ANOVA_lettuce_plants.Rmd
file.
The researchers want to find out if biochar, compost and a combination of both biochar and compost have an influence on the growth of lettuce plants. To this end, they grew up lettuce plants in a greenhouse. The pots were filled with one of four soil types;
- Soil only (control)
- Soil supplemented with biochar (refoak)
- Soil supplemented with compost (compost)
- Soil supplemented with both biochar and compost (cobc)
The dataset freshweight_lettuce.txt
contains the freshweight (in grams) for 28 lettuce plants (7 per condition).
Load the required libraries
Data import
lettuce <- read_csv("https://raw.githubusercontent.com/statOmics/PSLS21/data/freshweight_lettuce.txt")
## Rows: 28 Columns: 3
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (1): treatment
## dbl (2): id, freshweight
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Take a glimpse at the data
## Rows: 28
## Columns: 3
## $ id <dbl> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17,…
## $ treatment <chr> "control", "control", "control", "control", "control", "co…
## $ freshweight <dbl> 38, 34, 41, 43, 43, 29, 38, 59, 64, 57, 56, 50, 64, 62, 38…
Data tidying
## set treatment to factor
## ...
Data exploration
## Count the number of observations per treatment
Now let’s make a boxplot displaying the freshweight of each treatment condition:
Interpret the visualization!
In the analysis in chapter 7 (ANOVA_lettuce_plants_half.rmd
file), we accepted the assumptions for analyzing the data with an ANOVA. However, it was not clear if all the assumptions of ANOVA were met. Indeed, with only 7 values per group, it is very hard to assess the assumptions of normality and equal variances.
Therefore, we will re-analyse the dataset by using the non-parametric alternative to ANOVA: the Kruskal-Wallis test.
Kruskal-Wallis rank test
Hypotheses
Formulate a correct null and alternative hypothesis for the Kruskal-Wallis test in this analysis.
Analysis
#set.seed(1)
#kw <- kruskal_test(...)
#kw
Interpret the results!
Post-hoc analysis
We will perform a post-hoc analysis with pairwise Wilcoxon rank sum test. As we did not want to assume the location shift, we will interpret the outcome in terms of probabilistic indices. Note that after the analysis, we will need to correct the acquired p-values for multiple testing.
Hypotheses
Formulate a correct null and alternative hypothesis for the Wilcoxon test post-hoc analysis.
Analysis
## pairwise.wilcox.test(...)
What do you observe?
## Alternative: caluculate the p-value for each treatment combination with wilcoxon_test
treatments <- levels(lettuce$treatment)
freshweight <- lettuce$freshweight
pvalues <- combn(treatments,2,function(x){
## Pairwise Wilcoxon test
test = wilcox_test(freshweight~treatment,subset(lettuce,treatment%in%x), distribution = 'exact')
## Get and store p-value of test
pvalue(test)
})
## Adjust for multiple testing
pvalues_bonf = p.adjust(pvalues,method = 'bonferroni')
## link the p-value with the correct pairwise test
names(pvalues_bonf) <- combn(levels(lettuce$treatment),2,paste,collapse="_VS_")
pvalues_bonf
Interpret.
Based on the chunk of code above, can extract the point estimates for the probabilistic indices? Interpret those as well.
Conclusion
Formulate a proper conclusion that answers the research hypothesis.
LS0tCnRpdGxlOiAiRXhlcmNpc2UgOS4yOiBOb24tcGFyYW1ldHJpYyB0ZXN0IG9uIHRoZSBsZXR0dWNlIGRhdGFzZXQiICAgCmF1dGhvcjogIkxpZXZlbiBDbGVtZW50IGFuZCBKZXJvZW4gR2lsaXMiCmRhdGU6ICJzdGF0T21pY3MsIEdoZW50IFVuaXZlcnNpdHkgKGh0dHBzOi8vc3RhdG9taWNzLmdpdGh1Yi5pbykiICAKb3V0cHV0OgogICAgaHRtbF9kb2N1bWVudDoKICAgICAgY29kZV9kb3dubG9hZDogdHJ1ZSAgICAKICAgICAgdGhlbWU6IGNvc21vCiAgICAgIHRvYzogdHJ1ZQogICAgICB0b2NfZmxvYXQ6IHRydWUKICAgICAgaGlnaGxpZ2h0OiB0YW5nbwogICAgICBudW1iZXJfc2VjdGlvbnM6IHRydWUKLS0tCgojIFRoZSBsZXR0dWNlIGRhdGFzZXQKCkluIGEgcHJldmlvdXMgdHV0b3JpYWwsIHdlIGFuYWx5c2VkIHRoZSBkYXRhc2V0IG9uCmxldHR1Y2UgcGxhbnRzIHVzaW5nIEFOT1ZBLiBIb3dldmVyLCBpdCB3YXMgbm90IGNsZWFyCmlmIGFsbCB0aGUgYXNzdW1wdGlvbnMgb2YgQU5PVkEgd2VyZSBtZXQuIEluZGVlZCwgd2l0aApvbmx5IDcgZGF0YXBvaW50cyBwZXIgZ3JvdXAsIGl0IGlzIHZlcnkgaGFyZCB0byBhc3Nlc3MKdGhlIGFzc3VtcHRpb25zIG9mIG5vcm1hbGl0eSBhbmQgZXF1YWwgdmFyaWFuY2VzLgoKVGhlcmVmb3JlLCB3ZSB3aWxsIHJlLWFuYWx5c2UgdGhlIGRhdGFzZXQgYnkgdXNpbmcgdGhlCm5vbi1wYXJhbWV0cmljIGFsdGVybmF0aXZlIHRvIEFOT1ZBLCB0aGUgYEtydXNrYWwtV2FsbGlzIHRlc3RgLgpXZSB3aWxsIGZpcnN0IGdpdmUgYSBjb25jaXNlIG92ZXJ2aWV3IG9mIHdoYXQgd2Ugc2F3IGluIHRoZQpBTk9WQSBhbmFseXNpcywgd2hpY2ggY2FuIGJlIGZvdW5kIGluIHRoZSAKYEFOT1ZBX2xldHR1Y2VfcGxhbnRzLlJtZGAgZmlsZS4KClRoZSByZXNlYXJjaGVycyB3YW50IHRvIGZpbmQgb3V0IGlmIGJpb2NoYXIsIGNvbXBvc3QgYW5kCmEgY29tYmluYXRpb24gb2YgYm90aCBiaW9jaGFyIGFuZCBjb21wb3N0IGhhdmUgYW4gaW5mbHVlbmNlCm9uIHRoZSBncm93dGggb2YgbGV0dHVjZSBwbGFudHMuIFRvIHRoaXMgZW5kLCB0aGV5IGdyZXcgdXAKbGV0dHVjZSBwbGFudHMgaW4gYSBncmVlbmhvdXNlLiBUaGUgcG90cyB3ZXJlIGZpbGxlZCB3aXRoCm9uZSBvZiBmb3VyIHNvaWwgdHlwZXM7CgoxLiBTb2lsIG9ubHkgKGNvbnRyb2wpCjIuIFNvaWwgc3VwcGxlbWVudGVkIHdpdGggYmlvY2hhciAocmVmb2FrKQozLiBTb2lsIHN1cHBsZW1lbnRlZCB3aXRoIGNvbXBvc3QgKGNvbXBvc3QpCjQuIFNvaWwgc3VwcGxlbWVudGVkIHdpdGggYm90aCBiaW9jaGFyIGFuZCBjb21wb3N0IChjb2JjKQoKVGhlIGRhdGFzZXQgYGZyZXNod2VpZ2h0X2xldHR1Y2UudHh0YCBjb250YWlucyB0aGUgZnJlc2h3ZWlnaHQKKGluIGdyYW1zKSBmb3IgMjggbGV0dHVjZSBwbGFudHMgKDcgcGVyIGNvbmRpdGlvbikuCgpMb2FkIHRoZSByZXF1aXJlZCBsaWJyYXJpZXMKCmBgYHtyLCBtZXNzYWdlID0gRkFMU0V9CmxpYnJhcnkodGlkeXZlcnNlKQpgYGAKCiMgRGF0YSBpbXBvcnQKCmBgYHtyfQpsZXR0dWNlIDwtIHJlYWRfY3N2KCJodHRwczovL3Jhdy5naXRodWJ1c2VyY29udGVudC5jb20vc3RhdE9taWNzL1BTTFMyMS9kYXRhL2ZyZXNod2VpZ2h0X2xldHR1Y2UudHh0IikKYGBgCgpUYWtlIGEgZ2xpbXBzZSBhdCB0aGUgZGF0YQoKYGBge3J9CmdsaW1wc2UobGV0dHVjZSkKYGBgCgojIERhdGEgdGlkeWluZwoKYGBge3J9CiMjIHNldCB0cmVhdG1lbnQgdG8gZmFjdG9yCiMjIC4uLgpgYGAKCgojIERhdGEgZXhwbG9yYXRpb24KCmBgYHtyfQojIyBDb3VudCB0aGUgbnVtYmVyIG9mIG9ic2VydmF0aW9ucyBwZXIgdHJlYXRtZW50CgpgYGAKCk5vdyBsZXQncyBtYWtlIGEgYm94cGxvdCBkaXNwbGF5aW5nIHRoZSBmcmVzaHdlaWdodApvZiBlYWNoIHRyZWF0bWVudCBjb25kaXRpb246CgpgYGB7cn0KIyAuLi4KYGBgCgpJbnRlcnByZXQgdGhlIHZpc3VhbGl6YXRpb24hCgpJbiB0aGUgYW5hbHlzaXMgaW4gY2hhcHRlciA3IChgQU5PVkFfbGV0dHVjZV9wbGFudHNfaGFsZi5ybWRgIGZpbGUpLAp3ZSBhY2NlcHRlZCB0aGUgYXNzdW1wdGlvbnMgZm9yIGFuYWx5emluZyB0aGUgZGF0YSB3aXRoIGFuIEFOT1ZBLgpIb3dldmVyLCBpdCB3YXMgbm90IGNsZWFyIGlmIGFsbCB0aGUgYXNzdW1wdGlvbnMgb2YgQU5PVkEgd2VyZSBtZXQuIApJbmRlZWQsIHdpdGggb25seSA3IHZhbHVlcyBwZXIgZ3JvdXAsIGl0IGlzIHZlcnkgaGFyZCB0byBhc3Nlc3MKdGhlIGFzc3VtcHRpb25zIG9mIG5vcm1hbGl0eSBhbmQgZXF1YWwgdmFyaWFuY2VzLgoKVGhlcmVmb3JlLCB3ZSB3aWxsIHJlLWFuYWx5c2UgdGhlIGRhdGFzZXQgYnkgdXNpbmcgdGhlCm5vbi1wYXJhbWV0cmljIGFsdGVybmF0aXZlIHRvIEFOT1ZBOiB0aGUgS3J1c2thbC1XYWxsaXMgdGVzdC4KCiMgS3J1c2thbC1XYWxsaXMgcmFuayB0ZXN0CgojIyBIeXBvdGhlc2VzCgpGb3JtdWxhdGUgYSBjb3JyZWN0IG51bGwgYW5kIGFsdGVybmF0aXZlIGh5cG90aGVzaXMgZm9yIHRoZSBLcnVza2FsLVdhbGxpcyB0ZXN0IGluIHRoaXMgYW5hbHlzaXMuCgojIyBBbmFseXNpcwoKYGBge3J9CiNzZXQuc2VlZCgxKQoja3cgPC0ga3J1c2thbF90ZXN0KC4uLikKI2t3CmBgYAoKSW50ZXJwcmV0IHRoZSByZXN1bHRzIQoKIyBQb3N0LWhvYyBhbmFseXNpcwoKV2Ugd2lsbCBwZXJmb3JtIGEgcG9zdC1ob2MgYW5hbHlzaXMgd2l0aCBwYWlyd2lzZSBXaWxjb3hvbiByYW5rCnN1bSB0ZXN0LiBBcyB3ZSBkaWQgbm90IHdhbnQgdG8gYXNzdW1lIHRoZSBsb2NhdGlvbiBzaGlmdCwgd2UgCndpbGwgaW50ZXJwcmV0IHRoZSBvdXRjb21lIGluIHRlcm1zIG9mIHByb2JhYmlsaXN0aWMgaW5kaWNlcy4KTm90ZSB0aGF0IGFmdGVyIHRoZSBhbmFseXNpcywgd2Ugd2lsbCBuZWVkIHRvIGNvcnJlY3QgdGhlIGFjcXVpcmVkCnAtdmFsdWVzIGZvciBtdWx0aXBsZSB0ZXN0aW5nLgoKIyMgSHlwb3RoZXNlcwoKRm9ybXVsYXRlIGEgY29ycmVjdCBudWxsIGFuZCBhbHRlcm5hdGl2ZSBoeXBvdGhlc2lzIGZvciB0aGUgV2lsY294b24gdGVzdCBwb3N0LWhvYyBhbmFseXNpcy4KCiMjIEFuYWx5c2lzCgpgYGB7cn0KIyMgcGFpcndpc2Uud2lsY294LnRlc3QoLi4uKQpgYGAKCldoYXQgZG8geW91IG9ic2VydmU/CgpgYGAKIyMgQWx0ZXJuYXRpdmU6IGNhbHVjdWxhdGUgdGhlIHAtdmFsdWUgZm9yIGVhY2ggdHJlYXRtZW50IGNvbWJpbmF0aW9uIHdpdGggd2lsY294b25fdGVzdAoKdHJlYXRtZW50cyA8LSBsZXZlbHMobGV0dHVjZSR0cmVhdG1lbnQpCmZyZXNod2VpZ2h0IDwtIGxldHR1Y2UkZnJlc2h3ZWlnaHQKCnB2YWx1ZXMgPC0gY29tYm4odHJlYXRtZW50cywyLGZ1bmN0aW9uKHgpewogIAogICMjIFBhaXJ3aXNlIFdpbGNveG9uIHRlc3QKICB0ZXN0ID0gd2lsY294X3Rlc3QoZnJlc2h3ZWlnaHR+dHJlYXRtZW50LHN1YnNldChsZXR0dWNlLHRyZWF0bWVudCVpbiV4KSwgZGlzdHJpYnV0aW9uID0gJ2V4YWN0JykKICAKICAjIyBHZXQgYW5kIHN0b3JlIHAtdmFsdWUgb2YgdGVzdAogIHB2YWx1ZSh0ZXN0KQp9KQoKIyMgQWRqdXN0IGZvciBtdWx0aXBsZSB0ZXN0aW5nCnB2YWx1ZXNfYm9uZiA9IHAuYWRqdXN0KHB2YWx1ZXMsbWV0aG9kID0gJ2JvbmZlcnJvbmknKSAKCiMjIGxpbmsgdGhlIHAtdmFsdWUgd2l0aCB0aGUgY29ycmVjdCBwYWlyd2lzZSB0ZXN0Cm5hbWVzKHB2YWx1ZXNfYm9uZikgPC0gY29tYm4obGV2ZWxzKGxldHR1Y2UkdHJlYXRtZW50KSwyLHBhc3RlLGNvbGxhcHNlPSJfVlNfIikKcHZhbHVlc19ib25mCmBgYAoKSW50ZXJwcmV0LgoKQmFzZWQgb24gdGhlIGNodW5rIG9mIGNvZGUgYWJvdmUsIGNhbiBleHRyYWN0IHRoZSBwb2ludCBlc3RpbWF0ZXMKZm9yIHRoZSBwcm9iYWJpbGlzdGljIGluZGljZXM/IEludGVycHJldCB0aG9zZSBhcyB3ZWxsLgoKIyBDb25jbHVzaW9uCgpGb3JtdWxhdGUgYSBwcm9wZXIgY29uY2x1c2lvbiB0aGF0IGFuc3dlcnMgdGhlIHJlc2VhcmNoIGh5cG90aGVzaXMuCgoKCgoKCgoKCgoKCg==