Fish tank dataset
In this experiment, 96 fish (dojofish, goldfish and zebrafish) were placed separately in a tank with two litres of water and a certain dose (in mg) of a certain poison EI-43,064. The resistance of the fish a against the poison was measured as the amount of minutes the fish survived upon adding the poison (Surv_time, in minutes). Additionally, the weight of each fish was measured.
Goal
The research goal is to study the association between the dose of the poison that was administered to the fish and their survival time by using a linear regression model.
Read the required libraries
Import the data
poison <- read_csv("https://raw.githubusercontent.com/statOmics/PSLSData/main/poison.csv")
Data tidying
We can see a couple of things in the data that can be improved:
Capitalise the fist column name
Set the Species column as a factor
Change the species factor levels from 0, 1 and 2 to Dojofish, Goldfish and Zebrafish. Hint: use the fct_recode
function.
Data Exploration and Descriptive Statistics
How many fish do we have per species?
Which variables might influence survival? Make a suitable visualisation of the association between the dose and the survival time.
Important note on the dataset
In this dataset, there are multiple variables can have an effect on the survival time of the fish. The most obvious one is the dose of poison that was administered (as displayed above). However, we could also imagine that heavier fish are less prone to the poison than light fish. Additionally, one fish species may be more resistant to the poison.
To correctly analyse this data, all these factors should be taken into account. However, modeling the response based on multiple predictors will only be discussed later in this course. For now, we will simply ignore the potential effect of weigth and species on the survival time of the fish. Hence, we only consider the effect of the poison dosage
. This allows us to analyze the data using simple linear regression, but please bear in mind that not taking into account thesee other factors will invalidate our analysis. Later in the course, we will come back to this dataset and perform a correct analysis that takes into acount all relevant predictors.
Linear regression
In order to get familiar with simple linear regression
Check the assumptions
Interpret the model parameters of the linear model
Interpret the results, both for the intercept as well as for the slope
Write a conclusion that answers the research hypothesis.
LS0tCnRpdGxlOiAiRXhlcmNpc2UgNi4xOiBMaW5lYXIgcmVncmVzc2lvbiBvbiB0aGUgZmlzaCB0YW5rIGRhdGFzZXQiCmF1dGhvcjogIkxpZXZlbiBDbGVtZW50LCBKZXJvZW4gR2lsaXMgYW5kIE1pbGFuIE1hbGZhaXQiCmRhdGU6ICJzdGF0T21pY3MsIEdoZW50IFVuaXZlcnNpdHkgKGh0dHBzOi8vc3RhdG9taWNzLmdpdGh1Yi5pbykiCi0tLQoKIyBGaXNoIHRhbmsgZGF0YXNldAoKSW4gdGhpcyBleHBlcmltZW50LCA5NiBmaXNoIChkb2pvZmlzaCwgZ29sZGZpc2ggYW5kIHplYnJhZmlzaCkKd2VyZSBwbGFjZWQgc2VwYXJhdGVseSBpbiBhIHRhbmsgd2l0aCB0d28gbGl0cmVzIG9mIHdhdGVyIGFuZAphIGNlcnRhaW4gZG9zZSAoaW4gbWcpIG9mIGEgY2VydGFpbiBwb2lzb24gRUktNDMsMDY0LiBUaGUgcmVzaXN0YW5jZQpvZiB0aGUgZmlzaCBhIGFnYWluc3QgdGhlIHBvaXNvbiB3YXMgbWVhc3VyZWQgYXMgdGhlIGFtb3VudCBvZgptaW51dGVzIHRoZSBmaXNoIHN1cnZpdmVkIHVwb24gYWRkaW5nIHRoZSBwb2lzb24gKFN1cnZfdGltZSwgaW4KbWludXRlcykuIEFkZGl0aW9uYWxseSwgdGhlIHdlaWdodCBvZiBlYWNoIGZpc2ggd2FzIG1lYXN1cmVkLgoKIyBHb2FsCgpUaGUgcmVzZWFyY2ggZ29hbCBpcyB0byBzdHVkeSB0aGUgYXNzb2NpYXRpb24gYmV0d2VlbiB0aGUgZG9zZSBvZgp0aGUgcG9pc29uIHRoYXQgd2FzIGFkbWluaXN0ZXJlZCB0byB0aGUgZmlzaCBhbmQgdGhlaXIKc3Vydml2YWwgdGltZSBieSB1c2luZyBhIGxpbmVhciByZWdyZXNzaW9uIG1vZGVsLgoKClJlYWQgdGhlIHJlcXVpcmVkIGxpYnJhcmllcwoKYGBge3IsIG1lc3NhZ2UgPSBGQUxTRX0KbGlicmFyeSh0aWR5dmVyc2UpCmBgYAoKIyBJbXBvcnQgdGhlIGRhdGEKCmBgYHtyLCBtZXNzYWdlPUZBTFNFfQpwb2lzb24gPC0gcmVhZF9jc3YoImh0dHBzOi8vcmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbS9zdGF0T21pY3MvUFNMU0RhdGEvbWFpbi9wb2lzb24uY3N2IikKYGBgCgojIERhdGEgdGlkeWluZwoKV2UgY2FuIHNlZSBhIGNvdXBsZSBvZiB0aGluZ3MgaW4gdGhlIGRhdGEgdGhhdCBjYW4gYmUgaW1wcm92ZWQ6CgoxLiBDYXBpdGFsaXNlIHRoZSBmaXN0IGNvbHVtbiBuYW1lCgoyLiBTZXQgdGhlIFNwZWNpZXMgY29sdW1uIGFzIGEgZmFjdG9yCgozLiBDaGFuZ2UgdGhlIHNwZWNpZXMgZmFjdG9yIGxldmVscyBmcm9tIDAsIDEgYW5kIDIgdG8KRG9qb2Zpc2gsIEdvbGRmaXNoIGFuZCBaZWJyYWZpc2guICpIaW50KjogdXNlIHRoZSBgZmN0X3JlY29kZWAgZnVuY3Rpb24uCgpgYGB7cn0KCmBgYAoKIyBEYXRhIEV4cGxvcmF0aW9uIGFuZCBEZXNjcmlwdGl2ZSBTdGF0aXN0aWNzCgpIb3cgbWFueSBmaXNoIGRvIHdlIGhhdmUgcGVyIHNwZWNpZXM/CgpgYGB7cn0KCmBgYAoKV2hpY2ggdmFyaWFibGVzIG1pZ2h0IGluZmx1ZW5jZSBzdXJ2aXZhbD8gTWFrZSBhIHN1aXRhYmxlIHZpc3VhbGlzYXRpb24gb2YgdGhlCmFzc29jaWF0aW9uIGJldHdlZW4gdGhlIGRvc2UgYW5kIHRoZSBzdXJ2aXZhbCB0aW1lLgoKYGBge3J9CgpgYGAKCiMgSW1wb3J0YW50IG5vdGUgb24gdGhlIGRhdGFzZXQKCkluIHRoaXMgZGF0YXNldCwgdGhlcmUgYXJlIG11bHRpcGxlIHZhcmlhYmxlcyBjYW4gaGF2ZSBhbiBlZmZlY3Qgb24gdGhlIHN1cnZpdmFsCnRpbWUgb2YgdGhlIGZpc2guIFRoZSBtb3N0IG9idmlvdXMgb25lIGlzIHRoZSBkb3NlIG9mIHBvaXNvbiB0aGF0IHdhcwphZG1pbmlzdGVyZWQgKGFzIGRpc3BsYXllZCBhYm92ZSkuIEhvd2V2ZXIsIHdlIGNvdWxkIGFsc28gaW1hZ2luZSB0aGF0IGhlYXZpZXIKZmlzaCBhcmUgbGVzcyBwcm9uZSB0byB0aGUgcG9pc29uIHRoYW4gbGlnaHQgZmlzaC4gQWRkaXRpb25hbGx5LCBvbmUgZmlzaApzcGVjaWVzIG1heSBiZSBtb3JlIHJlc2lzdGFudCB0byB0aGUgcG9pc29uLgoKVG8gY29ycmVjdGx5IGFuYWx5c2UgdGhpcyBkYXRhLCBhbGwgdGhlc2UgZmFjdG9ycyBzaG91bGQgYmUgdGFrZW4gaW50byBhY2NvdW50LgpIb3dldmVyLCBtb2RlbGluZyB0aGUgcmVzcG9uc2UgYmFzZWQgb24gbXVsdGlwbGUgcHJlZGljdG9ycyB3aWxsIG9ubHkgYmUKZGlzY3Vzc2VkIGxhdGVyIGluIHRoaXMgY291cnNlLiAqKkZvciBub3csIHdlIHdpbGwgc2ltcGx5IGlnbm9yZSB0aGUgcG90ZW50aWFsKioKKiplZmZlY3Qgb2Ygd2VpZ3RoIGFuZCBzcGVjaWVzIG9uIHRoZSBzdXJ2aXZhbCB0aW1lIG9mIHRoZSBmaXNoLioqIEhlbmNlLCB3ZQpvbmx5IGNvbnNpZGVyIHRoZSBlZmZlY3Qgb2YgdGhlIHBvaXNvbiBgZG9zYWdlYC4gVGhpcyBhbGxvd3MgdXMgdG8gYW5hbHl6ZQp0aGUgZGF0YSB1c2luZyBzaW1wbGUgbGluZWFyIHJlZ3Jlc3Npb24sICoqYnV0IHBsZWFzZSBiZWFyIGluIG1pbmQgdGhhdCoqCioqbm90IHRha2luZyBpbnRvIGFjY291bnQgdGhlc2VlIG90aGVyIGZhY3RvcnMgd2lsbCBpbnZhbGlkYXRlIG91ciBhbmFseXNpcy4qKgpMYXRlciBpbiB0aGUgY291cnNlLCB3ZSB3aWxsIGNvbWUgYmFjayB0byB0aGlzIGRhdGFzZXQgYW5kIHBlcmZvcm0gYSBjb3JyZWN0CmFuYWx5c2lzIHRoYXQgdGFrZXMgaW50byBhY291bnQgYWxsIHJlbGV2YW50IHByZWRpY3RvcnMuCgojIExpbmVhciByZWdyZXNzaW9uCgpJbiBvcmRlciB0byBnZXQgZmFtaWxpYXIgd2l0aCBzaW1wbGUgbGluZWFyIHJlZ3Jlc3Npb24KCjEuIENoZWNrIHRoZSBhc3N1bXB0aW9ucwoKMi4gSW50ZXJwcmV0IHRoZSBtb2RlbCBwYXJhbWV0ZXJzIG9mIHRoZSBsaW5lYXIgbW9kZWwKCjMuIEludGVycHJldCB0aGUgcmVzdWx0cywgYm90aCBmb3IgdGhlIGludGVyY2VwdCBhcyB3ZWxsIGFzIGZvciB0aGUgc2xvcGUKCjQuIFdyaXRlIGEgY29uY2x1c2lvbiB0aGF0IGFuc3dlcnMgdGhlIHJlc2VhcmNoIGh5cG90aGVzaXMuCg==