Linear regression tutorial
Linear regression tutorial
On the third day of the “Practical Statistics for the Life Sciences (2020)” course, we will have three tutorials on data linear regression, based on different datasets:
1) Breast cancer dataset
- Exercise: Exercise_1
2) The fish dataset:
In this experiments 96 fish (dojofish, goldfish and zebrafish) were placed separately in a tank with two liters of water and a certain dose (in mg) of a certain poison EI-43,064. The resistance of the fish a against the poison was measured as the amount of minutes the fish survived upon adding the poison (Surv_time, in minutes). Additionally, the weightt of each fish was measured.
In this tutorial, we will study the association between dose and survival time by using a linear regression model.
- Exercise: Exercise_2
- Data: “https://raw.githubusercontent.com/GTPB/PSLS20/master/data/poison.csv”
- Solution: Solution_2
#### 3) The FEV dataset:
The FEV, which is an acronym for forced expiratory volume, is a measure of how much air a person can exhale during a forced breath.
In this dataset, the FEV of 606 children, between the ages of 6 and 17, were measured. The dataset also provides additional information on
these children: their age
, their height
, their gender
and, most importantly, whether the child is a smoker or a non-smoker.
The goal of this experiment was to find out whether or not smoking has an effect on the FEV of children.
- Exercise: Exercise_3
- Data: “https://raw.githubusercontent.com/GTPB/PSLS20/master/data/fev.txt”
- Exercise: Exercise_3