IntroFig

Course Description

This intermediate level course is one of our Foundations courses. It covers essential statistical concepts and methods for extracting insights from empirical data in the life sciences. The course positions applied statistics, starting from important aspects of experimental design and data exploration. We then move into statistical modeling and data analysis. We will focus on the link between linear regression and analysis of variance. Together, these methods contribute to the study of General Linear Models. The course also introduces the basics of non-parametric testing, and addresses categorical data analysis and logistic regression.

The concepts and methods are exclusively introduced via case-studies in the life sciences. For every study we elaborate on a concrete research question and then provide a study design, which is followed by data exploration. Next, we will focus on how to model the data and elaborate on the link between model parameters and the subject matter research question.


Learning objectives

The case studies will enable the participants to build self-confidence in understanding, reading and communicating on data and data analysis selecting appropriate statistical methods and software tools for analysing different types of data interpreting the result of a statistical data analysis in terms of subject matter research questions and reporting them appropriately.

There is a strong emphasis in reproducible research by extensively using R/Rmarkdown scripts. This approach will enable the participants to weave statistical analyses, code, results and interpretation in webpages and PDF documents so that their entire data analysis workflow is transparent and reproducible. The course materials are designed in R/Rmarkdown, kickstarting the course participants into developing their own scripts.


Practical Statistics for the Life Sciences

Software

  • The learning materials are developed for R version 3.6.0 or later. We also recommend to also install the latest version of RStudio.

  • To install all requirements, please copy and paste this line of code in your R console.

source("https://raw.githubusercontent.com/GTPB/PSLS20/master/install.R")
  • Alternatively, learners can launch an R studio interface in the cloud.

Binder

  • To get familiar with most important concepts in working with R and Rmarkdown, you may download the respective cheat sheets: https://raw.githubusercontent.com/GTPB/PSLS20/master/background_material/r-cheatsheet.pdf and https://raw.githubusercontent.com/GTPB/PSLS20/master/background_material/rmarkdown-cheatsheet.pdf

  • All data can be downloaded locally through data

Learning materials

  1. Introduction
  2. Concepts
  3. Experimental Design
  4. Data Exploration
  5. Statistical Inference
  6. Linear Regression
  7. Analysis of Variance
  8. Multiple Regression
  9. Nonparametric Statistics:
  10. Categorical Data Analysis

Instructors


The source for this course webpage is in github.


Creative Commons License
PSLS20 by GTPB is licensed under a Creative Commons Attribution 4.0 International License.