Think Statistics with R

11 Sep 2023 at 09:00 - 13 Sep 2023 at 13:00

Think Statistics with R

The main aim of this course is to teach you to how to approach data analysis problems
with classical statistics. We focus on the intuition behind statistical methodologies
rather than on “how to run a t-test with R” (which we will also learn, by the way).

First we review the foundations (sampling theory, discrete and continuous distributions),
then we continue with hypothesis testing. The technology itself is introduced using “Student”‘s t-test
as an example, with a strong emphasis on errors (false positives, p-value distributions,
test power calculations). Finally a short “cookbook of tests” is offered.

Instructor: András Aszódi.

Topics

This course teaches the same statistical concepts as the Basic statistics with Python
training but uses the R programming language.

Sampling theory: obtaining information about a population via sampling.
Sample characteristics (location, dispersion, skewness), estimation of the mean, standard error of the mean.
Discrete and continuous probability distributions. Central limit theorem.
Hypothesis testing. Basic principles, one- and two-sided testing.
Type I and Type II errors. P-value distributions. Power calculations.
“Cookbook of tests”: location testing, normality, variance comparisons,
counting statistics, contingency tables, regression tests.

Out of scope

We cannot go into the specific data analysis problems of your particular project.

Furthermore, this course will not teach you bioinformatics.
In particular, no high-throughput sequencing data will be used because they are impractically large,
and not everyone on campus is working with sequencing.

Prerequisites

Basic familiarity with R is required. In particular the following skills are necessary:

Using the R interpreter, either the command-line program or in R Studio
How to invoke R functions, pass optional/named parameters
Some familiarity with simple plotting commands

If you have attended our R as a programming language training
then you are well equipped to take this course.

“Bring Your Own Data”

You can bring your own data to this course and run
a “Student”‘s t-test on it.

Please prepare
a comma-separated-values (CSV) file with UNIX line endings (n) that
contains two columns corresponding to the two groups of data whose means you would
like to compare. The size of the two groups need not be the same.

Practical information

Number of participants: minimum 5, maximum 10.

Length: The course takes two half-days,
from 09:00 to 13:00 with 2 breaks.

Details

Start:: 11 Sep 2023 at 09:00
End:: 13 Sep 2023 at 13:00
Event Category:: Trainings
Event Tags:: HPC
Website:: https://training.vbcf.ac.at/

Think Statistics with R

Topics

Out of scope

Prerequisites

“Bring Your Own Data”

Practical information

Details

Details

Contact us

FIND US

CONTACT US

Service Desk

QUICK LINKS

ABOUT