12th Summer Institute in Statistics and Modeling in Infectious Diseases (SISMID)

Module 15: Microbiome Data Analysis

Mon Jul 27 to Wed Jul 29

Module Schedule: Monday, July 27; Tuesday, July 28, and Wednesday, July 29. 

Prerequisites: Programming will be done in R and fluency at the level of the module on Introduction to R, though not necessarily from taking that module, will be expected. This module assumes knowledge of the material in Module 1: Probability and Statistical Inference, though not necessarily from taking that module.

This course is concerned with multivariate statistical analysis of microbiome data. We will briefly cover foundational concepts in microbial ecology, molecular biology, bioinformatics, and DNA sequencing.

The main focus of the course will be on developing an understanding of multivariate analysis of microbiome data. Practical skills to be developed in this course include managing high-dimensional and structured data in metagenomics, visualization and representation of high-dimensional data, normalization, filtering, and mixture-model noise modeling of count data, as well as clustering and predictive model building.