SISBID 2026 Module 4 Unsupervised Methods for Statistical Learning

Module info

  • Location Online
  • Meeting Times Mon, Jul 27, 8am-2:30 pm PDT Tue, Jul 28, 8am-2:30 pm PDT Wed, Jul 29, 8-11am PDT
  • Registration Fees
    Regular Price$575
    Acad/Gov't/Non-Profit$475
    Early Pricing in Effect
  • Instructors Genevera Allen Genevera Allen Yufeng Liu Yufeng Liu
BD2604: Unsupervised Methods for Statistical Learning

In this module, we will present a number of unsupervised learning techniques for finding patterns and associations in Biomedical Big Data. These include dimension reduction techniques such as principal components analysis and non-negative matrix factorization, clustering analysis, and network analysis with graphical models.

We will also discuss large-scale inference issues, such as multiple testing, that arise when mining for associations in Biomedical Big Data. As in Module 4 on supervised learning, the main emphasis will be on the analysis of real high-dimensional data sets from various scientific fields, including genomics and biomedical imaging. The techniques discussed will be demonstrated in R.

This course assumes some previous exposure to linear regression and statistical hypothesis testing, as well as some familiarity with R or another programming language (see previous year’s materials as reference).

Recommended ReadingJames et al. (2013) Introduction to Statistical Learning. Springer Series in Statistics. Available for free download at www.statlearning.com.

Genevera Allen

Professor of Statistics
Columbia University
Member, Center for Theoretical Neuroscience
Columbia University
Member Zuckerman Institute for Mind, Brain, and Behavior
Columbia University
Member, Irving Institute for Cancer Dynamics
Columbia University

Yufeng Liu

John D. MacArthur Professor of Statistics
University of Michigan