SISBID 2025 Module 3 Supervised Methods for Statistical Machine Learning

Module info

  • Location Online
  • Meeting Times Mon, Jul 28, 8am-2:30 pm PDT Tue, Jul 29, 8am-2:30 pm PDT Wed, Jul 30, 8-11am PDT
  • Registration Fees
    Regular Price$645
    Acad/Gov't/Non-Profit$545
  • Instructors Jean Feng Jean Feng Ali Shojaie Ali Shojaie
BD2503: Supervised Methods for Statistical Machine Learning

Registration for this module is now closed.

In this module, we will present a number of supervised learning techniques for the analysis of Biomedical Big Data. These techniques include penalized approaches for performing regression, classification, and survival analysis with Big Data. Support vector machines, decision trees, and random forests will also be covered.

The main emphasis will be on the analysis of “high-dimensional” data sets from genomics, transcriptomics, metabolomics, proteomics, and other fields. These data are typically characterized by a huge number of molecular measurements (such as genes) and a relatively small number of samples (such as patients). We will also consider electronic health record data sets, which often contain many missing measurements.

Throughout the course, we will focus on common pitfalls in the supervised analysis of Biomedical Big Data and how to avoid them. The techniques discussed will be demonstrated in R.

This course assumes some previous exposure to linear regression and statistical hypothesis testing, as well as some familiarity with R or another programming language (see previous years materials for reference).

Recommended ReadingJames et al. (2013) Introduction to Statistical Learning. Springer Series in Statistics. Available for free download.

Jean Feng

Associate Professor, Department of Epidemiology and Biostatistics
University of California, San Francisco

Ali Shojaie

Director of Summer Institute in Statistics for Big Data (SISBID)
Department of Biostatistics, University of Washington
Professor and Associate Chair, Norm Breslow Endowed Faculty
Department of Biostatistics, University of Washington