Introduction to Biological Data Analysis

Instructors: Christopher Hemme, PhD, University of Rhode Island and Jillian Wise, PhD, Salve Regina University

Location: URI   
Session 1: June 23-25 ( Location 205 Avedisian Hall)
Session 2: July 14-16 ( Location 205 Avedisian Hall)

Course Overview

This WDT Module aims to train students in basic methods of biological data science.  It will cover concepts in using the Unix command line in a high-performance computing environment, introduction to the R programming language, design of biomedical experiments, and basic statistical analyses (e.g. t-test, ANOVA, regression).  Finally, students will get a basic introduction to bioinformatics and omics methods. The module will be completed over 2.5 days and participants who complete the module will receive an RI-INBRE Certificate of completion.

Learning Outcomes

  • Learn how to use the Unix command line on URI’s Unity High-Performance Computing Cluster
  • Learn basic concepts in biomedical experimental design, such as selection of controls, determining proper sample sizes, and different experimental setups
  • Process different types of biomedical data using common statistical concepts such as t-test, ANOVA, and linear regression
  • Learn basic concepts in bioinformatics including omics data analysis
  • Analyze bioinformatics data using established methods such as differential expression analysis

Lab Report

Students will be expected to maintain detailed laboratory notes to include:

  • Data analysis protocols (e.g., workflows, software used, etc.)
  • Quality control metrics for sequencing and data analysis
  • Results of data analysis

Resources Used in this Module:

  • Jupyter Notebooks for Hands-On Bioinformatics Data Analysis
  • Slide Decks for Lectures

Proposed Timeline (subject to change)

Day 1Day 2Day 3
9:00 AM-10:00 AMIntroduction to Best Practices in Biomedical Data Science (Lecture)Experimental Design and Power Analysis for Determining Sample Size for Biomedical Experiments (Lecture and Hands-On)Single Cell Identification (Hands-On)
10:00 AM-11:00 AMUsing the Unix Command Line and Interactive Tools (Jupyter Notebooks and RStudio) on High-Performance Computing Clusters (Hands-On)Methods for Exploratory Analysis of Biomedical Data (Lecture and Hands-On)Exploratory Analysis of Single Cell Data (Hands-On)
11:00 AM-12:00 PMR Programming I – Basic Data Structures and Functions (Hands-On)Statistical Methods for Analysis of Biomedical Data (ANOVA, t-Test, linear regression, logistic regression) (Lecture/Hands-On)Artificial Intelligence in Biomedical Data Analysis (Optional, if time allows)
12:00 - 1:00 PMBreakBreakWDT Survey and
Certificate Distribution
1:00 PM-2:00 PMR Programming II – Tidy Data and Tidyverse (Hands On)Introduction to Bioinformatics and Single Cell Omics (Lecture)
2:00 PM-3:00 PMR Programming III – Data Visualization (Hands On)Processing of Single Cell Data (Hands-On)
3:00 PM-4:00 PMBasic Statistical Concepts for Biomedical Research Processing of Single Cell Data Cont. (Hands-On)