Data Science in Infectious Disease Modeling using R
Online Lab Meeting Times:
- Monday, July 7, 1:00 - 2:30 PM ET and 3:00 - 4:30 PM ET
- Tuesday July 8, 1:00 - 2:30 PM ET and 3:00 - 4:30 PM ET
Classroom: Virtual
Module Summary:
This course will foster a problem solving mindset while exploring more advanced concepts in R and is ideal for someone who has some familiarity with R (base and Tidyverse). There are multiple approaches to coding an analysis; contents covered in this course will serve as a guide to help you decide which approach is best for your particular situation.
The course will draw concepts from R for Data Science, Advanced R, and R for Epidemiology books along with the instructors’ experiences with infectious disease modeling. We will work in both base R and the tidyverse to wrangle messy data and build analytic workflows.
Prerequisites:
Familiarity with R base and Tidyverse, including:
- R Studio/Post Studio
- R Markdown
- Pipes
Module Content:
This course will consist of 3 themes:
- Foundations in data science that are not often covered in an Intro R programming course.
- Advanced data wrangling tailored to different types of public health data.
- Special considerations for SISMID and public health data.
Asynchronous content:
-
Foundation Theme
-
Problem solving using advanced data wrangling
-
Advanced Coding Methods
-
Special considerations
Synchronous content:
-
Foundations
-
Working with public health survey data
-
Working with simulation/time series data
-
Working with line list/PII data
Instructors

Sarah Bowden, PhD
Lead Data Scientist, Division of Global Migration Health at CDC
Dr. Sarah Bowden is a Data Scientist in the Division of Global Migration Health at CDC. She has been coding in R since 2007 and has enjoyed seeing the Tidyverse develop and grow over time. Dr. Bowden uses Tidyverse tools and best practices in her day-to-day coding activities and has trained and mentored 20+ undergraduate, graduate, and postdoctoral fellows in data science and public health analytics over the past 8 years.

Raj Reni Kaul, PhD
Health Scientist (Data Scientist), Immunization Services Division at CDC
Dr. Reni Kaul is a Health Scientist in the Immunization Services Division at the CDC. She is a certified Carpentries Instructor and is committed to creating an inclusive learning environment. She has previously designed and taught coding courses in R for undergraduate and graduate students.
Required Software:
-
R Statistical Programming Language
-
RStudio/Posit Studio (desktop or cloud)
Recommended Reading: