Mathematical models are an essential tool in epidemiology. Models are used in a number of ways, including: to predict the course of an epidemic; to estimate key epidemiological parameters; to determine an optimal vaccination strategy; or to gain a general, qualitative understanding of the salient features of the dynamics of a disease. This course will focus on differential equations, a class of mathematical models used across the sciences to model quantities that change continuously in time. The course will begin with a short introduction to differential equations. We will then step through the full “pipeline” of mathematical modeling using epidemiology as a case study. We will begin with a first-principles derivation of several models and will consider how the natural history of an infectious disease influences modeling choices. We will then see how to solve and analyze differential equations computationally. We will then cover how to calibrate a model to epidemiological data and estimate key model parameters in a principled way. Finally, we will see how to create model projections for different intervention scenarios and will explore the strengths and limitations of a mathematical model in relation to given research and policy questions. Through this process, students will gain skills and experiences that will apply to mathematical modeling in a range of settings across the natural and social sciences.
In addition to learning the mathematics of differential equations applied to epidemiology, a central goal of this course is to gain skills necessary for research in the mathematical, natural, and social sciences. This includes conceptualizing and framing a research question, formulating a model to explore that question, engaging with peer-reviewed research publications, and giving a research presentation and preparing a short technical report.
Evaluation will be based on class participation, problem sets and programming exercises, and a term-long project culminating in a research presentation and short technical report.
Class Tuesday, Wednesday, Friday, Help Sessions: Thursday and Friday.
Class - 14:35-16:00
Help Sessions: 3:30-4:30 Thursday CHE 103, After class on request
Mathematical models are an essential tool in epidemiology. Models are used in a number of ways, including: to predict the course of an epidemic; to estimate key epidemiological parameters; to determine an optimal vaccination strategy; or to gain a general, qualitative understanding of the salient features of the dynamics of a disease. This course will focus on differential equations, a class of mathematical models used across the sciences to model quantities that change continuously in time. The course will begin with a short introduction to differential equations. We will then step through the full “pipeline” of mathematical modeling using epidemiology as a case study. We will begin with a first-principles derivation of several models and will consider how the natural history of an infectious disease influences modeling choices. We will then see how to solve and analyze differential equations computationally. We will then cover how to calibrate a model to epidemiological data and estimate key model parameters in a principled way. Finally, we will see how to create model projections for different intervention scenarios and will explore the strengths and limitations of a mathematical model in relation to given research and policy questions. Through this process, students will gain skills and experiences that will apply to mathematical modeling in a range of settings across the natural and social sciences.
In addition to learning the mathematics of differential equations applied to epidemiology, a central goal of this course is to gain skills necessary for research in the mathematical, natural, and social sciences. This includes conceptualizing and framing a research question, formulating a model to explore that question, engaging with peer-reviewed research publications, and giving a research presentation and preparing a short technical report.
Evaluation will be based on class participation, problem sets and programming exercises, and a term-long project culminating in a research presentation and short technical report.
This course is designed as a community learning journey. Together, we will:
Gain a basic understanding of ordinary differential equations (ODEs), including systems of coupled ODEs. Learn what ODEs are, what it means to find a solution to an ODE, and different ways of visualizing solutions, including phase space methods. Learn and implement (by hand and in R) Euler’s method, a numerical approach to solving ODEs. Learn to use ODE integrators that are part of standard packages in R.
Gain an understanding of the different ways mathematical and computational models are used in the sciences. Gain experience deriving a model from first principles, using models to explore and test ideas, and developing your own models.
Gain skills in identifying key characteristics of a disease including who, what, when, where, and how the disease is spread. Develop skills in translating and incorporating these characteristics into epidemiological models.
It is also our hope that we will:
This course has several distinct components, each of which may have a somewhat different feel.
Introduction to Differential Equations: We’ll talk about what differential equations are and what it means to solve a differential equation. We’ll learn about some techniques for determining the qualitative behavior of solutions to a differential equation. We’ll then learn Euler’s method, a simple but powerful numerical technique for solving differential equations. This part of the course will feel sorta like a math class. There will be a few problem sets and you’ll be doing some math.
R Basics: In parallel with item 1, we’ll cover some of the basics of the R environment for statistical computing and visualization. You’ll learn how to work with data files and plot data. We’ll then see how to use R to plot numerical solutions differential equations.
Basic Epidemiological Models: We then turn our attention to SIR models, a simple compartmental model commonly used in epidemiology (and elsewhere). We will build up this model carefully, starting with first principles and discussing the assumptions embedded in the model. This part of the course will seem less like a math class, since we’ll be relying almost exclusively on computational solutions to models. We’ll also consider a number of variants of the basic SIR model.
More Epidemiological Modeling Topics: Our attention will turn more toward modeling. We’ll read and think about the different sorts of ways that models are used in epidemiology and in the sciences more generally. We’ll also learn about how to estimate model parameters using data. AS time permits, we will cover additional topics of interest to students.
The class meets on Tuesday, Wednesday and Friday from 14:35-16:00.
In addition to the 4.5 hours of scheduled class time every week, we expect that between readings, going over notes, and doing assignments you will spend at least an additional 10.5 hours a week on this course, for a total of at least 150 hours over the term devoted to this class.
The question forum on google classroom will be used for online Q&A and collaborative discussion. You can post questions and answers can be posted by the instructor, TA, and/or other students. Please check google classroom to see whether a question has already been asked/answered before starting a new post. However, instead of sharing code on google classroom, describe the issue or post the error message - if we need to troubleshoot outside google classroom, I will let you know!
We will make an effort to check the question forum daily during the week to answer any questions. We will check the board on Monday to answer any questions posted over the weekend.
Online resources (e.g., StackOverflow) can be a good source for understanding and inspiration. These should not be an immediate go-to without your having struggled with the problem first. You should also take any such online posting with a grain of salt, as they can sometimes be misleading or in a different context than your own or, in some cases, simply wrong. Moreover, take special note of the “Sharing and Reusing Code” section below.
The Teaching Assistant and us will have a handful of help sessions every week. You are warmly invited and encouraged to attend these sessions. Help sessions are relaxed, informal, and hopefully fun. Things that happen at help sessions:
Everyone is welcome at help sessions! Attending these sessions help students do well in class and get as much out of it as possible.
A growing body of research indicates that traditional approaches to grading fail to produce the sorts of meaningful learning desired by both teachers and students. Such approaches often reinforce inequitable power dynamics between teachers and students, promote faulty reward systems that disincentive creativity and risk-taking, and devalue important aspects of learning (including revision and feedback). Given this context, instead of a traditional approach to grading in which you do work that is evaluated singularly by me, this course assumes that you opt to take ownership and responsibility over your performance and engagement with the class. To make this happen, this course uses a “contract grading” scheme, which gives you a voice in the grading process, provides you with the agency to specify your intended course performance, and also share in the responsibility for evaluating whether or not you fulfilled your intended obligations. Please see the contract grading document (on Google Classroom) for a more-fleshed-out explanation of this approach and how it will operate in the course.
We will meet with each of you individually to set goals at the start of the course.
The work in this course will be comprised of the following components and their weights:
Unless indicated otherwise on a specific assignment, your HW and lab assignments may, and should, be discussed with others in this course. Each assignment is subject to the “empty-hands policy:”
Any manifestation of copying another student’s work for your own (whether digital, hand-written, oral, etc.) is not permitted. This includes, for example, looking at or screen-capturing another student’s implementation and then writing “your own” version of that implementation.
I am well aware that a huge volume of code is available on the web to solve any number of problems. You can, and should, consult online resources (e.g., StackOverflow) when you get stuck. Recommendations there are often useful for helping you to get unstuck. However, you are not permitted to copy-and-paste any solutions directly from online resources. Rather, use them to guide your own solution and implementation. Caveat: answers on StackOverflow are sometimes wrong and/or can be misleading.
By enrolling in an academic institution, a student is subscribing to common standards of academic honesty. Any cheating, plagiarism, falsifying or fabricating of data is a breach of such standards. A student must make it their responsibility to not use words or works of others without proper acknowledgement. Plagiarism is unacceptable and evidence of such activity is reported to the provost or their designee. Two violations of academic integrity are grounds for dismissal from the college. Students would request in-class discussions of such questions when complex issues of ethical scholarship arise.
Most often, violations of academic integrity are due to perceived timeline pressures and a corresponding feeling of desperation. Please understand that, if you find yourself in such a situation, I want you to come talk to me, rather than making a rash (and poor) decision that can have negative consequences. We can discuss appropriate accommodations and alternative timelines for an assignment.
Many of us learn in different ways. For example, you may process information by speaking and listening, so while lectures are quite helpful for you, some of the written material may be difficult to absorb. You might have difficulty following lectures, but are able to quickly assimilate written information. You may need to fidget to focus in class. You might take notes best when you can draw a concept. For some of you, speaking in class can be a stressful or daunting experience. For some of you, certain topics or themes might be so traumatic as to be disruptive to learning. The principle of Universal Design for Learning calls for our classrooms, our virtual spaces, our practices and our interactions to be designed to include as many different modes of learning as possible, and is a principle I take seriously in this class.
It is also my goal to create an inclusive classroom, which depends on community building, and which requires everyone to come to class with mutual respect, civility, and a willingness to listen to and observe others. As such the syllabus serves as a contract of some expectations between all members of the class, including myself.
If you anticipate or experience any barriers to learning in this course, please reach out to me and your student support advisor. If you have a disability, or think you may have a disability, COA’s Disability Support Services located within the Office of Student Life in Deering Commons to develop a plan for your academic accommodations. You can find out more information in the course catalog under Accommodating students with disabilities. If you have already been approved for accommodations through the Disability Support Services please let me know! We can meet 1-1 to explore concerns and potential options.
All work is due on the stated due date. Due dates are there to help guide your pace through the course and they also allow me to return feedback to you in a timely manner. However, sometimes life gets in the way and you might not be able to turn in your work on time.
If you intend to submit work late for an assignment or project, you must notify me before the original deadline and as soon as the completed work is submitted. This allows me to return feedback to you and let’s me know when to check your work. Lab work cannot be submitted late.
We want to make sure that you learn everything you were hoping to learn from this class. If this requires flexibility, please don’t hesitate to ask.
You never owe us personal information about your health (mental or physical) but you’re always welcome to talk to us. If we can’t help, we likely know someone who can.
We want you to learn lots of things from this class, but we primarily want you to stay healthy, balanced, and grounded.
Most of you will need help at some point and we want to make sure you can identify when that is without getting too frustrated and feel comfortable seeking help.
Showcase your inner epidemiologist
First, select a disease. Your goal is to build a simple compartmental model for a particular outbreak or scenario (i.e. simulation) of your chosen disease, defining the scope of your model, estimating your model’s parameters as best you can, and showing your model’s results during the time span of the outbreak. Or you may try to reproduce a result from a peer-reviewed paper. You may select an outbreak from the articles shared on Google Classroom or find your own disease papers (we will cover google scholar and the datasets used be helpful: epimdr vignette. Use a compartmental model and explain your reasoning for the model you choose.
In most cases, your model’s results will not be very close to the timeline of the data from the actual outbreak. This is not just acceptable, but is expected!. The goal of this project is to learn about a disease, how to create a model for that disease, and identify ways in which the model is, and is not enlightening about your chosen disease outbreak. During the term, we will learn more approaches that will help us improve the fit of our models to data.
For your final project you will present your findings in a 10-minute presentation to the class on Tuesday or Wednesday of Week 10 (June 6 and 7) and a copy of your documented code and figures. We will discuss in class guidelines for effective scientific presentations and writing reproducible and readable code.
It is inevitable that some of you will “get farther” in your projects than others, since some of you are farther along in your studies. Also, some research topics are harder to make progress on than others. This diversity is fine. The point of the projects is not to come up with publishable original research or even have an amazing and polished final result. Rather, the idea is to strive toward as good a result as possible, and learn a lot (and challenge yourself and have fun) while doing so.
We will have a project scoping workshop on Friday, May 12 where we will meet in small groups to discuss our project ideas:
Come with a disease and an initial sketch (or slide) of a compartment model you are considering to model the disease
The outbreak dataset if you have identified one (if you are still searching for a dataset, that is fine).
Two or more papers that take a modeling approach to your disease that you find interesting
What you find interesting about your disease and what questions about the disease your model may shed light on
Thorndike Library offers many resources and services that can assist you in your academic endeavors, including individualized research support and access to resources beyond COA. Study spaces are also available. The library is open 7 days/week. Remote access to the research databases is available 24/7. Contact library@coa.edu or visit the library website for details.
If you are in need of a term long loaner laptop, please contact the IT department at helpdesk@coa.edu. Mention that you are taking a data science class, and pick up the laptop in A&S right by the whale skull.
The grading contract used in this course draws on the work of Ethan Miller, Misty Beck, Francis Eanes at Bates as well as scholars beyond these institutions.
The course website design draws inspiration from Introductory Statistics by Mine Çetinkaya-Rundel.