Course Information
Teaching team
Class Time and Location
Monday through Friday 12:45PM (EST) on zoom
Communication
All communication will take place via Piazza:
- All official announcements and communication will happen over Piazza.
- Any questions regarding course content and course organization should be posted on Piazza. You are strongly encouraged to answer other students' questions when you know the answer.
- If there are private matters specific to you (e.g special accommodations, requesting alternative arrangements etc.), please create a private post on Piazza.
- For longer discussions (e.g., help on R project), please come to office hours.
- I will respond to Piazza posts sent during the week within 24h. I will respond to Piazza posts sent during the weekend at my own discretion.
Course materials
Textbook
Finding a good book is always a problem—none of the available texts is fully satisfactory. I will mostly follow the chapters in
Introduction to Statistics and Data Science: A moderndive into R and the tidyverse
by Chester Ismay, Albert Y. Kim, Arend M. Kuyper, Elizabeth Tipton, and Kaitlyn G. Fitzgerald. But we shall also read widely from various sources. I will occasionally post additional reading materials. Please check the daily schedule for reading assignments each day.
Computer and zoom attendance
You are not required to purchase any books, hardware, or software for this course. While not required, having one’s own laptop (and webcam, if not built-in) is helpful, particularly for in-class activities and office hours. If you do not have a laptop, please reach out to me at the beginning of the module to discuss possibilities.
Students should ideally be attending the class via zoom with webcam enabled and microphone capability, unless simply not possible, to maximize opportunities for participation. If you do not have stable internet or webcam, please let me know at the beginning of the module.
R and Rstudio
We will be using the programming language R and Rstudio. R is one of the most commonly used programming languages in academic research and industry. Knowing R is a marketable skill. In this class, you will learn enough about R to conduct basic data analysis tasks. Please install R and Rstudio at your earliest convenience.
Datacamp
In the first two weeks, you will be asked to complete 4 DataCamp assignment. You will get an email by the first day of class inviting you to join our class organization with an assignment pointing you to the specific chapters to do. Available for free.
Navigate the course website
- How is your grade calculated? Check out the Syllabus tab for course policies.
- You can find all course materials (readings, slides, videos, assignments) about this course on the daily Schedule.
- Got a question? Come to the teaching staff's Office Hours. We are ready to help!
- All communcations about this course should take place on Piazza.
- You will submit all your assignments on Gradescope.
- Browse through the Tips tab for some quick answers to FAQs.
Course goals
Statistical Thinking
Data visualization and summary, statistical inferenceComputational Skills
Working with data in R and RstudioCommunication Skills
Writing and speaking about statisticsTypical workflow
Weekly
-
Monday
Q&A (15 min) + Lecture (30min) + Tutorial exercise (30min)
-
Tuesday
Lecture (30min) + Tutorial exercise (30min)
-
Wednesday: HW due
Lecture (15min) + Tutorial exercise (1hr 30min)
-
Thursday
Lecture (30min) + Tutorial exercise (30min)
-
Friday: Project due
Lecture (15min) + Tutorial exercise (1hr 30min)
Daily
-
Before class (1hr)
Finish reading/video assignments
-
During class (1.5hr)
Some lecture and lots of learning by doing
-
After class (2hr)
Complete a problem set and project assignment everyweek
In general, this course should take you about 20-22 hours per week.
Acknowledgement
Some of the slides, exercises and language of syllabus come from Mine Çetinkaya-Rundel, David Banks, Evan Ray, Kari Lock Morgan, Ben Baumer, Albert Kim, Drew Hilton, Teaching Statistics: A Bag of Tricks by Andrew Gelman and Deborah Nolan, and Asking good questions blog by Allan Rossman. Illustrations in the course goal sections come from the Openscapes blog Tidy Data for reproducibility, efficiency, and collaboration by Julia Lowndes and Allison Horst (second) and the Luminousmen blog luminousmen.com (first and third). Some of the website design come from Mine Çetinkaya-Rundel and Mine Dogucu.