Styling and infrastructure for this page inspired by related syllabi produced by Ben Baumer and R. Jordan Crouser.
All readings for this course will be available in our course Perusall, which is linked in Moodle. I encourage you to complete the readings there so that you can leave comments and questions as they come up.
No Readings
No Readings
If you wish you may fill out the Trigger Warnings Questionnaire in Moodle.
Be sure to configure Slack notifications for our course. I also encourage you to download and install a Desktop version of Slack.
Class slides are here
No Readings
No Readings
Trigger warning: Monday’s reading opens with a case study on gun homicide in the U.S.
The CS Liaisons will be hosting a software install session from 2:30-4 today. More information soon.
Today’s lab is posted here.
I will remain on the call for one hour past our scheduled class time to help anyone that would like to finish the lab. Please note that I won’t be able to do this every week.
2. R Basics , Irizarry, Rafael A. (2022). Introduction to Data Science. Data Analysis and Prediction Algorithms with R. URL: https://rafalab.github.io/dsbook/ (visited on Jan. 14, 2022).
Finish RStudio/GitHub Set-up
No Readings
You do not need to follow along with exercises in the course texts but may choose to do so if you wish.
Please thread responses to messages in Slack.
Class slides are here
Today’s worksheet is in Moodle. Download it and then open it in RStudio.
2. Data Visualization , Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton (2021). Modern Data Science with R. 2nd. CRC Press. URL: https://mdsr-book.github.io/mdsr2e/ (visited on Jan. 14, 2022).
list()
11. Data Visualization Principles , Irizarry, Rafael A. (2022). Introduction to Data Science. Data Analysis and Prediction Algorithms with R. URL: https://rafalab.github.io/dsbook/ (visited on Jan. 14, 2022).
Tufte, Edward R. (2001). The Visual Display of Quantitative Information. 2nd edition. Cheshire, Conn: Graphics Press. ISBN: 978-1-930824-13-3.
D’Ignazio, Catherine and Lauren Klein (2020). “3. On Rational, Scientific, Objective Viewpoints from Mythical, Imaginary, Impossible Standpoints”. En. In: Data Feminism. Publisher: PubPub. MIT Press. URL: https://data-feminism.mitpress.mit.edu/pub/5evfe9yd/release/1 (visited on Aug. 24, 2021).
2. Data Visualization , Ismay, Chester and Albert Y. Kim (2021). Modern Dive: Statistical Inference via Data Science. CRC Press. URL: https://moderndive.com/ (visited on Jan. 14, 2022).
No Readings
No Readings
Quiz 1
No Readings
Class slides are here
Quiz 2 posted today.
Mini-project 1 will be posted on Friday.
Bryan, Jennifer (2018). “Excuse Me, Do You Have a Moment to Talk About Version Control?” In: The American Statistician 72.1. Publisher: Taylor & Francis _ eprint: https://doi.org/10.1080/00031305.2017.1399928, pp. 20-27. DOI: 10.1080/00031305.2017.1399928. URL: https://doi.org/10.1080/00031305.2017.1399928 (visited on Jan. 14, 2022).
Brennan, Stephen (2022). GitHub for Non-Coders - Stephen Brennan. URL: https://brennan.io/2015/08/07/github-noncoders/ (visited on Jan. 14, 2022).
Class slides are here
No Readings
No Readings
Snow Day!
3. Data Wrangling (3.1-3.3) , Ismay, Chester and Albert Y. Kim (2021). Modern Dive: Statistical Inference via Data Science. CRC Press. URL: https://moderndive.com/ (visited on Jan. 14, 2022).
No Readings
Day 14 solutions are here
Class slides are here
MP1 due Friday!
If you struggled on quiz 2 be sure to study the y-axis labels on all of our frequency plots and pay attention to units of observation!
Trigger warning: This week’s lab will review data that demonstrates racial profiling in policing. We will be reproducing the NYCLU’s data analysis of stop and frisk in NYC in 2011.
3. Data Wrangling (3.4-3.6) , Ismay, Chester and Albert Y. Kim (2021). Modern Dive: Statistical Inference via Data Science. CRC Press. URL: https://moderndive.com/ (visited on Jan. 14, 2022).
No Readings
Class slides are here
No Readings
Mini-Project 1: Profile a dataset
No Readings
5. Importing Data , Irizarry, Rafael A. (2022). Introduction to Data Science. Data Analysis and Prediction Algorithms with R. URL: https://rafalab.github.io/dsbook/ (visited on Jan. 14, 2022).
26. Parsing dates and times , Irizarry, Rafael A. (2022). Introduction to Data Science. Data Analysis and Prediction Algorithms with R. URL: https://rafalab.github.io/dsbook/ (visited on Jan. 14, 2022).
25. String processing , Irizarry, Rafael A. (2022). Introduction to Data Science. Data Analysis and Prediction Algorithms with R. URL: https://rafalab.github.io/dsbook/ (visited on Jan. 14, 2022).
Class slides are here
Topics and due dates have been updated since Friday’s discussion
Today’s office hours are in-person
First 15 minutes of office hours on Wednesday will be devoted to quiz 2 review.
Solutions for lab 18 will be posted by end of day.
No Readings
No Readings
No Readings
Quiz 3
Wickham, Hadley (2014). “Tidy Data”. En. In: Journal of Statistical Software 59, pp. 1-23. DOI: 10.18637/jss.v059.i10. URL: https://doi.org/10.18637/jss.v059.i10 (visited on Jan. 14, 2022).
6. Tidy Data , Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton (2021). Modern Data Science with R. 2nd. CRC Press. URL: https://mdsr-book.github.io/mdsr2e/ (visited on Jan. 14, 2022).
No Readings
Class slides are here
5. Data wrangling on multiple tables , Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton (2021). Modern Data Science with R. 2nd. CRC Press. URL: https://mdsr-book.github.io/mdsr2e/ (visited on Jan. 14, 2022).
No Readings
Class slides are here
No Readings
No Readings
Class slides are here
No Readings
No Readings
Class slides are here
MP2 due Wednesday; last day to request an extension (by 5PM)
Quiz 4 due Wednesday (5PM)
No Readings
Mini-Project 2: Wrangle a dataset
Quiz 4
No Readings
Class slides are here
No Readings
No Readings
17. Working with geospatial data (17.1-17.3) , Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton (2021). Modern Data Science with R. 2nd. CRC Press. URL: https://mdsr-book.github.io/mdsr2e/ (visited on Jan. 14, 2022).
No Readings
No Readings
No Readings
Class slides are here
No Readings
No Readings
17. Working with geospatial data (17.4-17.8) , Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton (2021). Modern Data Science with R. 2nd. CRC Press. URL: https://mdsr-book.github.io/mdsr2e/ (visited on Jan. 14, 2022).
No Readings
No Readings
Quiz 5
No Readings
Class slides are here
Quiz 5 due at 5PM.
MP3 soft deadline this Friday.
Class will be in-person on Friday.
No Readings
Mini-Project 3: Aqcuire
No Readings
Class slides are here
7. Iteration , Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton (2021). Modern Data Science with R. 2nd. CRC Press. URL: https://mdsr-book.github.io/mdsr2e/ (visited on Jan. 14, 2022).
No Readings
Class slides are here
No Readings
No Readings
Class slides are here
No Readings
No Readings
Class slides are here
No Readings
No Readings
No Readings
Quiz 6
No Readings
No Readings
Mini-Project 4: Mapping Census Data
Quiz 7 due by the last day of finals
No Readings