General Notes:

  1. Please bring a notebook and pencil to every class
  2. The principal documents for this course are ModernDive: An Introduction to Statistical and Data Sciences via R (MD), Data Science with R (DSWR), and An Introduction to Statistical Learning with Applications in R (ISLR)

Grading Rubric for Assignments

Field Excellent (3) Competent (2) Needs Work (1)
Reproducible All graphs, code, and answers are created from text files. Answers are never hard-coded but instead are inserted using inline R code. An automatically generated references section with properly formatted citations when appropriate and sessionInfo() are provided at the end of the document. All graphs, code, and answers are created from text files. Answers are hard coded. No sessionInfo() is provided at the end of the document. References are present but not cited properly or not automatically generated. Document uses copy and paste with graphs or code. Answers are hard coded; and references, when appropriate are hard coded.
Statistical Understanding Answers to questions demonstrate clear statistical understanding by comparing theoretical answers to simulated answers. When hypotheses are tested, classical methods are compared and contrasted to randomization methods. When confidence intervals are constructed, classical approaches are compared and contrasted with bootstrap procedures. The scope of inferential conclusions made is appropriate for the sampling method. Theoretical and simulated answers are computed but no discussion is present comparing and contrasting the results. When hypotheses are tested, results for classical and randomization methods are presented but are not compared and contrasted. When confidence intervals are constructed, classical and bootstrap approaches are computed but the results are not compared and contrasted. The scope of inferential conclusions made is appropriate for the sampling method. Theoretical and simulated answers are not computed correctly. No comparison between classical and randomization approaches is present when testing hypotheses. When confidence intervals are constructed, there is no comparison between classical and bootstrap confidence intervals .
Graphics Graphs for categorical data (barplot, mosaic plot, etc.) have appropriately labeled axes and titles. Graphs for quantitative data (histograms, density plots, violin plots, etc.) have appropriately labeled axes and titles. Multivariate graphs use appropriate legends and labels. Computer variable names are replaced with descriptive variable names. Appropriate graphs for the type of data are used. Not all axes have appropriate labels or computer variable names are used in the graphs. Inappropriate graphs are used for the type of data. Axes are not labeled and computer variable names appear in the graphs.
Coding Code (primarily R) produces correct answers. Non-standard or complex functions are commented. Code is formatted using a consistent standard. Code produces correct answers. Commenting is not used with non-standard and complex functions. No consistent code formatting is used. Code does not produce correct answers. Code has no comments and is not formatted.
Clarity Few errors of grammar and usage; any minor errors do not interfere with meaning. Language style and word choice are highly effective and enhance meaning. Style and word choice are appropriate for the assignment. Some errors of grammar and usage; errors do not interfere with meaning. Language style and word choice are, for the most part, effective and appropriate for the assignment. Major errors of grammar and usage make meaning unclear. Language style and word choice are ineffective and/or inappropriate.

Week 1: (Aug 17, 19)

Optional


Week 2: (Aug 24, 26)


Week 3: (Aug 31, Sep 2)


Week 4: (Sep 7, 9)

Optional

  • Add an avatar or picture on your GitHub account

  • Read the Git and GitHub chapter from Hadley Wickham’s book R Packages

  • Brian Caffo’s take on R IDEs


Week 5: (Sep 14, 16)

Optional

  • Complete Data Manipulation in R with dplyr (DataCamp)

Week 6: (Sep 21, 23)

Optional


Week 7: (Sep 28, 30)


Week 8: (Oct 5, 7)


Week 9: (Oct 12—Fall Break, Oct 14)

Optional


Week 10: (Oct 19, 21)


Week 11: (Oct 26, 28)


Week 12: (Nov 2, 4)


Week 13: (Nov 9, 11)


Week 14: (Nov 16, 18)


Week 15: (Nov 23)


Week 16: (Nov 30)

Work on kaggle assignment.


Final Exam: (Dec 7—11:00 am-1:30 pm)