This course is an introduction to statistical thinking and concepts, beginning with basic probability theory. The course concludes with selected statistical methods useful for data exploration and description of vector-valued data, a common setup in modern data analysis applications. Python and/or R will be used for practical implementation of all numerical and graphical procedures, including simulations.
Common requirements for the Semester in Mathematical Tools for Data Science.
On completion of the course, students will:
2 hours a week with a teaching assistant
Two midterm exams (25% each), homework (20%) and a final project (30%)