Data Science 2: Introduction to Data Science 2

Course Description: 

 

20 NOVEMBER 2021 -- UPDATE: Some students might need an add code. If you are unable to sign up for Data Science 2 in Winter 2021 on GOLD, then please complete this Google Form. You will be sent an approval code via email. 

New for Fall 2021: Find this course in the General Catalog as CMPSC 5B.

 

Overview and Learning Objectives

This course continues the themes of CMPSC 90DA / CMPSC 5A / Data Science 1.  Students explore the data science lifecycle, including question formulation, data collection and cleaning, exploratory data analysis and visualization, statistical inference and prediction, and decision-making. The course focus is on languages for transforming and analyzing data; machine learning methods including regression, classification and clustering; principles behind data visualizations; concepts of measurement error and prediction; and techniques for scalable data processing.

Syllabus

  • Introduction
  • Data lifecycle
  • Pandas and EDA
  • Correlation and causality
  • Regression
  • Clustering
  • Testing hypotheses (confidence interval and p-value)
  • Comparing two samples (A/B testing)
  • Estimation
  • Sampling
  • Prediction and Regression
  • Classification
  • Privacy and Fairness
  • As time permits
    • Bayesian learning (Naive Bayesian and Maximum likelihood)
    • Advanced classification methods
    • Algorithms for learning models (Gradient descent and Computation complexity)

 

 

Additional Information: 

Description (from General Catalog): Course continues the themes of {CMPSC 90DA / Data Science 1 / CMPSC 5A}. Students explore the data science lifecycle, including question formulation, data collection and cleaning, exploratory data analysis and visualization, statistical inference and prediction, and decision-making. The course focus is on transforming and analyzing data; machine learning methods including regression, classification and clustering; principles behind data visualizations; concepts of measurement error and prediction; and techniques for scalable data processing.

Pre-requisite: Data Science 1 (CMPSC 90DA / CMPSC 5A with a grade of C or better)

College: Engineering

Units: 4

Course Level: 

  • Undergraduate

Course Number: 

2

Course Time: 

Offered every quarter.
Currently listed as CMPSC 5B.

 

Winter 2022

Tues/Thurs: 3:30 - 4:45 pm, Phelps 1260
Lab1: Weds at 1:00 pm, SH 1430
Lab 2: Weds at 2:00 pm, Phelps 2532

 

Fall 2021

Tues/Thurs: 3:30 - 4:45 pm, Phelps 1260
Lab1: Weds at 1:00  pm, Bldg 387, rm. 1011
Lab 2: Weds at 2:00 pm, Girvetz 2116

 

Spring 2021

Tues/Thurs: 3:30 - 4:45 pm
Labs: Wed at 1:00 and 2:00 pm