Data Science 2: Introduction to Data Science 2

Course Description: 

 

Overview and Learning Objectives

This course continues the themes of CMPSC 90DA / CMPSC 5A / Data Science 1.  Students explore the data science lifecycle, including question formulation, data collection and cleaning, exploratory data analysis and visualization, statistical inference and prediction, and decision-making. The course focus is on languages for transforming and analyzing data; machine learning methods including regression, classification and clustering; principles behind data visualizations; concepts of measurement error and prediction; and techniques for scalable data processing.

Syllabus

  • Introduction
  • Data lifecycle
  • Pandas and EDA
  • Correlation and causality
  • Regression
  • Clustering
  • Testing hypotheses (confidence interval and p-value)
  • Comparing two samples (A/B testing)
  • Estimation
  • Sampling
  • Prediction and Regression
  • Classification
  • Privacy and Fairness
  • As time permits
    • Bayesian learning (Naive Bayesian and Maximum likelihood)
    • Advanced classification methods
    • Algorithms for learning models (Gradient descent and Computation complexity)

 

 

Additional Information: 

Description (from General Catalog): Course continues the themes of {CMPSC 90DA / Data Science 1 / CMPSC 5A}. Students explore the data science lifecycle, including question formulation, data collection and cleaning, exploratory data analysis and visualization, statistical inference and prediction, and decision-making. The course focus is on transforming and analyzing data; machine learning methods including regression, classification and clustering; principles behind data visualizations; concepts of measurement error and prediction; and techniques for scalable data processing.

Pre-requisite: Data Science 1 (CMPSC 90DA / CMPSC 5A with a grade of C or better)

College: Engineering

Units: 4

As of Fall 2022, CMPSC 5B fulfills the General Education Requirements for Area C and Quantitative Relationships. 

Course Level: 

  • Undergraduate

Course Number: 

2

Course Time: 

Offered every quarter:

Spring 2024

Tues/Thurs: 3:30 - 4:45 pm, ILP 2211
Lab1: Fridays at 9:00 am, PHELP 2514
Lab2: Weds at 1:00 pm, PHELP 2514
Lab3: Weds at 2:00 pm, PHELP 2514

 

Winter 2024

Tues/Thurs: 12:30 am - 1:45 pm, ILP 2211
Lab1: Weds at 11:00 am, GIRV 1119
Lab2: Weds at 12:00 pm, PHELP 1448
Lab3: Weds at 1:00 pm, PHELP 2514

 

Fall 2023

Tues/Thurs: 3:30 - 4:45 pm, IV THEA2
Lab1: Weds at 3:00 pm, PHELP 1444
Lab2: Weds at 4:00 pm, PHELP 1444
Lab3: Weds at 5:00 pm, PHELP 1440

 

Spring 2023

Tues/Thurs: 12:30 - 1:45 pm, ELLSN 2626
Lab1: Weds at 1:00 pm, HSSB 1223
Lab2: Weds at 2:00 pm, HSSB 1237

 

Winter 2023

Mon/Weds: 11:00 - 12:15 pm, GIRV 2110
Lab1: Thurs at 6:00 pm, GIRV 1112

 

Fall 2022

Tues/Thurs: 3:30 - 4:45 pm, PSYCH 1902
Lab1: Weds at 3:00 pm, PHELPS 1526

 

Spring 2022

Tues/Thurs: 12:30 - 1:45 pm, SH 1431
Lab1: Weds at 1:00 pm, NH 1111
Lab 2: Weds at 2:00 pm, PHELPS 1445
Lab 3: Weds at 3:00 pm, GIRV 2116

 

Winter 2022

Tues/Thurs: 3:30 - 4:45 pm, Phelps 1260
Lab1: Weds at 1:00 pm, SH 1430
Lab 2: Weds at 2:00 pm, Phelps 2532

 

Fall 2021

Tues/Thurs: 3:30 - 4:45 pm, Phelps 1260
Lab1: Weds at 1:00  pm, Bldg 387, rm. 1011
Lab 2: Weds at 2:00 pm, Girvetz 2116

 

Spring 2021

Tues/Thurs: 3:30 - 4:45 pm
Labs: Wed at 1:00 and 2:00 pm