Course Description:
Overview and Learning Objectives
This course continues the themes of CMPSC 90DA / CMPSC 5A / Data Science 1. Students explore the data science lifecycle, including question formulation, data collection and cleaning, exploratory data analysis and visualization, statistical inference and prediction, and decision-making. The course focus is on languages for transforming and analyzing data; machine learning methods including regression, classification and clustering; principles behind data visualizations; concepts of measurement error and prediction; and techniques for scalable data processing.
Syllabus
- Introduction
- Data lifecycle
- Pandas and EDA
- Correlation and causality
- Regression
- Clustering
- Testing hypotheses (confidence interval and p-value)
- Comparing two samples (A/B testing)
- Estimation
- Sampling
- Prediction and Regression
- Classification
- Privacy and Fairness
- As time permits
- Bayesian learning (Naive Bayesian and Maximum likelihood)
- Advanced classification methods
- Algorithms for learning models (Gradient descent and Computation complexity)
Additional Information:
Description (from General Catalog): Course continues the themes of {CMPSC 90DA / Data Science 1 / CMPSC 5A}. Students explore the data science lifecycle, including question formulation, data collection and cleaning, exploratory data analysis and visualization, statistical inference and prediction, and decision-making. The course focus is on transforming and analyzing data; machine learning methods including regression, classification and clustering; principles behind data visualizations; concepts of measurement error and prediction; and techniques for scalable data processing.
Pre-requisite: Data Science 1 (CMPSC 90DA / CMPSC 5A with a grade of C or better)
College: Engineering
Units: 4
As of Fall 2022, CMPSC 5B fulfills the General Education Requirements for Area C and Quantitative Relationships.
Course Level:
- Undergraduate
Course Number:
Course Time:
Offered every quarter:
Spring 2024
Tues/Thurs: 3:30 - 4:45 pm, ILP 2211
Lab1: Fridays at 9:00 am, PHELP 2514
Lab2: Weds at 1:00 pm, PHELP 2514
Lab3: Weds at 2:00 pm, PHELP 2514
Winter 2024
Tues/Thurs: 12:30 am - 1:45 pm, ILP 2211
Lab1: Weds at 11:00 am, GIRV 1119
Lab2: Weds at 12:00 pm, PHELP 1448
Lab3: Weds at 1:00 pm, PHELP 2514
Fall 2023
Tues/Thurs: 3:30 - 4:45 pm, IV THEA2
Lab1: Weds at 3:00 pm, PHELP 1444
Lab2: Weds at 4:00 pm, PHELP 1444
Lab3: Weds at 5:00 pm, PHELP 1440
Spring 2023
Tues/Thurs: 12:30 - 1:45 pm, ELLSN 2626
Lab1: Weds at 1:00 pm, HSSB 1223
Lab2: Weds at 2:00 pm, HSSB 1237
Winter 2023
Mon/Weds: 11:00 - 12:15 pm, GIRV 2110
Lab1: Thurs at 6:00 pm, GIRV 1112
Fall 2022
Tues/Thurs: 3:30 - 4:45 pm, PSYCH 1902
Lab1: Weds at 3:00 pm, PHELPS 1526
Spring 2022
Tues/Thurs: 12:30 - 1:45 pm, SH 1431
Lab1: Weds at 1:00 pm, NH 1111
Lab 2: Weds at 2:00 pm, PHELPS 1445
Lab 3: Weds at 3:00 pm, GIRV 2116
Winter 2022
Tues/Thurs: 3:30 - 4:45 pm, Phelps 1260
Lab1: Weds at 1:00 pm, SH 1430
Lab 2: Weds at 2:00 pm, Phelps 2532
Fall 2021
Tues/Thurs: 3:30 - 4:45 pm, Phelps 1260
Lab1: Weds at 1:00 pm, Bldg 387, rm. 1011
Lab 2: Weds at 2:00 pm, Girvetz 2116
Spring 2021
Tues/Thurs: 3:30 - 4:45 pm
Labs: Wed at 1:00 and 2:00 pm