Exploratory data analysis (colab notebook)
notebook about data transformation and model building
2025-11-16 16:40
// updated 2025-12-21 15:05
// updated 2025-12-21 15:05
Proceedings from a (2025-11-16) lecture about exploratory data analysis (EDA):
(accessible via link since Google Colab does not allow posting notebooks in an iframe)
Topics covered
- EDA
- business understanding
- the concept of "feature"
- data understanding (+ data collection)
- the concept of "casting"
- data cleaning (i.e. data transformation)
- removing irrelevant data
- removing outliers
- handling missing values
- the concept of "imputing"
- handling invalid data
- feature scaling (standardizing scores)
- data visualization
- model building
- value prediction
- model testing
- univariate analysis
- bivariate analysis
- multivariate analysis
- business understanding