Exploratory data analysis (colab notebook)

notebook about data transformation and model building
2025-11-16 16:40
// updated 2025-12-21 15:05

Proceedings from a (2025-11-16) lecture about exploratory data analysis (EDA):

(accessible via link since Google Colab does not allow posting notebooks in an iframe)

Topics covered

  • EDA
    • business understanding
      • the concept of "feature"
    • data understanding (+ data collection)
      • the concept of "casting"
    • data cleaning (i.e. data transformation)
      • removing irrelevant data
      • removing outliers
      • handling missing values
        • the concept of "imputing"
      • handling invalid data
      • feature scaling (standardizing scores)
    • data visualization
    • model building
    • value prediction
    • model testing
      • univariate analysis
      • bivariate analysis
      • multivariate analysis
⬅️ older (in snippets)
📚 Python libraries (a quick how-to)
newer (in snippets) ➡️
NumPy essentials 🔢
⬅️ older (posts)
📚 Python libraries (a quick how-to)
newer (posts) ➡️
NumPy essentials 🔢