- Missing Data (Imputation Technique) - Missing Data within Features of the dataset for both Numerical and Categorical Features
- Categorical Variables - Machine Learning Model requires numbers instead of String or Object
- Variable Transformation - Linear Model assumes Features follow Normal or Gaussian Distribution
- Discretisation - Converting Continuous numbers into Discrete numbers.
- Outliers - outliers have huge impact on Linear Models mainly looks for continous features.
- Feature Scaling - All Features need to be scaled to one scale
DataSet: https://www.kaggle.com/c/house-prices-advanced-regression-techniques