Build software better, together

awslabs / datawig

Star

Imputation of missing values in tables.

imputation missing-value-handling

Updated Jan 14, 2026

DivyaKrishnani / Data-Preprocessing-with-Python

Star

Implementation of Data Preprocessing techniques such as handling missing values, noise smoothing, PCA, etc.

data-mining smoothing dispersion binning data-preprocessing normalization missing-value-handling

Updated Jan 23, 2019
Jupyter Notebook

gjorshoskaivana / MIDA-in-FCDBs

Star

Repository containing the implementation of the models and experiments in the paper "Missing value imputation in Food Composition Data with Denoising Autoencoders"

deep-learning missing-values missing-value-handling food-composition

Updated Nov 1, 2021
Jupyter Notebook

anikch / Telecom-churn-analysis-and-prediction

Star

Analyze customer-level data of a leading telecom firm, build predictive models to identify customers at high risk of churn (usage-based churn) and identify the main indicators of churn.

machine-learning random-forest eda xgboost pca logistic-regression rfe missing-value-handling telecom-churn-prediction telecom-churn-analysis

Updated Nov 22, 2021
Jupyter Notebook

rahulvictor12 / German-Bank-Loan-Defaulter-Prediction

Star

A machine learning project to predict loan defaults in a German bank's customer base. Using the German Credit Risk dataset, it explores key factors contributing to defaults and trains models like Random Forest, GBM, and XGBoost. Includes EDA, data processing, hyperparameter tuning, and model evaluation.

machine-learning random-forest exploratory-data-analysis xgboost accuracy gbm recall data-processing hyperparameter-tuning precision f1-score bagging ada-boost-classifier gridsearchcv missing-value-handling categorical-encoding modelevaluation randomsearch-cv

Updated Nov 24, 2024
Jupyter Notebook

MouliSirigiri / Electric-Power-Consumption-Analysis

Star

I visualized electric power consumption (kWh/capita) for 8 countries (2001–2014) using Pandas/Matplotlib. Line plots show China's +200% surge (1,200→3,600); bar (2008) ranks Canada #1 (16k); pies highlight China's share rise (16%→23%).

pandas matplotlib missing-value-handling

Updated Jan 8, 2026
Python

grahman20 / kDMI

Star

kDMI employs two levels of horizontal partitioning (based on a decision tree and k-NN algorithm) of a data set, in order to find the records that are very similar to the one with missing value/s. Additionally, it uses a novel approach to automatically find the value of k for each record.

data-science machine-learning data-mining linear-regression data-analytics classification data-analysis missing-data preprocessing decision-tree data-cleansing missing-values missing-value-handling missing-data-imputation missing-value-imputation missing-data-treatment

Updated Mar 25, 2023
Java

grahman20 / DMI

Star

DMI Class implements the DMI imputation algorithm for imputing missing values in a dataset from Rahman, M. G., and Islam, M. Z. (2013): Missing Value Imputation Using Decision Trees and Decision Forests by Splitting and Merging Records: Two Novel Techniques

java data-science data data-mining analysis linear-regression weka imputation missing-data preprocessing missing expectation-maximization-algorithm data-cleaning decision-tree imputation-algorithm missing-value-treatment missing-value-handling missing-value-imputation

Updated Mar 24, 2023
Java

kajakgupta / Missing-Value-Treatment

Star

Prevention and handling of missing data

missing-data missing-values missing-value-treatment missing-value-handling

Updated Aug 16, 2018
Jupyter Notebook

ANikhilAgarwal / Analysis-Of-Google-Play-Store-Data

Star

visualization python exploratory-data-analysis statistical-analysis missing-value-treatment missing-value-handling missing-value-imputation analysis-of-google-play-store

Updated Jun 11, 2022
Jupyter Notebook

grahman20 / SiMI

Star

SiMI imputes numerical and categorical missing values by making an educated guess based on records that are similar to the record having a missing value. Using the similarity and correlations, missing values are then imputed. To achieve a higher quality of imputation some segments are merged together using a novel approach.

data-science linear-regression dataset missing-data preprocessing data-cleaning decision-tree decision-tree-classifier missing-values decision-forest decision-forest-algorithm missing-value-handling missing-data-imputation missing-value-imputation numerical-missing-value categorical-missing-value

Updated Mar 24, 2023
Java

katerinaharana / Team-2-Project

Star

Predicting the City Cycle Fuel Consumption in MPG of a Car. A Classification Problem

encoding eda neural-networks pca classification outlier-detection data-cleaning ensemble-model kmeans-clustering ensemble-classifier isolation-forest missing-value-handling

Updated Apr 19, 2025
Jupyter Notebook

priya-aggarwal27 / Telecom-Churn-Case-Study

Star

This project is based on the Indian and Southeast Asian market. Analyse customer-level data of a leading telecom firm, build predictive models to identify customers at high risk of churn and identify the main indicators of churn.

random-forest eda supervised-learning xgboost-algorithm rescaling missing-value-handling

Updated Mar 30, 2021
Jupyter Notebook

Mehnaz2004 / Data-Cleaning-CaseStudy

Star

This repository demonstrates data cleaning with a layoffs dataset. It covers handling missing values, detecting outliers, and encoding categorical data, using visualizations like boxplots and distplots to enhance data quality. Check out the code to see these techniques in action.

sklearn pandas data-visualization seaborn data-cleaning data-integrity missing-value-handling outlier-detection-and-removal categorical-data-encoding

Updated Sep 11, 2024
Python

Gaurabh007 / Feature_Engineering

Star

This repository focuses on practical feature engineering techniques for machine learning. Learn to handle missing values, balance datasets, perform interpolation, encode variables, and explore data relationships using summary statistics and visualizations. Perfect for boosting model performance with smarter data prep.

sampling feature-engineering missing-value-handling missing-value-imputation data-encoder

Updated Oct 1, 2023
Jupyter Notebook

jodiambra / ICE-Retail-EDA

Star

Exploratory data analysis on ICE retail gaming store.

data-science exploratory-data-analysis pivot-tables t-test scipy feature-engineering filtering hypothesis-testing profitability missing-value-handling cleaning-data

Updated May 14, 2023
Jupyter Notebook

souravsuvarna / MissNoMore

Star

MissNoMore is a Python-based missing value imputation tool designed to handle CSV datasets with missing data.

python data-science csv-parser datacleaning missing-value-handling streamlit missing-value-imputation

Updated Aug 13, 2023
Python

sajjad425 / missingValue

Star

This repository provides a guide on handling missing values in Python, covering identification methods, imputation techniques (mean, median, mode, fill, interpolation), advanced methods (KNN, multiple imputation), and best practices. It includes practical examples for both numerical and categorical data.

data-science data data-analysis-python missing-value-handling missing-value-imputation

Updated Dec 11, 2024
Jupyter Notebook

AMRHiwa / Hotel_booking_Data_Exploration

Star

In this repository, we intend to extract data from the mentioned dataset and display everything that seems interesting.

data-science data-mining data-visualization data-analysis missing-values missing-value-handling

Updated Mar 19, 2024
Jupyter Notebook

grahman20 / FIMUS

Star

FIMUS imputes numerical and categorical missing values by using a data set’s existing patterns including co-appearances of attribute values, correlations among the attributes and similarity of values belonging to an attribute.

data-science data-mining correlation missing-data similarity-measures preprocessing data-cleaning data-quality data-cleansing missing-values missing-value-handling missing-data-imputation missing-value-imputation co-appearance

Updated Mar 24, 2023
HTML

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

missing-value-handling

Here are 26 public repositories matching this topic...

awslabs / datawig

DivyaKrishnani / Data-Preprocessing-with-Python

gjorshoskaivana / MIDA-in-FCDBs

anikch / Telecom-churn-analysis-and-prediction

rahulvictor12 / German-Bank-Loan-Defaulter-Prediction

MouliSirigiri / Electric-Power-Consumption-Analysis

grahman20 / kDMI

grahman20 / DMI

kajakgupta / Missing-Value-Treatment

ANikhilAgarwal / Analysis-Of-Google-Play-Store-Data

grahman20 / SiMI

katerinaharana / Team-2-Project

priya-aggarwal27 / Telecom-Churn-Case-Study

Mehnaz2004 / Data-Cleaning-CaseStudy

Gaurabh007 / Feature_Engineering

jodiambra / ICE-Retail-EDA

souravsuvarna / MissNoMore

sajjad425 / missingValue

AMRHiwa / Hotel_booking_Data_Exploration

grahman20 / FIMUS

Improve this page

Add this topic to your repo