0%

Data Cleaning Projects

Data Cleansing Projects 汇总

Projects

基本操作

缺失值

可视化缺失值

  • 直接统计
1
2
3
4
5
6
7
8
import pandas as pd

## 读取数据
train_data = pd.read_csv("used_car_train_20200313.csv", sep=" ")
missing = train_data.isnull().sum() ## 统计每列的缺失值样本数
missing = missing[missing > 0]
missing.sort_values(inplace=True)
missing.plot.bar()
1
2
3
import missingno as msno

msno.matrix(train_data.sample(250))

参考资料

Thank you for your approval.

欢迎关注我的其它发布渠道