I was trying the house-price-dataset from kaggle. I was trying to clear the NaN values from column named 'Alley'
for column in missing_data.columns.values.tolist():
print(column)
print(missing_data[column].value_counts())
print("")
Id
False 1460
Name: Id, dtype: int64
MSSubClass
False 1460
Name: MSSubClass, dtype: int64
MSZoning
False 1460
Name: MSZoning, dtype: int64
LotFrontage
False 1201
True 259
Name: LotFrontage, dtype: int64
LotArea
False 1460
Name: LotArea, dtype: int64
Street
False 1460
Name: Street, dtype: int64
Alley
True 1369
False 91
Name: Alley, dtype: int64
LotShape
False 1460
Name: LotShape, dtype: int64
LandContour
False 1460
Name: LandContour, dtype: int64
Utilities
False 1460
Name: Utilities, dtype: int64
LotConfig
False 1460
Name: LotConfig, dtype: int64
These are some of the values i got running the above code. I replaced LotFrontage's NaN value with it's mean and wanted to replace the NaN values in 'Alley' with it's frequency. But when I write this code I get an error.
train['Alley'].value_counts()
KeyError Traceback (most recent call last)
~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
2645 try:
-> 2646 return self._engine.get_loc(key)
2647 except KeyError:
pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()
pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()
KeyError: 'Alley'
During handling of the above exception, another exception occurred:
KeyError Traceback (most recent call last)
<ipython-input-72-8e1e57b44782> in <module>
1 #replace nan values in Alley with frequency
----> 2 train['Alley'].value_counts()
~\anaconda3\lib\site-packages\pandas\core\frame.py in __getitem__(self, key)
2798 if self.columns.nlevels > 1:
2799 return self._getitem_multilevel(key)
-> 2800 indexer = self.columns.get_loc(key)
2801 if is_integer(indexer):
2802 indexer = [indexer]
~\anaconda3\lib\site-packages\pandas\core\indexes\base.py in get_loc(self, key, method, tolerance)
2646 return self._engine.get_loc(key)
2647 except KeyError:
-> 2648 return self._engine.get_loc(self._maybe_cast_indexer(key))
2649 indexer = self.get_indexer([key], method=method, tolerance=tolerance)
2650 if indexer.ndim > 1 or indexer.size > 1:
pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()
pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()
pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()
KeyError: 'Alley'
Why am I getting this error? When the column named Alley exists?
似乎您可能为数据帧分配了不同的值,因此在下一次迭代中它会抛出错误,因为数据帧可能已分配给熊猫系列
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.