如何從dtype為列表的Pandas系列中刪除NaN？

Question

我有一個pandas.Series ，每行的pandas.Series是一個列表對象。 例如

>>> import numpy as np
>>> import pandas as pd
>>> x = pd.Series([[1,2,3], [2,np.nan], [3,4,5,np.nan], [np.nan]])
>>> x
0         [1, 2, 3]
1          [2, nan]
2    [3, 4, 5, nan]
3             [nan]
dtype: object

如何刪除每行列表中的nan ？

期望的輸出是：

>>> x
0         [1, 2, 3]
1               [2]
2         [3, 4, 5]
3                []
dtype: object

這有效：

>>> x.apply(lambda y: pd.Series(y).dropna().values.tolist())
0          [1, 2, 3]
1              [2.0]
2    [3.0, 4.0, 5.0]
3                 []
dtype: object

是否有比使用lambda更簡單的方法，將列表轉換為Series，刪除NaN然后再將值提取回列表？

Answer 1

您可以將list comprehension與pandas.notnull一起pandas.notnull以刪除NaN值：

print (x.apply(lambda y: [a  for a in y if pd.notnull(a)]))
0    [1, 2, 3]
1          [2]
2    [3, 4, 5]
3           []
dtype: object

帶filter另一種解決方案，其中v!=v僅適用於NaN ：

print (x.apply(lambda a: list(filter(lambda v: v==v, a))))
0    [1, 2, 3]
1          [2]
2    [3, 4, 5]
3           []
dtype: object

謝謝DYZ的另一個解決方案：

print (x.apply(lambda y: list(filter(np.isfinite, y))))
0    [1, 2, 3]
1          [2]
2    [3, 4, 5]
3           []
dtype: object

Answer 2

一個簡單的numpy解決方案，列表理解：

pd.Series([np.array(e)[~np.isnan(e)] for e in x.values])

如何從dtype為列表的Pandas系列中刪除NaN？

問題描述

2 個解決方案

解決方案1
6 已采納 2017-01-04 06:26:16

解決方案2
1 2017-01-04 06:31:44

如何從dtype為列表的Pandas系列中刪除NaN？

問題描述

2 個解決方案

解決方案1 6 已采納 2017-01-04 06:26:16

解決方案2 1 2017-01-04 06:31:44

解決方案1
6 已采納 2017-01-04 06:26:16

解決方案2
1 2017-01-04 06:31:44