熊猫，将Unicode列转换为字符串列表列

Question

我的pandas数据框列中有一种此类的u'asd,abc,tre,der34,whatever' 。 最终结果应该是一列字符串列表： ['asd','abc','tre','der34','whatever'] 。 Unicode列表也可以这样做： [u'asd',u'abc',u'tre',u'der34',u'whatever'] 。

顺便说一下，tt可能会在unicodes列中出现nan或u''。

有什么建议吗？ 我知道我可以做str(df['column'].iloc[0]).split(',')并手动添加一个新列或做一些棘手的事情，但是我一直在寻找一些更pythonic的东西。

Answer 1

此解决方案似乎有效：

df['Column'] =df['Column'].astype(str).str.split(',')

Answer 2

这应该可以工作，如果有nan或空字符串，则您必须处理它，但您认为合适。

In [1]: [str(col) for col in u'asd,abc,tre,der34,whatever'.split(',')]

Out[1]: ['asd', 'abc', 'tre', 'der34', 'whatever']

熊猫，将Unicode列转换为字符串列表列

问题描述

2 个解决方案

解决方案1
3 2014-08-07 11:12:12

解决方案2
0 2016-12-21 18:33:36

熊猫，将Unicode列转换为字符串列表列

问题描述

2 个解决方案

解决方案1 3 2014-08-07 11:12:12

解决方案2 0 2016-12-21 18:33:36

解决方案1
3 2014-08-07 11:12:12

解决方案2
0 2016-12-21 18:33:36