从 Pandas 中的字符串切片创建列

Question

does somebody know why is this displaying a NaN value in the column "2_stars"??有人知道为什么这会在“2_stars”列中显示 NaN 值？ Thanks in advance提前致谢

data['1_star']=data['Sentiment'].str.slice(31,40)
data['start'] = data['Sentiment'].str.find("'2 stars', 'score': ") + len("'2 stars', 'score': ")
data['end'] = data['Sentiment'].str.find("}, {'label': '3 stars'")
data['2_stars']=data['Sentiment'].str.slice(data['start'],data['end'])

Answer 1

Pandas str.slice working with scalars numbers, not by all columns values. Pandas str.slice使用标量数字，而不是所有列值。 So need processing per rows in DataFrame.apply :所以需要在DataFrame.apply处理每行：

data['2_stars']= data.apply(lambda x: x['Sentiment'][slice(x['start'], x['end'])], axis=1)

Another idea with list comprehension:列表理解的另一个想法：

zipped = zip(data['Sentiment'], data['start'], data['end'])
data['2_stars'] = [a[slice(s, e)] for a, s, e in zipped]

从 Pandas 中的字符串切片创建列

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-09-09 06:05:24

从 Pandas 中的字符串切片创建列

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-09-09 06:05:24

解决方案1
1 已采纳 2020-09-09 06:05:24