使用str.split在熊猫中拆分列并保留值

Question

So I am stuck with a problem here: 所以我在这里遇到了一个问题：

I have a pandas dataframe which looks like the following: 我有一个熊猫数据框，如下所示：

ID Name    Value
0  Peter   21,2
1  Frank   24
2  Tom     23,21/23,60 
3  Ismael  21,2/ 21,54
4  Joe     23,1

and so on...

What I am trying to is to split the "Value" column by the slash forward (/) but keep all the values, which do not have this kind of pattern. 我要尝试的是用斜杠（/）分隔“值”列，但保留所有不具有这种模式的值。

Like here: 像这儿：

ID Name    Value
0  Peter   21,2
1  Frank   24
2  Tom     23,21
3  Ismael  21,2
4  Joe     23,1

How can I achieve this? 我该如何实现？ I tried the str.split method but it's not giving me the solution I want. 我尝试了str.split方法，但没有给我想要的解决方案。 Instead, it returns NaN as can be seen in the following. 而是返回NaN，如下所示。

My Code: df['Value']=df['value'].str.split('/', expand=True)[0]

Returns:

ID Name    Value
0  Peter   NaN
1  Frank   NaN
2  Tom     23,21
3  Ismael  21,2
4  Joe     Nan

All I need is the very first Value before the '/' is coming. 我需要的只是在'/'出现之前的第一个值。

Appreciate any kind of help! 感谢任何帮助！

Answer 1

Remove expand=True for return lists and add str[0] for select first value: 删除expand=True返回列表，并添加str[0]选择第一个值：

df['Value'] = df['Value'].str.split('/').str[0]
print (df)
   ID    Name  Value
0   0   Peter   21,2
1   1   Frank     24
2   2     Tom  23,21
3   3  Ismael   21,2
4   4     Joe   23,1

If performance is important use list comprehension: 如果性能很重要，请使用清单理解：

df['Value'] = [x.split('/')[0] for x in df['Value']]

Answer 2

`pandas.Series.str.replace` with regex 用正则表达式`pandas.Series.str.replace`

df.assign(Value=df.Value.str.replace('/.*', ''))

   ID    Name  Value
0   0   Peter   21,2
1   1   Frank     24
2   2     Tom  23,21
3   3  Ismael   21,2
4   4     Joe   23,1

Optionally, you can assign results directly back to dataframe （可选）您可以将结果直接分配回数据框

df['Value'] = df.Value.str.replace('/.*', '')

使用str.split在熊猫中拆分列并保留值

问题描述

2 个解决方案

解决方案1
3 已采纳 2018-09-25 13:02:16

解决方案2
1 2018-09-25 13:07:54

`pandas.Series.str.replace` with regex 用正则表达式`pandas.Series.str.replace`

使用str.split在熊猫中拆分列并保留值

问题描述

2 个解决方案

解决方案1 3 已采纳 2018-09-25 13:02:16

解决方案2 1 2018-09-25 13:07:54

pandas.Series.str.replace with regex 用正则表达式pandas.Series.str.replace

解决方案1
3 已采纳 2018-09-25 13:02:16

解决方案2
1 2018-09-25 13:07:54

`pandas.Series.str.replace` with regex 用正则表达式`pandas.Series.str.replace`