删除 pandas dataframe 中第一个空格之后的所有内容

Question

Here is the dataframe:这是dataframe：

     State  RegionName            
0    NY     New York             
1    CA     Los Angeles      
2    IL     Chicago 865         
3    PA     Philadelphia Wrin   
4    AZ     Phoenix City

I want the output to look like this:我希望 output 看起来像这样：

     State   RegionName           
0    NY      New             
1    CA      Los         
2    IL      Chicago            
3    PA      Philadelphia 
4    AZ      Phoenix

How to do it without using for loops??如何在不使用 for 循环的情况下做到这一点？

Answer 1

Use Series.str.split with select first values by indexing:通过索引将Series.str.split与 select 第一个值一起使用：

print (df['RegionName'].str.split())
0             [New, York]
1          [Los, Angeles]
2          [Chicago, 865]
3    [Philadelphia, Wrin]
4         [Phoenix, City]
Name: RegionName, dtype: object

df['RegionName'] = df['RegionName'].str.split().str[0]
print (df)
  State    RegionName
0    NY           New
1    CA           Los
2    IL       Chicago
3    PA  Philadelphia
4    AZ       Phoeni

Answer 2

Here's an alternative using pd.Series.str.extract这是使用pd.Series.str.extract的替代方法

df['RegionName'] = df['RegionName'].str.extract(r'(.*)\s')

But my first instinct would be to use what @jezrael mentioned.但我的第一直觉是使用@jezrael提到的内容。

Regex demo正则表达式演示

Answer 3

You could also str.extract the start of the string but exclude space ^[^\s]+ using regex您也可以str.extract字符串的开头，但使用正则表达式排除空格^[^\s]+

df['RegionName']=df['RegionName'].str.extract('(^[^\s]+)')

Answer 4

You can replace extra words by '' using str.replace您可以使用str.replace将多余的单词替换为''

df["RegionName"] = df.RegionName.str.replace('\s.*','')
df
     RegionName state
0           New    NY
1           Los    CA
2       Chicago    IL
3  Philadelphia    PA
4       Phoenix    AZ

删除 pandas dataframe 中第一个空格之后的所有内容

问题描述

4 个解决方案

解决方案1
2 已采纳 2020-05-31 08:24:03

解决方案2
0 2020-05-31 08:35:49

解决方案3
0 2020-05-31 08:38:58

解决方案4
0 2020-05-31 09:25:59

删除 pandas dataframe 中第一个空格之后的所有内容

问题描述

4 个解决方案

解决方案1 2 已采纳 2020-05-31 08:24:03

解决方案2 0 2020-05-31 08:35:49

解决方案3 0 2020-05-31 08:38:58

解决方案4 0 2020-05-31 09:25:59

解决方案1
2 已采纳 2020-05-31 08:24:03

解决方案2
0 2020-05-31 08:35:49

解决方案3
0 2020-05-31 08:38:58

解决方案4
0 2020-05-31 09:25:59