熊猫从另一列的字符串切片创建新列

Question

I want to create a new column in Pandas using a string sliced for another column in the dataframe.我想使用为数据框中另一列切片的字符串在 Pandas 中创建一个新列。

For example.例如。

Sample  Value  New_sample
AAB     23     A
BAB     25     B

Where New_sample is a new column formed from a simple [:1] slice of Sample其中New_sample是由简单的[:1] Sample切片形成的新列

I've tried a number of things to no avail - I feel I'm missing something simple.我尝试了很多方法都无济于事 - 我觉得我错过了一些简单的东西。

What's the most efficient way of doing this?这样做的最有效方法是什么？

Answer 1

You can call the str method and apply a slice, this will be much quicker than the other method as this is vectorised (thanks @unutbu):您可以调用str方法并应用切片，这将比其他方法快得多，因为这是矢量化的（感谢@unutbu）：

df['New_Sample'] = df.Sample.str[:1]

You can also call a lambda function on the df but this will be slower on larger dataframes:您还可以在 df 上调用 lambda 函数，但这在较大的数据帧上会变慢：

In [187]:

df['New_Sample'] = df.Sample.apply(lambda x: x[:1])
df
Out[187]:
  Sample  Value New_Sample
0    AAB     23          A
1    BAB     25          B

Answer 2

You can also use slice() to slice string of Series as following:您还可以使用slice()对Series字符串进行切片，如下所示：

df['New_sample'] = df['Sample'].str.slice(0,1)

From pandas documentation :来自熊猫文档：

Series.str.slice(start=None, stop=None, step=None)系列.str.slice（开始=无，停止=无，步骤=无）

Slice substrings from each element in the Series/Index从系列/索引中的每个元素切片子字符串

For slicing index ( if index is of type string ), you can try:对于切片索引（如果索引是字符串类型），您可以尝试：

df.index = df.index.str.slice(0,1)

Answer 3

Adding solution to a common variation when the slice width varies across DataFrame Rows:当切片宽度跨 DataFrame Rows 变化时，为常见变化添加解决方案：

#--Here i am extracting the ID part from the Email (i.e. the part before @)

#--First finding the position of @ in Email
d['pos'] = d['Email'].str.find('@')

#--Using position to slice Email using a lambda function
d['new_var'] = d.apply(lambda x: x['Email'][0:x['pos']],axis=1)

#--Imagine x['Email'] as a string on which, slicing is applied

Hope this Helps !希望这可以帮助！

熊猫从另一列的字符串切片创建新列

问题描述

3 个解决方案

解决方案1
99 已采纳 2014-09-11 14:02:02

解决方案2
10 2018-07-29 16:33:03

解决方案3
8 2020-07-03 08:03:32

熊猫从另一列的字符串切片创建新列

问题描述

3 个解决方案

解决方案1 99 已采纳 2014-09-11 14:02:02

解决方案2 10 2018-07-29 16:33:03

解决方案3 8 2020-07-03 08:03:32

解决方案1
99 已采纳 2014-09-11 14:02:02

解决方案2
10 2018-07-29 16:33:03

解决方案3
8 2020-07-03 08:03:32