如何删除df中整个列的某个字符后的所有内容？

Question

I have a df with one column (SKUID) where I want to remove all the characters that are not numerical.我有一个带有一列 (SKUID) 的 df，我想在其中删除所有非数字字符。 Here is an sample of the column:这是该列的示例：

Essentially I want to remove the underscore and the letter for each row.基本上我想删除每一行的下划线和字母。 I have tried using following code:我尝试使用以下代码：

sku_data.split('_', 1)[0]

This gives me an error of 'DataFrame' object has no attribute 'split'.这给了我“DataFrame”对象没有属性“split”的错误。 Where am I going wrong?我哪里错了？

Answer 1

This should do for number extraction:这应该用于数字提取：

sku_data.SKUID = sku_data.SKUID.str.extract('(\d+)')

Note : don't forget to add the str operator if you want to perform string operations on a DataFrame column注意：如果要对DataFrame列执行字符串操作，请不要忘记添加str运算符

如何删除df中整个列的某个字符后的所有内容？

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-11-18 18:03:55

如何删除df中整个列的某个字符后的所有内容？

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-11-18 18:03:55

解决方案1
1 已采纳 2020-11-18 18:03:55