I would like a simple mehtod to delete parts of a string after a specified character inside a dataframe. Here is a simplified example:
df:
obs a b c d
0 1 1-23-12 1 2 3
1 2 12-23-13 4 5 5
2 3 21-23-14 4 5 5
I would like to remove the parts in the a column after the first - sign, my expected output is:
newdf:
obs a b c d
0 1 1 1 2 3
1 2 12 4 5 5
2 3 21 4 5 5
You can reformat the values by passing a reformatting function into the apply
method as follows:
from StringIO import StringIO
import pandas as pd
data = """ obs a b c d
1 1-23-12 1 2 3
2 12-23-13 4 5 5
3 21-23-14 4 5 5"""
# Build dataframe from data
df = pd.read_table(StringIO(data), sep=' ')
# Reformat values for column a using an unnamed lambda function
df['a'] = df['a'].apply(lambda x: x.split('-')[0])
This gives you your desired result:
obs a b c d
0 1 1 1 2 3
1 2 12 4 5 5
2 3 21 4 5 5
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.