簡體   English   中英

Pandas:刪除特定字符重復4次的dataframe列中特定字符(最后一個特定字符)之前的所有字符

[英]Pandas: Remove all characters before a specific character (last specific character) in a dataframe column that specific character is repeated 4 times

這是用 header 列“ParentPath”構建的 dataframe

data = {'ParentPath': ['Hi \ All \ First Name \ Last Name \ A \ 200', 'Hi \ All \ First Name \ Middle Name \ Last Name \ B \ 33', 'Hi \ All \ First Name \ C \ 199', 'Hi \ All \ First Name \ D \ 333', 'Hi \ All \ First Name \ E \ 12', 'Hi \ All \ F \ 88']}
df = pd.DataFrame(data)

     ParentPath        
0   Hi \ All \ First Name \ Last Name \ A \ 200        
1   Hi \ All \ First Name \ Middle Name \ Last Name \ B \ 33        
2   Hi \ All \ First Name \ C \ 199        
3   Hi \ All \ First Name \ D \ 333        
4   Hi \ All \ First Name \ E \ 12        
5   Hi \ All \ F \ 88  

Output 在刪除最后一個“\”之后的所有字符后需要如下所示,請記住每個“反斜杠”前后都有一個空格

    ParentPath        
0   Hi \ All \ First Name \ Last Name \ A        
1   Hi \ All \ First Name \ Middle Name \ Last Name \ B        
2   Hi \ All \ First Name \ C        
3   Hi \ All \ First Name \ D        
4   Hi \ All \ First Name \ E        
5   Hi \ All \ F  

嘗試拆分然后加入:

df['ParentPath'] = df['ParentPath'].str.split(' \\\\ ').str[:-1].str.join(' \\ ')

Output:

                                          ParentPath
0                Hi \ All \ First Name \ Last Name \ A
1  Hi \ All \ First Name \ Middle Name \ Last Name \ B
2                            Hi \ All \ First Name \ C
3                            Hi \ All \ First Name \ D
4                            Hi \ All \ First Name \ E
5                                         Hi \ All \ F

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM