簡體   English   中英

從數據框列中刪除字符

[英]Removing characters from a dataframe column

以下是用於從“類別列表”和數據集中提取匹配值的代碼。

matches= token.apply(lambda x: pd.Series(x).str.extractall("|".join(["({})".format(cat) for cat in Categories.HealthCare])))
match_list= [[m for m in match.values.ravel() if isinstance(m, str)] for match in matches]
match_df = pd.DataFrame({"Hc1":match_list})

def match_health(row):
    categories = []

    for bigram in row.bigram:
        joined = ' '.join(bigram)
        if joined in HealthCare:
            categories.append(joined)
    for trigram in row.trigram:
        joined = ' '.join(trigram)
        if joined in HealthCare:
            categories.append(joined)

    return categories

match_df['Hc2'] = df.apply(match_health, axis=1)
match_df['HealthCare'] = match_df[match_df.columns[[0,1]]].apply(lambda x: ','.join(x.dropna().astype(str)),axis=1)

結果如下:

    Hc1                 Hc2                         HealthCare
0   []                  []                          [],[]
1   [Sauna, Jacuzzi]    [Health Club, Steam Room]   ['Sauna', 'Jacuzzi'],['Health Club', 'Steam Ro...
2   [Sauna, Jacuzzi]    [Health Club, Steam Room]   ['Sauna', 'Jacuzzi'],['Health Club', 'Steam Ro...
3   [Sauna, Jacuzzi]    [Health Club, Steam Room]   ['Sauna', 'Jacuzzi'],['Health Club', 'Steam Ro...

類型(match_df)

pandas.core.frame.DataFrame

但是我的輸出應該沒有'[]'-方括號和像這樣的字符串周圍的單引號:

    Hc1                 Hc2                         HealthCare
0                                                   
1   Sauna, Jacuzzi   Health Club, Steam Room    Sauna,Jacuzzi,Health Club,Steam Ro...
2   Sauna, Jacuzzi   Health Club, Steam Room    Sauna,Jacuzzi,Health Club,Steam Ro...
3   Sauna, Jacuzzi   Health Club, Steam Room    Sauna,Jacuzzi,Health Club,Steam Ro...

需要幫忙。

您可以致電.str.replace

match_df['HealthCare'] = match_df['HealthCare']\
                          .astype(str).str.replace(r"[\[\]']", '')
match_df['HealthCare'] = match_df['HealthCare'].map(lambda x: x.replace('[','').replace(']','').replace("'",''))

這可以替換所有方括號和單引號。

O / P:

      HealthCare
0                                                   
1   Sauna,Jacuzzi,Health Club,Steam Ro...
2   Sauna,Jacuzzi,Health Club,Steam Ro...
3   Sauna,Jacuzzi,Health Club,Steam Ro...

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM