簡體   English   中英

提取除第一個單詞以外的大寫單詞的句子

[英]Extract sentences with capitalized words other than first word

假設我有一個 dataframe

df = pd.DataFrame({'col1': ['Hello, world. Good day','My name is Bob. Call Me','good evening','yep. stack Overflow.',"Ain't McDonald Yo"]})

                      col1
0   Hello, world. Good day
1  My name is Bob. Call Me
2             good evening
3     yep. stack Overflow.
4        Ain't McDonald Yo

我正在嘗試從每行中提取包含除第一個單詞以外的大寫單詞的句子。 句子用句號分隔。

Output:

                      col1                     col2
0   Hello, world. Good day                      NaN
1  My name is Bob. Call Me  My name is Bob. Call Me
2             good evening                      NaN
3     yep. stack Overflow.           stack Overflow
4        Ain't McDonald Yo        Ain't McDonald Yo

嘗試:

df["col2"] = df["col1"].apply(
    lambda x: ".".join(
        [
            sentence
            for sentence in x.split(".")
            if any(word[0].isupper() for word in sentence.split()[1:])
        ]
    )
    or np.nan
)
print(df)

印刷:

                      col1                     col2
0   Hello, world. Good day                      NaN
1  My name is Bob. Call Me  My name is Bob. Call Me
2             good evening                      NaN
3     yep. stack Overflow.           stack Overflow
4        Ain't McDonald Yo        Ain't McDonald Yo

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM