阅读大字符串并拆分每个单词在python中花费太多时间

Question

I am reading a dataframe column having comments. 我正在阅读有评论的数据框列。 The data is taking forever to read using the code below. 使用下面的代码将永远读取数据。 Is there a way to make this faster ? 有没有办法使它更快？

for val in df.Description:
    val = str(val)
    tokens = val.split()  
    for i in range(len(tokens)):
        tokens[i] = tokens[i].lower()  
        for words in tokens:
            comment = comment + words + ''

df.Description is a column of comments (basically email text) df.Description是一列评论（主要是电子邮件文本）

Answer 1

Update: Assuming df.Description is your column, this might be helpful: 更新：假设df.Description是您的专栏，这可能会有所帮助：

arr_string = df.Description.astype(str).values.tolist()
for val in arr_string:
    for words in val:
            comment = ''.join([comment, words])

Take a look at this . 看看这个。

阅读大字符串并拆分每个单词在python中花费太多时间

问题描述

1 个解决方案

解决方案1
2 已采纳 2019-09-04 01:32:44

阅读大字符串并拆分每个单词在python中花费太多时间

问题描述

1 个解决方案

解决方案1 2 已采纳 2019-09-04 01:32:44

解决方案1
2 已采纳 2019-09-04 01:32:44