删除停用词和标点符号时出现错误

Question

def transform_text(text):
    text = text.lower()
    text = nltk.word_tokenize(text)
    
    y = []
    for i in text:
        if i.isalnum():
            y.append(i)
    
    text = y[:]
    y.clear()
    
    for i in text:
        if i not in stopwords.words('english') and i not in string.punctuation:
            y.append(i)
            
    text = y[:]
    y.clear()
    
    for i in text:
        y.append(ps.stem(i))
    
            
    return " ".join(y)

gives给

<ipython-input-47-c84ab809613a> in <module>
----> 1 transform_text("I'm gonna be home soon and i don't want to talk about this stuff anymore tonight, k? I've cried enough today.")

<ipython-input-46-fed03b80da62> in transform_text(text)
     12 
     13     for i in text:
---> 14         if i not in stopwords.words('english') and i not in string.punctuation:
     15             y.append(i)
     16 

NameError: name 'stopwords' is not defined

Answer 1

You need to include the following at the top of your module:您需要在模块顶部包含以下内容：

from nltk.corpus import stopwords

NameError: name 'stopwords' is not defined means exactly that - you haven't imported or defined what stopwords is yet. NameError: name 'stopwords' is not defined确切的意思是——你还没有导入或定义什么是stopwords 。

Read the docs for more details.阅读文档以获取更多详细信息。

删除停用词和标点符号时出现错误

问题描述

1 个解决方案

解决方案1
2 2021-11-30 09:12:21

删除停用词和标点符号时出现错误

问题描述

1 个解决方案

解决方案1 2 2021-11-30 09:12:21

解决方案1
2 2021-11-30 09:12:21