Python NLTK：Stanford NER标记器错误消息：NLTK无法找到Java文件

Question

试图让Stanford NER使用Python。 按照网上的一些说明进行操作，但收到错误消息：“ NLTK无法找到Java文件！使用软件特定的配置参数或设置JAVAHOME环境变量。” 什么问题？ 谢谢！

from nltk.tag.stanford import StanfordNERTagger
from nltk.tokenize import word_tokenize

model = r'C:\Stanford\NER\classifiers\english.muc.7class.distsim.crf.ser.gz'
jar = r'C:\Stanford\NER\stanford-ner-3.9.1.jar'

ner_tagger = StanfordNERTagger(model, jar, encoding = 'utf-8')

text = 'While in France, Christine Lagarde discussed short-term stimulus ' \
       'efforts in a recent interview with the Wall Street Journal.'

words = word_tokenize(text)
classified_words = ner_tagger.tag(words)

Answer 1

在网上找到了解决方案。 用您自己的路径替换。

  import os java_path = "C:/../../jdk1.8.0_101/bin/java.exe" os.environ['JAVAHOME'] = java_path

要么：

 import nltk nltk.internals.config_java('C:/../../jdk1.8.0_101/bin/java.exe')

资料来源： https : //tianyouhu.wordpress.com/2016/09/01/problem-of-nltk-with-stanfordtokenizer/

Python NLTK：Stanford NER标记器错误消息：NLTK无法找到Java文件

问题描述

1 个解决方案

解决方案1
0 已采纳 2018-10-18 15:08:36

Python NLTK：Stanford NER标记器错误消息：NLTK无法找到Java文件

问题描述

1 个解决方案

解决方案1 0 已采纳 2018-10-18 15:08:36

解决方案1
0 已采纳 2018-10-18 15:08:36