使用Stanford CoreNLP Python解析器获取特定输出

Question

I'm using SCP to get the parse CFG tree for English sentences. 我正在使用SCP获取英语句子的解析CFG树。

from corenlp import *
corenlp = StanfordCoreNLP()
corenlp.parse("Every cat loves a dog")

My expected output is a tree like this: 我的预期输出是这样的树：

(S (NP (DET Every) (NN cat)) (VP (VT loves) (NP (DET a) (NN dog))))

But what i got is: 但是我得到的是：

(ROOT (S (NP (DT Every) (NN cat)) (VP (VBZ loves) (NP (DT a) (NN dog)))))

How to change the POS tag as expected and remove the ROOT node? 如何按预期更改POS标签并删除ROOT节点？

Thanks 谢谢

Answer 1

You can use nltk.tree module from NLTK . 您可以使用NLTK中的 nltk.tree模块。

from nltk.tree import *

def traverse(t):
    try:
        # Replace Labels
        if t.label() == "DT":
            t.set_label("DET")
        elif t.label() == "VBZ":
            t.set_label("VT")   
    except AttributeError:
        return

    for child in t:
        traverse(child)

output_tree= "(ROOT (S (NP (DT Every) (NN cat)) (VP (VBZ loves) (NP (DT a) (NN dog)))))"
tree = ParentedTree.fromstring(output_tree)

# Remove ROOT Element
if tree.label() == "ROOT":  
    tree = tree[0]

traverse(tree)
print tree  
# (S (NP (DET Every) (NN cat)) (VP (VT loves) (NP (DET a) (NN dog))))

使用Stanford CoreNLP Python解析器获取特定输出

问题描述

1 个解决方案

解决方案1
1 已采纳 2016-07-25 15:30:47

使用Stanford CoreNLP Python解析器获取特定输出

问题描述

1 个解决方案

解决方案1 1 已采纳 2016-07-25 15:30:47

解决方案1
1 已采纳 2016-07-25 15:30:47