简体   繁体   English

SentiWordNet 3的内部结构

[英]Internal structure of SentiWordNet 3

I am working on a project of Sentiment analysis that uses SentiwordNet3 lexicon. 我正在研究一个使用SentiwordNet3词典的情感分析项目。 The problem is, I am unable to understand the structure of lexicon. 问题是,我无法理解词典的结构。 One line of this Lexicon is: 这个词典的一行是:

a 00002730 0 0 acroscopic#1 facing or on the side toward the apex a 00002730 0 0 acroscopic#1面向或朝向顶点的一侧

1. a =? 1. a =? Purpose of a ? 目的a什么?

2. POS-ID=00002730 . 2. POS-ID=00002730 What is the purpose of POS-ID ? POS-ID的目的是什么?

3. SynsetTerms Gloss = acroscopic#1 facing or on the side toward the apex 3. SynsetTerms Gloss = acroscopic#1 facing or on the side toward the apex

  1. What is acroscopic ? 什么是acroscopic Is it word that we want the score of? 这是我们想要得分的词吗?
  2. What is #1 ? 什么是#1 Purpose? 目的?
  3. Why is the sentence after #1 ? 为什么#1之后的句子? facing or on the side toward the apex . facing or on the side toward the apex

As far as the accessing of score is concerned, we accessed it as 就得分的访问而言,我们将其作为访问

`list(SWN.senti_synsets(Tagged[i][0],'a')`

a is Adjective whereas v , r and n are also in the lexicon. aAdjectivevrn也在词典中。

The corpus has a header containing the definition of the columns: 语料库有一个包含列定义的标题:

POS ID PosScore NegScore SynsetTerms Gloss POS ID PosScore NegScore SynsetTerms Gloss

  1. POS 'a' is the part of speech (adjective) POS'a'是演讲的一部分(形容词)
  2. ID is the unique identifier of the lemma in Wordnet ID是Wordnet中引理的唯一标识符
  3. PosScore => The positive score PosScore =>正分数
  4. NegScore => The negative score NegScore =>负分
  5. SynsetTerms => Wordnet synset names (#1 = the first sense, #2 = the second sense) SynsetTerms => Wordnet同义词名称(#1 =第一感,#2 =第二感)
  6. Explanatory sentence 解释性的句子

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM