简体繁体 English

如何从gensim的word2vec中提取词汇向量？

[英]How extract vocabulary vectors from gensim's word2vec?

原文 2017-05-10 23:09:25 0 1 python/ machine-learning/ gensim/ word2vec/ text-classification

I want to analyze the vectors looking for patterns and stuff, and use SVM on them to complete a classification task between class A and B, the task should be supervised. 我想分析向量以查找样式和内容，并在它们上使用SVM完成A类和B类之间的分类任务，该任务应受到监督。 (I know it may sound odd but it's our homework.) so as a result I really need to know: （我知道这听起来可能很奇怪，但这是我们的作业。）因此，我真的需要知道：

1- how to extract the coded vectors of a document using a trained model? 1-如何使用经过训练的模型提取文档的编码矢量？

2- how to interpret them and how does word2vec code them? 2-如何解释它们以及word2vec如何编码它们？

I'm using gensim's word2vec. 我正在使用gensim的word2vec。

1 个解决方案

If you have trained word2vec model, you can get word-vector by __getitem__ method 如果您已经训练过word2vec模型，则可以通过__getitem__方法获得单词向量
model = gensim.models.Word2Vec(sentences) print(model["some_word_from_dictionary"])
Unfortunately, embeddings from word2vec/doc2vec not interpreted by a person (in contrast to topic vectors from LdaModel) 不幸的是，word2vec / doc2vec中的嵌入没有被人解释（与LdaModel的主题向量相反）

P/S If you have texts at the object in your tasks, then you should use Doc2Vec model P / S如果您在任务中的对象处有文本，则应使用Doc2Vec模型

Gensim word2vec-从不同于0的索引开始词汇表 - Gensim word2vec - start vocabulary from index different than 0

Python Gensim word2vec词汇密钥 - Python Gensim word2vec vocabulary key

词汇表中的单词数 gensim word2vec - Number of words in vocabulary gensim word2vec

Gensim Word2Vec词汇：输出不清楚 - Gensim Word2Vec Vocabulary: Unclear output

Gensim的word2vec返回尴尬的向量 - Gensim's word2vec returning awkward vectors

有没有办法遍历 Gensim 的 Word2Vec 的向量？ - Is there a way to iterate through the vectors of Gensim's Word2Vec?

如何从Word2Vec模型中提取向量进行聚类 - How to extract vectors from a Word2Vec Model for clustering

Python Gensim从向量创建Word2Vec模型（在ndarray中） - Python gensim create word2vec model from vectors (in ndarray)

gensim word2vec访问进/出向量 - gensim word2vec accessing in/out vectors

训练gensim word2vec模型后，词汇不在词汇表中，为什么？ - word not in vocabulary after training gensim word2vec model, why?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Gensim word2vec-从不同于0的索引开始词汇表 - Gensim word2vec - start vocabulary from index different than 0 Python Gensim word2vec词汇密钥 - Python Gensim word2vec vocabulary key 词汇表中的单词数 gensim word2vec - Number of words in vocabulary gensim word2vec Gensim Word2Vec词汇：输出不清楚 - Gensim Word2Vec Vocabulary: Unclear output Gensim的word2vec返回尴尬的向量 - Gensim's word2vec returning awkward vectors 有没有办法遍历 Gensim 的 Word2Vec 的向量？ - Is there a way to iterate through the vectors of Gensim's Word2Vec? 如何从Word2Vec模型中提取向量进行聚类 - How to extract vectors from a Word2Vec Model for clustering Python Gensim从向量创建Word2Vec模型（在ndarray中） - Python gensim create word2vec model from vectors (in ndarray) gensim word2vec访问进/出向量 - gensim word2vec accessing in/out vectors 训练gensim word2vec模型后，词汇不在词汇表中，为什么？ - word not in vocabulary after training gensim word2vec model, why?

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM