[英]word2vec deeplearning4j ArrayIndexOutOfBoundsException in WordVectorSerializer.loadTxtVectors()
I trained the word2vec model from http://deeplearning4j.org/word2vec successfully and now get this exception when trying to apply the wordsNearest: 我成功地从http://deeplearning4j.org/word2vec训练了word2vec模型,现在在尝试应用最近的单词时会遇到此异常:
Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 99
at
org.deeplearning4j.models.embeddings.loader.WordVectorSerializer.loadTxt(WordVectorSerializer.java:1107)
at
org.deeplearning4j.models.embeddings.loader.WordVectorSerializer.loadTxtVectors(WordVectorSerializer.java:1033)
at
org.deeplearning4j.examples.nlp.word2vec.NearestWords.main(NearestWords.java:13)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at com.intellij.rt.execution.application.AppMain.main(AppMain.java:144)
This is my code: 这是我的代码:
package org.deeplearning4j.examples.nlp.word2vec;
import java.io.File;
import java.util.Collection;
import org.deeplearning4j.models.embeddings.loader.WordVectorSerializer;
import org.deeplearning4j.models.embeddings.wordvectors.WordVectors;
public class NearestWords {
public static void main(String[] args) throws Exception{
File file = new File("pathToWriteto.txt");
WordVectors vec = WordVectorSerializer.loadTxtVectors(file);
Collection<String> similar = vec.wordsNearest("day", 10);
System.out.println(similar);
}
}
The current release (0.4-rc3.10) has a bug in loadTxt function. 当前版本(0.4-rc3.10)在loadTxt函数中存在错误。 It's fixed in the master repo and will be reflected in next release.
它已在主存储库中修复,并将在下一个版本中反映出来。 Read this github issue for your problem: https://github.com/deeplearning4j/deeplearning4j/issues/1721
阅读此github问题以了解您的问题: https : //github.com/deeplearning4j/deeplearning4j/issues/1721
An easy fix at the moment is to copy the latest WordVectorSerializer.java to your project. 目前,一个简单的解决方法是将最新的WordVectorSerializer.java复制到您的项目中。 https://github.com/deeplearning4j/deeplearning4j/blob/master/deeplearning4j-scaleout/deeplearning4j-nlp/src/main/java/org/deeplearning4j/models/embeddings/loader/WordVectorSerializer.java
https://github.com/deeplearning4j/deeplearning4j/blob/master/deeplearning4j-scaleout/deeplearning4j-nlp/src/main/java/org/deeplearning4j/models/embeddings/loader/WordVectorSerializer.java
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.