[英]Mahout CVB topics file
我听不懂输出结果,也许有人可以帮忙
cvb命令
bin/mahout cvb \
-i /work/matrix \
-o /work/cvb -k 10 -ow -x 20 \
-dict /work/sparseVectors/dictionary.file-* \
-dt /work/topics \
-mt /work/models
cvb和矢量转储后
bin/mahout vectordump -i /work/topics \
-d /work/sparseVectors/dictionary.file-* \
-o /work/cvb-topic \
-dt sequencefile --vectorSize 10 \
-sort /work/topics \
-p TRUE
文件cvb-topic具有类似
0 {0:0.5380152557598438,09:0.4619846630645179,10:1.541304295897372E-8,08:1.5405183424669223E-8,04:1.4964621316424798E-8,01:1.0302427985842305E-8,00:7.566567734425231E-9,1:7.394593812516846E-9,1.180:6.745071922943742E-9,07:3.3841292288528656E-9}
接下来我该怎么办?
我今天遇到了同样的问题,要获取单词的概率,请改用-i /work/cvb
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.