簡體 English 中英

Lucene獲取最高頻率條款和原始文件

[英]Lucene Get Highest Frequency Terms and Origin Document

原文 2013-12-19 16:05:45 5 1 java/ lucene

我已經通過Lucene 4.0.0實現了一個詞雲，調用方法getHighFreqTerms（）如下

TermStats[] termStats = HighFreqTerms.getHighFreqTerms(ir, HITS, "content");

我正在嘗試找到一種方法來獲取每個術語的由來。 這可能嗎？ 我需要做什么？ 我想到了一個解決方案，即獲取每個文檔每個術語的頻率值，同時將術語和每個文檔的ArrayList值存儲在HashMap中，但是我堅信這效率低下。

你有什么建議嗎？

非常感謝你，

1 個解決方案

HighFreqTerms僅為您提供有關索引的信息。 如果需要文檔，則必須使用查詢。

從Lucene索引中獲取最高頻率項

[英]Get highest frequency terms from Lucene index

如何在Lucene中獲得多單詞詞的頻率？

[英]How to get frequency of multi-word terms in Lucene?

為什么在Lucene的同一文檔中得到不同的術語？

[英]Why do I get different terms in same document in Lucene?

如何獲得Lucene 4中Lucene場的所有術語

[英]How to get all terms for a Lucene field in Lucene 4

如何在Lucene中索引文檔中的所有術語？

[英]How to index all the terms in the document in Lucene?

Java Lucene從Document對象獲取條款

[英]Java Lucene Obtain Terms from Document object

分析后如何獲取Lucene文檔字段令牌的條款？

[英]How can I get the terms of a Lucene document field tokens after they are analyzed?

需要計算文檔中每個術語的頻率

[英]need to count the frequency of each terms inside a document

如何計算lucene索引中每個文檔的術語數？

[英]How to count the number of terms for each document in lucene index?

Lucene：獲取類別的最新文檔

[英]Lucene: get newest document for category

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 從Lucene索引中獲取最高頻率項如何在Lucene中獲得多單詞詞的頻率？為什么在Lucene的同一文檔中得到不同的術語？如何獲得Lucene 4中Lucene場的所有術語如何在Lucene中索引文檔中的所有術語？ Java Lucene從Document對象獲取條款分析后如何獲取Lucene文檔字段令牌的條款？需要計算文檔中每個術語的頻率如何計算lucene索引中每個文檔的術語數？ Lucene：獲取類別的最新文檔

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM