简体繁体中英

Latent Semantic Analysis and Stemming

原文 2017-03-14 23:30:00 9 1 nlp/ svd/ lemmatization/ lsa/ latent-semantic-analysis

Assume a very large corpus of any inflective language. Does the following make sense? By applying LSA on such corpus, words with similar concepts converge together in vector space, thus inflected word forms reffering to the same concept should ideally be identical with their lemma in the space. With such assumption, any lemmatization or stemming of queries or corpus is not necessary. Or am i totally wrong?

1 answers

According to the founders of LSA, stemming is not necessary . Though, I think there is general disagreement in the literature about this. I have read a few papers where stemming was found to improve results for a given information retrieval task.

Generally, there is recent research that shows stemming does not help in topic modeling and may even hurt topic coherence.

Latent Semantic Analysis concepts

Scalability of Latent Semantic Analysis in WEKA

Latent Semantic Analysis in Python discrepancy

Probabilistic latent semantic analysis/Indexing - Introduction

Using Latent Semantic Analysis to measure passage similarity

How Latent Semantic Analysis Handle Semantics

Latent Semantic Analysis/Indexing Library for C++

difference between Latent and Explicit Semantic Analysis

Taking a latent semantic analysis (lsa) object and scoring on new data in R

How to do Latent Semantic Analysis on a very large dataset

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Latent Semantic Analysis concepts Scalability of Latent Semantic Analysis in WEKA Latent Semantic Analysis in Python discrepancy Probabilistic latent semantic analysis/Indexing - Introduction Using Latent Semantic Analysis to measure passage similarity How Latent Semantic Analysis Handle Semantics Latent Semantic Analysis/Indexing Library for C++ difference between Latent and Explicit Semantic Analysis Taking a latent semantic analysis (lsa) object and scoring on new data in R How to do Latent Semantic Analysis on a very large dataset

Related Tags

Latent Semantic Analysis and Stemming

Question

1 answers

solution1 1 ACCPTED 2019-05-22 15:17:59

solution1
1 ACCPTED 2019-05-22 15:17:59