简体繁体中英

How can we use artificial neural networks to find similar documents?

原文 2018-10-17 05:46:31 4 3 python/ machine-learning/ nlp/ artificial-intelligence/ word-embedding

How can we use ANN to find some similar documents? I know its a silly question, but I am new to this NLP field. I have made a model using kNN and bag-of-words approach to solve my problem. Using that I can get n number of documents (along with their closeness) that are somewhat similar to the input, but now I want to implement the same using ANN and I am not getting any idea.

Thanks in advance for any help or suggestions.

3 answers

You can use "word embeddings" - technique, that presents words in the dense vector representation. To find similar documents as the vectors, you can simply use cosine similarity .

An example how to build word2vec model using TensorFlow. One more example how to use embeddings layer from Keras.

The way to obtain embeddings for your language is either training them yourself on your corpus of choice (large enough - eg wikipedia) or downloading the trained embeddings (for python there are plenty of sources for embeddings trained or loadable with gensim module - which is a de facto standard for Python word2vec).

You can also use GloVe (using glove-python ) or FastText word embeddings.

If you are interested you can find more detailed descriptions of embeddings with code examples and source papers .

Have a look at the paper https://arxiv.org/pdf/1805.10685.pdf that gives you a overall idea. check this link for more references https://github.com/Hironsan/awesome-embedding-models

Multiple artificial neural networks

how we can compute the training time of deep neural networks?

How can I use batch size in neural networks

How to do one hot encoded and get the number of classes programatically in Python? For artificial neural networks

How can we estimate weights given a set of data?(Neural networks back prop)

Keras – Artificial Neural Networks - Error when using a custom activation function

I am studying artificial neural networks. Where is the hidden layer?

Role of activation function in calculating the cost function for artificial neural networks

Can I use different length of input data in neural networks?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Multiple artificial neural networks Find most similar images by using neural networks how we can compute the training time of deep neural networks? How can I use batch size in neural networks How to do one hot encoded and get the number of classes programatically in Python? For artificial neural networks How can we estimate weights given a set of data?(Neural networks back prop) Keras – Artificial Neural Networks - Error when using a custom activation function I am studying artificial neural networks. Where is the hidden layer? Role of activation function in calculating the cost function for artificial neural networks Can I use different length of input data in neural networks?

Related Tags

How can we use artificial neural networks to find similar documents?

Question

3 answers

solution1
0 2018-10-17 06:37:36

solution2
0 2018-10-17 09:55:07

solution3
0 2019-01-10 20:20:25

How can we use artificial neural networks to find similar documents?

Question

3 answers

solution1 0 2018-10-17 06:37:36

solution2 0 2018-10-17 09:55:07

solution3 0 2019-01-10 20:20:25

solution1
0 2018-10-17 06:37:36

solution2
0 2018-10-17 09:55:07

solution3
0 2019-01-10 20:20:25