简体繁体中英

Shared memory among processes for pre-trained word2vec model?

原文 2021-01-26 11:30:54 0 2 python/ multiprocessing/ word2vec

I have a look-up object, specifically a pre-trained word2vec model from gensim.models.keyedvectors.Word2VecKeyedVectors . I need to do some data pre-processing and I am using multi-processing for the same. Is there a way in which all of my processes can use the object from the same memory location instead of each process loading the object into its own memory?

2 answers

Yes, here are two options:

you can use multiprocessing
or you can use Ray

Yes, if:

the files were saved using Gensim's internal .save() method, and the relevant large-arrays of vectors are clearly separate .npy files
the files are loaded using Gensim's internal .load() method, with the mmap option
you avoid doing any operations which inadvertently cause each process's object to reallocate the backing array completely (breaking the mmap-sharing).

See this prior answer for an overview of the steps/concerns of a similar need.

(The concern & extra steps listed there to avoid breaking the mmap-sharing – by performing manual patch-ups of the norm properties – should no longer be necessary in Gensim 4.0.0, currently available only as a prerelease version.)

How to load a pre-trained Word2vec MODEL File?

How to access/use Google's pre-trained Word2Vec model without manually downloading the model?

How to extract a word vector from the Google pre-trained model for word2vec?

How to initialize a new word2vec model with pre-trained model weights?

Gensim word2vec augment or merge pre-trained vectors

How to load a pre-trained Word2vec MODEL File and reuse it?

word2vec: user-level, document-level embeddings with pre-trained model

Word2Vec: Error received at uploading a pre-trained word2vec file using Gensim

Gensim's Doc2Vec - How to use pre-trained word2vec (word similarities)

pre-trained Word2Vec with LSTM, predict next word in sentence

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to load a pre-trained Word2vec MODEL File? How to access/use Google's pre-trained Word2Vec model without manually downloading the model? How to extract a word vector from the Google pre-trained model for word2vec? How to initialize a new word2vec model with pre-trained model weights? Gensim word2vec augment or merge pre-trained vectors How to load a pre-trained Word2vec MODEL File and reuse it? word2vec: user-level, document-level embeddings with pre-trained model Word2Vec: Error received at uploading a pre-trained word2vec file using Gensim Gensim's Doc2Vec - How to use pre-trained word2vec (word similarities) pre-trained Word2Vec with LSTM, predict next word in sentence

Related Tags

Shared memory among processes for pre-trained word2vec model?

Question

2 answers

solution1
1 2021-01-26 13:01:58

solution2
1 ACCPTED 2021-01-26 16:31:10

Shared memory among processes for pre-trained word2vec model?

Question

2 answers

solution1 1 2021-01-26 13:01:58

solution2 1 ACCPTED 2021-01-26 16:31:10

solution1
1 2021-01-26 13:01:58

solution2
1 ACCPTED 2021-01-26 16:31:10