標簽[pytextrank] - 堆棧內存溢出

[英]Feed large text to PyTextRank

我想使用PyTextRank進行關鍵短語提取。如何將 500 萬份文件（每個文件由幾個段落組成）送入包裹？這是我在官方教程中看到的示例。我的選擇是僅將幾百萬個文檔連接到一個字符串並將其傳遞給nlp(text)嗎？我不認為我可以使用nlp.pipe(texts)因為我想通過計算所有文檔中 ...

讀取 pyTextRank 文件

[英]read pyTextRank file

我有一段文本，我希望使用 pytextrank 將其呈現為圖表。代碼（從源代碼復制）是 import spacy nlp = spacy.load("en_core_web_sm") import pytextrank import graphviz tr = ...

Pytextrank - 避免將小寫標簽轉換為關鍵短語提取

[英]Pytextrank - avoid lowercasing tags into key phrases extraction

我想避免在 pytextrank 中使用小寫標簽。關於如何實現的任何建議？ ...

帶有 pytextrank 的 spacy-udpipe 從非英文文本中提取關鍵字

[英]spacy-udpipe with pytextrank to extract keywords from non-English text

我一直在使用 pytextrank ( https://github.com/DerwenAI/pytextrank/ ) 和 spacy 和英語模型來提取關鍵字 - 效果很好！現在我需要處理非英文文本，我找到了 udpipe ( https://github.com/TakeLab/spacy ...

模塊“pytextrank”沒有屬性“parse_doc”

[英]Module 'pytextrank' has no attribute 'parse_doc'

我正在執行 nlp 任務。我已經編寫了以下代碼。在執行時，它顯示以下錯誤。任何解決錯誤的建議都會有所幫助。我在 google colab 中有 python 3 env。我收到以下錯誤： ...

OSError: [E050] 找不到 model 'en'

[英]OSError: [E050] Can't find model 'en'

我正在嘗試使用 python -https://github.com/DerwenAI/pytextrank/blob/master/example.ipynb的這個 pytextrank 庫，但我無法解決這個錯誤，之前我收到了一個錯誤，即 Z957B527BCFBAD2E80F58D206839 ...

Google colab：找不到名為 pytextrank 的模塊（以前使用同一筆記本工作過）

[英]Google colab: No module named pytextrank can be found (worked previously with the same notebook)

從最近開始，每當我運行筆記本時，我都會收到此錯誤： ModuleNotFoundError: No module named 'pytextrank' 這是我的筆記本的鏈接： https : //colab.research.google.com/github/neomatrix369/awes ...

使用 pytextrank 時 Spacy 中的值錯誤（textrank 的 Python 實現）

[英]Value error in Spacy when using pytextrank(Python implementation of textrank)

我已經使用pytextrank來提取關鍵字。我使用以下命令安裝了 pytextrank 和 spacy。這是我的代碼當我嘗試執行此操作時出現以下錯誤我正在使用 python 2.7、anaconda 4.3、jupyter notebook 和 ubuntu 14.04 ...

spaCy 需要一個不存在的文件：strings.json

[英]spaCy needs a file that is not there: strings.json

我正在運行 pytextrank 處於第二階段，我從 spaCy 收到此錯誤：我尋找strings.json，但沒有這樣的東西。有趣的是，當我使用以下錯誤代碼安裝 spaCy 時，存在與 pathlib.py 類似的錯誤：你們有什么想法嗎？謝謝 ...