簡體 English 中英

Web抓取大數據時，進程以退出代碼-1073740791（0xc0000409）完成

[英]Process finished with exit code -1073740791 (0xc0000409) when web scraping large data

原文 2018-10-27 19:34:55 7 1 python/ web-scraping

我編寫了腳本來對網頁進行一些網絡抓取。 網頁上有JavaScript，因此在使用BeautifulSoup抓取所需內容之前，我使用PyQT5渲染了頁面。

但是，我要抓取許多頁面（超過10,000個），並且試圖將內容存儲在dict中，然后將其轉換為json文件。 我已嘗試定期周期性地寫入json文件，因為我認為由於刮擦次數而導致字典變得太大。 仍收到退出代碼。

在另一個線程上，有人提出了有關更新視頻卡驅動程序的建議（不知道為什么這會影響我的Python腳本，但是我試了一下。沒有任何進展。）

1 個解決方案

問題（至少在這種情況下）是字典太大了。 我解決問題的方法是每隔1000次抓取，通過在文件名后附加迭代器，將日期轉儲為硬盤上的json格式，清除dict，增加迭代器，然后繼續抓取。

... while/for loop iterating over all web pages
    data_table = soup.find('table', attrs={'class', 'dataTable'})
    ... process data into dict d
    data[id] = d
    if id % 1000 == 0:
        with open(r'datafile-{num}.json'.format(num=id//1000)) as file:
            json.dump(data, file)
        data.clear()
    id += 1  # increment the key for dict data and counter for file separation

由於我現在有很多文件，但至少有我想要的數據，這並不理想。 萬一其他人在Windows上獲得退出代碼-1073740791（0xc0000409），如果將大量數據轉儲到詞典中，這很可能就是原因。

使用XGBoost在PyCharm上以退出代碼-1073740791（0xC0000409）退出的過程完成

[英]Process finished with exit code -1073740791 (0xC0000409) on PyCharm with XGBoost

進程完成，退出代碼 -1073740791 (0xC0000409) pycharm 錯誤

[英]Process finished with exit code -1073740791 (0xC0000409) pycharm error

Pycharm 錯誤“進程以退出代碼 -1073740791 (0xC0000409) 完成”

[英]Pycharm error "Process finished with exit code -1073740791 (0xC0000409)"

進程以退出代碼 -1073740791 (0xC0000409) Tensorflow 錯誤結束

[英]Process finished with exit code -1073740791 (0xC0000409) Tensorflow error

tensorflow 進程完成，退出代碼為 -1073740791 (0xC0000409) STATUS_STACK_BUFFER_OVERRUN

[英]tensorflow process finished with exit code -1073740791 (0xC0000409) STATUS_STACK_BUFFER_OVERRUN

過程以退出代碼 -1073740791 (0xC0000409) PyQt5 和 Firebase 身份驗證完成

[英]Process finished with exit code -1073740791 (0xC0000409) PyQt5 and Firebase authentication

這是什么錯誤，我該如何解決？進程以退出代碼 -1073740791 (0xC0000409) 結束

[英]What is this error and how do I fix it? Process finished with exit code -1073740791 (0xC0000409)

進程完成，退出代碼 -1073740791 (0xC0000409) STATUS_STACK_BUFFER_OVERRUN

[英]Process finished with exit code -1073740791 (0xC0000409) STATUS_STACK_BUFFER_OVERRUN

進程已完成，退出代碼為 -1073740791 (0xC0000409) 錯誤，無法打開網站

[英]Process finished with exit code -1073740791 (0xC0000409) error not opening a website

一旦我單擊 signinButton 並退出，程序就停止運行，進程完成，退出代碼為 -1073740791 (0xC0000409)

[英]The program just stops running once i click the signinButton and exits with Process finished with exit code -1073740791 (0xC0000409)

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 使用XGBoost在PyCharm上以退出代碼-1073740791（0xC0000409）退出的過程完成進程完成，退出代碼 -1073740791 (0xC0000409) pycharm 錯誤 Pycharm 錯誤“進程以退出代碼 -1073740791 (0xC0000409) 完成” 進程以退出代碼 -1073740791 (0xC0000409) Tensorflow 錯誤結束 tensorflow 進程完成，退出代碼為 -1073740791 (0xC0000409) STATUS_STACK_BUFFER_OVERRUN 過程以退出代碼 -1073740791 (0xC0000409) PyQt5 和 Firebase 身份驗證完成這是什么錯誤，我該如何解決？進程以退出代碼 -1073740791 (0xC0000409) 結束進程完成，退出代碼 -1073740791 (0xC0000409) STATUS_STACK_BUFFER_OVERRUN 進程已完成，退出代碼為 -1073740791 (0xC0000409) 錯誤，無法打開網站一旦我單擊 signinButton 並退出，程序就停止運行，進程完成，退出代碼為 -1073740791 (0xC0000409)

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM