我如何從帶有美麗湯的湯中得到特定的單詞短語？

Question

我已經使用BeautifulSoup對代碼進行了排序，結果如下：

<bound method Tag.prettify of <script type="text/javascript">var LifeTimeStats = [{"Key":"Top 3","Value":"31"},{"Key":"Top 5s","Value":"36"},{"Key":"Top 3s","Value":"13"},{"Key":"Top 6s","Value":"27"},{"Key":"Top 12s","Value":"76"},{"Key":"Top 25s","Value":"58"},{"Key":"Score","Value":"99,788"},{"Key":"Matches Played","Value":"502"},{"Key":"Wins","Value":"9"},{"Key":"Win%","Value":"2%"},{"Key":"Kills","Value":"730"},{"Key":"K/d","Value":"1.48"}];</script>>

我試圖從中獲取特定的值“ 730” ：

{"Key":"Kills","Value":"730"}

由於沒有可以分類的HTML標記。 我不知道如何獲得此特定值。 你有什么主意嗎？

也許還有另一種解決方法……這是完整的代碼：

#----WEB INPUT BASIC----

    #import bs4
    from urllib.request import urlopen as uReq
    from urllib.request import Request, urlopen
    from  bs4 import BeautifulSoup as soup


    #setting my url
    url = 'https://fortnitetracker.com/profile/psn/Rehgum'

    #making my https page work
    req = Request(url, headers={'User-Agent': 'Mozilla/5.0'})

    web_byte = urlopen(req).read()
    webpage = web_byte.decode('utf-8')
    urlopen(req).close()

    #html parsing
    page_soup = soup(webpage, "html.parser")

    lifetime = page_soup.findAll("script",{"type":"text/javascript"})

    stats = lifetime[3]

    specific = stats.prettify

    value = specific.text

    #from here there is just code to put that value in a .txt file

Answer 1

這只是您可以做什么的一個想法：

將JS代碼提取到Python變量中。
進行正則表達式操作以提取變量的值。
“ JSONify”這樣的變量值。
提取所需的數據。

作為摘錄：

a = '''var LifeTimeStats = [{"Key":"Top 3","Value":"31"},{"Key":"Top 5s","Value":"36"},{"Key":"Top 3s","Value":"13"},{"Key":"Top 6s","Value":"27"},{"Key":"Top 12s","Value":"76"},{"Key":"Top 25s","Value":"58"},{"Key":"Score","Value":"99,788"},{"Key":"Matches Played","Value":"502"},{"Key":"Wins","Value":"9"},{"Key":"Win%","Value":"2%"},{"Key":"Kills","Value":"730"},{"Key":"K/d","Value":"1.48"}];'''

b = re.findall(r'var.*?=\s*(.*?);', a)[0]
c = json.loads(b)

請參閱我編寫的虛擬完整代碼。

UPDATE

看到完整的代碼后... 這可能是您的問題的解決方案。

Answer 2

我終於開始工作了！ 產生我的錯誤的是“ def loop（）：”部分。

這是最終的工作代碼：

def loop():
    from urllib.request import Request, urlopen
    from  bs4 import BeautifulSoup as soup
    import json
    import re
    import time


    #setting my url
    url = 'https://fortnitetracker.com/profile/psn/Rehgum'

    #making my https page work
    req = Request(url, headers={'User-Agent': 'Mozilla/5.0'})

    web_byte = urlopen(req).read()
    webpage = web_byte.decode('utf-8')
    urlopen(req).close()

    #html parsing
    page_soup = soup(webpage, "html.parser")

    lifetime = page_soup.findAll("script",{"type":"text/javascript"})

    stats = lifetime[3]

    stats_var = re.findall(r'var.*?=\s*(.*?);', stats.text)[0]

    vals = json.loads(stats_var)

    for val in vals:
        if val['Key'] == 'Kills':
            num_kills = val['Value']
            break

    print('Num kills = {}'.format(num_kills))

    with open('lifetime_wins.txt', 'w') as fd:
        fd.write(str(num_kills))

    time.sleep(30)

    loop()


for i in range(1,2):
        loop()

while i<1:
    print ("Ende")

大聲“謝謝” @kazbeel。 你救了我的一天！ +代表

我如何從帶有美麗湯的湯中得到特定的單詞短語？

問題描述

2 個解決方案

解決方案1
1 已采納 2018-04-25 09:50:02

解決方案2
0 2018-04-25 14:02:16

我如何從帶有美麗湯的湯中得到特定的單詞短語？

問題描述

2 個解決方案

解決方案1 1 已采納 2018-04-25 09:50:02

解決方案2 0 2018-04-25 14:02:16

解決方案1
1 已采納 2018-04-25 09:50:02

解決方案2
0 2018-04-25 14:02:16