[英]How to parse a JSON file from HTML into Python using BeautifulSoup?
我有一個包含各種信息的HTML頁面,例如:
<input class="json-data" id="init-data" type="hidden" value='{"keyboardShortcuts":[{"name":"xxxx","description":"yyyyy", > etc...
我如何獲得類似的東西
name = xxxx
到目前為止,這就是我的代碼,但是每次只打印None時:
content = page.read()
html = BeautifulSoup(content, "html.parser")
element = html.find("input", class_="json-data", value_="keyboardShortcuts")
拉出value
屬性,使用json
讀取,然后即可獲取name
鍵的值:
從bs4導入BeautifulSoup
import json
content = '''
<input class="json-data" id="init-data" type="hidden" value='{"keyboardShortcuts":[{"name":"xxxx","description":"yyyyy"}]}
'''
html = BeautifulSoup(content, "lxml")
jsonData = json.loads(html.find('input', {'id':'init-data'})['value'])
print (jsonData['keyboardShortcuts'][0]['name'])
輸出:
print (jsonData['keyboardShortcuts'][0]['name'])
xxxx
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.