beautifulsoup 没有完全解析页面

Question

    import requests
from bs4 import BeautifulSoup as bs

url1 = 'https://school.karelia.ru/auth/login'
url2 = 'https://school.karelia.ru/personal-area/#diary'

payload = {
    'login_login': 'КлочковМ',
    'login_password': 'КлочковМ7'
}

def getHW():
    with requests.session() as s:
        s.post(url1, data=payload)
        r = s.get(url2)
        soup = bs(r.content, 'html.parser')
        print(soup.find_all("div"))

getHW()

i am trying to parse a site, and this code just doesnt do it fully.我正在尝试解析一个站点，但这段代码并没有完全解析。 in the website's code, there are a lot more subclasses than the result i get from this code:在网站的代码中，有比我从这段代码得到的结果更多的子类：

<div class="right" id="main-region"></div>

for some reason, the class "right" just ends there, even though in the site it continues a lot more.出于某种原因，class“正确”就到此为止，即使在站点中它继续了很多。 why could this be?为什么会这样？

Answer 1

it is because you did soup.find_all("div") .这是因为你做了soup.find_all("div") 。 the div ends there with </div> and you told BS to only look for divs, so BS stops there. div 以</div>结尾，你告诉 BS 只查找 div，所以 BS 就停在那里。 to actually search for classes see for example this answer要实际搜索课程，请参见例如此答案

beautifulsoup 没有完全解析页面

问题描述

1 个解决方案

解决方案1
1 2022-03-21 17:05:02

beautifulsoup 没有完全解析页面

问题描述

1 个解决方案

解决方案1 1 2022-03-21 17:05:02

解决方案1
1 2022-03-21 17:05:02