BeautifulSoup 如果找不到返回 0 而不是 None

Question

我有以下 Python 語法，使用 BeautifulSoup 進行 web 抓取。

page = soup.find('span', attrs={'class':'h-text-lg'})

對於一個特定頁面，這不會返回任何內容，因為不存在 class。 我應該如何修改代碼以使其返回[0]而不是None而不是將None作為返回值？

Answer 1

您可以使用 Python 的 boolean 表達式返回最后評估值的事實：

page = soup.find('span', attrs={'class':'h-text-lg'}) or [0]

但為什么？ 在這種情況下，擁有None會好得多。

如果page是找到的元素或None ，任何依賴page的代碼都可以簡單地檢查if page或if not page 。 如果page是[0]這將不起作用，因為bool([0])是True 。

Answer 2

如果您不希望它在 class 不存在的情況下返回None ，如果該值不是真值，您可以簡單地更改它

換句話說：

page = soup.find('span', attrs={'class':'h-text-lg'}) or [0]

Answer 3

使用if語句：

page = soup.find('span', attrs={'class':'h-text-lg'})

if not page:
    page = 0

Answer 4

page = soup.find('span', attrs={'class':'h-text-lg'}) or [0]

    def find(self, name=None, attrs={}, recursive=True, text=None,
             **kwargs):
        """Look in the children of this PageElement and find the first
        PageElement that matches the given criteria.

        All find_* methods take a common set of arguments. See the online
        documentation for detailed explanations.

        :param name: A filter on tag name.
        :param attrs: A dictionary of filters on attribute values.
        :param recursive: If this is True, find() will perform a
            recursive search of this PageElement's children. Otherwise,
            only the direct children will be considered.
        :param limit: Stop looking after finding this many results.
        :kwargs: A dictionary of filters on attribute values.
        :return: A PageElement.
        :rtype: bs4.element.Tag | bs4.element.NavigableString
        """
        r = None
        l = self.find_all(name, attrs, recursive, text, 1, **kwargs)
        if l:
            r = l[0]
        return r

這就是 find 方法的定義方式，因此您必須實際顯式處理None情況。 希望這能回答問題

BeautifulSoup 如果找不到返回 0 而不是 None

問題描述

4 個解決方案

解決方案1
1 2020-06-25 19:33:14

解決方案2
0 已采納 2020-06-25 19:33:49

解決方案3
0 2020-06-25 19:34:09

解決方案4
0 2020-06-25 19:36:59

BeautifulSoup 如果找不到返回 0 而不是 None

問題描述

4 個解決方案

解決方案1 1 2020-06-25 19:33:14

解決方案2 0 已采納 2020-06-25 19:33:49

解決方案3 0 2020-06-25 19:34:09

解決方案4 0 2020-06-25 19:36:59

解決方案1
1 2020-06-25 19:33:14

解決方案2
0 已采納 2020-06-25 19:33:49

解決方案3
0 2020-06-25 19:34:09

解決方案4
0 2020-06-25 19:36:59