我的脚本没有检索列表项。如果有人可以帮助我理解原因，我将不胜感激。

Question

我需要更改AtBS图书中的xkcd项目，才能从其他网站下载漫画。 这是我的剧本。

#! python3
# getwebcomic.py - Downloads every single smbc comic.

import requests, os, bs4

os.chdir('C:\\Users\\Bob\\Desktop\\')
url = 'https://smbc-comics.com' # starting url
os.makedirs('smbc', exist_ok=True) # store comics in ./smbc
noAbuse=0

for noAbuse in range(0, 5):
#while not url.endswith('#'):
    # Download the page.
    print('Downloading page %s...' % url)
    res = requests.get(url)
    res.raise_for_status()

    soup = bs4.BeautifulSoup(res.text, "html.parser")

    # Find the URL of the comic image.
    comicElem = soup.select('#cc-comicbody')
    print('I am finding it')
    print(comicElem)
    if comicElem == []:
        print('Could not find comic image.')
    else:
        print(comicElem[0].get('src'))
        print('I dont know why the .get is returning NONE!')
        print('It is there???')
        print('...and now it crashes')
        comicUrl = 'https//smbc-comics.com' + comicElem[0].get('src')
        print(comicUrl)
        # Download the image.

        print('Downloading image %s...' % (comicUrl))
        res = requests.get(comicUrl)
        res.raise_for_status()

        # Save the image to ./smbc
        imageFile = open(os.path.join('smbc', os.path.basename(comicUrl)), 
'wb')
       for chunk in res.iter_content(100000):
            imageFile.write(chunk)
        imageFile.close()

    # Get the Prev button's url.
   prevLink = soup.select('a[rel="prev"]')[0]
   url = 'http://smbc-comic.com' + prevLink.get('href')

print('Done.')

输出

Downloading page https://smbc-comics.com...

I am finding it
[<div id="cc-comicbody"><img border="0" id="cc-comic" 
 src="/comics/1524150658-20170419 (1).png" 
 title="You can also just use an infinite quantity of compasses as on-off switches."/><br/></div>]
None

我不知道为什么.get没有返回！ 在那里吗？？？ ...现在崩溃了

追溯（最近一次通话）：
<module>中的文件“ C：\\ Users \\ Bob \\ PythonScripts \\ getwebcomic.py”，第31行
comicUrl ='https // smbc-comics.com'+ comicElem [0] .get（'src'）
TypeError：必须为str，而不是NoneType

我似乎无法弄清楚为什么当'src'属性存在时.get方法不返回任何内容。 任何提示将不胜感激。 我添加了一些额外的print（）来帮助我查看脚本运行时发生的情况。

Answer 1

comicElem[0]是一个除法（ <div> ）。 它没有src属性，因此.get返回None 。 您应该改用comicElem[0].img.get("src") ，它返回"/comics/1524150658-20170419 (1).png" 。

我的脚本没有检索列表项。如果有人可以帮助我理解原因，我将不胜感激。

问题描述

1 个解决方案

解决方案1
0 2018-04-19 16:09:21

我的脚本没有检索列表项。 如果有人可以帮助我理解原因，我将不胜感激。

问题描述

1 个解决方案

解决方案1 0 2018-04-19 16:09:21

我的脚本没有检索列表项。如果有人可以帮助我理解原因，我将不胜感激。

解决方案1
0 2018-04-19 16:09:21