Python html 解析使用漂亮的汤问题

Question

I am trying to get the name of all organizations from https://www.devex.com/organizations/search using beautifulsoup.However, I am getting an error.我正在尝试使用 beautifulsoup 从https://www.devex.com/organizations/search获取所有组织的名称。但是，我遇到了错误。 Can someone please help.有人可以帮忙吗。

import requests from requests import get from bs4 import BeautifulSoup import pandas as pd import numpy as np从请求中导入请求从 bs4 中导入请求导入 BeautifulSoup 导入 pandas 作为 pd 导入 numpy 作为 np

from time import sleep from random import randint从时间导入睡眠从随机导入randint

headers = {"Accept-Language": "en-US,en;q=0.5"} headers = {"Accept-Language": "en-US,en;q=0.5"}

titles = [] pages = np.arange(1, 2, 1)标题 = [] 页 = np.arange(1, 2, 1)

for page in pages:对于页面中的页面：

page = requests.get("https://www.devex.com/organizations/search?page%5Bnumber%5D=" + str(page) + "", headers=headers) page = requests.get("https://www.devex.com/organizations/search?page%5Bnumber%5D=" + str(page) + "", headers=headers)

soup = BeautifulSoup(page.text, 'html.parser') movie_div = soup.find_all('div', class_='info-container') soup = BeautifulSoup(page.text, 'html.parser') movie_div = soup.find_all('div', class_='info-container')

sleep(randint(2,10))睡眠（randint（2,10））

for container in movie_div:对于movie_div中的容器：

    name = container.a.find('h3', class_= 'ng-binding').text
    titles.append(name)

movies = pd.DataFrame({ 'movie': titles,电影 = pd.DataFrame({ '电影': 标题,

}) })

to see your dataframe看你的 dataframe

print(movies)印刷（电影）

to see the datatypes of your columns查看列的数据类型

print(movies.dtypes)打印（movies.dtypes）

to see where you're missing data and how much data is missing查看您丢失数据的位置以及丢失了多少数据

print(movies.isnull().sum())打印(movies.isnull().sum())

to move all your scraped data to a CSV file将所有抓取的数据移动到 CSV 文件

movies.to_csv('movies.csv') movies.to_csv('movies.csv')

Answer 1

you may try with something like你可以试试类似的东西

name = bs.find("h3", {"class": "ng-binding"})

Python html 解析使用漂亮的汤问题

问题描述

to see your dataframe看你的 dataframe

to see the datatypes of your columns查看列的数据类型

to see where you're missing data and how much data is missing查看您丢失数据的位置以及丢失了多少数据

to move all your scraped data to a CSV file将所有抓取的数据移动到 CSV 文件

1 个解决方案

解决方案1
0 2021-01-23 13:11:16

Python html 解析使用漂亮的汤问题

问题描述

to see your dataframe看你的 dataframe

to see the datatypes of your columns查看列的数据类型

to see where you're missing data and how much data is missing查看您丢失数据的位置以及丢失了多少数据

to move all your scraped data to a CSV file将所有抓取的数据移动到 CSV 文件

1 个解决方案

解决方案1 0 2021-01-23 13:11:16

解决方案1
0 2021-01-23 13:11:16