如何在python中編碼字符'\\ u0107'

Question

我正在嘗試從Wikipedia頁面（這是某些年份的前100首單曲）中抓取數據，同時將輸出保存到1951-1959年得到的csv中，然后給出了一個錯誤：

在writer.writerow（songs）文件“ C：\\ Python36_64 \\ lib \\ encodings \\ cp1252.py”中的第43行，

第19行，以編碼返回codecs.charmap_encode（input，self.errors，encoding_table）[0]

UnicodeEncodeError：'charmap'編解碼器無法在位置29編碼字符'\\ u0107'：字符映射到<undefined>

碼：


from bs4 import BeautifulSoup
import requests
import csv

data = []


def scrape_data(search_year):
    year_data = []
    url = f'https://en.wikipedia.org/wiki/Billboard_Year-End_Hot_100_singles_of_{str(search_year)}'
    # Get a source code from url
    r = requests.get(url).text
    soup = BeautifulSoup(r, 'html.parser')
    # Isolate the table part from the source code
    table = soup.find('table', attrs={'class': 'wikitable'})
    # Extract every row of the table
    rows = table.find_all('tr')

    # Iterate through every row
    for row in rows[1:]:
        # Extract cols (with tags td and th)
        cols = row.find_all(['td', 'th'])
        # List comprehension (create a list of lists, list of rows, in which every row is a list of table text)
        year_data.append([col.text.replace('\n', '') for col in cols])

    # Add the year, this data is from to the beginning of the list
    for n in year_data:
        n.insert(0, search_year)
    return year_data


for year in range(1951, 2019):
    try:
        data.append(scrape_data(year))
        print(f'Year {str(year)} Scrapped')
    except AttributeError as e:
        print(f'Year {str(year)} is not aviable')

writer = csv.writer(open('songs.csv', 'w'), delimiter=',', lineterminator='\n', quotechar='"')
for year_data in data:
    for songs in year_data:
        writer.writerow(songs)
        print(songs)

Answer 1

我認為您可以在編寫輸出時通過使用正確的unicode編碼來糾正此問題：

writer = csv.writer(open('songs.csv', 'w', encoding='utf-8'),
                    delimiter=',', lineterminator='\n', quotechar='"')

如何在python中編碼字符'\\ u0107'

問題描述

1 個解決方案

解決方案1
2 已采納 2019-01-09 19:33:54

如何在python中編碼字符&#39;\\ u0107&#39;

問題描述

1 個解決方案

解決方案1 2 已采納 2019-01-09 19:33:54

如何在python中編碼字符'\\ u0107'

解決方案1
2 已采納 2019-01-09 19:33:54