UnicodeDecodeError：'ascii'编解码器无法解码位置0的字节0xa0：序数不在范围内（128）

Question

I'm working on scraping Oregon Teacher License data for a project I'm doing. 我正在为正在执行的项目抓取俄勒冈州教师许可数据。 Here's my code: 这是我的代码：

educ_employ = tree.xpath('//tr[15]//td[@bgcolor="#A9EDFC"]//text()')
print educ_employ
#[u'Jefferson Middle School\xa0\xa0(2013 - 2014)']

I want to strip the the "\\xa0". 我要剥离“ \\ xa0”。 This is my code: 这是我的代码：

educ_employ = ([s.strip('\xa0') for s in educ_employ])
print educ_employ
UnicodeDecodeError: 'ascii' codec can't decode byte 0xa0 in position 0: ordinal not in range(128)

I tried this : 我尝试了这个：

educ_employ = ([s.decode('utf-8').strip('\xa0') for s in educ_employ])
print educ_employ
UnicodeDecodeError: 'ascii' codec can't decode byte 0xa0 in position 0: ordinal not in range(128)

And this : 这：

import sys

reload(sys)
sys.setdefaultencoding('utf-8')

educ_employ = tree.xpath('//tr[15]//td[@bgcolor="#A9EDFC"]//text()')
educ_employ = ([s.decode('utf-8').strip('\xa0') for s in educ_employ])
print educ_employ
>>>

I didn't get an error with the last one but I also didn't get an output. 我没有遇到最后一个错误，但是也没有得到输出。 I'm using Python 2.7. 我正在使用Python 2.7。 Does anyone know how to fix this? 有谁知道如何解决这一问题？

Answer 1

You are mixing up unicode objects and str objects. 您正在混合unicode对象和str对象。 educ_employ is a unicode , but '\\xa0' is a str . educ_employ是unicode ，但是'\\xa0'是str 。

Additionally, .strip() only removes characters from the beginning and end of the string, not the middle. 此外， .strip()仅从字符串的开头和结尾删除字符，而不从中间删除字符。 Try .replace() instead. 尝试使用.replace()代替。

Try: 尝试：

educ_employ = [u'Jefferson Middle School\xa0\xa0(2013 - 2014)']
educ_employ = [s.replace(u'\xa0', u'') for s in educ_employ]
print educ_employ

UnicodeDecodeError：'ascii'编解码器无法解码位置0的字节0xa0：序数不在范围内（128）

问题描述

1 个解决方案

解决方案1
3 已采纳 2016-03-18 14:46:10

UnicodeDecodeError：&#39;ascii&#39;编解码器无法解码位置0的字节0xa0：序数不在范围内（128）

问题描述

1 个解决方案

解决方案1 3 已采纳 2016-03-18 14:46:10

UnicodeDecodeError：'ascii'编解码器无法解码位置0的字节0xa0：序数不在范围内（128）

解决方案1
3 已采纳 2016-03-18 14:46:10