简体   繁体   English

UnicodeDecodeError:'ascii'编解码器无法解码位置0中的字节0xe2:序号不在范围内(128)

[英]UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 0: ordinal not in range(128)

I have this code: 我有这个代码:

# -*- coding: utf-8 -*-

forbiddenWords=['for', 'and', 'nor', 'but', 'or', 'yet', 'so', 'not', 'a', 'the', 'an', 'of', 'in', 'to', 'for', 'with', 'on', 'at', 'from', 'by', 'about', 'as']


def IntoSentences(paragraph):
    paragraph = paragraph.replace("–", "-")
    import nltk.data
    sent_detector = nltk.data.load('tokenizers/punkt/english.pickle')
    sentenceList = sent_detector.tokenize(paragraph.strip())
    return sentenceList

from Tkinter import *

root = Tk()

var = StringVar()
label = Label( root, textvariable=var)
var.set("Fill in the caps: ")
label.pack()

text = Text(root)
text.pack()

button=Button(root, text ="Create text with caps.", command =lambda: IntoSentences(text.get(1.0,END)))
button.pack()

root.mainloop()

Everything works fine when I run the code. 当我运行代码时,一切正常。 Then I insert the text and press the button. 然后我插入文本并按下按钮。 But then I get this error: 但后来我得到了这个错误:

C:\Users\Indrek>caps_main.py
Exception in Tkinter callback
Traceback (most recent call last):
  File "C:\Python27\lib\lib-tk\Tkinter.py", line 1470, in __call__
    return self.func(*args)
  File "C:\Python27\Myprojects\caps_main.py", line 25, in <lambda>
    button=Button(root, text ="Create text with caps.", command =lambda: IntoSen
tences(text.get(1.0,END)))
  File "C:\Python27\Myprojects\caps_main.py", line 7, in IntoSentences
    paragraph = paragraph.replace("ŌĆō", "-")
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 0: ordinal
not in range(128)

How to fix this problem? 如何解决这个问题? At first I had the same error message, when I tried to run thecode, then I added the lambda: and now the problem appears when i click the button in my app. 起初我有相同的错误消息,当我尝试运行代码,然后我添加了lambda:现在当我点击我的应用程序中的按钮时出现问题。

You'll have to decode the string to utf-8(or some other encoding) and then replace the unicode string to someting else. 您必须将字符串解码为utf-8(或其他一些编码),然后将unicode字符串替换为其他字符串。 This piece of code does what you are trying to achieve: 这段代码完成了您要实现的目标:

paragraph = paragrah.decode('utf-8').replace(u'\u014c\u0106\u014d','-')
# '\u014c\u0106\u014d' is the unicode representation of characters ŌĆō

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 UnicodeDecodeError:“ascii”编解码器无法解码位置 13 中的字节 0xe2:序号不在范围内(128) - UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 13: ordinal not in range(128) UnicodeDecodeError:&#39;ascii&#39; 编解码器无法解码位置 14 中的字节 0xe2:在 GAE python 中序号不在范围内(128)? - UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 14: ordinal not in range(128) in GAE python? Python UnicodeDecodeError:&#39;ascii&#39;编解码器无法解码位置0的字节0xe2:序数不在范围内(128) - Python UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 0: ordinal not in range(128) 安装熊猫:UnicodeDecodeError:&#39;ascii&#39;编解码器无法解码位置 72 中的字节 0xe2:序号不在范围内(128) - Installing pandas: UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 72: ordinal not in range(128) UnicodeDecodeError:&#39;ascii&#39;编解码器无法解码位置139中的字节0xe2:序数不在范围内(128) - UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 139: ordinal not in range(128) Python 3 UnicodeDecodeError:&#39;ascii&#39;编解码器无法解码位置0中的字节0xe2:序数不在范围内(128) - Python 3 UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 0: ordinal not in range(128) UnicodeDecodeError:&#39;ascii&#39;编解码器无法解码位置35的字节0xe2:序数不在范围内(128) - UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 35: ordinal not in range(128) UnicodeDecodeError:&#39;ascii&#39;编解码器无法解码位置4中的字节0xe2:序数不在范围内(128) - UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 4: ordinal not in range(128) Python UnicodeDecodeError:&#39;ascii&#39;编解码器不能解码不在范围内的字节0xe2序数(128) - Python UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 ordinal not in range(128) UnicodeDecodeError:&#39;ascii&#39;编解码器无法解码位置13的字节0xe2:关于读取文件的序数不在range(128)中 - UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 13: ordinal not in range(128), regarding reading in files
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM