带有日文字符的 re.sub

Question

I have the following string:我有以下字符串：

s = u'アガサ・クリスティー　奥さまは名探偵　～パディントン発4時50分～（字幕版）'

However, when I try and get rid of the character （ and everything after it, it doesn't match:但是，当我尝试删除字符（及其后的所有内容时，它不匹配：

>>> print re.sub(r'\（.+$', '', s)
アガサ・クリスティー　奥さまは名探偵　～パディントン発4時50分～（字幕版）

How would I get the string to be just:我将如何让字符串只是：

アガサ・クリスティー　奥さまは名探偵　～パディントン発4時50分～

? ?

Answer 1

You should ensure that all of the parameters to re.sub() are the same type -- str or unicode .您应该确保re.sub()所有参数都是相同的类型—— str或unicode 。 Try this:尝试这个：

# encoding: utf-8

import re
s = u'アガサ・クリスティー　奥さまは名探偵　～パディントン発4時50分～（字幕版）'
print re.sub(ur'\（.+$', u'', s)

带有日文字符的 re.sub

问题描述

1 个解决方案

解决方案1
2 已采纳 2016-04-27 18:28:39

带有日文字符的 re.sub

问题描述

1 个解决方案

解决方案1 2 已采纳 2016-04-27 18:28:39

解决方案1
2 已采纳 2016-04-27 18:28:39