繁体   English   中英

用Python将每个句子的首字母大写

[英]Capitalizing the first letter of each sentence in Python

这是我的代码:

def fix_capitalization(usrStr):
    newString = ''
    wordList = []
    numLetters = 0
    for s in usrStr.split('. '):
        if s[0].isupper():
            s = s.capitalize()
            s = s.replace(' i ', " I ") 
            wordList.append(s)
        if s.islower():
            s = s.capitalize()
            s = s.replace(' i ', " I ") 
            wordList.append(s)
            numLetters += 1

        if s[0].islower():
            s = s.capitalize()
            s = s.replace(' i ', " I ") 
            wordList.append(s)
            numLetters += 1 



    newString = '. '.join(wordList)
    return newString, numLetters

传入的字符串是:

i want some water. he has some. maybe he can give me some. i think I will ask.

请注意,在之前maybe有4个空格。 我想要的结果是:

I want some water. He has some. Maybe he can give me some. I think I will ask.

但我得到:

I want some water. He has some. maybe he can give me some. I think I will ask.

我知道maybe因为我分手而没有大写. 在句号之后,该句子有多个空格,但是我不确定如何解决这个问题,或者不确定是否有更好的方法来处理我正在做的事情。 任何帮助将不胜感激。

在for循环中:首先找到非空格字符的索引。 然后将s [0]替换为s [index]。

使用正则表达式子方法的解决方案

def fix_capitalization(mystr):
    numLettets = 0
    newstr = []
    for s in mystr.split('. '):
        tmp = re.sub('^(\s*\w+)', lambda x:x.group(1).title(), s)
        newstr.append(tmp)
        # num of letters
        if s.lstrip()[0] != tmp.lstrip()[0]:
            numLetters += 1          
    return '. '.join(newstr).replace(' i ', ' I '), numLetters

fix_capitalization( 'i want some water. he has some.    maybe he can give me some. i think I will ask.')
# return ('I want some water. He has some.    Maybe he can give me some. I think I will ask.', 4)

简单修复如下原始代码

def fix_capitalization(usrStr):
    newString = ''
    wordList = []
    numLetters = 0
    for s in usrStr.split('. '):
        # check for leading space
        lspace = len(s) - len(s.lstrip())
        s = s.lstrip()

        if s[0].isupper():
            s = s.capitalize()
            s = s.replace(' i ', " I ") 
            wordList.append(' '*lspace + s)
        if s.islower():
            s = s.capitalize()
            s = s.replace(' i ', " I ") 
            wordList.append(' '*lspace + s)
            numLetters += 1
        if s[0].islower():
            s = s.capitalize()
            s = s.replace(' i ', " I ") 
            wordList.append(' '*lspace + s)
            numLetters += 1 

    newString = '. '.join(wordList)
    return newString, numLetters

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM