python在字符前重新分割字符串

Question

how to split a string at positions before a character? 如何在字符前的位置拆分字符串？

split a string before 'a' 在'a'之前拆分一个字符串
input: "fffagggahhh" 输入：“fffagggahhh”
output: ["fff", "aggg", "ahhh"] 输出：[“fff”，“aggg”，“ahhh”]

the obvious way doesn't work: 显而易见的方法不起作用：

>>> h=re.compile("(?=a)")

>>> h.split("fffagggahhh")

['fffagggahhh']

>>>

Answer 1

Ok, not exactly the solution you want but I thought it will be a useful addition to problem here. 好吧，不完全是你想要的解决方案，但我认为这将是一个有用的问题补充。

Solution without re 没有重新解决方案

Without re: 没有重新：

>>> x = "fffagggahhh"
>>> k = x.split('a')
>>> j = [k[0]] + ['a'+l for l in k[1:]]
>>> j
['fff', 'aggg', 'ahhh']
>>>

Answer 2

>>> rx = re.compile("(?:a|^)[^a]*")
>>> rx.findall("fffagggahhh")
['fff', 'aggg', 'ahhh']
>>> rx.findall("aaa")
['a', 'a', 'a']
>>> rx.findall("fgh")
['fgh']
>>> rx.findall("")
['']

Answer 3

>>> r=re.compile("(a?[^a]+)")
>>> r.findall("fffagggahhh")
['fff', 'aggg', 'ahhh']

EDIT: 编辑：

This won't handle correctly double a s in the string: 这不会正确处理翻一番a字符串中S：

>>> r.findall("fffagggaahhh")
['fff', 'aggg', 'ahhh']

KennyTM's re seems better suited. KennyTM似乎更适合。

Answer 4

import re

def split_before(pattern,text):
    prev = 0
    for m in re.finditer(pattern,text):
        yield text[prev:m.start()]
        prev = m.start()
    yield text[prev:]


if __name__ == '__main__':
    print list(split_before("a","fffagggahhh"))

re.split treats the pattern as a delimiter. re.split将模式视为分隔符。

>>> print list(split_before("a","afffagggahhhaab"))
['', 'afff', 'aggg', 'ahhh', 'a', 'ab']
>>> print list(split_before("a","ffaabcaaa"))
['ff', 'a', 'abc', 'a', 'a', 'a']
>>> print list(split_before("a","aaaaa"))
['', 'a', 'a', 'a', 'a', 'a']
>>> print list(split_before("a","bbbb"))
['bbbb']
>>> print list(split_before("a",""))
['']

Answer 5

This one works on repeated a 's 这个是重复的a

  >>> re.findall("a[^a]*|^[^a]*", "aaaaa")
  ['a', 'a', 'a', 'a', 'a']
  >>> re.findall("a[^a]*|[^a]+", "ffaabcaaa")
  ['ff', 'a', 'abc', 'a', 'a', 'a']

Approach: the main chunks that you are looking for are an a followed by zero or more not- a . 方法：您正在寻找的主要块是a后跟零或更多不是a 。 That covers all possibilities except for zero or more not- a . 覆盖除零个或多个不可─所有的可能性a 。 That can happen only at the start of the input string. 这只能在输入字符串的开头发生。

Answer 6

>>> foo = "abbcaaaabbbbcaaab"
>>> bar = foo.split("c")
>>> baz = [bar[0]] + ["c"+x for x in bar[1:]]
>>> baz
['abb', 'caaaabbbb', 'caaab']

Due to how slicing works, this will work properly even if there are no occurrences of c in foo . 由于切片如何工作，即使foo中没有出现c ，这也能正常工作。

Answer 7

split() takes an argument for the character to split on: split()接受要拆分的字符的参数：

>>> "fffagggahhh".split('a')
['fff', 'ggg', 'hhh']

python在字符前重新分割字符串

问题描述

7 个解决方案

解决方案1
20 2010-11-04 06:39:06

解决方案2
4 2010-11-04 06:41:28

解决方案3
3 2010-11-04 06:38:46

解决方案4
2 2010-11-04 06:50:07

解决方案5
0 2010-11-04 07:02:43

解决方案6
-1 2010-11-04 06:41:41

解决方案7
-3 2010-11-04 06:40:15

python在字符前重新分割字符串

问题描述

7 个解决方案

解决方案1 20 2010-11-04 06:39:06

解决方案2 4 2010-11-04 06:41:28

解决方案3 3 2010-11-04 06:38:46

解决方案4 2 2010-11-04 06:50:07

解决方案5 0 2010-11-04 07:02:43

解决方案6 -1 2010-11-04 06:41:41

解决方案7 -3 2010-11-04 06:40:15

解决方案1
20 2010-11-04 06:39:06

解决方案2
4 2010-11-04 06:41:28

解决方案3
3 2010-11-04 06:38:46

解决方案4
2 2010-11-04 06:50:07

解决方案5
0 2010-11-04 07:02:43

解决方案6
-1 2010-11-04 06:41:41

解决方案7
-3 2010-11-04 06:40:15