迭代拆分列表元素

Question

I want to split a list names element.我想拆分一个列表names元素。 More precicely i only want to split the strings with Oscar Muller更准确地说，我只想和Oscar Muller分手

names = ['Oscar Muller Some other Name', 'Oscar Muller', 'Peter Pan']
expected_names = ['Oscar Muller', 'Some other Name', 'Oscar Muller', 'Peter Pan']

d = "Oscar Muller "
for line in names:
    s = [e+d for e in line.split(d) if e]

That didnt do anything.那没有做任何事情。

[list(filter(None, re.split(r'Oscar\sMuller\s', i))) for i in names]

didnt do anything either.也没有做任何事情。

d1 = re.compile(r"Oscar\sMuller\s")
d = d1.search(names)
for line in names:
    if d:
        s = [e+d for e in line.split(d) if e]

but it caused issues with input .split() .但它导致输入.split()出现问题。 Error: TypeError: must be str or None, not re.Pattern .错误： TypeError: must be str or None, not re.Pattern 。 So i changed it to process each list element.所以我改变它来处理每个列表元素。

d1 = re.compile(r"Oscar\sMuller\s")
d = list(filter(d1.match, names))
for line in names:
    if d:
        s = [e+d for e in line.split(d) if e]

But it didnt work either, returning TypeError: must be str or None, not list但它也没有用，返回TypeError: must be str or None, not list

Question : What am i doing wrong?问题：我做错了什么？

Answer 1

You can also use list comprehension to make it one line:您还可以使用列表理解使其成为一行：

import re
[j for i in [re.split(r"(?<=Oscar Muller)", k) for k in names] for j in i if j]

Answer 2

Essentially, what you need to do is generate 1 or 2 item sublists for each item in the original list, and then flatten the list into a single iterable.本质上，您需要做的是为原始列表中的每个项目生成 1 或 2 个项目子列表，然后将列表展平为单个可迭代对象。

A couple ways you could do this.有几种方法可以做到这一点。 You could use a generator function, or some clever use of itertools您可以使用生成器 function，或者巧妙地使用itertools

import re

def my_generator(names):
    for name in names:
        sublist = re.split(r"(?<=Oscar Muller) ", name)
        yield from sublist

names = ['Oscar Muller Some other Name', 'Oscar Muller', 'Peter Pan']
expected_names = list(my_generator(names))

Or you could one-liner it with itertools :或者您可以使用itertools对其进行单线处理：

import itertools
import re

names = ['Oscar Muller Some other Name', 'Oscar Muller', 'Peter Pan']
expected_names = list(itertools.chain.from_iterable(re.split(r"(?<=Oscar Muller) ", s) for s in names))

迭代拆分列表元素

问题描述

2 个解决方案

解决方案1
1 2022-04-27 06:56:30

解决方案2
0 已采纳 2022-04-27 06:51:52

迭代拆分列表元素

问题描述

2 个解决方案

解决方案1 1 2022-04-27 06:56:30

解决方案2 0 已采纳 2022-04-27 06:51:52

解决方案1
1 2022-04-27 06:56:30

解决方案2
0 已采纳 2022-04-27 06:51:52