如何从 python 中的大量列表中拆分所有其他元素

Question

I have a huge list in python in a single column and i need to split all the fruits, colours etc from the list and make a dafaframe.我在 python 的单列中有一个巨大的列表，我需要从列表中拆分所有水果、颜色等并制作一个 dafaframe。

example
details=['banana', 
'type:', 
'fruit', 
'color:', 
'yellow', 
'orange', 
'type:', 
'fruit', 
'color:', 
'orange',
'blueberry', 
'type:', 
'fruit', 
'color:', 
'blue']

what I'm expecting to achieve is if I extract all color from above then the result should be a single column of list like below.我期望实现的是，如果我从上面提取所有颜色，那么结果应该是如下所示的单列列表。

Out[1]:

['yellow',
 'orange',
 'blue']

Answer 1

details=['banana',
         'type:',
         'fruit',
         'color:',
         'yellow',
         'orange',
         'type:',
         'fruit',
         'color:',
         'orange',
         'blueberry',
         'type:',
         'fruit',
         'color:',
        'blue']
# split each fruit as a list and index colors 
details = [details[i:i+5] for i in range(0,len(details),5)]
fruits = []
color = []
for i in details:
  fruits.append(i[0])
  color.append(i[4])

Answer 2

One possible approach if the data-structure doesn't change, is using list comprehension:如果数据结构没有改变，一种可能的方法是使用列表理解：

eg: [details[i+1] for i, x in enumerate(details) if x == 'color:']例如： [details[i+1] for i, x in enumerate(details) if x == 'color:']

Full code:完整代码：

details=['banana',
'type:',
'fruit',
'color:',
'yellow',
'orange',
'type:',
'fruit',
'color:',
'orange',
'blueberry',
'type:',
'fruit',
'color:',
'blue']

colors = [details[i+1] for i, x in enumerate(details) if x == 'color:']
fruits = [details[i-1] for i, x in enumerate(details) if x == 'type:']
types = [details[i+1] for i, x in enumerate(details) if x == 'type:']

print('fruits: ', fruits)
print('types: ', types)
print('colors: ', colors)

Output: Output：

fruits:  ['banana', 'orange', 'blueberry']
types:  ['fruit', 'fruit', 'fruit']
colors:  ['yellow', 'orange', 'blue']

Or as Dataframe或如 Dataframe

# to make datafame
import pandas as pd

df = pd.DataFrame()
df['fruits'] = [details[i-1] for i, x in enumerate(details) if x == 'type:']
df['types'] = [details[i+1] for i, x in enumerate(details) if x == 'type:']
df['colors'] = [details[i+1] for i, x in enumerate(details) if x == 'color:']

print(df)

Output: Output：

      fruits  types  colors
0     banana  fruit  yellow
1     orange  fruit  orange
2  blueberry  fruit    blue

Explanation:解释：

value of colour comes after the string color , so find the indices of that string in the list color 的值在字符串color之后，因此在列表中找到该字符串的索引
add 1 to this index to get the value of color将此索引加1以获取颜色的值
use this with list comprehension to get desired output array将此与列表理解一起使用以获得所需的 output 数组
repeat for other values重复其他值

Answer 3

If you want to extract only colors, you can use the package colour - https://github.com/vaab/colour如果只想提取 colors，可以使用 package colour - https://github.com/vaab/colour

In [7]: from colour import Color

In [8]: details
Out[8]:
['banana',
 'type:',
 'fruit',
 'color:',
 'yellow',
 'orange',
 'type:',
 'fruit',
 'color:',
 'orange',
 'blueberry',
 'type:',
 'fruit',
 'color:',
 'blue']

In [9]: colors_only = set()

In [10]: for i in details:
    ...:     try:
    ...:         Color(i)
    ...:         colors_only.add(i)
    ...:     except: pass
    ...:

In [11]: colors_only
Out[11]: {'yellow', 'orange', 'blue'}

如何从 python 中的大量列表中拆分所有其他元素

问题描述

3 个解决方案

解决方案1
1 2021-06-03 12:33:05

解决方案2
0 2021-06-03 12:25:08

解决方案3
0 已采纳 2021-06-03 12:39:23

如何从 python 中的大量列表中拆分所有其他元素

问题描述

3 个解决方案

解决方案1 1 2021-06-03 12:33:05

解决方案2 0 2021-06-03 12:25:08

解决方案3 0 已采纳 2021-06-03 12:39:23

解决方案1
1 2021-06-03 12:33:05

解决方案2
0 2021-06-03 12:25:08

解决方案3
0 已采纳 2021-06-03 12:39:23