熊猫-在数据框的值中具有不等长的转置列表

Question

这个问题是对Pandas的扩展：将列中的列表分成多行，现在我不想合并更多的DataFrame。 而且我无法使其与超过2个dfs一起使用。

我有这个DataFrame：

  Index     Job positions   Job types   Locations
      0          [5]         [6]        [3, 4, 5]
      1          [1]         [2, 6]     [3, NaN] 
      2          [1,3]       [9, 43]    [1]

我想要数字的每个单一组合，因此最终结果将是：

index   Job position  Job type  Location
    0   5             6         3
    0   5             6         4
    0   5             6         5
    1   1             2         3
    1   1             2         NaN
    1   1             6         3
    1   1             6         NaN
    2   1             9         1
    2   1             43        1
    2   3             9         1
    2   3             43        1

所以我要做的是将列转换为Series：

positions = df['Job positions'].apply(pd.Series).reset_index().melt(id_vars='index').dropna()[['index', 'value']].set_index('index')
types = df['Job types'].apply(pd.Series).reset_index().melt(id_vars='index').dropna()[['index', 'value']].set_index('index')
locations = df['Locations'].apply(pd.Series).reset_index().melt(id_vars='index').dropna()[['index', 'value']].set_index('index')

dfs = [positions, types, locations]

然后尝试像这样合并它们：

df_final = reduce(lambda left,right: pd.merge(left,right,left_index=True, right_index=True, how="left"), dfs)

但似乎是用NaN跳过了这些字段-如何防止这种情况？

Answer 1

1行：

import itertools

dfres = pd.DataFrame([(i[0],)+j for i in df.values for j in itertools.product(*i[1:])]
        ,columns=df.columns).set_index('index')


       Job positions  Job types  Locations
index                                     
0                  5          6        3
0                  5          6        4
0                  5          6        5
1                  1          2        3
1                  1          2        NaN
1                  1          6        3
1                  1          6        NaN
2                  1          9        1
2                  1         43        1
2                  3          9        1
2                  3         43        1

熊猫-在数据框的值中具有不等长的转置列表

问题描述

1 个解决方案

解决方案1
1 已采纳 2018-05-12 14:47:43

熊猫-在数据框的值中具有不等长的转置列表

问题描述

1 个解决方案

解决方案1 1 已采纳 2018-05-12 14:47:43

解决方案1
1 已采纳 2018-05-12 14:47:43