pandas dataframe 添加特定列值随机排列的行

Question

I have the dataframe:我有 dataframe：

df = b_150 h_200 b_250 h_300 b_350 h_400  c1  c2 q4
       1.    2.    3.     4    5.    6.   3.  4.  4

I want to add rows with possible shuffles between values of b_150, b_250, b_350 and h_200, h_300, h_400我想在 b_150、b_250、b_350 和 h_200、h_300、h_400 的值之间添加可能随机播放的行

So for example例如

df = add_shuffles(df, cols=[b_150, b_250, b350], n=1)
df = add_shuffles(df, cols=[h_200, h_300, h_400], n=1)

I will add 2 combinations (1 for l1 and one for l2) to get:我将添加 2 个组合（1 个用于 l1，一个用于 l2）以获得：

df = b_150 h_200 b_250 h_300 b_350 h_400   c1  c2 q4
       1.    2.    3.     4    5.    6.    3.  4.  4
       3.    2.    5.     4    1.    6.    3.  4.  4 
       1.    2.    3.     6    5.    4.    3.  4.  4

What is the most efficient way to do it?最有效的方法是什么？

Answer 1

Try:尝试：

def columns_shuffler():
    x, y = random.sample(list(cols), 2)
    if y:
        return random.sample(cols[0], len(cols[0])) + cols[1]
    else:
        return cols[0] + random.sample(cols[1], len(cols[1]))

msk = df.columns.str.contains('b')
msk1 = df.columns.str.contains('h')
cols = dict(enumerate([df.columns[msk].tolist(), df.columns[msk1].tolist()]))
out = pd.concat([df, pd.DataFrame(np.c_[np.r_[[df[columns_shuffler()] 
                                         for _ in range(n)]].reshape(n, -1), 
                                        np.tile(df.loc[:, ~(msk | msk1)], (n,1))], 
                                  columns=cols[0]+cols[1]+df.columns[~(msk|msk1)].tolist())])

pandas dataframe 添加特定列值随机排列的行

问题描述

1 个解决方案

解决方案1
1

pandas dataframe 添加特定列值随机排列的行

问题描述

1 个解决方案

解决方案1 1

解决方案1
1