如何在具有特定分隔符位置的熊猫中将列拆分为多列？

Question

here is my dataframe这是我的数据框

                    df_test
0   (-, 136), (-, 136), 1.0
1   (-, 136), (-, 438), 0.5
2   (-, 136), (-, 257), 0.5

I would like to see the result like this我想看到这样的结果

      df_t1   df_t2  df_val
0   (-, 136) (-, 136) 1.0
1   (-, 136) (-, 438) 0.5
2   (-, 136) (-, 257) 0.5

I have used this code but it is not working我已经使用了此代码，但它不起作用

new_df[['df_t1', 'df_t2', 'df_val']] = new_df['df_test'].str.split(',',expand=True)

any suggestion?有什么建议吗？

Answer 1

Specific to your format, you can use ast.literal_eval .具体到您的格式，您可以使用ast.literal_eval 。 Better, try and solve the issue upstream before your dataframe is constructed.更好的是，在构建数据帧之前尝试解决上游的问题。

from ast import literal_eval

df = pd.DataFrame({'df_test': ['(-, 136), (-, 136), 1.0',
                               '(-, 136), (-, 438), 0.5',
                               '(-, 136), (-, 257), 0.5']})

series = df.pop('df_test').str.replace('-', '"-"').apply(literal_eval)
df = df.join(pd.DataFrame(series.values.tolist(), columns=['df_t1', 'df_t2', 'df_val']))

print(df)

      df_t1     df_t2  df_val
0  (-, 136)  (-, 136)     1.0
1  (-, 136)  (-, 438)     0.5
2  (-, 136)  (-, 257)     0.5

Answer 2

Use:用：

new_df[['df_t1', 'df_t2', 'df_val']] = new_df['df_test'].str.rsplit('),', expand=True)
new_df[['df_t1', 'df_t2']] += ')' 
print (new_df)
                   df_test     df_t1      df_t2 df_val
0  (-, 136), (-, 136), 1.0  (-, 136)   (-, 136)    1.0
1  (-, 136), (-, 438), 0.5  (-, 136)   (-, 438)    0.5
2  (-, 136), (-, 257), 0.5  (-, 136)   (-, 257)    0.5

如何在具有特定分隔符位置的熊猫中将列拆分为多列？

问题描述

2 个解决方案

解决方案1
1 2019-01-16 11:45:54

解决方案2
0 2019-01-16 11:38:17

如何在具有特定分隔符位置的熊猫中将列拆分为多列？

问题描述

2 个解决方案

解决方案1 1 2019-01-16 11:45:54

解决方案2 0 2019-01-16 11:38:17

解决方案1
1 2019-01-16 11:45:54

解决方案2
0 2019-01-16 11:38:17