简体   繁体   中英

Slicing pandas dataframe on equal column values

I have a pandas df that looks like this:

import pandas as pd

df = pd.DataFrame({0:[1],5:[1],10:[1],15:[1],20:[0],25:[0],
                   30:[1],35:[1],40:[0],45:[0],50:[0]})
df

在此处输入图片说明

The column names reflect coordinates. I would like to retrieve the start and end coordinate of columns with consecutive equal numbers.

The output should be something like this:

# start,end
0,15 
20,25
30,35
40,50

IIUCusing groupby with diff and cumsum to split the group

s=df.T.reset_index()
s=s.groupby(s[0].diff().ne(0).cumsum())['index'].agg(['first','last'])
Out[241]: 
   first  last
0             
1      0    15
2     20    25
3     30    35
4     40    50

cumsum to identify group, and groupby :

s = df.iloc[0].diff().ne(0).cumsum()
(df.columns.to_series()
 .groupby(s).agg(['min','max'])
)

Output:

   min  max
0          
1    0   15
2   20   25
3   30   35
4   40   50

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM