繁体 English 中英

当标志列中的所有 1 并保持大小大于 2 的组时如何从数据框中获取组

[英]how to get groups from dataframe when all 1s in flag column and keep group with size greater than 2

原文 2022-06-21 01:24:18 9 2 python-3.x/ pandas/ numpy/ pandas-groupby

我想提取标志为连续 1 的组，如果大小大于 2 则保留组

here is my df :
df2=pd.DataFrame({'A':[1,20,40,45,56,1,20,40,45,56],'flag':[3,2,4,1,1,3,3,1,1,1]})
print(df2)
    A  flag
0   1     3
1  20     2
2  40     4
3  45     1
4  56     1
5   1     3
6  20     3
7  40     1
8  45     1
9  56     1

output
7  40     1
8  45     1
9  56     1

2 个解决方案

有很多方法。 您可以使用 pandas 的内置功能，例如 groupby 和 where 或简单地使用以下内容。

print(df2[(df2['flag']==1) & (df2['A']>2)])

使用pandas.DataFrame.groupby一种方法：

s = df2["flag"].eq(1)
m = s.diff(1).ne(0).cumsum()
new_df = df2[s.groupby(m).transform(lambda x: x.sum()>2)]

输出：

    A  flag
7  40     1
8  45     1
9  56     1

如何在 pyspark 的列中按连续 1 分组并保持特定大小的组

[英]How to groupby by consective 1s in column in pyspark and keep groups with specific size

如何向 Pandas df 添加一个新列，该列从另一个数据帧返回同一组中较大的最小值

[英]How to add a new column to a pandas df that returns the smallest value that is greater in the same group from another dataframe

如何基于groupby操作产生的组获取pandas DataFrame的组ID值的列

[英]How to get a column of group id values for a pandas DataFrame based on the groups produced by a groupby operation

Pandas：对 select DataFrame 的行使用 DataFrameGroupBy.filter() 方法，其值大于相应组的平均值

[英]Pandas: Use DataFrameGroupBy.filter() method to select DataFrame's rows with a value greater than the mean of the respective group

如何从 Pandas dataframe 的另一列获取最大值组

[英]How to get max value group by another column from Pandas dataframe

在二进制列中标识具有一定大小的1的第一个簇

[英]Identifying first cluster of 1s of a certain size in a binary column

熊猫如何通过标志列获得前n组

[英]Pandas how to get top n group by flag column

pandas dataframe从元素频率大于1的列创建唯一ID

[英]pandas dataframe create unique ids from column having elements frequency greater than 1

如何在 dataframe 中按行计算部分列中大于 0 的数字并将其保存在列中

[英]How can i count numbers greater than 0 in a part of colums row wise in a dataframe and save it in a column

如何在熊猫数据框列中找到大于 1 的 value_counts() 的长度

[英]How to find the length of value_counts() that is greater than 1 in a pandas dataframe column

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何在 pyspark 的列中按连续 1 分组并保持特定大小的组如何向 Pandas df 添加一个新列，该列从另一个数据帧返回同一组中较大的最小值如何基于groupby操作产生的组获取pandas DataFrame的组ID值的列 Pandas：对 select DataFrame 的行使用 DataFrameGroupBy.filter() 方法，其值大于相应组的平均值如何从 Pandas dataframe 的另一列获取最大值组在二进制列中标识具有一定大小的1的第一个簇熊猫如何通过标志列获得前n组 pandas dataframe从元素频率大于1的列创建唯一ID 如何在 dataframe 中按行计算部分列中大于 0 的数字并将其保存在列中如何在熊猫数据框列中找到大于 1 的 value_counts() 的长度

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM