簡體 English 中英

尋找從 dataframe 中的列中刪除子集的方法

[英]Looking for a way to remove subsets from columns in dataframe

原文 2020-06-12 19:13:55 6 1 python/ pandas/ dataframe

我有一個 dataframe 格式如下 -

'''
ids                        size
[A, B, C, D, E, F]         100
[C,D,E]                     50 
[C,D,E,F,G]                200
[D,E,F,G,H]                190
[E,F,G,H]                  100
[K, L, M, N]               200
'''

這個 dataframe 有數千行和眾多的 ID 組合。 處理列表有點痛苦。 我可以使用 issubset 刪除 [C, D, E] 條目

我想做的是保留具有最大尺寸的唯一 id 分組（在這種情況下，C、D、E、F、G）。 因為其他條目共同的最大的一個，我對那些不感興趣。 唯一應該存活的是 C、D、E、F、G 和 K、L、M、N。有沒有辦法在 Pandas 中處理這個問題？

1 個解決方案

我不確定你到底想要什么，但你可以過濾一些最小值

    minimumVal = 195
    df = df[df['ids'] > minimumVal]

檢查 DataFrame 的 2 列是否彼此子集的有效方法

[英]Efficient way to check if 2 columns of DataFrame are subsets of each other

從 Python 中的列表中刪除列表子集的最快方法

[英]Fastest way to remove subsets of lists from a list in Python

尋找一種基於2列的快速過濾熊貓Dataframe的方法

[英]Looking for a fast way to filter a Panda Dataframe, based on 2 columns

Pandas：將多列的子集映射到單列的子集的有效方法

[英]Pandas: Efficient way of mapping subsets of multiple columns into subsets of single columns

在數據框中創建行的子集並對應列

[英]Create subsets of rows and correspond columns in dataframe

將函數應用於數據幀子集的最佳方法是什么？

[英]What is the best way to apply a function to subsets of a dataframe?

熊貓：從數據框中返回多個列子集不為零的行

[英]Pandas: return rows from a dataframe where multiple subsets of columns are non zero

計算數據框列子集的平均值，並從列匹配的整個數據集中減去這些平均值

[英]Calculate mean values of subsets of a dataframe column and subtract those mean values from a whole dataset where columns match

從 Pandas 數據框中刪除總和為零的所有列和行的最佳方法

[英]Best way to remove all columns and rows with zero sum from a pandas dataframe

從DataFrame中刪除高度相關的列

[英]Remove strongly correlated columns from DataFrame

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 檢查 DataFrame 的 2 列是否彼此子集的有效方法從 Python 中的列表中刪除列表子集的最快方法尋找一種基於2列的快速過濾熊貓Dataframe的方法 Pandas：將多列的子集映射到單列的子集的有效方法在數據框中創建行的子集並對應列將函數應用於數據幀子集的最佳方法是什么？熊貓：從數據框中返回多個列子集不為零的行計算數據框列子集的平均值，並從列匹配的整個數據集中減去這些平均值從 Pandas 數據框中刪除總和為零的所有列和行的最佳方法從DataFrame中刪除高度相關的列

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM