如何检查 dataframe pandas 中是否不存在列列表

Question

I have a dataframe (df) and list (l) containing a list of column names.我有一个包含列名列表的 dataframe (df) 和列表 (l)。 df:东风：

Col_A可乐	Col_B Col_B	Col_D寒冷的	Col_G Col_G
AA AA	12 12	Q问	no不
BB BB	23 23	W W	yes是的
WW万维网	44 44		yes是的

l = ['Col_A', 'Col_B', 'Col_C', 'Col_D', 'Col_E', 'Col_F', 'Col_G']

I would like to print the column names that are not present in the df.我想打印 df 中不存在的列名。

Desired output:所需的 output：

['Col_C', 'Col_E', 'Col_F']

What I tried so far:到目前为止我尝试了什么：

if l not in df.columns:
    print(l)

I get an error TypeError: unhashable type: 'list'我收到一个错误TypeError: unhashable type: 'list'

Answer 1

You can use list comprehension for this:您可以为此使用列表推导：

[i for i in l if i not in df.columns]

This goes through every element in l ( i ) and if it is not in the columns of df, it will add it to a new list.这会遍历l ( i ) 中的每个元素，如果它不在 df 的列中，它会将其添加到新列表中。 Output: Output：

['Col_C', 'Col_E', 'Col_F']

Answer 2

Use numpy.setdiff1d :使用numpy.setdiff1d ：

L = np.setdiff1d(l, df.columns).tolist()

Or Index.difference :或Index.difference ：

L = pd.Index(l).difference(df.columns).tolist()

Or list comprehension with not in :或使用not in列出理解：

L = [x for x in l if x not in df.columns]

print (L)
['Col_C', 'Col_E', 'Col_F']

Answer 3

You have to loop over the list l .您必须遍历列表l 。

Like:喜欢：

for item in l:
  if item not in df.columns:
    print(item)

Answer 4

You can use set difference:您可以使用设置差异：

list(set(l).difference(df.columns))

or或者

list(set(l) - set(df.columns))

如何检查 dataframe pandas 中是否不存在列列表

问题描述

4 个解决方案

解决方案1
3 2021-11-23 09:37:07

解决方案2
0 已采纳 2021-11-23 09:34:33

解决方案3
0 2021-11-23 09:44:26

解决方案4
0 2021-11-23 15:04:14

如何检查 dataframe pandas 中是否不存在列列表

问题描述

4 个解决方案

解决方案1 3 2021-11-23 09:37:07

解决方案2 0 已采纳 2021-11-23 09:34:33

解决方案3 0 2021-11-23 09:44:26

解决方案4 0 2021-11-23 15:04:14

解决方案1
3 2021-11-23 09:37:07

解决方案2
0 已采纳 2021-11-23 09:34:33

解决方案3
0 2021-11-23 09:44:26

解决方案4
0 2021-11-23 15:04:14