[英]remove a row from a dataframe if any row value is in another dataframe , with dataframes having multiple columns
I hava two data frames :我有两个数据框:
df1 = {0:[1,2,3,4,5,6,7,11],1:[100,20,7]}
df2 = {0:[100,4,6,7],1:[1,3,4,7]}
i have to remove rows from df1 that occurs in any row of df2我必须从 df1 中删除出现在 df2 的任何行中的行
result dataframe结果数据框
df3 = [2,5,11,20]
You can flatten values by np.ravel
and get difference by np.setdiff1d
:您可以通过扁平化值np.ravel
并获得通过差异np.setdiff1d
:
df1 = pd.DataFrame({0:[1,2,3,4,5,6,7,11],1:[100,20,7,1,2,3,4,5]})
df2 = pd.DataFrame({0:[100,4,6,7],1:[1,3,4,7]})
L = np.setdiff1d(np.ravel(df1), np.ravel(df2)).tolist()
print (L)
[2, 5, 11, 20]
Or difference of sets:或集的差异:
L = list(set(df1.stack()) - set(df2.stack()))
print (L)
[2, 11, 20, 5]
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.