[英]How to replace values in a dataFrame by checking for a condition?
我有以下數據框:
A | Date1 | Date2
10 | 2/2/2016 | 3/2/2016
11 | 1/5/2016 | 1/5/2016
12 | 2/3/2016 | 2/3/2016
13 | 1/5/2016 | 3/2/2013
如果Date1中的值等於Date2 ,我想將A列中的值設為0 。 最終結果:
A | Date1 | Date2
10 | 2/2/2016 | 3/1/2016
0 | 1/5/2016 | 1/5/2016
0 | 2/3/2016 | 2/3/2016
13 | 1/5/2016 | 3/2/2013
我想這樣做而不編寫for循環。 我可以使用申請嗎?
您可以重新創建我的df:
df = pd.DataFrame([[10, "2/2/2016", "3/2/2016" ] , [11, "1/5/2016", "1/5/2016"] , [12 , "2/3/2016" , "2/3/2016" ] , [13, "1/5/2016", "3/2/2013"]])
df.columns = ['A','B','C']
使用mask
:
import pandas as pd
df = pd.DataFrame([[10, "2/2/2016", "3/2/2016" ] ,
[11, "1/5/2016", "1/5/2016"] ,
[12 , "2/3/2016" , "2/3/2016" ] ,
[13, "1/5/2016", "3/2/2013"]])
df.columns = ['A','B','C']
print (df)
A B C
0 10 2/2/2016 3/2/2016
1 11 1/5/2016 1/5/2016
2 12 2/3/2016 2/3/2016
3 13 1/5/2016 3/2/2013
df['A'] = df.mask(df.B == df.C, 0)
print (df)
A B C
0 10 2/2/2016 3/2/2016
1 0 1/5/2016 1/5/2016
2 0 2/3/2016 2/3/2016
3 13 1/5/2016 3/2/2013
解where
:
df['A'] = df.where(df.B != df.C, 0)
print (df)
A B C
0 10 2/2/2016 3/2/2016
1 0 1/5/2016 1/5/2016
2 0 2/3/2016 2/3/2016
3 13 1/5/2016 3/2/2013
使用jezrael的設置
import pandas as pd
df = pd.DataFrame([[10, "2/2/2016", "3/2/2016" ] ,
[11, "1/5/2016", "1/5/2016"] ,
[12 , "2/3/2016" , "2/3/2016" ] ,
[13, "1/5/2016", "3/2/2013"]])
df.columns = ['A','B','C']
loc
df.loc[df.B == df.C, 'A'] = 0
print df
A B C
0 10 2/2/2016 3/2/2016
1 0 1/5/2016 1/5/2016
2 0 2/3/2016 2/3/2016
3 13 1/5/2016 3/2/2013
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.