通過遍歷熊貓中的連續行來創建新列

Question

我有一個數據框：

import pandas as pd
import numpy as np
df = pd.DataFrame({'a':[1,20,3,4,50,6],
               'b':[12,43,78,23,14,28],
               'c': [100,200,300,400,500,600]})`

我想遍歷連續的行，這樣，

如果下一行的'a'值-當前行的'a'值小於10 ，

然后檢查下一行的'c'值-當前行的'b' 小於400

return 0

else return Nan.

我想使用.apply編寫代碼。

def query(row,df):
    try:
        i = row.name
        curr = df.iloc[i]
        curr_a = curr['a']
        next = df.iloc[i+1]
       next_a = next['a']
        if (next_a-curr_a) < 10:
            print(next_a,curr_a)
            curr_b = curr['b']
            next_c = next['c']
            print(next_c,curr_b)
           if (next_c - curr_b) < 400:
                return 0
        else:
            diff = np.nan
        return diff
    except:
        pass

df['new_col'] = df.apply(lambda x: query(x,df),axis=1)

基本上，我獲取當前行的索引，即i ，並將其傳遞給一個函數，在其中使用df.iloc[i]定位當前行，然后使用df.iloc[i+1]定位下一行，然后檢查條件。 但我認為這不是最好的方法。

有一個更好的方法嗎？ 可能使用.shift或任何.shift方式？ 任何線索都將有所幫助。

Answer 1

在shift使用np.where

np.where(((df.a.shift(-1)-df.a)<10)&((df.c.shift(-1)-df.b)<400),0,np.NaN)
Out[85]: array([nan,  0.,  0., nan, nan, nan])

通過遍歷熊貓中的連續行來創建新列

問題描述

1 個解決方案

解決方案1
2 已采納 2018-08-16 15:21:38

通過遍歷熊貓中的連續行來創建新列

問題描述

1 個解決方案

解決方案1 2 已采納 2018-08-16 15:21:38

解決方案1
2 已采納 2018-08-16 15:21:38