熊貓：最后一個非相等行的索引

Question

我有一個帶有排序索引I的熊貓數據框F 我有興趣知道其中一列的最新變化，比如說A 特別是，我想構造一個具有與F相同的索引的序列，即I ，其在i值為j ，其中j是小於i的最大索引值，從而F[A][j] != F[A][i] 。 例如，考慮以下框架：

所需的序列為：

1 NaN
2 NaN
3   2
4   3
5   3

有沒有熊貓/ numpy慣用的方式來構造這個系列？

Answer 1

嘗試這個：

df['B'] = np.nan
last = np.nan
for index, row in df.iterrows():
    if index == 0:
        continue
    if df['A'].iloc[index] != df['A'].iloc[index - 1]:
        last = index
    df['B'].iloc[index] = last

這將使用結果創建一個新列。 我認為在行中更改行並不是一個好主意，在那之后，您可以簡單地替換一列並刪除另一行。

Answer 2

布爾數據上的np.argmax或pd.Series.argmax可以幫助您找到第一個（在本例中為最后一個） True值。 不過，您仍然必須在此解決方案中循環討論該系列。

# Initiate source data
F = pd.DataFrame({'A':[5,5,6,2,2]}, index=list('fobni'))

# Initiate resulting Series to NaN
result = pd.Series(np.nan, index=F.index)

for i in range(1, len(F)):
    value_at_i = F['A'].iloc[i]
    values_before_i = F['A'].iloc[:i]
    # Get differences as a Boolean Series
    # (keeping the original index)
    diffs = (values_before_i != value_at_i)
    if diffs.sum() == 0:
        continue
    # Reverse the Series of differences,
    # then find the index of the first True value
    j = diffs[::-1].argmax()
    result.iloc[i] = j

熊貓：最后一個非相等行的索引

問題描述

2 個解決方案

解決方案1
0 2015-10-22 02:04:54

解決方案2
0 2016-03-30 16:58:07

熊貓：最后一個非相等行的索引

問題描述

2 個解決方案

解決方案1 0 2015-10-22 02:04:54

解決方案2 0 2016-03-30 16:58:07

解決方案1
0 2015-10-22 02:04:54

解決方案2
0 2016-03-30 16:58:07