簡體   English   中英

Python pandas:按多列排序並添加帶有訂單號的列

[英]Python pandas: sort by multiple columns and add a column with the order numbers

假設如下 dataframe:

ID          Date1           Date2       Col1      Col2       DateDiff 
22000118    2019-11-06      2019-11-15  0.562231  0.470641   9 days
22000118    2019-11-06      2019-12-20  0.375000  0.319872   44 days
30          2019-11-06      2019-11-15  0.916047  0.730626   9 days
30          2019-11-06      2019-12-20  0.519936  0.423861   44 days
22000118    2020-11-05      2025-12-19  0.316772  0.301951   1870 days
30          2020-11-05      2026-12-18  0.256964  0.234729   2234 days
30          2020-11-05      2027-12-17  0.250835  0.230236   2598 days
22000118    2019-11-06      2020-01-17  0.330995  0.287567   72 days
22000118    2020-11-05      2026-12-18  0.310234  0.296930   2234 days
22000118    2020-11-05      2027-12-17  0.305502  0.293349   2598 days
30          2019-11-06      2020-01-17  0.443920  0.366206   72 days
30          2020-11-05      2025-12-19  0.264916  0.240628   1870 days

我想按列ID, Date1DateDiff (其中Date1datetime64類型, DateDeltatimedelta64類型)對其進行排序,並添加一個帶有訂單號的列,如下例所示:

ID          Date1           Date2       Col1      Col2       DateDiff    Order
30          2019-11-06      2019-11-15  0.916047  0.730626   9 days      1
30          2019-11-06      2019-12-20  0.519936  0.423861   44 days     2
30          2019-11-06      2020-01-17  0.443920  0.366206   72 days     3
30          2020-11-05      2025-12-19  0.264916  0.240628   1870 days   1
30          2020-11-05      2026-12-18  0.256964  0.234729   2234 days   2
30          2020-11-05      2027-12-17  0.250835  0.230236   2598 days   3
22000118    2019-11-06      2019-11-15  0.562231  0.470641   9 days      1
22000118    2019-11-06      2019-12-20  0.375000  0.319872   44 days     2
22000118    2019-11-06      2020-01-17  0.330995  0.287567   72 days     3
22000118    2020-11-05      2025-12-19  0.316772  0.301951   1870 days   1
22000118    2020-11-05      2026-12-18  0.310234  0.296930   2234 days   2
22000118    2020-11-05      2027-12-17  0.305502  0.293349   2598 days   3

我已經知道如何使用sort_values對其進行排序,但我不知道如何在最后一列中添加訂單號。

sort_values排序后

df['Order'] = df.groupby(['ID','Date1']).cumcount()+1

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM