Python：在 dataframe 中获取唯一日期

Question

I have a data frame that looks like this:我有一个看起来像这样的数据框：

                       price
Date
2022-01-01 19:20:00    100   
2022-01-01 19:27:00    100
2022-01-02 19:31:00    102

I want the dataframe to only have unique dates:我希望 dataframe 只有唯一日期：

                       price
Date
2022-01-01 19:20:00    100   
2022-01-02 19:31:00    102

How can I achieve that?我怎样才能做到这一点？

Answer 1

You can sort the dataframe with:您可以使用以下命令对dataframe进行排序：

df = df.sort_values('Date')

And than leave only the rows with a new date with:而不是只留下带有新日期的行：

df = df[df['Date'].dt.date != df['Date'].shift().dt.date]

Answer 2

You can extract the date from the datetime column using df.Date.dt.date , put that into a new column using assign , and after that use drop_duplicates based on only that column.您可以使用df.Date.dt.date从 datetime 列中提取日期，使用assign将其放入新列，然后仅基于该列使用drop_duplicates 。 Last, you might want to drop the newly create column that has only the date information.最后，您可能希望删除仅包含日期信息的新创建列。 In code that reads在读取的代码中

df = (
    df.assign(new_date=lambda df:df.Date.dt.date)
   .drop_duplicates(subset=["new_date"])
   .drop(columns=["new_date"])
)

Answer 3

You can simply use duplicated :您可以简单地使用duplicated ：

# pre-requisite
df['Date'] = pd.to_datetime(df['Date'])

df[~df['Date'].dt.date.duplicated()]

Or if working with the index:或者如果使用索引：

df[~df.index.to_series().dt.date.duplicated().values]

Output: Output：

                 Date  price
0 2022-01-01 19:20:00    100
2 2022-01-02 19:31:00    102

Python：在 dataframe 中获取唯一日期

问题描述

2 个解决方案

解决方案1
0 2022-01-20 19:53:07

解决方案2
0 2022-01-20 20:05:43

解决方案3
0 2022-01-20 20:31:30

Python：在 dataframe 中获取唯一日期

问题描述

2 个解决方案

解决方案1 0 2022-01-20 19:53:07

解决方案2 0 2022-01-20 20:05:43

解决方案3 0 2022-01-20 20:31:30

解决方案1
0 2022-01-20 19:53:07

解决方案2
0 2022-01-20 20:05:43

解决方案3
0 2022-01-20 20:31:30