在 pandas dataframe 中为另一个日期框列中的每个日期添加一行

Question

I have a dataframe that contains an entry for a symbol occasionally and then a count.我有一个 dataframe 偶尔包含一个符号条目，然后是一个计数。 I would like to expand the dataframe so that every symbol contains a row for the entire daterange in the dataframe.我想扩展 dataframe 以便每个符号包含 dataframe 中整个日期范围的一行。 I want to enter a value of '0' for the count where there is no entry for a symbol on a certain date.我想为在某个日期没有符号条目的计数输入一个值“0”。

My dataframe:我的 dataframe：

dates = ['2021-01-01','2021-01-02','2021-01-03']
symbol = ['a','b','a']
count = [1,2,3]
df = pd.DataFrame({'Mention Datetime': dates,
                'Symbol': symbol,
                'Count':count})


    Mention Datetime    Symbol  Count
0   2021-01-01  a   1
1   2021-01-02  b   2
2   2021-01-03  a   3

what I want it to look like:我希望它看起来像什么：

Mention Datetime    Symbol  Count
0   2021-01-01  a   1
1   2021-01-02  a   0
2   2021-01-03  a   3
3   2021-01-01  b   0
4   2021-01-02  b   2
5   2021-01-03  b   0

Answer 1

Use pivot_table then stack :使用pivot_table然后stack ：

df = df.pivot_table(index='Mention Datetime',
                    columns='Symbol', fill_value=0
                    ).stack().reset_index()

Output: Output：

  Mention Datetime Symbol  Count
0       2021-01-01      a      1
1       2021-01-01      b      0
2       2021-01-02      a      0
3       2021-01-02      b      2
4       2021-01-03      a      3
5       2021-01-03      b      0

Answer 2

You can reindex with a new multi index created from the unique values of the columns in question.您可以使用从相关列的唯一值创建的新多索引重新索引。

import pandas as pd
from io import StringIO

s = '''
Mention Datetime    Symbol  Count
2021-01-01          a       1
2021-01-02          b       2
2021-01-03          a       3
'''

df = pd.read_fwf(StringIO(s), header=1)
df = df.set_index(['Mention Datetime', 'Symbol'])
df
                            Count
Mention Datetime    Symbol  
2021-01-01          a       1
2021-01-02          b       2
2021-01-03          a       3

df = df.reindex(
    pd.MultiIndex.from_product(
        [
        df.index.get_level_values('Mention Datetime').unique(), 
        df.index.get_level_values('Symbol').unique()
        ]
    ) 
).fillna(0)

df
                            Count
Mention Datetime    Symbol  
2021-01-01          a       1.0
                    b       0.0
2021-01-02          a       0.0
                    b       2.0
2021-01-03          a       3.0
                    b       0.0

在 pandas dataframe 中为另一个日期框列中的每个日期添加一行

问题描述

2 个解决方案

解决方案1
1 已采纳 2021-02-01 20:48:11

解决方案2
1 2021-02-01 20:53:36

在 pandas dataframe 中为另一个日期框列中的每个日期添加一行

问题描述

2 个解决方案

解决方案1 1 已采纳 2021-02-01 20:48:11

解决方案2 1 2021-02-01 20:53:36

解决方案1
1 已采纳 2021-02-01 20:48:11

解决方案2
1 2021-02-01 20:53:36