creating a dataframe from a dictionary of tuples in pandas

Question

I have this data structure :

word1 [('date', freq) , ('date',freq) , ...]
word2 [('date',freq) , ('date',freq) , ...]

and so on.for analyzing the time series , I want to create a dataframe. I can not figure out the best way to do it as I'm quite new to python(and I apologize for that). should I use:

classmethod DataFrame.from_dict(data, orient='index', dtype=None)

Answer 1

There's a lot of possible ways to start, but assuming a structure of words as

words
Out[203]: 
[[('2000-01-01', 1), ('2000-01-02', 5)],
 [('2000-01-01', 2), ('2000-01-02', 4)]]

the following is a natural starting point.

df = pd.DataFrame(index=range(0), columns=['date', 'word', 'freq'])
i = 0
for j, word in enumerate(words):
    for d, f in word:
        df.loc[i] = [d, j, f]
        i += 1

df.loc[i] will append new rows. If you know the total number of entries from the start, you could change index=range(0) to the correct value. Next steps would probably be

df.date = pd.to_datetime(df.date)
df.set_index(['date', 'word'], drop=True)
                freq
date       word     
2000-01-01 0       1
2000-01-02 0       5
2000-01-01 1       2
2000-01-02 1       4

creating a dataframe from a dictionary of tuples in pandas

Question

1 answers

solution1
0 ACCPTED 2014-07-28 15:09:21

creating a dataframe from a dictionary of tuples in pandas

Question

1 answers

solution1 0 ACCPTED 2014-07-28 15:09:21

solution1
0 ACCPTED 2014-07-28 15:09:21