How to de-normalize data with pandas dataframe

Question

I have a pandas dataframe created out of CSV file. The dataframe looks like this

srvr_name log_type       hour  
server1   impressionWin  18:00:00 
server1   transactionWin 18:00:00 
server2   impressionWin  18:00:00 
server2   transactionWin 18:00:00

What I would like to get from this is:

srvr_name impressionWin transactionWin hour
server1   true          true           18:00:00
server2   true          true           18:00:00

What is the best way to achieve this in pandas?

Answer 1

Using join with get_dummies

df.join(pd.get_dummies(df.log_type)).groupby(['srvr_name', 'hour']).sum().astype(bool)

                    impressionWin  transactionWin
srvr_name hour
server1   18:00:00           True            True
server2   18:00:00           True            True

Answer 2

You can use this:

df = pd.crosstab([df.srvr_name, df.hour], df.log_type).astype(bool).rename_axis(None, 1).reset_index()

Output:

  srvr_name      hour  impressionWin  transactionWin
0   server1  18:00:00           True            True
1   server2  18:00:00           True            True

How to de-normalize data with pandas dataframe

Question

2 answers

solution1
2 ACCPTED 2018-06-14 19:34:37

solution2
1 2018-06-14 19:43:10

How to de-normalize data with pandas dataframe

Question

2 answers

solution1 2 ACCPTED 2018-06-14 19:34:37

solution2 1 2018-06-14 19:43:10

solution1
2 ACCPTED 2018-06-14 19:34:37

solution2
1 2018-06-14 19:43:10