Fill up a column in a loop

Question

I have a dataset like this:

import pandas as pd
df = pd.DataFrame([[0, 0], [2,2] ], columns=('feature1', 'feature2'))

Now I would like to add an extra column

df['c'] = ""

And then loop trought the data.frame to fill up column C with the contents of both feature 1 and feature 2

for index, row in df.iterrows():
    subject = row["feature1"]
    content = row["feature2"]
    row["C"] = subject, content

However if I print the data frame now. Something seems to go wrong cause column C is empty.

Answer 1

If you want to build a tuple out of two columns, be explicit and keep it simple:

df['c'] = df.apply(tuple, axis=1)

df
Out[7]: 
   feature1  feature2       c
0         0         0  (0, 0)
1         2         2  (2, 2)

Answer 2

EdChum has you covered in the comments for how to fix your approach - you should be using .loc for indexing. However can achieve the same much more simply and without having to resort to row iteration by using zip .

In[43]: df['c'] = list(zip(df.feature1, df.feature2))
in[44]: df
Out[44]: 
   feature1  feature2       c
0         0         0  (0, 0)
1         2         2  (2, 2)

Answer 3

df.assign(c=df.set_index(['feature1', 'feature2']).index.to_series().values)

Answer 4

You never updated the original column. You just updated a variable named row. But for ease of remembering code (not the most efficient obviously):

df['C'] = zip(df.feature1, df.feature2)

Fill up a column in a loop

Question

4 answers

solution1
4 2017-01-31 14:03:54

solution2
2 2017-01-31 13:59:23

solution3
2 2017-01-31 14:14:51

solution4
0 2017-01-31 14:14:50

Fill up a column in a loop

Question

4 answers

solution1 4 2017-01-31 14:03:54

solution2 2 2017-01-31 13:59:23

solution3 2 2017-01-31 14:14:51

solution4 0 2017-01-31 14:14:50

solution1
4 2017-01-31 14:03:54

solution2
2 2017-01-31 13:59:23

solution3
2 2017-01-31 14:14:51

solution4
0 2017-01-31 14:14:50