Pandas - Merge DataFrame to Series when all column values are the same.

Question

I just started to using pandas and I would to reduce amount of data that I get by merging my DataFrames in that way:

Load df
Check in which columns all values are the same
Delete other columns
Reduce df to single Series

Return

 def merge_df(in_df): alist = [] for col in in_df.columns: if len(in_df[col].unique()) == 1: alist.append(col) return in_df[alist].T.squeeze()[1]

Is there any more elegent way to do it? Eg without looping through all columns?

Answer 1

Yeah you can remove duplicate data by pandas simple function. df.drop_duplicates() You can refer documentation here.

For removing particular column redundant data you can pass column name as a parameter " subset ". It will remove whole row for duplicate data.

Pandas - Merge DataFrame to Series when all column values are the same.

Question

1 answers

solution1
1 ACCPTED 2018-01-11 10:38:43

Pandas - Merge DataFrame to Series when all column values are the same.

Question

1 answers

solution1 1 ACCPTED 2018-01-11 10:38:43

solution1
1 ACCPTED 2018-01-11 10:38:43