Pandas: Create dataframe from dataframe

Question

I have a function that iterates over the rows of a DataFrame, and then creates a new DataFrame with it. The number of rows of the created DataFrame is the same, but the number and name of colums is not, so I can't

def _mod_df(df: pd.DataFrame) -> pd.DataFrame:
    dff = pd.DataFrame(COL_NAMES)
    for _, row in df.iterrows():
        col1 = row.colname8 / row.colname3
        col2 = row.colname2 ** 2
        col3 = row.colname[:8]

        dff = dff.concat([dff, Series([col1, col2, col3])], ignore_index=True)

    return dff

Is there any way to optimize this?

Answer 1

Try:

def _mod_df(df: pd.DataFrame) -> pd.DataFrame:
    return pd.DataFrame([(df.colname8 / df.colname3).values,
                         (df.colname2 ** 2).values,
                         df.colname[:8].values],
                        columns=COL_NAMES)

Pandas: Create dataframe from dataframe

Question

1 answers

solution1
0 2022-02-02 09:51:52

Pandas: Create dataframe from dataframe

Question

1 answers

solution1 0 2022-02-02 09:51:52

solution1
0 2022-02-02 09:51:52