簡體   English   中英

我正在嘗試使用python pandas數據框將多個列求和成一個新的sum列

[英]I'm trying to sum multiple columns into a new sum column using a python pandas dataframe

我正在嘗試學習python,並一直試圖弄清楚如何創建我的數據的sum列。 我想總結所有其他列。 我創建新列,但所有總和值為零。 數據可以在這里找到。 我的代碼如下,謝謝您的幫助:

import pandas as pd
#Importing csv file to chinaimport_df datafram
filename=r'C:\Users\Ing PC\Documents\Intro to Data Analysis\Final Project\CHINA_DOLLAR_IMPORTS.csv'
chinaimport_df = pd.read_csv(filename)

# Removing all rows that contain only zeros, thresh since since first column is words
chinaimport_df = chinaimport_df.dropna(how='all',axis=0, thresh=2) 

#Convert NANs to zeros
chinaimport_df=chinaimport_df.fillna(0)

#create a list of columns excluding the first column, to make sum func work later

col_list= list(chinaimport_df)
col_list.remove('Commodity')
print(col_list)

#adding column that sums 

chinaimport_df['Total'] = chinaimport_df[col_list].sum(axis=1)




chinaimport_df.to_csv("output.csv", index=False)

IIUC應該這樣做。

import pandas as pd

df = pd.read_csv('CHINA_DOLLAR_IMPORTS.csv')

df['Total'] = df.replace(r',',"", regex=True).iloc[:, 1:].astype(float).sum(axis=1)

df.to_csv('output.csv', index=False)

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM