簡體   English   中英

在熊貓數據框中,如何在迭代的每一行的末尾添加一個值?

[英]In a pandas dataframe, how do you add a value to the end of each row in iterrows?

假設我有一個包含產品,當前價格和平均價格的簡單數據框。 我想使用當前價格和平均價格來計算零售價。 我的嘗試是:

import csv
from pandas import Series, DataFrame
import pandas as pd
import os

frame = pd.read_csv('/ ... /data.csv')

for index, row in frame.iterrows():    
  product = row['product']
  price = row['price']    
  ave_price = row['ave_price']
  weight_price = 2.0
  max_price = ave_price * weight_price

  retail_price = max_price / (1.0 + 2.0 * price / ave+price)
  retail_total = rs_price * 1.0875

frame.to_csv('/Users ... /output.csv', encoding ='utf-8')

如何獲得Retail_total並將其添加到可以打印產品,當前價格,平均價格和零售價格的整個數據框的方式?

當我嘗試這樣做時,它只是將所有產品的零售價格填充為產品列表中的最后一個:

在框架中添加一列:

frame['retail_price'] = Series(np.zeros(len(frame)), index=frame.index)

然后將每行的值存儲在for循環內

for index, row in frame.iterrows():    
  product = row['product']
  price = row['price']    
  ave_price = row['ave_price']
  weight_price = 2.0
  max_price = ave_price * weight_price

  retail_price = max_price / (1.0 + 2.0 * price / ave_price)
  retail_total = retail_price * 1.0875

  row['retail_price'] = retail_price
import pandas as pd
import numpy as np

# simulate some artificial data
# ===========================================
np.random.seed(0)
product = np.random.choice(list('ABCDE'), size=10)
price = np.random.randint(100, 200, size=10)
avg_price = np.random.randint(100, 200, size=10)
df = pd.DataFrame(dict(product=product, price=price, avg_price=avg_price))
df

   avg_price  price product
0        125    188       E
1        177    112       A
2        172    158       D
3        109    165       D
4        120    139       D
5        180    187       B
6        169    146       D
7        179    188       C
8        147    181       E
9        164    137       A


# processing
# ===========================================
# some constant parameters
weight_price = 2.0

df['retail_price'] = df['avg_price'] * weight_price / (1.0 + 2.0 * df['price'] / df['avg_price'])

df['retail_total'] = df['retail_price'] * 1.0875

df

   avg_price  price product  retail_price  retail_total
0        125    188       E       62.3752       67.8331
1        177    112       A      156.2544      169.9266
2        172    158       D      121.2459      131.8549
3        109    165       D       54.1276       58.8637
4        120    139       D       72.3618       78.6935
5        180    187       B      116.9675      127.2022
6        169    146       D      123.9089      134.7509
7        179    188       C      115.4631      125.5661
8        147    181       E       84.9077       92.3371
9        164    137       A      122.8128      133.5589

# df.to_csv()

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM