如何遍历熊猫数据框

Question

我正在尝试遍历Pandas数据框。 列表L包含用于指定X或Y应该从哪个行开始的值，即（1 :, 2 :, 3 :)。

list = [1,2,3]

for L in list:
    X = data.ix[L:, 'X':]
    Y = data.ix[L:, 'Y']     
    regressor = LinearRegression()
    regressor.fit(X, Y)
    prediction = regressor.predict([[Variable]])

尝试上述操作时的错误是：

TypeError: 'type' object is not iterable

Answer 1

IIUC您可以执行以下操作：

l = [1,2,3]
results = []

for idx in l:
    X = data.ix[idx:, 'X':]
    Y = data.ix[idx:, 'Y']     
    regressor = LinearRegression()
    regressor.fit(X, Y)
    results.append(regressor.predict([[Variable]]))

但是，我不知道这里的Variable是什么，您也可以执行以下操作：

for df in data.iloc[::1]:
    regressor = LinearRegression()
    regressor.fit(df['X'], df['Y'])
    results.append(regressor.predict([[Variable]]))

Answer 2

您应该尝试iterrrows，[ http://pandas-docs.github.io/pandas-docs-travis/basics.html#iterrows]

>>> df = pd.DataFrame([[1, 1.5]], columns=['int', 'float'])
>>> row = next(df.iterrows())[1]
>>> row
int      1.0
float    1.5
Name: 0, dtype: float64

Answer 3

您还可以将其转换为dict或在之前和之后使用，如您所知：

list_from_df = df.to_list()
for item in list_from_df:
   print(item)

或作为命令：

df.to_dict()
for key, value in list_from_df.items():
   print(key) # index
   print(value)

如何遍历熊猫数据框

问题描述

3 个解决方案

解决方案1
1 2016-01-19 14:22:02

解决方案2
0 2016-01-19 13:59:12

解决方案3
0 2019-04-14 17:10:34

如何遍历熊猫数据框

问题描述

3 个解决方案

解决方案1 1 2016-01-19 14:22:02

解决方案2 0 2016-01-19 13:59:12

解决方案3 0 2019-04-14 17:10:34

解决方案1
1 2016-01-19 14:22:02

解决方案2
0 2016-01-19 13:59:12

解决方案3
0 2019-04-14 17:10:34