簡體   English   中英

將 statsmodels 回歸匯總表導出為 csv

[英]Export summary table of statsmodels regressions as csv

假設我有三個要並排比較的 statsmodels OLS對象。 我可以使用summary_col創建一個匯總表,我可以將其打印為文本或導出到乳膠。

如何將此表導出為 csv?

這是我想要做的一個可復制的例子:

# Libraries
import pandas as pd
import statsmodels.api as sm
from statsmodels.iolib.summary2 import summary_col

# Load silly data and add constant
df = sm.datasets.stackloss.load_pandas().data
df['CONSTANT'] = 1

# Train three silly models
m0 = sm.OLS(df['STACKLOSS'], df[['CONSTANT','AIRFLOW']]).fit()
m1 = sm.OLS(df['STACKLOSS'], df[['CONSTANT','AIRFLOW','WATERTEMP']]).fit()
m2 = sm.OLS(df['STACKLOSS'], df[['CONSTANT','AIRFLOW','WATERTEMP','ACIDCONC']]).fit()

# Results table
res = summary_col([m0,m1,m2], regressor_order=m2.params.index.tolist())
print(res)

    ================================================
              STACKLOSS I STACKLOSS II STACKLOSS III
    ------------------------------------------------
    CONSTANT  -44.1320    -50.3588     -39.9197     
              (6.1059)    (5.1383)     (11.8960)    
    AIRFLOW   1.0203      0.6712       0.7156       
              (0.1000)    (0.1267)     (0.1349)     
    WATERTEMP             1.2954       1.2953       
                          (0.3675)     (0.3680)     
    ACIDCONC                           -0.1521      
                                       (0.1563)     
    ================================================
    Standard errors in parentheses.

有沒有辦法將res導出到 csv?

結果存儲為數據框列表:

res.tables
[               STACKLOSS I STACKLOSS II STACKLOSS III
 CONSTANT          -44.1320     -50.3588      -39.9197
                   (6.1059)     (5.1383)     (11.8960)
 AIRFLOW             1.0203       0.6712        0.7156
                   (0.1000)     (0.1267)      (0.1349)
 WATERTEMP                        1.2954        1.2953
                                (0.3675)      (0.3680)
 ACIDCONC                                      -0.1521
                                              (0.1563)
 R-squared           0.8458       0.9088        0.9136
 R-squared Adj.      0.8377       0.8986        0.8983]

這應該有效:

res.tables[0].to_csv("test.csv")

pd.read_csv("test.csv")

       Unnamed: 0 STACKLOSS I STACKLOSS II STACKLOSS III
0        CONSTANT    -44.1320     -50.3588      -39.9197
1             NaN    (6.1059)     (5.1383)     (11.8960)
2         AIRFLOW      1.0203       0.6712        0.7156
3             NaN    (0.1000)     (0.1267)      (0.1349)
4       WATERTEMP         NaN       1.2954        1.2953
5             NaN         NaN     (0.3675)      (0.3680)
6        ACIDCONC         NaN          NaN       -0.1521
7             NaN         NaN          NaN      (0.1563)
8       R-squared      0.8458       0.9088        0.9136
9  R-squared Adj.      0.8377       0.8986        0.8983

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM