簡體   English   中英

python (pandas) 中的 Dataframe 有問題

[英]have an issue with Dataframe in python(pandas)

我對 dataframe 有疑問,我需要幫助

圖1和代碼

代碼:

import pandas as pd
from pandas import Series, DataFrame

# example - Revenue of companies

revenue_df = pd.read_clipboard()
print revenue_df

錯誤信息和圖片 2

Traceback (most recent call last):
  File "C:/Users/Techie-Jife/PycharmProjects/Data Science/DataFrame.py", line 6, in <module>
    revenue_df = pd.read_clipboard()
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\clipboards.py", line 85, in read_clipboard
    return read_csv(StringIO(text), sep=sep, **kwargs)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 702, in parser_f
    return _read(filepath_or_buffer, kwds)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 435, in _read
    data = parser.read(nrows)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 1139, in read
    ret = self._engine.read(nrows)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 2415, in read
    alldata = self._rows_to_cols(content)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 3066, in _rows_to_cols
    self._alert_malformed(msg, row_num + 1)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 2835, in _alert_malformed
    raise ParserError(msg)
pandas.errors.ParserError: Expected 4 fields in line 4, saw 5. Error could possibly be due to quotes being ignored when a multi-char delimiter is used.

Process finished with exit code 1

3 是我從維基百科獲得的數據

用你最新的錯誤編輯了這個

import pandas as pd
from io import StringIO

d = '''50   Lukoil  Oil and gas $119,145    Increase $9,863 102,500 Russia Russia   [54]
49  Fannie Mae  Financials  $120,101    Increase $15,959    7,400   United States United States [53]
48  General Electric    Conglomerate    $120,268    Decrease -$22,355   283,000 United States United States [52]
47  Kroger  Retail  $121,162    Increase $3,110 453,000 United States United States [51]'''

df = pd.read_csv(StringIO(d), sep='\t', error_bad_lines=False, header=None, lineterminator="\n")

df
0   1   2   3   4   5   6   7
0   50  Lukoil  Oil and gas $119,145    Increase $9,863 102,500 Russia Russia   [54]
1   49  Fannie Mae  Financials  $120,101    Increase $15,959    7,400   United States United States [53]
2   48  General Electric    Conglomerate    $120,268    Decrease -$22,355   283,000 United States United States [52]
3   47  Kroger  Retail  $121,162    Increase $3,110 453,000 United States United States [51]

在此處輸入圖像描述

Edit1:如果您不將 select 和 header 放到剪貼板,您就不會遇到問題。

import pandas as pd
from pandas import Series, DataFrame

# example - Revenue of companies

revenue_df = pd.read_clipboard(sep='\t', header=None)
print (revenue_df)

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM