简体   繁体   English

python (pandas) 中的 Dataframe 有问题

[英]have an issue with Dataframe in python(pandas)

i have issues with dataframe, please i need help我对 dataframe 有疑问,我需要帮助

图1和代码

Code:代码:

import pandas as pd
from pandas import Series, DataFrame

# example - Revenue of companies

revenue_df = pd.read_clipboard()
print revenue_df

错误信息和图片 2

Traceback (most recent call last):
  File "C:/Users/Techie-Jife/PycharmProjects/Data Science/DataFrame.py", line 6, in <module>
    revenue_df = pd.read_clipboard()
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\clipboards.py", line 85, in read_clipboard
    return read_csv(StringIO(text), sep=sep, **kwargs)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 702, in parser_f
    return _read(filepath_or_buffer, kwds)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 435, in _read
    data = parser.read(nrows)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 1139, in read
    ret = self._engine.read(nrows)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 2415, in read
    alldata = self._rows_to_cols(content)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 3066, in _rows_to_cols
    self._alert_malformed(msg, row_num + 1)
  File "C:\Users\Techie-Jife\venv\lib\site-packages\pandas\io\parsers.py", line 2835, in _alert_malformed
    raise ParserError(msg)
pandas.errors.ParserError: Expected 4 fields in line 4, saw 5. Error could possibly be due to quotes being ignored when a multi-char delimiter is used.

Process finished with exit code 1

3 是我从维基百科获得的数据

edited this by your latest error用你最新的错误编辑了这个

import pandas as pd
from io import StringIO

d = '''50   Lukoil  Oil and gas $119,145    Increase $9,863 102,500 Russia Russia   [54]
49  Fannie Mae  Financials  $120,101    Increase $15,959    7,400   United States United States [53]
48  General Electric    Conglomerate    $120,268    Decrease -$22,355   283,000 United States United States [52]
47  Kroger  Retail  $121,162    Increase $3,110 453,000 United States United States [51]'''

df = pd.read_csv(StringIO(d), sep='\t', error_bad_lines=False, header=None, lineterminator="\n")

df
0   1   2   3   4   5   6   7
0   50  Lukoil  Oil and gas $119,145    Increase $9,863 102,500 Russia Russia   [54]
1   49  Fannie Mae  Financials  $120,101    Increase $15,959    7,400   United States United States [53]
2   48  General Electric    Conglomerate    $120,268    Decrease -$22,355   283,000 United States United States [52]
3   47  Kroger  Retail  $121,162    Increase $3,110 453,000 United States United States [51]

在此处输入图像描述

Edit1: you dont face a problem if you dont select the header to clipboard. Edit1:如果您不将 select 和 header 放到剪贴板,您就不会遇到问题。

import pandas as pd
from pandas import Series, DataFrame

# example - Revenue of companies

revenue_df = pd.read_clipboard(sep='\t', header=None)
print (revenue_df)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM