python csv DictReader类型

Question

我开始在python中编码，我现在遇到了问题，csv.DictReader让我得到了错误的数据类型。

csv文件看起来像：

Col1，Col2，Col3

1,2,3

90,2,3

pol = csv.DictReader(open('..\data\data.csv'),dialect='excel')

Col1 = []

for row in pol:
    if row["Col1"] < 90:
        Col1.append(row["Col1"] * 1.5)
    else:
        Col1.append("Col1")

我收到以下错误：

if row["Col1"] < 90:
TypeError: unorderable types: str() < int()

我不会转换每一个值。 是否可以定义列的值？

Answer 1

您可以使用像熊猫一样的库，它会为您推断出类型（这有点过分但它可以完成工作）。

import pandas
data = pandas.read_csv(r'..\data\data.csv')
# if you just want to retrieve the first column as a list of int do
list(data.Col1)
>>> [1, 90]

# to convert the whole CSV file to a list of dict use
data.transpose().to_dict().values()
>>> [{' Col2': 2, ' Col3': 3, 'Col1': 1}, {' Col2': 2, ' Col3': 3, 'Col1': 90}]

或者这里是一个类型化的DictReader的实现：

from csv import DictReader
from itertools import imap, izip

class TypedDictReader(DictReader):
  def __init__(self, f, fieldnames=None, restkey=None, restval=None, \
      dialect="excel", fieldtypes=None, *args, **kwds):

    DictReader.__init__(self, f, fieldnames, restkey, restval, dialect, *args, **kwds)
    self._fieldtypes = fieldtypes

  def next(self):
    d = DictReader.next(self)
    if len(self._fieldtypes) >= len(d) :
      # extract the values in the same order as the csv header
      ivalues = imap(d.get, self._fieldnames) 
      # apply type conversions
      iconverted = (x(y) for (x,y) in izip(self._fieldtypes, ivalues)) 
      # pass the field names and the converted values to the dict constructor
      d = dict(izip(self._fieldnames, iconverted)) 

    return d

以下是如何使用它：

reader = TypedDictReader(open('..\data\data.csv'), dialect='excel', \
  fieldtypes=[int, int, int])
list(reader)
>>> [{' Col2': 2, ' Col3': 3, 'Col1': 1}, {' Col2': 2, ' Col3': 3, 'Col1': 90}]

Answer 2

如果引用csv文件中的非数字值并初始化阅读器

pol = csv.DictReader(open('..\data\data.csv'),
    quoting=csv.QUOTE_NONNUMERIC, dialect="excel")

那么数值将自动转换为浮点数。

Answer 3

我以前没有使用过DictReader，但你可以这样做：

...
for row in pol:
    col1 = float(row["Col1"]) # or int()
    ...

然后使用col1，你可能也可以编辑字典：

row["Col1"] = float(row["Col1"])

但这取决于您以后想要使用该行的内容。

Answer 4

看起来你希望Col1是一个数字数组，所以你需要将row [“Col1”]转换为数字，无论你是否将它与数字进行比较。 所以转换吧！

python csv DictReader类型

问题描述

4 个解决方案

解决方案1
8 2013-06-26 17:22:53

解决方案2
2 2012-01-05 19:43:27

解决方案3
1 2012-01-05 19:49:08

解决方案4
0 2012-01-05 19:30:24

python csv DictReader类型

问题描述

4 个解决方案

解决方案1 8 2013-06-26 17:22:53

解决方案2 2 2012-01-05 19:43:27

解决方案3 1 2012-01-05 19:49:08

解决方案4 0 2012-01-05 19:30:24

解决方案1
8 2013-06-26 17:22:53

解决方案2
2 2012-01-05 19:43:27

解决方案3
1 2012-01-05 19:49:08

解决方案4
0 2012-01-05 19:30:24