繁体   English   中英

Python 3:“TypeError:无法将 datetimelike 从 [datetime64[ns]] 键入到 [int32]”- Jupyter Notebook

[英]Python 3: "TypeError: cannot astype a datetimelike from [datetime64[ns]] to [int32]" - Jupyter Notebook

jupyter notebook 项目遇到问题我正在尝试在我的 Windows 10 机器上工作,运行 Python 3。我从这个 function 得到提到的错误:

buy_per_min = (buy
               .groupby([pd.Grouper(key='timestamp', freq='Min'), 'price'])
               .shares
               .sum()
               .apply(np.log)
               .to_frame('shares')
               .reset_index('price')
               .between_time(market_open, market_close)
               .groupby(level='timestamp', as_index=False, group_keys=False)
               .apply(lambda x: x.nlargest(columns='price', n=depth))
               .reset_index())
buy_per_min.timestamp = buy_per_min.timestamp.add(utc_offset).astype(int)
buy_per_min.info()

问题出在buy_per_min.timestamp = buy_per_min.timestamp.add(utc_offset).astype(int)行中,但我不完全理解为什么会得到它。 这是完整的回溯:

TypeError                                 Traceback (most recent call last)
<ipython-input-28-396768b710c8> in <module>()
     10                .apply(lambda x: x.nlargest(columns='price', n=depth))
     11                .reset_index())
---> 12 buy_per_min.timestamp = buy_per_min.timestamp.add(utc_offset).astype(int)
     13 buy_per_min.info()

~\AppData\Local\Programs\Python\Python36\lib\site-packages\pandas\core\generic.py in astype(self, dtype, copy, errors, **kwargs)
   5689             # else, only a single dtype is given
   5690             new_data = self._data.astype(dtype=dtype, copy=copy, errors=errors,
-> 5691                                          **kwargs)
   5692             return self._constructor(new_data).__finalize__(self)
   5693 

~\AppData\Local\Programs\Python\Python36\lib\site-packages\pandas\core\internals\managers.py in astype(self, dtype, **kwargs)
    529 
    530     def astype(self, dtype, **kwargs):
--> 531         return self.apply('astype', dtype=dtype, **kwargs)
    532 
    533     def convert(self, **kwargs):

~\AppData\Local\Programs\Python\Python36\lib\site-packages\pandas\core\internals\managers.py in apply(self, f, axes, filter, do_integrity_check, consolidate, **kwargs)
    393                                             copy=align_copy)
    394 
--> 395             applied = getattr(b, f)(**kwargs)
    396             result_blocks = _extend_blocks(applied, result_blocks)
    397 

~\AppData\Local\Programs\Python\Python36\lib\site-packages\pandas\core\internals\blocks.py in astype(self, dtype, copy, errors, values, **kwargs)
    532     def astype(self, dtype, copy=False, errors='raise', values=None, **kwargs):
    533         return self._astype(dtype, copy=copy, errors=errors, values=values,
--> 534                             **kwargs)
    535 
    536     def _astype(self, dtype, copy=False, errors='raise', values=None,

~\AppData\Local\Programs\Python\Python36\lib\site-packages\pandas\core\internals\blocks.py in _astype(self, dtype, **kwargs)
   2137 
   2138         # delegate
-> 2139         return super(DatetimeBlock, self)._astype(dtype=dtype, **kwargs)
   2140 
   2141     def _can_hold_element(self, element):

~\AppData\Local\Programs\Python\Python36\lib\site-packages\pandas\core\internals\blocks.py in _astype(self, dtype, copy, errors, values, **kwargs)
    631 
    632                     # _astype_nansafe works fine with 1-d only
--> 633                     values = astype_nansafe(values.ravel(), dtype, copy=True)
    634 
    635                 # TODO(extension)

~\AppData\Local\Programs\Python\Python36\lib\site-packages\pandas\core\dtypes\cast.py in astype_nansafe(arr, dtype, copy, skipna)
    644         raise TypeError("cannot astype a datetimelike from [{from_dtype}] "
    645                         "to [{to_dtype}]".format(from_dtype=arr.dtype,
--> 646                                                  to_dtype=dtype))
    647 
    648     elif is_timedelta64_dtype(arr):

TypeError: cannot astype a datetimelike from [datetime64[ns]] to [int32]

我需要对时间戳信息进行某种转换吗?它看起来像什么? 谢谢!

更新

之前有一个类似的问题,我已经阅读过,但看不出如何将其应用于我的问题,如果其他人知道,我希望得到解释。 在这里能找到它:

Pandas DataFrame - 使用 ols/线性回归时,“无法将日期时间从 [datetime64[ns]] 键入到 [float64]”

.astype(int)更改为.astype('int64')也解决了这个问题。

来自 .astype() 方法的Pylance文档:

(method) astype: (dtype: Any | _str | Type[str] | Type[bytes] | Type[date] | Type[datetime] | Type[timedelta] | Type[bool] | Type[int] | Type[float] | Type[complex] | Type[Timestamp] | Type[Timedelta], copy: _bool = ..., errors: _str = ...) -> Series
Cast a pandas object to a specified dtype dtype.

Parameters
dtype : data type, or dict of column name -> data type
    Use a numpy.dtype or Python type to cast entire pandas object to
    the same type. Alternatively, use {col: dtype, ...}, where col is a column label and dtype is a numpy.dtype or Python type to cast one or more of the DataFrame's columns to column-specific types.
copy : bool, default True
    Return a copy when copy=True (be very careful setting copy=False as changes to values then may propagate to other pandas objects).
errors : {'raise', 'ignore'}, default 'raise'
    Control raising of exceptions on invalid data for provided dtype.

raise : allow exceptions to be raised
ignore : suppress exceptions. On error return original object.
Returns
casted : same type as caller

See Also
to_datetime : Convert argument to datetime.
to_timedelta : Convert argument to timedelta.
to_numeric : Convert argument to a numeric type.
numpy.ndarray.astype : Cast a numpy array to a specified type.

Notes
Examples
Create a DataFrame:

>>> d = {'col1': [1, 2], 'col2': [3, 4]}
>>> df = pd.DataFrame(data=d)
>>> df.dtypes
col1    int64
col2    int64
dtype: object
Cast all columns to int32:

>>> df.astype('int32').dtypes
col1    int32
col2    int32
dtype: object
Cast col1 to int32 using a dictionary:

>>> df.astype({'col1': 'int32'}).dtypes
col1    int32
col2    int64
dtype: object
Create a series:

>>> ser = pd.Series([1, 2], dtype='int32')
>>> ser
0    1
1    2
dtype: int32
>>> ser.astype('int64')
0    1
1    2
dtype: int64
Convert to categorical type:

>>> ser.astype('category')
0    1
1    2
dtype: category
Categories (2, int64): [1, 2]
Convert to ordered categorical type with custom ordering:

>>> from pandas.api.types import CategoricalDtype
>>> cat_dtype = CategoricalDtype(
...     categories=[2, 1], ordered=True)
>>> ser.astype(cat_dtype)
0    1
1    2
dtype: category
Categories (2, int64): [2 < 1]
Note that using copy=False and changing data on a new pandas object may propagate changes:

>>> s1 = pd.Series([1, 2])
>>> s2 = s1.astype('int64', copy=False)
>>> s2[0] = 10
>>> s1  # note that s1[0] has changed too
0    10
1     2
dtype: int64
Create a series of dates:

>>> ser_date = pd.Series(pd.date_range('20200101', periods=3))
>>> ser_date
0   2020-01-01
1   2020-01-02
2   2020-01-03
dtype: datetime64[ns]

Pandas 无法将日期时间转换为int32 ,因此引发了错误。 如果转换为np.int64它可以工作,也可以使用错误的值将 numpy 数组转换为int或转换为int64 - 然后以nanoseconds以本机格式获取日期时间:

rng = pd.date_range('2017-04-03 12:00:45', periods=3)
buy_per_min = pd.DataFrame({'timestamp': rng})  

from datetime import timedelta
utc_offset = timedelta(hours=4)

print (buy_per_min.timestamp.add(utc_offset))
0   2017-04-03 16:00:45
1   2017-04-04 16:00:45
2   2017-04-05 16:00:45
Name: timestamp, dtype: datetime64[ns]

print (buy_per_min.timestamp.add(utc_offset).values)
['2017-04-03T16:00:45.000000000' '2017-04-04T16:00:45.000000000'
 '2017-04-05T16:00:45.000000000']
print (buy_per_min.timestamp.add(utc_offset).values.astype(np.int64))
[1491235245000000000 1491321645000000000 1491408045000000000]

print (buy_per_min.timestamp.add(utc_offset).astype(np.int64))
0    1491235245000000000
1    1491321645000000000
2    1491408045000000000
Name: timestamp, dtype: int64

#https://stackoverflow.com/a/12716674
print (buy_per_min.timestamp.add(utc_offset).values.astype(int))
[ -289111552 -2146205184   291668480]

我刚刚遇到了一个非常相似的问题:

类型错误:不能从 [datetime64[ns]] 到 [bool] 输入类似日期时间的类型

在我的情况下,问题是通过添加大括号解决的。 比较一下:

df2 = df[
    (df['column1'] != df['column2']) &
    df['column3'] >= '03.02.2020'
].copy()

对此:

df2 = df[
    (df['column1'] != df['column2']) &
    (df['column3'] >= '03.02.2020')
].copy()

看起来在我的情况下,错误消息只是由&运算符应用于基于日期时间的列column3的事实触发的。

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM