Numpy isnat()返回日期时间对象的值错误

2024-06-09 03:18:23 发布

您现在位置:Python中文网/ 问答频道 /正文

一直在处理Converting between datetime, Timestamp and datetime64中列出的选项;但是,numpy的isnat()似乎无法识别datetime对象,或者我缺少函数输入所需的其他类型的datetime对象。在

以下是dataframe的概述:

>>> time_data.head()
     Date          Name               In AM              Out AM  \
0  2017-12-04  AUSTIN LEWIS 1900-01-01 07:03:11 1900-01-01 12:01:50   
1  2017-12-05  AUSTIN LEWIS 1900-01-01 05:24:07 1900-01-01 12:08:21   
2  2017-12-06  AUSTIN LEWIS 1900-01-01 11:58:32                 NaT   
3  2017-12-07  AUSTIN LEWIS 1900-01-01 08:31:23 1900-01-01 12:49:51   
4  2017-12-11  AUSTIN LEWIS 1900-01-01 06:55:21 1900-01-01 12:02:08   

            In PM              Out PM Sick Time  
0 1900-01-01 12:28:52 1900-01-01 17:34:53       NaT  
1 1900-01-01 12:35:12 1900-01-01 16:15:17       NaT  
2                 NaT 1900-01-01 23:59:01       NaT  
3 1900-01-01 13:18:34 1900-01-01 18:10:35       NaT  
4 1900-01-01 12:30:49 1900-01-01 17:39:54       NaT  

>>> time_data.dtypes
Date                 object
Name                 object
In AM        datetime64[ns]
Out AM       datetime64[ns]
In PM        datetime64[ns]
Out PM       datetime64[ns]
Sick Time    datetime64[ns]
dtype: object

>>> type(time_data['In AM'][3])
<class 'pandas._libs.tslib.Timestamp'>

>>> type(time_data['In AM'][3].to_datetime())
<type 'datetime.datetime'>

if np.isnat(time_data['Out AM'][row].to_datetime()) & np.isnat(time_data['In PM'][row].to_datetime()):

Throws“ValueError:ufunc'isnat'只为datetime和timedelta定义”

我错过了什么?!在


Tags: indatadatetimeobjecttimetypeoutam
2条回答

呃,这真是个糟糕的错误信息!np.isnat(“不是时间”)只适用于numpy的datetimes。ufunc的典型用法是使用np.datetime64np.timedelta64dtype数组:

>>> dt = datetime.now()
>>> np.isnat(np.array([dt], dtype=np.datetime64))
array([False])
>>> np.isnat(np.array([dt], dtype=object))
TypeError: ufunc 'isnat' is only defined for datetime and timedelta.

有关支持的输入类型,请参阅docs。在

您也可以在所需的datetime列上使用pd.to_datetime从头开始转换所有内容:

df = pd.DataFrame({
    'date' : [
        '2017-12-04',
        '2017-12-05',
        '2017-12-06',
        '2017-12-07',
        '2017-12-11'
    ],
    'name' : ['AUSTIN LEWIS'] * 5,
    'in_am' : [
        '1900-01-01 07:03:11',
        '1900-01-01 05:24:07',
        '1900-01-01 11:58:32',
        '1900-01-01 08:31:23',
        '1900-01-01 06:55:21'
    ],
    'out_am' : [
        '1900-01-01 12:01:50',
        '1900-01-01 12:08:21',
        '',
        '1900-01-01 12:49:51',
        '1900-01-01 12:02:08'
    ],
    'in_pm' : [
        '1900-01-01 12:28:52',
        '1900-01-01 12:35:12',
        '',
        '1900-01-01 13:18:34',
        '1900-01-01 12:30:49'
    ],
    'out_pm' : [
        '1900-01-01 17:34:53',
        '1900-01-01 16:15:17',
        '1900-01-01 23:59:01',
        '1900-01-01 18:10:35',
        '1900-01-01 17:39:54'
    ],
    'sick_time' : [''] * 5
})

input

^{pr2}$

dtypes

# convert to datetimes
for col in df.columns.drop('name').tolist():
    df[col] = pd.to_datetime(df[col])

# name should be only object
df.dtypes

{a3}

# np.isnat should now work
np.isnat(df.loc[:, df.dtypes == 'datetime64[ns]'])

output

相关问题 更多 >