且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

将"TimeStamp"列截断为pandas DataFrame中的小时精度

更新时间:2021-12-26 01:25:43

在pandas 0.18.0及更高版本中,日期时间为 ceil

In pandas 0.18.0 and later, there are datetime floor, ceil and round methods to round timestamps to a given fixed precision/frequency. To round down to hour precision, you can use:

>>> df['dt2'] = df['dt'].dt.floor('h')
>>> df
                      dt                     dt2
0    2014-10-01 10:02:45     2014-10-01 10:00:00
1    2014-10-01 13:08:17     2014-10-01 13:00:00
2    2014-10-01 17:39:24     2014-10-01 17:00:00


这是截断时间戳的另一种方法.与floor不同,它支持截断到年或月之类的精度.


Here's another alternative to truncate the timestamps. Unlike floor, it supports truncating to a precision such as year or month.

您可以临时调整基础NumPy datetime64数据类型的精度单位,将其从[ns]更改为[h]:

You can temporarily adjust the precision unit of the underlying NumPy datetime64 datatype, changing it from [ns] to [h]:

df['dt'].values.astype('<M8[h]')

这会将所有内容截断为小时精度.例如:

This truncates everything to hour precision. For example:

>>> df
                       dt
0     2014-10-01 10:02:45
1     2014-10-01 13:08:17
2     2014-10-01 17:39:24

>>> df['dt2'] = df['dt'].values.astype('<M8[h]')
>>> df
                      dt                     dt2
0    2014-10-01 10:02:45     2014-10-01 10:00:00
1    2014-10-01 13:08:17     2014-10-01 13:00:00
2    2014-10-01 17:39:24     2014-10-01 17:00:00

>>> df.dtypes
dt     datetime64[ns]
dt2    datetime64[ns]

对于其他任何单位,同样的方法也应适用:月份'M',分钟'm',依此类推:

The same method should work for any other unit: months 'M', minutes 'm', and so on:

  • 保留至一年:'<M8[Y]'
  • 保持月份:'<M8[M]'
  • 保持一天:'<M8[D]'
  • 保持最新:'<M8[m]'
  • 紧跟第二:'<M8[s]'
  • Keep up to year: '<M8[Y]'
  • Keep up to month: '<M8[M]'
  • Keep up to day: '<M8[D]'
  • Keep up to minute: '<M8[m]'
  • Keep up to second: '<M8[s]'