Pandas重新采样的日期时间跨越2天，一周从周日开始

def read_in_files(file_names): """ 1. Read the csv files to memory into a pandas dataframe with pd.read_csv 2. separate the df into year, month, and date objects 3. It also chunks the data by single day """ import os import pandas as pd file1 = pd.read_csv(file_names, parse_dates=[['Date', 'Time']]) df = pd.DataFrame(file1) # Week is defined as sunday 4pm to Friday 4pm --not working correctly # this is a timestamp obj df['year'], df['month'] = df['Date_Time'].dt.year, df['Date_time'].dt.month df['date'] = df['Date_Time'].dt.day df['week'] = df['Date_Time'].dt.week """ these three lines below chunk the data by dates """ df_single_day = [] for group in df.groupby(df.Date_Time, sort=False): df_single_day.append(group[1]) df_single_week = [] for group in df.groupby(['week', 'year'], sort=False): df_single_week.append(group[1]) df_single_month = [] for group in df.groupby(['month', 'year'], sort=False): df_single_month.append(group[1]) return df df_single_day, df_single_week, df_single_month

Unnamed: 0 Symbol Date_Time Open High Low Close \ 95 96 ABCDEF 2008-05-07 00:00 0.9478 0.9483 0.9475 0.9481 96 97 ABCDEF 2008-05-07 00:05 0.9481 0.9484 0.9479 0.9484 97 98 ABCDEF 2008-05-07 00:10 0.9482 0.9485 0.9480 0.9482 98 99 ABCDEF 2008-05-07 00:15 0.9482 0.9485 0.9478 0.9483 99 100 ABCDEF 2008-05-07 00:20 0.9483 0.9485 0.9480 0.9484 year month date week 95 2008 5 7 19 96 2008 5 7 19 97 2008 5 7 19 98 2008 5 7 19 99 2008 5 7 19

1条回答

网友

1楼 · 发布于 2024-04-26 13:57:24

df['temp'] = df['Date'].astype(str) + ' ' + df['Time']
df.temp = pd.to_datetime(df.temp, infer_datetime_format=True)
df.temp = df.temp + pd.offsets.Hour(8)

g = df.groupby(df['temp'].dt.normalize())
df_single_day = []
for group in g:
    if len(group[1])> 1:
        df_single_day.append(group[1])

上面的代码产生正确的答案。我有一个小问题（但不重要）在周末16:00的组是单独的，所以我用if语句删除它们。在

还在想怎么做星期一如果我的数据是从周一开始的星期一周一到周一。。。在

相关问题更多 >

编程相关推荐

热门问题

热门文章