Pandas水位下降持续时间

2024-04-18 23:24:34 发布

您现在位置:Python中文网/ 问答频道 /正文

我试图计算一个股票系列的资金减少的持续时间和恢复的时间。我可以计算出提款额,但我很难计算每次提款的持续时间和恢复时间。到目前为止,我有这个代码:

import pandas as pd
import pickle
import xlrd
import numpy as np

np.random.seed(0)
df = pd.Series(np.random.randn(2500)*0.7+0.05, index=pd.date_range('1/1/2000', periods=2500, freq='D'))
df= 100*(1+df/100).cumprod()
df=pd.DataFrame(df)
df.columns = ['close']
df['ret'] = df.close/df.close[0]
df['modMax'] = df.ret.cummax()
df['modDD'] = 1-df.ret.div(df['modMax'])
groups = df.groupby(df['modMax'])
dd = groups['modMax','modDD'].apply(lambda g: g[g['modDD'] == g['modDD'].max()])
top10dd = dd.sort_values('modDD', ascending=False).head(10)
top10dd

这是系列赛中最高的10支球队,但我也希望能有一支球队的持续时间和恢复的时间。在


Tags: importdfcloseasnp时间randomdd
1条回答
网友
1楼 · 发布于 2024-04-18 23:24:34

我解决问题的方法如下:

def drawdown_group(df,index_list):
    group_max,dd_date = index_list
    ddGroup = df[df['modMax'] == group_max]
    group_length = len(ddGroup)
    group_dd = ddGroup['dd'].max()
    group_dd_length = len(ddGroup[ddGroup.index <= dd_date])
    group_start = ddGroup[0:1].index[0]
    group_end = ddGroup.tail(1).index[0]
    group_rec = group_length - group_dd_length
    #print (group_start,group_end,group_dd,dd_date,group_dd_length,group_rec,group_length)
    return group_start,group_end,group_max,group_dd,dd_date,group_dd_length,group_rec,group_length

dd_col = ('start','end','peak', 'dd','dd_date','dd_length','dd_rec','tot_length')
df_dd = pd.DataFrame(columns = dd_col)
for i in range(1,10):
    index_list = top10dd[i-1:i].index.tolist()[0]
    #print(index_list)
    start,end,peak,dd,dd_date,dd_length,dd_rec,tot_length = drawdown_group(df,index_list)
    #print(start,end,dd,dd_date,dd_length,dd_rec,tot_length)
    df_dd.loc[i-1] = drawdown_group(df,index_list)

生成此表: enter image description here

相关问题 更多 >