Pandas聚合分组数据，提供了某些标准为m

# Get the individual holding valuation data valuation = get_valuation(portfolio = portfolio, df = True) # Then next few lines retrieve the dates for which I have complete price data for the # assets that comprise this portflio # First get a list of the assets that this portfolio contains (or has contained). unique_assets = valuation['asset'].unique().tolist() # Then I get the price data for these assets ats = get_ats(assets = unique_assets, df = True )[['data_date','close_price']] # I mark those dates for which I have a 'close_price' for each asset: ats = ats.groupby('data_date')['close_price'].agg({'data_complete':lambda x: len(x) == len(unique_assets)} ).reset_index() # And extract the corresponding valid dates. valid_dates = ats['data_date'][ats['data_complete']] # Filter the valuation data for those dates for which I have complete data: valuation = valuation[valuation['data_date'].apply(lambda x: x in valid_dates.values)] # Group by date, and sum the individual hodling valuations by date, to get the Portfolio valuation portfolio_valuation = valuation[['data_date','valuation']].groupby('data_date').agg(lambda df: sum(df['valuation'])).reset_index()

1条回答

网友

1楼 · 发布于 2024-06-07 18:19:40

似乎再取样和/或fillna的结合会让你得到你想要的东西（意识到这来得有点晚了！）。在

像你现在这样去抓取你的数据。你把这些东西拿回来的时候有一些空隙。看看这个：

import pandas as pd
import numpy as np

dates = pd.DatetimeIndex(start='2012-01-01', periods=10, freq='2D')
df = pd.DataFrame(np.random.randn(20).reshape(10,2),index=dates)

所以现在你有了这些数据，其中有很多空白，但是你想要的是这些每日分辨率数据。在

就这么做吧：

^{pr2}$

这将用一堆丢失数据的nan填充数据帧。当你对它们进行聚合时，只需使用函数（例如。，南森np, np平均值)无视南斯！在

对你得到的数据的确切格式还是有点不清楚。希望有帮助。在

相关问题更多 >

编程相关推荐

热门问题

热门文章