如何限制pandas datafram中的NaN填充

2024-05-17 13:01:57 发布

您现在位置:Python中文网/ 问答频道 /正文

我有三个pandas dataframe,其中包含三种索引类型15分钟周期,1分15秒,我在后面的dataframes中添加了NaNs,并在同一个图中绘制了主题。在

图表: Graph1

现在我想替换一个dataframe NaN,我用了ffill(),它起作用了,但我需要限制填充{},我不需要我标记为红色的内容。在

图2:

Graph2

我的情节应该是这样的:

NOAA
(来源:noaa.gov

数据帧:

http://bayanbox.ir/id/1324113030042053806?download

http://bayanbox.ir/id/774076250887409862?download

http://bayanbox.ir/id/6217190851751601245?download

资料来源:

import matplotlib.pyplot as plt
import pandas as pd
import numpy as np
# 1 minutes recorded data
data = pd.read_csv('1m.csv', parse_dates=True, index_col='time')
# 15 minutes recorded data
data2 = pd.read_csv('15m.csv', parse_dates=True, index_col='time')
# 15 seconds recorded data
data3 = pd.read_csv('15s.csv', parse_dates=True, index_col='time')

del data['Unnamed: 0'], data2['Unnamed: 0'], data3['Unnamed: 0']

def add_nan(DF, T):
    start = DF.time[len(DF)-1]
    stop = DF.time[0]
    rng = pd.date_range(start, stop, freq=T)
    DF = DF.drop_duplicates('time').set_index('time').reindex(rng)
    return DF

data = pd.DataFrame({"1-min":np.array(data.Height[:]), "time":data.index})
data2 = pd.DataFrame({"15-min":np.array(data2.Height[:]), "time":data2.index})
data3 = pd.DataFrame({"15-sec":np.array(data3.Height[:]), "time":data3.index}) 

data = add_nan(data, '1min')
data2 = add_nan(data2, '15min')
data3 = add_nan(data3, '1S')

ax = data.plot(color='g', figsize=(10, 6))
data2.plot(ax=ax, color='b')
data3.plot(ax=ax, style='.-r')

plt.savefig('plot.png')

Tags: csvaddhttpdfdataindextimeplot
1条回答
网友
1楼 · 发布于 2024-05-17 13:01:57

根据Pandasdocumentation,限制参数应该设置True

DataFrame.ffill(axis=0, inplace=False, limit=None, downcast=None)
Synonym for NDFrame.fillna(method=’ffill’)

enter image description here

此函数用于将NaN数据添加到dataframe,因为在dataframe中应设置限制NaN填充{}:

^{pr2}$

相关问题 更多 >