在Python中读取和处理多个netcdf文件

Question

我需要帮助来读取多个netCDF文件，虽然这里有一些例子，但都没有正常工作。我使用的是Python(x,y)版本2.7.5，还有其他一些包：netcdf4 1.0.7-4，matplotlib 1.3.1-4，numpy 1.8，pandas 0.12，basemap 1.0.2……

我有一些在GrADS中习惯做的事情，现在需要在Python中实现。我有一些2米高度的温度数据（每4小时一次的数据，每年来自ECMWF），每个文件包含2米温度数据，X大小为480，Y大小为241，Z大小（层数）为1，T大小（时间）为1460或1464（闰年）。这些文件的名字大致是这样的：t2m.1981.nc，t2m.1982.nc，t2m.1983.nc……等等。

根据这个页面：循环读取netcdf文件并进行计算 - Python或R，这是我现在的进展：

from pylab import *
import netCDF4 as nc
from netCDF4 import *
import matplotlib.pyplot as plt
from mpl_toolkits.basemap import Basemap
import numpy as np

f = nc.MFDataset('d:/data/ecmwf/t2m.????.nc') # as '????' being the years
t2mtr = f.variables['t2m']

ntimes, ny, nx = shape(t2mtr)
temp2m = zeros((ny,nx),dtype=float64)
print ntimes
for i in xrange(ntimes):
    temp2m += t2mtr[i,:,:] #I'm not sure how to slice this, just wanted to get the 00Z values.
      # is it possible to assign to a new array,...
      #... (for eg.) the average values of  00z for January only from 1981-2000? 

#creating a NetCDF file
nco = nc.Dataset('d:/data/ecmwf/t2m.00zJan.nc','w',clobber=True)
nco.createDimension('x',nx)
nco.createDimension('y',ny)

temp2m_v = nco.createVariable('t2m', 'i4',  ( 'y', 'x'))
temp2m_v.units='Kelvin'
temp2m_v.long_name='2 meter Temperature'
temp2m_v.grid_mapping = 'Lambert_Conformal' # can it be something else or ..
#... eliminated?).This is straight from the solution on that webpage.

lono = nco.createVariable('longitude','f8')
lato = nco.createVariable('latitude','f8')
xo = nco.createVariable('x','f4',('x')) #not sure if this is important
yo = nco.createVariable('y','f4',('y')) #not sure if this is important
lco = nco.createVariable('Lambert_Conformal','i4') #not sure

#copy all the variable attributes from original file
for var in ['longitude','latitude']:
    for att in f.variables[var].ncattrs():
        setattr(nco.variables[var],att,getattr(f.variables[var],att))

# copy variable data for lon,lat,x and y
lono=f.variables['longitude'][:]
lato=f.variables['latitude'][:]
#xo[:]=f.variables['x']
#yo[:]=f.variables['y']

#  write the temp at 2 m data
temp2m_v[:,:]=temp2m

# copy Global attributes from original file
for att in f.ncattrs():
    setattr(nco,att,getattr(f,att))

nco.Conventions='CF-1.6' #not sure what is this.
nco.close()

#attempt to plot the 00zJan mean
file=nc.Dataset('d:/data/ecmwf/t2m.00zJan.nc','r')
t2mtr=file.variables['t2m'][:]
lon=file.variables['longitude'][:]
lat=file.variables['latitude'][:]
clevs=np.arange(0,500.,10.)
map =   Basemap(projection='cyl',llcrnrlat=0.,urcrnrlat=10.,llcrnrlon=97.,urcrnrlon=110.,resolution='i')
x,y=map(*np.meshgrid(lon,lat))
cs = map.contourf(x,y,t2mtr,clevs,extend='both')
map.drawcoastlines()
map.drawcountries()
plt.plot(cs)
plt.show()

第一个问题出现在temp2m += t2mtr[1,:,:]这行。我不确定怎么切片数据，只获取所有文件中的00z（比如说只获取一月份的数据）。

第二个问题是，在运行测试时，cs = map.contourf(x,y,t2mtr,clevs,extend='both')这行出现了错误，提示“形状与z不匹配：发现(1,1)而不是(241,480)”。我知道输出数据可能有错误，可能是记录值时出错，但我搞不清楚是什么问题/在哪里出错。

谢谢你的时间。我希望这不会让人困惑。

数据处理数据分析数值计算 netcdf 数据切片循环读取温度数据生态模型

在Python中读取和处理多个netcdf文件

1 个回答

撰写回答