从输入文件数据中求最大值和最小值的问题
我刚开始学习Python,需要一点帮助,我有一个叫做search_max.py的脚本。
这个脚本会打开一个“xyz”格式的文件,然后查找每个坐标的最小值和最大值。问题是,当我用awk脚本做同样的事情时,结果却不一样!!!
我在想,是不是数据类型或者字符串操作出了问题,或者其他什么原因……有没有人能帮我解决这个问题?
Python脚本:
#!/usr/bin/python
# -*- coding: iso-8859-15 -*-
inputfile = "peamorphe.xyz"
outputfile = "result.txt"
# open the input file
infile = open(inputfile, "r")
# read line 1 : number of atoms
atomsno = infile.readline().rstrip('\n').split(" ")
# read line 2 : name of the system
systemname = infile.readline().rstrip('\n')
# read line 3 : initialisation for min and max
temp2 = infile.readline().rstrip('\n').split(" ")
zmin = temp2[3]
zmax = temp2[3]
ymax = temp2[2]
ymin = temp2[2]
xmax = temp2[1]
xmin = temp2[1]
lineno = 3
print zmax, ymin, xmin
# read other lines
for ligne in infile.readlines():
lineno = lineno + 1
# extraction and strip of data spaced by " "
data = ligne.rstrip('\n\r').split(" ")
# Conditions for min and max
if data[1] < xmin:
xmin = data[1]
wclxmin = lineno
if data[1] > xmax:
xmax = data[1]
wclxmax = lineno
if data[2] < ymin:
ymin = data[2]
wclymin = lineno
if data[2] > ymax:
ymax = data[2]
wclymax = lineno
if data[3] < zmin:
zmin = data[3]
wclzmin = lineno
if data[3] > zmax:
zmax = data[3]
wclzmax = lineno
# Evaluation of centers
zcenter = float(zmax)-float(zmin)
ycenter = float(ymax)-float(ymin)
xcenter = float(xmax)-float(xmin)
# open the input file
infile = open(inputfile, "r")
# read line 1 : number of atoms
atomsno = infile.readline().rstrip('\n').split(" ")
# read line 2 : name of the system
systemname = infile.readline().rstrip('\n')
# read line 3 : initialisation for min and max
temp2 = infile.readline().rstrip('\n').split(" ")
zmin = temp2[3]
zmax = temp2[3]
ymax = temp2[2]
ymin = temp2[2]
xmax = temp2[1]
xmin = temp2[1]
lineno = 3
print zmax, ymin, xmin
# read other lines
for ligne in infile.readlines():
lineno = lineno + 1
# extraction and strip of data spaced by " "
data = ligne.rstrip('\n\r').split(" ")
# Conditions for min and max
if data[1] < xmin:
xmin = data[1]
wclxmin = lineno
if data[1] > xmax:
xmax = data[1]
wclxmax = lineno
if data[2] < ymin:
ymin = data[2]
wclymin = lineno
if data[2] > ymax:
ymax = data[2]
wclymax = lineno
if data[3] < zmin:
zmin = data[3]
wclzmin = lineno
if data[3] > zmax:
zmax = data[3]
wclzmax = lineno
# Evaluation of centers
zcenter = float(zmax)-float(zmin)
ycenter = float(ymax)-float(ymin)
xcenter = float(xmax)-float(xmin)
awk脚本:
#!/usr/bin/awk -f
# from a xyz file
BEGIN{
xmax;xmin;
zmax;zmin;
ymax;ymin;
xcent;ycent;zcent;
xcent = (xmax-xmin)/2;
ycent = (ymax-ymin)/2;
zcent = (zmax-zmin)/2;
print "At the start of the script";
print "xmax = " xmax "; " "xmin = " xmin "; xcent = " xcent;
print "ymax = " ymax "; " "ymin = " ymin "; ycent = " ycent;
print "zmax = " zmax "; " "zmin = " zmin "; zcent = " zcent;
print "";
}
{
if (xmax<$2) xmax = $2
if (xmin>$2) xmin = $2
if (ymax<$3) ymax = $3
if (ymin>$3) ymin = $3
if (zmax<$4) zmax = $4
if (zmin>$4) zmin = $4
}
END{
xcent = (xmax-xmin)/2;
ycent = (ymax-ymin)/2;
zcent = (zmax-zmin)/2;
print "At the end of the script";
print "xmax = " xmax "; " "xmin = " xmin "; xcent = " xcent;
print "ymax = " ymax "; " "ymin = " ymin "; ycent = " ycent;
print "zmax = " zmax "; " "zmin = " zmin "; zcent = " zcent}
你可以在这里下载我的输入文件(14天):peamorphe.xyz
提前谢谢你们,
Exilien。
1 个回答
5
你的代码之所以没有按预期工作,是因为上面评论中的eumiro提到了原因。
不过在Python中,有一个更简单的方法:使用NumPy。下面是一个示例代码,用来找出你文件中每一列的最大值和最小值:
import numpy
a = numpy.loadtxt("peamorphe.xyz", skiprows=2, usecols=(1, 2, 3))
max_xyz = a.max(axis=0)
min_xyz = a.min(axis=0)
center = max_xyz - min_xyz
这几行代码就能完成你脚本所做的所有事情,包括解析输入文件。如果你还需要最大值和最小值的索引,可以使用a.argmax()
和a.argmin()
。
看起来简单多了,是吧?