在Python表格中使用for循环
我正在尝试制作一个表格,用来整理高票房电影和与它们最相似的电影。我的代码已经写好了,但在完成表格时遇到了一些麻烦。
我的代码:
#import the data from the csv file and use DictRead to interpret the information.
import csv
csv_file = open("moviestats_med.csv")
csv_data = csv.DictReader(csv_file)
#create dictionary of films and their directors.
direct = {}
#create dictionary of films and their genres.
genre = {}
#create dictionary of films and each main actor.
actor1 = {}
actor2 = {}
actor3 = {}
#create dictionary of films and their worldwide gross.
gross = {}
#create dictionary of films and the year they came out.
year = {}
name = {}
#iterate over the csv file to fill the dictionaries.
for c in csv_data:
direct[c['name']] = c['director']
genre[c['name']] = c['genre']
actor1[c['name']] = c['actor1']
actor2[c['name']] = c['actor2']
actor3[c['name']] = c['actor3']
gross[c['name']] = c['Worldwide Gross']
year[c['name']] = c['date']
name[c['name']] = c['name']
#create a two-variable function to deterime the FavActor Similarity score:
def FavActorFunction(film1,film2):
#set the result of the FavActor formula between two films to a default of 0.
FavActorScore = 0
#add 3 to the similarity score if the films have the same director.
if direct[film1] == direct[film2]:
FavActorScore += 3
#add 2 to the similarity score if the films are in the same genre.
if genre[film1] == genre[film2]:
FavActorScore += 2
#add 5 to the similarity score for each actor they have in common.
if actor1[film1] in (actor1[film2], actor2[film2], actor3[film2]):
FavActorScore += 5
if actor2[film1] in (actor1[film2], actor2[film2], actor3[film2]):
FavActorScore += 5
if actor3[film1] in (actor1[film2], actor2[film2], actor3[film2]):
FavActorScore += 5
#print the resulting score.
return FavActorScore
#create a function to find the film with the greatest Worldwide Gross per year.
def MaxGrossFinder(c):
#set the intial maximum gross to zero.
MaxGross = 0
#replace the MaxGross with any film in that year that has a greater gross.
for film in year:
f = int(gross[film])
if year[film] == c:
if f > MaxGross:
MaxGross = f
max = film
#print the year and the max value for that year.
return max
#create a dictionary for the max grossing films of each year from 2000-2007.
max_films = {}
#create a list of years from 2000-2007.
for c in ['2000', '2001', '2002', '2003', '2004', '2005', '2006', '2007']:
max_films[c] = MaxGrossFinder(c)
if 'a' == 'a':
max_list = []
MaxSimilarity = 0
for d in year:
f = FavActorFunction(max_films[c], d)
if d != MaxGrossFinder(c):
if year[d] == c:
if f > MaxSimilarity:
MaxSimilarity = f
max = d
max_list.append(max)
MaxSimilarity2 = 0
for d in year:
g = FavActorFunction(max_films[c], d)
if d != MaxGrossFinder(c):
if d != max:
if year[d] == c:
if g > MaxSimilarity2:
MaxSimilarity2 = g
max2 = d
max_list.append(max2)
MaxSimilarity3 = 0
for d in year:
h = FavActorFunction(max_films[c], d)
if d != MaxGrossFinder(c):
if d != max and d != max2:
if year[d] == c:
if h > MaxSimilarity3:
MaxSimilarity3 = h
max3 = d
max_list.append(max3)
MaxSimilarity4 = 0
for d in year:
i = FavActorFunction(max_films[c], d)
if d != MaxGrossFinder(c):
if d != max and d != max2 and d != max3:
if year[d] == c:
if i > MaxSimilarity4:
MaxSimilarity4 = i
max4 = d
max_list.append(max4)
print "Content-Type: text/html"
print ""
print "<html>"
print "<body>"
print "<table border=1>"
print "<tr>"
print "<th><font color=green>Year</font></th>"
print "<th><font color=blue>Highest Grossing Film</font></th>"
print "<th><font color=red>Most Similar</font></th>"
print "<th><font color=red>2nd Most Similar</font></th>"
print "<th><font color=red>3rd Most Similar</font></th>"
print "<th><font color=red>4th Most Similar</font></th>"
print "</tr>"
for c in sorted(max_films):
print "<tr><th>"
print c
print "<td>"
print max_films[c]
print "</td><td>"
print max_list[0]
print "</td><td>"
print max_list[1]
print "</td><td>"
print max_list[2]
print "</td><td>"
print max_list[3]
print "</td></tr></th>"
我做出来的表格大部分是正确的,但每一行的“最相似”电影都对应到第一个年份[2000]。我该如何修改我的代码,让“最相似”的电影能对应到正确的数据呢?
2 个回答
1
for c in sorted(max_films):
print "<tr><th>"
print c
print "<td>"
print max_films[c]
print "</td><td>"
print max_list[0]
print "</td><td>"
print max_list[1]
print "</td><td>"
print max_list[2]
print "</td><td>"
print max_list[3]
print "</td></tr></th>"
你在每次循环的时候都打印相同的变量 max_list[0:4]。因为这些变量没有改变,所以每次输出的结果自然是一样的。
你需要把判断哪个最相似的逻辑放到循环里面,或者创建一个新的循环,把最相似的结果存到一个列表里,这样在你的循环中就可以取出来使用了。
3
一些建议:
- 你为什么用“actor1 actor2 actor3”而不是
main_actors = []
呢?你可以把字典放里面! - 你也可以用
seq
来代替for c in [2000, …. ]
。 - 最后,你可以用类似 printf 的格式来处理你的字符串(在最后),这样可以写成:
print "<tr><th>%s<td>%s</td><td>%s</td><td>%s</td><td>%s</td><td>%s</td><td></td></tr></th>" % (c, max_films[c], max_list[0], max_list[1], max_list[2], max_list[3])
但我觉得你遇到的实际问题是 HTML:你不能在每一行都用 th。正确的写法是:
<table>
<tr><th>Colname1</th><th>Colname2</th><th>Colname3</th></tr>
<tr><td>value 11</td><td>value 12</td><td>value 13</td></tr>
<tr><td>value 21</td><td>value 22</td><td>value 23</td></tr>
<tr><td>value 31</td><td>value 32</td><td>value 33</td></tr>
</table>
不过我可能在你的脚本上漏掉了什么。