在Django中实现热门算法

Question

我正在创建一个类似于reddit和hacker news的网站，里面有链接和投票的数据库。我正在实现hacker news的热门算法，进展得还不错，但在收集这些链接并展示它们时遇到了一些问题。这个算法其实很简单：

Y Combinator's Hacker News:
Popularity = (p - 1) / (t + 2)^1.5`

Votes divided by age factor.
Where`

p : votes (points) from users.
t : time since submission in hours.

p is subtracted by 1 to negate submitter's vote.
Age factor is (time since submission in hours plus two) to the power of 1.5.factor is (time since submission in hours plus two) to the power of 1.5.

我在另一个地方问过一个很相似的问题，关于Django中的复杂排序，但这次我没有考虑太多的选项，而是选择了一个方案并尝试让它工作，因为我以前用PHP/MySQL时就是这样做的。不过现在我知道Django的做法和我想象的很不一样。

我的模型大概是这样的：

class Link(models.Model):
category = models.ForeignKey(Category)
user = models.ForeignKey(User)
created = models.DateTimeField(auto_now_add = True)
modified = models.DateTimeField(auto_now = True)
fame = models.PositiveIntegerField(default = 1)
title = models.CharField(max_length = 256)
url = models.URLField(max_length = 2048)

def __unicode__(self):
    return self.title

class Vote(models.Model):
link = models.ForeignKey(Link)
user = models.ForeignKey(User)
created = models.DateTimeField(auto_now_add = True)
modified = models.DateTimeField(auto_now = True)
karma_delta = models.SmallIntegerField()

def __unicode__(self):
    return str(self.karma_delta)

然后是我的视图：

def index(request):
popular_links = Link.objects.select_related().annotate(karma_total = Sum('vote__karma_delta'))
return render_to_response('links/index.html', {'links': popular_links})

根据我之前的问题，我想用排序功能来实现这个算法。那里的一个回答似乎认为我应该把算法放在选择和排序中。我打算对这些结果进行分页，所以我觉得在Python中进行排序可能会抓取到所有的数据，这样不太合适。有没有什么建议可以让我更高效地做到这一点呢？

编辑

现在这个还没成功，但我觉得这是朝着正确方向迈出的一步：

from django.shortcuts import render_to_response
from linkett.apps.links.models import *

def index(request):
popular_links = Link.objects.select_related()
popular_links = popular_links.extra(
    select = {
        'karma_total': 'SUM(vote.karma_delta)',
        'popularity': '(karma_total - 1) / POW(2, 1.5)',
    },
    order_by = ['-popularity']
)
return render_to_response('links/index.html', {'links': popular_links})

不过出现了这样的错误：

Caught an exception while rendering: column "karma_total" does not exist
LINE 1: SELECT ((karma_total - 1) / POW(2, 1.5)) AS "popularity", (S...

编辑 2

这个错误更好了吗？

TemplateSyntaxError: Caught an exception while rendering: missing FROM-clause entry for table "vote"
LINE 1: SELECT ((vote.karma_total - 1) / POW(2, 1.5)) AS "popularity...

我的index.html文件很简单：

{% block content %}

{% for link in links %}
 
  
   karma-up
   {{ link.karma_total }}
   karma-down
  
  {{ link.title }}
  Posted by {{ link.user }} to {{ link.category }} at {{ link.created }}
 
{% empty %}
 No Links
{% endfor %}

{% endblock content %}

编辑 3

离成功又近了一步！这些回答都很棒，但我现在专注于一个特别的方案，因为我觉得它最适合我的情况。

from django.db.models import Sum
from django.shortcuts import render_to_response
from linkett.apps.links.models import *

def index(request): popular_links = Link.objects.select_related().extra( select = { 'popularity': '(SUM(links_vote.karma_delta) - 1) / POW(2, 1.5)', }, tables = ['links_link', 'links_vote'], order_by = ['-popularity'], ) return render_to_response('links/test.html', {'links': popular_links})

运行这个代码时，我遇到了一个错误，提示我缺少分组的值。具体来说：

TemplateSyntaxError at /
Caught an exception while rendering: column "links_link.id" must appear in the GROUP BY clause or be used in an aggregate function
LINE 1: ...karma_delta) - 1) / POW(2, 1.5)) AS "popularity", "links_lin...

我不明白为什么我的links_link.id不在我的分组中，但我不知道该如何修改我的分组，通常Django会处理这些事情。

数据库 django 模型设计排序视图函数投票系统分页热门算法

在Django中实现热门算法

4 个回答

撰写回答