Django查询忽略预期结果

2024-06-01 06:34:01 发布

您现在位置:Python中文网/ 问答频道 /正文

我必须在一个有点复杂的建模的大型数据库上执行查询,我将尝试在下面对其进行简化:

class ScreeningItem(models.Model):
    # other fields
    receivedItem = models.OneToOneField(ReceivedItem, null=True, on_delete=models.SET_NULL)

class ReceivedItem(models.Model):
    # other fields
    dossier = models.ForeignKey(Dossier, null=True, on_delete=models.SET_NULL)

class Dossier(models.Model):
    # other fields
    subjects = models.ManyToManyField('SubjectTypes', through='Subjects',
                                      through_fields=('dossier', 'subjectType'))

class Subject(models.Model):
    main = models.BooleanField(null=True)
    dossier = models.ForeignKey(Dossier, null=True, on_delete=models.SET_NULL)
    subjectType =  models.ForeignKey(SubjectType, null=True, on_delete=models.SET_NULL)

class SubjectType(models.Model):
    # other fields
    name = models.CharField(max_length=255, null=True, blank=True)
    parent = models.ForeignKey('self', null=True, on_delete=models.SET_NULL)

现在的问题是,我必须在ScreeningItem表中找到远相关字段SubjectType.name包含特定单词时的项。不,更糟。正如您在下面看到的,在该模型中有一个父子自引用,我必须在相关的SujectType、它的父级和它的祖父母中查找这些特定的单词,如果它们存在的话

我的尝试:

exp = 'something'
queryset = ScreeningItem.objects.filter(
    Q(receivedItem__dossier__subjects__subjecttype__name__iregex=exp) |     
    Q(receivedItem__dossier__subjects__subjecttype__parent__name__iregex=exp) |     
    Q(receivedItem__dossier__subjects__subjecttype__parent__parent__name__iregex=exp))     

然而,当我收到一些远低于我预期的记录时,我检查了数据库,惊讶地发现有许多ScreeningItem有一个ReceivedItem有一个Dossier与我正在搜索的SubjectTypes相关

不幸的是,这里不允许我透露内容。因此,我在下面编写了测试例程:

def test():
    exp = 'something'  # valid and equal both for Python and MySQL regular expression engines
    re_exp = re.compile(exp, re.IGNORECASE)
    queryset_1 = ScreeningItem.objects.filter(
        Q(receivedItem__dossier__subjects__subjecttype__name__iregex=exp) |     
        Q(receivedItem__dossier__subjects__subjecttype__parent__name__iregex=exp) |     
        Q(receivedItem__dossier__subjects__subjecttype__parent__parent__name__iregex=exp))     
    set_1 = set(queryset_1.values_list('id', flat=True))
    print(len(set_1))

    queryset_2 = GnomoItemTriagem.objects.filter(receivedItem__dossier__isnull=False)
    set_2a = set()
    set_2b = set()
    for item in queryset_2:
        subjects = item.receivedItem.dossier.subjects
        if subjects.filter(
                Q(name__iregex=exp) |
                Q(parent__name__iregex=exp) |
                Q(parent__parent__name__iregex=exp)).count() > 0:
            set_2a.add(item.id)

        for subject in subjects.all():
            if re_exp.findall(subject.name) or\
                (subject.parent and re_exp.findall(subject.parent.name)) or \
                    (subject.parent and subject.parent.parent and re_exp.findall(subject.parent.parent.name)):
                set_2b.add(item.id)

    print(len(set_2a))
    print(len(set_2b))

然后我的结果是

1596
21223
21223

那么,我的第一个查询应该如何编写,才能同时返回所有21223个所需的项目呢?我做错了什么


1条回答
网友
1楼 · 发布于 2024-06-01 06:34:01

由于subjectsSubjectType的多对多字段,它已经“着陆”在该模型上。您可以查询另一个__subjecttype的原因是它正在“反向”中访问parentForeignKey

因此,您的查询应该如下所示:

queryset = ScreeningItem.objects.filter(
    Q(receivedItem__dossier__subjects__name__iregex=exp) |     
    Q(receivedItem__dossier__subjects__parent__name__iregex=exp) |     
    Q(receivedItem__dossier__subjects__parent__parent__name__iregex=exp)
)

它没有出错的原因是parent关系没有related_name。这意味着parent关系的默认related_name_querysubjecttype。因此,您可以进行一个查询,在其中查找ScreeningItemreceivedItem以及dossierSubjectType一个名为查询的{}子项,或子项的父项,等等。子项部分因此出错

相关问题 更多 >