如何优化我的SQLAlchemy实例？

Question

我正在使用SQLAlchemy来获取大量数据。简单来说，我们有一个“交付”表，这个表通过多个其他表相连，而这些表又可以连接到更多的表。我需要从这些相连的表中获取数据，以便创建一个交付的JSON文件。

目前，我访问这些数据的方式非常慢。有没有人能给我一些建议（无论是具体的还是一般性的）来提高性能？下面是我访问数据的代码，我不想把所有的表定义都贴出来，因为它们比较大。所有表的定义都是标准的，没有设置懒加载之类的。

# Get the deliveries for the specified date
    delivery_rs = session.query(Delivery).join(Order) \
        .filter(and_(Delivery.DespatchDateTime.between(start_date, end_date), Order.ProductionSite == site_map.get(site))).all()
    # Setup our rowcount for the metadata later
    rowcount = 0

    # Go through each delivery in the resultset, formulate the full job/delivery/client/customer data and add it to the data array
    for delivery in delivery_rs:
        # Add to our rowcount
        rowcount = rowcount + 1

        # Add the jobs to our job array
        job_deliveries = delivery.JobDeliveries
        jobs = []
        quantity = 0
        for job_delivery in job_deliveries:
            job = job_delivery.Job
            web_ref = job.ClientJobReference
            if web_ref and not re.match(r'^CCW_', web_ref):
                web_ref = "" 
            elif web_ref:
                web_ref = re.sub(r'^CCW_', '', web_ref)
            jobs.append({
                "web_ref":      "CCW_{}".format(web_ref) if web_ref else "",
                "name":         job.JobName,
                # The artwork is stored in S3, so provide a link
                "thumbnail":    "https://example.com/{}.png".format(web_ref) if web_ref else ""
            })

            quantity = quantity + job_delivery.Quantity

        # Format our delivery data
        if delivery.AddressContact:
            address_contact = delivery.AddressContact
            contact_data =  {
                        "title":    title_map.get(address_contact.Title),
                        "name":     address_contact.ContactName,
                        "email":    address_contact.ContactEmail,
                        "phone":    address_contact.ContactNumber
                    }
        else:
            contact_data = {}
        
        order = delivery.Order
        client = order.Client
        delivery_method = delivery.DeliveryMethod
        address = delivery.Address
        
        result["data"].append(
            {
                "order_number": order.OrderSequenceId,
                "quantity":     quantity,
                "method":       delivery_method.Name,
                "client":       client.Name,
                "end_client":   client.EndCustomer,
                "jobs":         jobs,
                "contact":      contact_data,
                "address": {
                    "business": address.BusinessName,
                    "postcode": address.PostCode,
                    "town":     address.Town,
                    "county":   address.County,
                    "country":  address.Country.Name,
                    "lines": [
                        address.AddressLine1,
                        address.AddressLine2
                    ]
                }
            }
        )

我还没有尝试太多，因为网上有很多关于优化SQLAlchemy的信息，但我不太确定在我的情况下什么方法会有效。收集大约100个交付的数据需要15秒（大多数交付都有多个工作）。我尝试过把每个表都连接到查询上，但结果没有返回任何数据。

sqlalchemy 数据提取数据库优化懒加载关系型数据库查询性能 json生成表连接

如何优化我的SQLAlchemy实例？

1 个回答

撰写回答