用Python（Django）将PDF转换为二进制文件

response = HttpResponse(content_type="application/pdf") response["Content-Disposition"] = "inline; filename=a_test_document.pdf" p = canvas.Canvas(response) p.drawString(100, 500, "Hello world") p.showPage() p.save() return response

2条回答

网友

1楼 · 编辑于 2024-04-29 07:34:27

为了生成PDF，您可以使用xhtml2pdf库。

函数返回response object，只需传递模板名称、上下文数据和pdfname。在

def fetch_resources(uri, rel):
    """
    Callback to allow xhtml2pdf/reportlab to retrieve Images,Stylesheets, etc.
    `uri` is the href attribute from the html link element.
    `rel` gives a relative path, but it's not used here.

    """
    if uri.startswith(settings.MEDIA_URL):
        path = os.path.join(settings.MEDIA_ROOT,
                            uri.replace(settings.MEDIA_URL, ""))
    elif uri.startswith(settings.STATIC_URL):
        path = os.path.join(settings.STATIC_ROOT,
                            uri.replace(settings.STATIC_URL, ""))
    else:
        path = os.path.join(settings.STATIC_ROOT,
                            uri.replace(settings.STATIC_URL, ""))

        if not os.path.isfile(path):
            path = os.path.join(settings.MEDIA_ROOT,
                                uri.replace(settings.MEDIA_URL, ""))

            if not os.path.isfile(path):
                raise UnsupportedMediaPathException(
                                    'media urls must start with %s or %s' % (
                                    settings.MEDIA_ROOT, settings.STATIC_ROOT))

    return path

def render_to_pdf_response(template_name, context=None, pdfname='test.pdf'):
  file_object = HttpResponse(mimetype='application/pdf')
  file_object['Content-Disposition'] = 'attachment; filename=%s' % pdfname
  template = get_template(template_name)
  html = template.render(Context(context))
  pisa.CreatePDF(html.encode("UTF-8"), file_object , encoding='UTF-8',
                 link_callback=fetch_resources)
  return file_object

以下是安装说明：https://pypi.python.org/pypi/xhtml2pdf/

网友

2楼 · 编辑于 2024-04-29 07:34:27

看起来你正在尝试更新现有的PDF，而不是简单地创建一个新的。在这种情况下，this answer可能就是您要找的。总结一下他的解决方案：

read your PDF using PdfFileReader(), we'll call this input
create a new pdf containing your text to add using ReportLab, save this as a string object
read the string object using PdfFileReader(), we'll call this text
create a new PDF object using PdfFileWriter(), we'll call this output
iterate through input and apply .mergePage(text.getPage(0)) for each page you want the text added to, then use output.addPage() to add the modified pages to a new document

另一方面，如果您不确定接收到的二进制文件的文件类型（在您的示例中不太可能，但值得一提），可以使用名为^{}的东西。这是一个未经测试的潜在示例：

In [2]: import magic
In [3]: m = magic.Magic(mime=True)
In [4]: m.from_file('/home/culebron/Documents/chapter2.pdf')
Out[4]: 'pdf'

根据最终输出，您可以确定：

是否为PDF格式
如果是这样，如何应用您所需的更改或与当前PDF文档合并。在
如果没有，如何将内容写入画布。在

相关问题更多 >

编程相关推荐

热门问题

热门文章