擅长:python、mysql、java
<p>作为PyPDF2的替代方案,我建议<code>pdftotext</code>:</p>
<pre><code>#!/usr/bin/env python
"""Use pdftotext to extract text from PDFs."""
import pdftotext
with open("foobar.pdf") as f:
pdf = pdftotext.PDF(f)
# Iterate over all the pages
for page in pdf:
print(page)
</code></pre>