python简单操作pdf
想把多个PDF合并为一个。手动太麻烦了…
[alert class=”success”]懒是人类进步的阶梯,正好python提供了一个台阶。[/alert]
有个PyPDF库,正好测试一下。
[button url=”https://pypdf.readthedocs.io/en/latest/” types=”link”]官方文档参考[/button]
PDF的读取,获取页数,页面内容
from PyPDF2 import PdfReader
reader = PdfReader("c:\\1.pdf")
number_of_pages = len(reader.pages)
page = reader.pages[0]
text = page.extract_text()
合并PDF
[alert class=”success”]官方文档很简略,网上教程复杂且失效。[/alert]
from pypdf import PdfWriter merger = PdfWriter() for pdf in ["file1.pdf", "file2.pdf", "file3.pdf"]: merger.append(pdf) merger.write("merged-pdf.pdf") merger.close()
添加书签
merger.add_outline_item(title=pdf.replace(".pdf",""), page_number=count)
(Visited 78 times, 1 visits today)