想把多个PDF合并为一个。手动太麻烦了…

[alert class=”success”]懒是人类进步的阶梯,正好python提供了一个台阶。[/alert]

有个PyPDF库,正好测试一下。

[button url=”https://pypdf.readthedocs.io/en/latest/” types=”link”]官方文档参考[/button]

PDF的读取,获取页数,页面内容

from PyPDF2 import PdfReader
reader = PdfReader("c:\\1.pdf")
number_of_pages = len(reader.pages)
page = reader.pages[0]
text = page.extract_text()

合并PDF

[alert class=”success”]官方文档很简略,网上教程复杂且失效。[/alert]

from pypdf import PdfWriter
merger = PdfWriter()
for pdf in ["file1.pdf", "file2.pdf", "file3.pdf"]:
    merger.append(pdf)
merger.write("merged-pdf.pdf")
merger.close()

添加书签

merger.add_outline_item(title=pdf.replace(".pdf",""), page_number=count)
(Visited 78 times, 1 visits today)