PDF - extract several pages out of a large pdf using python
- bdata3
- Jul 19, 2020
- 1 min read
Sometimes you get a very long PDF and in the end, you want only a couple of pages out of it.
us the following code to grab these pages :
from PyPDF2 import PdfFileWriter, PdfFileReader,PdfFileMerger
inputpdf = PdfFileReader(open("/Downloads/הד מטות מסעי תשפ.pdf", "rb"))
for i in range(inputpdf.numPages):
output = PdfFileWriter()
output.addPage(inputpdf.getPage(i))
with open("/Users/allcloud/Documents/tmp/doc-page%s.pdf" % i, "ab") as outputStream:
output.write(outputStream)
merger = PdfFileMerger()
#taking pages 2-4
for i in range(2,5):
merger.append(PdfFileReader("/Documents/tmp/doc-page{}.pdf".format(i), 'rb'))
merger.write("/Documents/tmp/document-page_all.pdf")

Comments