Frequently-Asked Questions

Which Python versions are supported?

pypdf 3.0+ supports Python 3.6 and later. PyPDF2 2.0+ supports Python 3.6 and later. PyPDF2 1.27.10 supported Python 2.7 to 3.10.

Who uses pypdf?

pyPdf is vendored into several projects. That means the code of pyPdf was copied into that project.

Projects that depend on pypdf:

  • Camelot: A Python library to extract tabular data from PDFs

  • edi: Electronic Data Interchange modules

  • amazon-textract-textractor: Analyze documents with Amazon Textract and generate output in multiple formats.

  • maigret: Collect a dossier on a person by username from thousands of sites

  • deda: tracking Dots Extraction, Decoding and Anonymisation toolkit

  • opencanary

  • Document Conversions

  • pdfalyzer: A PDF analysis tool for visualizing the inner tree-like data structure of a PDF in spectacularly large and colorful diagrams as well as scanning the binary streams embedded in the PDF for hidden potentially malicious content.

How do I cite pypdf?

In BibTeX format:

 title         = {The {pypdf} library},
 author        = {Mathieu Fenniak and
                  Matthew Stamy and
                  pubpub-zz and
                  Martin Thoma and
                  Matthew Peveler and
                  exiledkingcc and {pypdf Contributors}},
 year          = {2024},
 url           = {}
 note          = {See for all contributors}

Which License does pypdf use?

pypdf uses the BSD-3-Clause license, see the LICENSE file.