1
0
mirror of https://github.com/kanzure/pdfparanoia.git synced 2024-06-08 20:07:49 +02:00
Go to file
Bryan Bishop 5c8a194445 deflation tool to help with debugging
The deflate function expands some of the FlateDecode streams in a pdf
file. The output of the deflate function is not always correct and it is
very buggy. Still, this is a useful tool to poke around in foreign pdfs
under investigation.
2013-02-07 20:51:10 -06:00
pdfparanoia deflation tool to help with debugging 2013-02-07 20:51:10 -06:00
tests jstor watermark removal 2013-02-06 17:33:00 -06:00
.gitignore add certain pdfs to .gitignore 2013-02-06 17:34:38 -06:00
Makefile add swap files to make clean 2013-02-06 17:39:42 -06:00
MANIFEST.in include README.md via MANIFEST.in 2013-02-05 04:17:05 -06:00
README.md jstor watermark removal 2013-02-06 17:33:00 -06:00
requirements.txt create requirements.txt 2013-02-06 00:03:48 -06:00
setup.py remove pdfquery from requirements 2013-02-06 00:03:33 -06:00

pdfparanoia

pdfparanoia is a PDF watermark removal library for academic papers.

Installing

Simple.

sudo pip install pdfparanoia

or,

sudo python setup.py install

Usage

import pdfparanoia

pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))

file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()

Supported

  • AIP
  • IEEE
  • JSTOR

Changelog

  • 0.0.10 - JSTOR
  • 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
  • 0.0.8 - IEEE

License

BSD.