Go to file
Bryan Bishop 25195f9b11 add swap files to make clean 2013-02-06 17:39:42 -06:00
pdfparanoia jstor watermark removal 2013-02-06 17:33:00 -06:00
tests jstor watermark removal 2013-02-06 17:33:00 -06:00
.gitignore add certain pdfs to .gitignore 2013-02-06 17:34:38 -06:00
MANIFEST.in include README.md via MANIFEST.in 2013-02-05 04:17:05 -06:00
Makefile add swap files to make clean 2013-02-06 17:39:42 -06:00
README.md jstor watermark removal 2013-02-06 17:33:00 -06:00
requirements.txt create requirements.txt 2013-02-06 00:03:48 -06:00
setup.py remove pdfquery from requirements 2013-02-06 00:03:33 -06:00

README.md

pdfparanoia

pdfparanoia is a PDF watermark removal library for academic papers.

Installing

Simple.

sudo pip install pdfparanoia

or,

sudo python setup.py install

Usage

import pdfparanoia

pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))

file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()

Supported

  • AIP
  • IEEE
  • JSTOR

Changelog

  • 0.0.10 - JSTOR
  • 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
  • 0.0.8 - IEEE

License

BSD.