1
0
mirror of https://github.com/kanzure/pdfparanoia.git synced 2024-06-01 16:48:05 +02:00
Go to file
Bryan Bishop 47bc734318 replace_object_with - alternative removal method
Some publishers generate pdfs with the watermarks inside the text of a
page, in which case the object needs to be replaced. This deflates the
object and uses plaintext instead. While this increases the size of the
pdf, it is also effective for removing watermarks from the stream.
2013-02-06 17:27:12 -06:00
pdfparanoia replace_object_with - alternative removal method 2013-02-06 17:27:12 -06:00
tests ieee watermark removal 2013-02-05 04:49:56 -06:00
.gitignore initial commit 2013-02-05 03:10:14 -06:00
Makefile clean repo before uploading to pypi 2013-02-05 17:24:47 -06:00
MANIFEST.in include README.md via MANIFEST.in 2013-02-05 04:17:05 -06:00
README.md README: minor wording change 2013-02-05 23:44:00 -06:00
requirements.txt create requirements.txt 2013-02-06 00:03:48 -06:00
setup.py remove pdfquery from requirements 2013-02-06 00:03:33 -06:00

pdfparanoia

pdfparanoia is a PDF watermark removal library for academic papers.

Installing

Simple.

sudo pip install pdfparanoia

or,

sudo python setup.py install

Usage

import pdfparanoia

pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))

file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()

Changelog

  • 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
  • 0.0.8 - ieee support
  • 0.0.1 - initial commit

License

BSD.