1
0
mirror of https://github.com/kanzure/pdfparanoia.git synced 2024-12-04 23:15:52 +01:00
Go to file
Bryan Bishop 8eb8797eeb support pdf formats with whitespace line endings
JSTOR pdfs have whitespace at the end of each line in their pdfs. Though
their watermarks are not yet removable, this supports parsing their
files in the future or any other publisher that does similar things.

see #1
2013-02-05 19:07:28 -06:00
pdfparanoia support pdf formats with whitespace line endings 2013-02-05 19:07:28 -06:00
tests ieee watermark removal 2013-02-05 04:49:56 -06:00
.gitignore initial commit 2013-02-05 03:10:14 -06:00
Makefile clean repo before uploading to pypi 2013-02-05 17:24:47 -06:00
MANIFEST.in include README.md via MANIFEST.in 2013-02-05 04:17:05 -06:00
README.md version bump to 0.0.9 2013-02-05 17:21:58 -06:00
setup.py more setup.py madness 2013-02-05 04:29:16 -06:00

pdfparanoia

pdfparanoia is a PDF watermark remover library for academic papers.

Installing

Simple.

sudo pip install pdfparanoia

or,

sudo python setup.py install

Usage

import pdfparanoia

pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))

file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()

Changelog

  • 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
  • 0.0.8 - ieee support
  • 0.0.1 - initial commit

License

BSD.