1
0
mirror of https://github.com/kanzure/pdfparanoia.git synced 2025-02-11 13:13:10 +01:00
Bryan Bishop 8eb8797eeb support pdf formats with whitespace line endings
JSTOR pdfs have whitespace at the end of each line in their pdfs. Though
their watermarks are not yet removable, this supports parsing their
files in the future or any other publisher that does similar things.

see #1
2013-02-05 19:07:28 -06:00
2013-02-05 04:49:56 -06:00
2013-02-05 03:10:14 -06:00
2013-02-05 17:24:47 -06:00
2013-02-05 04:17:05 -06:00
2013-02-05 17:21:58 -06:00
2013-02-05 04:29:16 -06:00

pdfparanoia

pdfparanoia is a PDF watermark remover library for academic papers.

Installing

Simple.

sudo pip install pdfparanoia

or,

sudo python setup.py install

Usage

import pdfparanoia

pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))

file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()

Changelog

  • 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
  • 0.0.8 - ieee support
  • 0.0.1 - initial commit

License

BSD.

Description
No description provided
Readme 2.3 MiB
Languages
Python 90.7%
Shell 8.5%
Makefile 0.8%