1
0
mirror of https://github.com/kanzure/pdfparanoia.git synced 2025-04-13 10:02:05 +02:00
Bryan Bishop 47bc734318 replace_object_with - alternative removal method
Some publishers generate pdfs with the watermarks inside the text of a
page, in which case the object needs to be replaced. This deflates the
object and uses plaintext instead. While this increases the size of the
pdf, it is also effective for removing watermarks from the stream.
2013-02-06 17:27:12 -06:00
2013-02-05 04:49:56 -06:00
2013-02-05 03:10:14 -06:00
2013-02-05 17:24:47 -06:00
2013-02-05 04:17:05 -06:00
2013-02-05 23:44:00 -06:00
2013-02-06 00:03:48 -06:00
2013-02-06 00:03:33 -06:00

pdfparanoia

pdfparanoia is a PDF watermark removal library for academic papers.

Installing

Simple.

sudo pip install pdfparanoia

or,

sudo python setup.py install

Usage

import pdfparanoia

pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))

file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()

Changelog

  • 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
  • 0.0.8 - ieee support
  • 0.0.1 - initial commit

License

BSD.

Description
No description provided
Readme 2.3 MiB
Languages
Python 90.7%
Shell 8.5%
Makefile 0.8%