Go to file
Zooko O'Whielacronx 9204b2e17e fix up verbose printouts, don't print out large data 2013-02-13 20:56:33 +00:00
bin add a "--verbose" option that writes to stderr if it finds anything to omit 2013-02-13 19:58:47 +00:00
pdfparanoia fix up verbose printouts, don't print out large data 2013-02-13 20:56:33 +00:00
tests SPIE watermark removal 2013-02-11 23:52:59 -06:00
.gitignore add certain pdfs to .gitignore 2013-02-06 17:34:38 -06:00
MANIFEST.in include README.md via MANIFEST.in 2013-02-05 04:17:05 -06:00
Makefile add swap files to make clean 2013-02-06 17:39:42 -06:00
README.md SPIE watermark removal 2013-02-11 23:52:59 -06:00
requirements.txt create requirements.txt 2013-02-06 00:03:48 -06:00
setup.py pdfparanoia command-line interface 2013-02-09 09:44:48 -06:00

README.md

pdfparanoia

pdfparanoia is a PDF watermark removal library for academic papers.

Installing

Simple.

sudo pip install pdfparanoia

or,

sudo python setup.py install

Usage

import pdfparanoia

pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))

file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()

or from the shell,

pdfparanoia input.pdf > output.pdf

and,

cat input.pdf | pdfparanoia > output.pdf

Supported

  • AIP
  • IEEE
  • JSTOR
  • SPIE (sort of)

Changelog

  • 0.0.12 - SPIE
  • 0.0.11 - pdfparanoia command-line interface. Use it by either piping in pdf data, or specifying a path to a pdf in the first argv slot.
  • 0.0.10 - JSTOR
  • 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
  • 0.0.8 - IEEE

License

BSD.