mirror of
https://github.com/kanzure/pdfparanoia.git
synced 2025-02-11 05:03:11 +01:00
![Zooko O'Whielacronx](/assets/img/avatar_default.png)
Also cleaned up some flakes noticed by pyflakes, and make the scrub() be @classmethod instead of @staticmethod so I could use the class for the verbose output. caveats: * there are no unit tests of this patch * now your logs of your stderr have potentially sensitive information in them * the implementation of arg parsing is very low-tech; (a *good* way to do arg parsing is the "argparse" module)
pdfparanoia
pdfparanoia is a PDF watermark removal library for academic papers.
Installing
Simple.
sudo pip install pdfparanoia
or,
sudo python setup.py install
Usage
import pdfparanoia
pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))
file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()
or from the shell,
pdfparanoia input.pdf > output.pdf
and,
cat input.pdf | pdfparanoia > output.pdf
Supported
- AIP
- IEEE
- JSTOR
- SPIE (sort of)
Changelog
- 0.0.12 - SPIE
- 0.0.11 - pdfparanoia command-line interface. Use it by either piping in pdf data, or specifying a path to a pdf in the first argv slot.
- 0.0.10 - JSTOR
- 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
- 0.0.8 - IEEE
License
BSD.
Description
Languages
Python
90.7%
Shell
8.5%
Makefile
0.8%