mirror of
https://github.com/kanzure/pdfparanoia.git
synced 2024-12-04 23:15:52 +01:00
8eb8797eeb
JSTOR pdfs have whitespace at the end of each line in their pdfs. Though their watermarks are not yet removable, this supports parsing their files in the future or any other publisher that does similar things. see #1 |
||
---|---|---|
pdfparanoia | ||
tests | ||
.gitignore | ||
Makefile | ||
MANIFEST.in | ||
README.md | ||
setup.py |
pdfparanoia
pdfparanoia is a PDF watermark remover library for academic papers.
Installing
Simple.
sudo pip install pdfparanoia
or,
sudo python setup.py install
Usage
import pdfparanoia
pdf = pdfparanoia.scrub(open("nmat91417.pdf", "rb"))
file_handler = open("output.pdf", "wb")
file_handler.write(pdf)
file_handler.close()
Changelog
- 0.0.9 - AIP: better checks for false-positives; IEEE: remove stdout garbage.
- 0.0.8 - ieee support
- 0.0.1 - initial commit
License
BSD.