pdfparanoia

mirror of https://github.com/kanzure/pdfparanoia.git synced 2025-03-25 17:10:56 +01:00

Author	SHA1	Message	Date
Zooko O'Whielacronx	503b8aead5	add -v -v mode which prints out the details (potentially sensitive, potentially bulky) remove spie, which appears to do nothing	2013-02-13 21:08:49 +00:00
Zooko O'Whielacronx	9204b2e17e	fix up verbose printouts, don't print out large data	2013-02-13 20:56:33 +00:00
Zooko O'Whielacronx	56cc7719da	add a "--verbose" option that writes to stderr if it finds anything to omit Also cleaned up some flakes noticed by pyflakes, and make the scrub() be @classmethod instead of @staticmethod so I could use the class for the verbose output. caveats: * there are no unit tests of this patch * now your logs of your stderr have potentially sensitive information in them * the implementation of arg parsing is very low-tech; (a good way to do arg parsing is the "argparse" module)	2013-02-13 19:58:47 +00:00
Bryan Bishop	caed396870	SPIE watermark removal This is slightly broken because the SPIE plugin removes more than just watermarks. For some reason it seems to also remove images and large blocks of text from the paper. However, the object that is being removed is tiny. In the unit testing sample, the removed object is pdf stream 55. For now, SPIE is partially disabled until this is fixed. The problem does not originate from the other plugins. fixes #20	2013-02-11 23:52:59 -06:00
Bryan Bishop	9d7fd1dbb6	README: add command-line usage	2013-02-10 01:29:58 -06:00
Bryan Bishop	775b927b42	pdfparanoia command-line interface	2013-02-09 09:44:48 -06:00
Bryan Bishop	5c8a194445	deflation tool to help with debugging The deflate function expands some of the FlateDecode streams in a pdf file. The output of the deflate function is not always correct and it is very buggy. Still, this is a useful tool to poke around in foreign pdfs under investigation.	2013-02-07 20:51:10 -06:00
Bryan Bishop	e108a43e26	make eraser handle more pdf formats	2013-02-07 03:56:18 -06:00
Bryan Bishop	f3d8475c79	better import formatting for core.py	2013-02-07 03:55:57 -06:00
Bryan Bishop	25195f9b11	add swap files to make clean	2013-02-06 17:39:42 -06:00
Bryan Bishop	11abd551d7	add certain pdfs to .gitignore	2013-02-06 17:34:38 -06:00
Bryan Bishop	b7b5a4ef65	jstor watermark removal fixes #1	2013-02-06 17:33:00 -06:00
Bryan Bishop	47bc734318	replace_object_with - alternative removal method Some publishers generate pdfs with the watermarks inside the text of a page, in which case the object needs to be replaced. This deflates the object and uses plaintext instead. While this increases the size of the pdf, it is also effective for removing watermarks from the stream.	2013-02-06 17:27:12 -06:00
Bryan Bishop	57b6aba099	create requirements.txt	2013-02-06 00:03:48 -06:00
Bryan Bishop	4f0208963d	remove pdfquery from requirements	2013-02-06 00:03:33 -06:00
Bryan Bishop	2711fc174b	README: minor wording change	2013-02-05 23:44:00 -06:00
Bryan Bishop	8eb8797eeb	support pdf formats with whitespace line endings JSTOR pdfs have whitespace at the end of each line in their pdfs. Though their watermarks are not yet removable, this supports parsing their files in the future or any other publisher that does similar things. see #1	2013-02-05 19:07:28 -06:00
Bryan Bishop	bc89bc5335	clean repo before uploading to pypi	2013-02-05 17:24:47 -06:00
Bryan Bishop	30c6e30891	version bump to 0.0.9	2013-02-05 17:21:58 -06:00
Bryan Bishop	f78aad78ef	AIP: better false-positives check	2013-02-05 17:20:11 -06:00
Bryan Bishop	d276954bfa	IEEE: remove print statement (oops)	2013-02-05 17:19:37 -06:00
Bryan Bishop	6ef9a19ff6	README: update changelog	2013-02-05 04:56:18 -06:00
Bryan Bishop	14f1439c76	ieee watermark removal	2013-02-05 04:49:56 -06:00
Bryan Bishop	0adec6c74e	better plugin support	2013-02-05 04:34:45 -06:00
Bryan Bishop	aa3e188ef4	more setup.py madness	2013-02-05 04:29:16 -06:00
Bryan Bishop	302d2ff2e5	include plugins in package	2013-02-05 04:20:26 -06:00
Bryan Bishop	99285252fc	include README.md via MANIFEST.in	2013-02-05 04:17:05 -06:00
Bryan Bishop	011c10c5c4	more setup.py magic	2013-02-05 04:14:08 -06:00
Bryan Bishop	99c88643f7	fix README packaging	2013-02-05 03:56:07 -06:00
Bryan Bishop	d8fc6c1d8f	initial commit	2013-02-05 03:10:14 -06:00

30 Commits