Add a note about why we do clean PDF in a completely overkill way

2025-06-29 01:57:53 +02:00 · 2018-04-03 21:45:05 +02:00 · 2018-04-03 21:45:05 +02:00 · cd8f1a55b1
commit cd8f1a55b1
parent e8e3ab6c86
1 changed files with 4 additions and 0 deletions
--- a/doc/implementation_notes.md
+++ b/doc/implementation_notes.md
@ -25,6 +25,10 @@ handle PDF. But apparently, people are ok with [pdf redact
 tools](https://github.com/firstlookmedia/pdf-redact-tools), that simply
 transform the PDF into images. So this is what's MAT2 is doing too.

+Of course, it would be possible to detect images in PDf file, and process them
+with MAT2, but since a PDF can contain a lot of things, like images, videos,
+javascript, pdf, blobs, … this is the easiest and safest way to clean them.
+
 Images handling
 ---------------