1
0
mirror of synced 2024-12-22 20:59:58 +01:00

178 Commits

Author SHA1 Message Date
jvoisin
52a2c800b7 Bump coverage again 2018-07-08 21:50:52 +02:00
jvoisin
ad3e7ccee8 Bump coverage for office files and fix some related crashes 2018-07-08 21:35:45 +02:00
jvoisin
ca01484126 Silence a mypy's stupid warning 2018-07-08 17:12:17 +02:00
jvoisin
f9bc022c96 Add defusedxml as an (optional) way to prevent XML-based attacks
Those attacks are DoS-only.
2018-07-08 17:07:26 +02:00
jvoisin
72e1fda18d Remove a leftover print 2018-07-08 15:19:18 +02:00
jvoisin
3cd4f9111f Bump coverage for torrent handling 2018-07-08 15:13:03 +02:00
jvoisin
b5fcddd6a6 Simplify how torrent files are handled
- Rework the testsuite wrt. torrent
- fail at parser's instantiation on corrupted torrent,
  instead of during `get_meta` or `remove_all` call
2018-07-08 13:49:11 +02:00
jvoisin
7ea362d908 Bump the coverage for pdf 2018-07-07 18:12:33 +02:00
jvoisin
85455a4419 Fix a mistake in office file revisions handling 2018-07-07 18:05:54 +02:00
jvoisin
9f631a1bb1 Bump a bit the coverage 2018-07-07 18:02:53 +02:00
jvoisin
c2ef35d1f1 Bump the changelog 0.1.3 2018-07-06 01:00:14 +02:00
jvoisin
3d80f97524 Simplify BMP handling 2018-07-06 00:49:17 +02:00
jvoisin
53271495f7 Add support for .txt files 2018-07-06 00:42:09 +02:00
jvoisin
0638b9bbbb Document that we do like PEP8 2018-07-02 00:27:38 +02:00
jvoisin
893f58554a Improve a bit the formatting of the code thanks to pyflakes3 2018-07-02 00:22:05 +02:00
jvoisin
11008f8fd4 Improve a bit the README 2018-07-01 23:35:04 +02:00
jvoisin
a430403c7e Document in our implementation notes how revisions are handled and why 2018-07-01 23:27:24 +02:00
jvoisin
bee56a57ce Remove docx revisions 2018-07-01 23:16:14 +02:00
jvoisin
02f7605ac1 MAT2 is now cleaning revisions from odt files! 2018-07-01 21:09:20 +02:00
jvoisin
80fc4ffb40 Remove the thumbnails from libreoffice files 2018-07-01 17:29:05 +02:00
jvoisin
177184ac67 Massively simplify how we're cleaning office files 2018-06-27 21:48:46 +02:00
jvoisin
f44769df41 Ensure Poppler's minimal version
We're using methods that aren't available in Poppler
below 0.46, so we're checking for this upon import.

This commit is based on ideas from @LogicalDash ♥
2018-06-24 22:40:57 +02:00
jvoisin
1e9906de29 Document that we tests against corrupted files 2018-06-22 21:21:03 +02:00
jvoisin
63b19416ef Pyflakes should run on the testsuite too 2018-06-22 21:18:22 +02:00
jvoisin
74f2d50433 Split the testsuite a bit and add more tests 2018-06-22 21:16:55 +02:00
jvoisin
b4ef0c9622 Improve reliability against corrupted image files 2018-06-22 20:38:29 +02:00
jvoisin
dfccf79f22 Bump the changelog 0.1.2 2018-06-21 23:34:12 +02:00
jvoisin
8810564b8e Fix some deprecated directives in the COTNRIBUTING.md file 2018-06-21 23:33:56 +02:00
jvoisin
5b38bd7ccd Improve the reliability of the office parser 2018-06-21 23:18:59 +02:00
jvoisin
846a261465 Fix some linter warnings 2018-06-21 23:07:21 +02:00
jvoisin
09e748fa4c Refactor how offices files are handled
- xml files are no longer considered harmless
- Factorization of the `remove_all` method for office files
- Explicit whitelist are used
- Blacklist are used to skip files completely
  - Non-blacklisted files are _still cleaned_
  - Unsupported files are still triggering an error
2018-06-21 23:02:41 +02:00
jvoisin
a89dae054a Minor simplification of the office-related code 2018-06-21 21:24:53 +02:00
jvoisin
c1f4426612 Improve the threat-model again, thanks to @joe 2018-06-20 00:10:21 +02:00
jvoisin
120c3bf72f Improve a bit our threat model 2018-06-19 23:39:06 +02:00
jvoisin
84277740a9 Add fedora in the CI
refactor
2018-06-19 00:01:28 +02:00
Antoine Tenart
3a776ff1ca README: software is always singular
Fix on small typo. Cosmetic patch.

Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-18 23:49:54 +02:00
Antoine Tenart
cce5de82e5 libmat2: harmless: add the text/xml mime type
Fedora defines the 'text/xml' mime type for xml files. Adds this mime
type to the harmless parser.

Fixes #36.

Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 21:34:47 +02:00
Antoine Tenart
484e26dd9c libmat2: audio: add the audio/x-flac mime type
The FLAC parser looks for the 'audio/flac' mime type, but Fedora
defines 'audio/x-flac' in /etc/mime.types for FLAC files. Add this mime
type to the audio parser.

Fixes #36.

Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 21:34:47 +02:00
Antoine Tenart
3359f36b67 README: fix one typo
Fixes one small typo in the README.

Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 18:59:51 +02:00
Antoine Tenart
c19cbc48e2 README: fix the Jessie python3 URL
Fixes the scheme in Jessie's Python3 URL.

Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 18:59:03 +02:00
Antoine Tenart
6f8427abf9 README: remove extra empty spaces
This patch removes extra empty spaces at the end of some lines in
README.md. This is a cosmetic patch.

Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 18:55:22 +02:00
jvoisin
4600ce3490 Improve a bit the coverage 2018-06-10 20:20:45 +02:00
jvoisin
545887af98 Minor code simplification 2018-06-10 20:20:32 +02:00
jvoisin
7dad77a785 Make the parsing of office format's metadata more robust 2018-06-10 20:20:00 +02:00
jvoisin
8c7979aae3 Add some tests for non-supported embedded fileformats 2018-06-10 20:19:35 +02:00
jvoisin
b310a18e69 Add branch coverage to the CI 2018-06-10 01:02:21 +02:00
jvoisin
87bdcd1a95 Improve a bit our coverage wrt. torrent files handling 2018-06-10 00:56:55 +02:00
jvoisin
3c56fa3237 Improve a bit the performances wrt. image's metadata display 2018-06-10 00:43:38 +02:00
jvoisin
9c7aa34f50 Bump a bit the coverage 2018-06-10 00:43:25 +02:00
jvoisin
e81ce6cd1a Fix and add a test for explicitly non-supported formats 2018-06-10 00:28:43 +02:00