jvoisin
b4edd6d2a2
Document that MAT2 not being able to detect metadata doesn't mean that the file is clean
2018-07-09 00:17:59 +02:00
jvoisin
bd357b85f8
Remove a useless option that was never implemented anyway
2018-07-09 00:13:16 +02:00
jvoisin
8c21006e6c
Fix some pep8 issues spotted by pyflakes
2018-07-08 22:40:36 +02:00
jvoisin
f49aa5cab7
Achieve 100% coverage!
2018-07-08 22:27:37 +02:00
jvoisin
52a2c800b7
Bump coverage again
2018-07-08 21:50:52 +02:00
jvoisin
ad3e7ccee8
Bump coverage for office files and fix some related crashes
2018-07-08 21:35:45 +02:00
jvoisin
ca01484126
Silence a mypy's stupid warning
2018-07-08 17:12:17 +02:00
jvoisin
f9bc022c96
Add defusedxml as an (optional) way to prevent XML-based attacks
...
Those attacks are DoS-only.
2018-07-08 17:07:26 +02:00
jvoisin
72e1fda18d
Remove a leftover print
2018-07-08 15:19:18 +02:00
jvoisin
3cd4f9111f
Bump coverage for torrent handling
2018-07-08 15:13:03 +02:00
jvoisin
b5fcddd6a6
Simplify how torrent files are handled
...
- Rework the testsuite wrt. torrent
- fail at parser's instantiation on corrupted torrent,
instead of during `get_meta` or `remove_all` call
2018-07-08 13:49:11 +02:00
jvoisin
7ea362d908
Bump the coverage for pdf
2018-07-07 18:12:33 +02:00
jvoisin
85455a4419
Fix a mistake in office file revisions handling
2018-07-07 18:05:54 +02:00
jvoisin
9f631a1bb1
Bump a bit the coverage
2018-07-07 18:02:53 +02:00
jvoisin
c2ef35d1f1
Bump the changelog
2018-07-06 01:00:14 +02:00
jvoisin
3d80f97524
Simplify BMP handling
2018-07-06 00:49:17 +02:00
jvoisin
53271495f7
Add support for .txt files
2018-07-06 00:42:09 +02:00
jvoisin
0638b9bbbb
Document that we do like PEP8
2018-07-02 00:27:38 +02:00
jvoisin
893f58554a
Improve a bit the formatting of the code thanks to pyflakes3
2018-07-02 00:22:05 +02:00
jvoisin
11008f8fd4
Improve a bit the README
2018-07-01 23:35:04 +02:00
jvoisin
a430403c7e
Document in our implementation notes how revisions are handled and why
2018-07-01 23:27:24 +02:00
jvoisin
bee56a57ce
Remove docx revisions
2018-07-01 23:16:14 +02:00
jvoisin
02f7605ac1
MAT2 is now cleaning revisions from odt files!
2018-07-01 21:09:20 +02:00
jvoisin
80fc4ffb40
Remove the thumbnails from libreoffice files
2018-07-01 17:29:05 +02:00
jvoisin
177184ac67
Massively simplify how we're cleaning office files
2018-06-27 21:48:46 +02:00
jvoisin
f44769df41
Ensure Poppler's minimal version
...
We're using methods that aren't available in Poppler
below 0.46, so we're checking for this upon import.
This commit is based on ideas from @LogicalDash ♥
2018-06-24 22:40:57 +02:00
jvoisin
1e9906de29
Document that we tests against corrupted files
2018-06-22 21:21:03 +02:00
jvoisin
63b19416ef
Pyflakes should run on the testsuite too
2018-06-22 21:18:22 +02:00
jvoisin
74f2d50433
Split the testsuite a bit and add more tests
2018-06-22 21:16:55 +02:00
jvoisin
b4ef0c9622
Improve reliability against corrupted image files
2018-06-22 20:38:29 +02:00
jvoisin
dfccf79f22
Bump the changelog
2018-06-21 23:34:12 +02:00
jvoisin
8810564b8e
Fix some deprecated directives in the COTNRIBUTING.md file
2018-06-21 23:33:56 +02:00
jvoisin
5b38bd7ccd
Improve the reliability of the office parser
2018-06-21 23:18:59 +02:00
jvoisin
846a261465
Fix some linter warnings
2018-06-21 23:07:21 +02:00
jvoisin
09e748fa4c
Refactor how offices files are handled
...
- xml files are no longer considered harmless
- Factorization of the `remove_all` method for office files
- Explicit whitelist are used
- Blacklist are used to skip files completely
- Non-blacklisted files are _still cleaned_
- Unsupported files are still triggering an error
2018-06-21 23:02:41 +02:00
jvoisin
a89dae054a
Minor simplification of the office-related code
2018-06-21 21:24:53 +02:00
jvoisin
c1f4426612
Improve the threat-model again, thanks to @joe
2018-06-20 00:10:21 +02:00
jvoisin
120c3bf72f
Improve a bit our threat model
2018-06-19 23:39:06 +02:00
jvoisin
84277740a9
Add fedora in the CI
...
refactor
2018-06-19 00:01:28 +02:00
Antoine Tenart
3a776ff1ca
README: software is always singular
...
Fix on small typo. Cosmetic patch.
Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-18 23:49:54 +02:00
Antoine Tenart
cce5de82e5
libmat2: harmless: add the text/xml mime type
...
Fedora defines the 'text/xml' mime type for xml files. Adds this mime
type to the harmless parser.
Fixes #36 .
Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 21:34:47 +02:00
Antoine Tenart
484e26dd9c
libmat2: audio: add the audio/x-flac mime type
...
The FLAC parser looks for the 'audio/flac' mime type, but Fedora
defines 'audio/x-flac' in /etc/mime.types for FLAC files. Add this mime
type to the audio parser.
Fixes #36 .
Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 21:34:47 +02:00
Antoine Tenart
3359f36b67
README: fix one typo
...
Fixes one small typo in the README.
Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 18:59:51 +02:00
Antoine Tenart
c19cbc48e2
README: fix the Jessie python3 URL
...
Fixes the scheme in Jessie's Python3 URL.
Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 18:59:03 +02:00
Antoine Tenart
6f8427abf9
README: remove extra empty spaces
...
This patch removes extra empty spaces at the end of some lines in
README.md. This is a cosmetic patch.
Signed-off-by: Antoine Tenart <antoine.tenart@ack.tf>
2018-06-12 18:55:22 +02:00
jvoisin
4600ce3490
Improve a bit the coverage
2018-06-10 20:20:45 +02:00
jvoisin
545887af98
Minor code simplification
2018-06-10 20:20:32 +02:00
jvoisin
7dad77a785
Make the parsing of office format's metadata more robust
2018-06-10 20:20:00 +02:00
jvoisin
8c7979aae3
Add some tests for non-supported embedded fileformats
2018-06-10 20:19:35 +02:00
jvoisin
b310a18e69
Add branch coverage to the CI
2018-06-10 01:02:21 +02:00