1
0
mirror of synced 2024-11-05 08:58:52 +01:00
mat2 is a metadata removal tool, supporting a wide range of commonly used file formats, written in python3: at its core, it's a library, used by an eponymous command-line interface, and a nautilus extension.
Go to file
2018-07-01 23:35:04 +02:00
data Add an svg version of the logo 2018-05-15 21:48:38 +02:00
doc Document in our implementation notes how revisions are handled and why 2018-07-01 23:27:24 +02:00
libmat2 Remove docx revisions 2018-07-01 23:16:14 +02:00
nautilus Add a prototype for the nautilus extension 2018-04-15 13:59:44 +02:00
tests Remove docx revisions 2018-07-01 23:16:14 +02:00
.gitignore Improve a bit the gitignore file 2018-05-21 22:50:36 +02:00
.gitlab-ci.yml MAT2 is now cleaning revisions from odt files! 2018-07-01 21:09:20 +02:00
CHANGELOG.md Bump the changelog 2018-06-21 23:34:12 +02:00
CONTRIBUTING.md Document that we tests against corrupted files 2018-06-22 21:21:03 +02:00
LICENSE Add LICENSE 2018-03-31 07:00:14 -07:00
mat2 Ensure Poppler's minimal version 2018-06-24 22:40:57 +02:00
README.md Improve a bit the README 2018-07-01 23:35:04 +02:00
setup.py Bump the changelog 2018-06-21 23:34:12 +02:00

 _____ _____ _____ ___
|     |  _  |_   _|_  |  Keep you data,
| | | |     | | | |  _|     trash your meta!
|_|_|_|__|__| |_| |___|

This software is currently in beta, please don't use it for anything critical.

Metadata and privacy

Metadata consist of information that characterizes data. Metadata are used to provide documentation for data products. In essence, metadata answer who, what, when, where, why, and how about every facet of the data that are being documented.

Metadata within a file can tell a lot about you. Cameras record data about when a picture was taken and what camera was used. Office documents like PDF or Office automatically adds author and company information to documents and spreadsheets. Maybe you don't want to disclose those information on the web.

This is precisely the job of MAT2: getting rid, as much as possible, of metadata.

Requirements

  • python3-mutagen for audio support
  • python3-gi-cairo and gir1.2-poppler-0.18 for PDF support
  • gir1.2-gdkpixbuf-2.0 for images support
  • libimage-exiftool-perl for everything else

Please note that MAT2 requires at least Python3.5, meaning that it doesn't run on Debian Jessie,

Running the test suite

$ python3 -m unittest discover -v

How to use MAT2

usage: mat2 [-h] [-v] [-l] [-c | -s | -L] [files [files ...]]

Metadata anonymisation toolkit 2

positional arguments:
  files

optional arguments:
  -h, --help         show this help message and exit
  -v, --version      show program's version number and exit
  -l, --list         list all supported fileformats
  -c, --check        check if a file is free of harmful metadatas
  -s, --show         list all the harmful metadata of a file without removing
                     them
  -L, --lightweight  remove SOME metadata

Related software

Contact

If possible, use the issues system. If you think that a more private contact is needed (eg. for reporting security issues), you can email Julien (jvoisin) Voisin at julien.voisin+mat@dustri.org, using the gpg key 9FCDEE9E1A381F311EA62A7404D041E8171901CC.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU Lesser General Public License along with this program. If not, see http://www.gnu.org/licenses/.

Copyright 2018 Julien (jvoisin) Voisin julien.voisin+mat2@dustri.org

Thanks

MAT2 wouldn't exist without:

Many thanks to them!