782 Commits

Author SHA1 Message Date
Marcus Holland-Moritz
ce9216f581 Add unreadable files as empty files (fixes gh #40) 2022-11-16 12:30:17 +01:00
Marcus Holland-Moritz
7af9e83ca6 Default FUSE driver debuglevel to warn in background mode (fixes gh #113) 2022-11-15 08:35:14 +01:00
Marcus Holland-Moritz
3b8bb3e7da Fix linking against compression libs (fixes github #112) 2022-11-11 23:37:17 +01:00
Marcus Holland-Moritz
fad64db3e2 Reorder mkdwarfs options 2022-11-11 23:12:38 +01:00
Marcus Holland-Moritz
dc247018a0 Fix usage 2022-11-08 23:10:01 +01:00
Marcus Holland-Moritz
6168c5dcb4 Update mkdwarfs usage 2022-11-08 23:08:39 +01:00
Marcus Holland-Moritz
03019e4a73 Document how to produce bit-identical images 2022-11-08 21:54:57 +01:00
Marcus Holland-Moritz
ee39c3eef7 Update internal operation section of mkdwarfs manpage 2022-11-08 21:54:36 +01:00
Marcus Holland-Moritz
f95844a35d Add more documentation details for --file-hash option 2022-11-08 21:52:53 +01:00
Marcus Holland-Moritz
0e488b958d Test image reproducibility for path and similarity ordering 2022-11-08 21:50:19 +01:00
Marcus Holland-Moritz
340e4d9ed2 Add wip branch to travis config 2022-11-08 14:18:03 +01:00
Marcus Holland-Moritz
c743feb3cd Update change log v0.7.0-RC1 2022-11-08 13:43:31 +01:00
Marcus Holland-Moritz
3a3dbc9e5f Update folly/fbthrift 2022-11-08 13:11:34 +01:00
Marcus Holland-Moritz
46dd7a2385 Fix libfmt handling in CMakeLists.txt 2022-11-08 12:56:33 +01:00
Marcus Holland-Moritz
2f65559407 Enable --with-devices and --with-specials when using --input-list 2022-11-08 12:45:02 +01:00
Marcus Holland-Moritz
43e5a3bcdc Add simple test for --input-list 2022-11-08 12:40:30 +01:00
Marcus Holland-Moritz
a326f533d5 Support extracting corrupted file systems (fixes github #51)
This adds two new options `--continue-on-error` and
`--disable-integrity-check` to `dwarfsextract` that allow
extracting data from corrupted images.
2022-11-08 11:56:10 +01:00
Marcus Holland-Moritz
8550c47873 Add offset to section description 2022-11-08 11:45:06 +01:00
Marcus Holland-Moritz
18a065bb88 Try to dump file system information even if errors were encountered 2022-11-08 11:44:06 +01:00
Marcus Holland-Moritz
f231ce0878 Add some filter tests 2022-11-08 10:08:02 +01:00
Marcus Holland-Moritz
87fd512df7 Prettier time formatting 2022-11-07 10:48:20 +00:00
Marcus Holland-Moritz
eb8803d6df Add --input-list option to pass in a list of files 2022-11-07 10:48:20 +00:00
Marcus Holland-Moritz
ff5f99f3d9 Add --max-similarity-size option 2022-11-06 14:32:14 +01:00
Marcus Holland-Moritz
21fc4c9524 Consolidate tool header code 2022-11-06 11:36:04 +01:00
Marcus Holland-Moritz
a14fa38a0d Reintroduce --num-scanner-workers 2022-11-06 10:52:02 +01:00
Marcus Holland-Moritz
c6a6ed4f8f Support lz4 compression levels 10..12 2022-11-06 10:20:53 +01:00
Marcus Holland-Moritz
50fa3c8374 Update .travis.yml 2022-11-06 09:57:59 +01:00
Marcus Holland-Moritz
841dcf17ac Static build tweaks 2022-11-06 07:49:40 +00:00
Marcus Holland-Moritz
12b2e35f05 Add support for Brotli compression (fixes github #76) 2022-11-06 07:49:40 +00:00
Marcus Holland-Moritz
87271666ac Autogenerate compression type code 2022-11-05 22:23:58 +00:00
Marcus Holland-Moritz
03829e6da4 First step at making compression code more modular 2022-11-05 22:23:58 +00:00
Marcus Holland-Moritz
c2e3cdfecb Factor out file_scanner 2022-11-05 22:23:58 +00:00
Marcus Holland-Moritz
b41a400e32 Update module versions 2022-11-05 22:23:58 +00:00
Marcus Holland-Moritz
5ab9cbb3c4 Simplify libfmt setup and add override to use system library 2022-10-30 10:35:09 +01:00
Marcus Holland-Moritz
356120c058 Doc fix 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
1215a30f78 Support for simple filter rules (potential fix for github #6) 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
c2f00d78c3 Log full path when skipping files in scanner 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
aeeddaecab Honour user locale when formatting numbers 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
0ed3c933bf Some new TODOs 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
6920df7334 Improved deduplication algorithm
Instead of hashing all files unconditionally, the new algorithm first
checks if there are multiple files of the same size. Files with a
unique size cannot have duplicates and so don't have to be hashed at
all.
2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
17567e009a Improved checks for openssl digest functions 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
b1db6470df Fix spelling 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
5256cf09ae Rename lookup tables in file scanner 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
6a6fe94228 Simplify original_size update in file::scan() 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
b001d9f28e Progress unit tests & fixes 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
148de5bf0d Add --file-hash option (fixes github #92)
This does not yet address the issue that uniquely sized files are
unnecessarily hashed, which is also mentioned in #92. This will be
addressed separately.
2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
482a40560e Add inode ordering test (for image reproducibility, see gh #91) 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
c5ac04347c Rename cache tidying functions to be less ambiguous 2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
b59a48c2b6 Replace use of boost posix_time with {fmt}
Only issue is that in order to properly format fractional seconds,
we need a bleeding edge version of {fmt}.
2022-10-29 18:54:31 +02:00
Marcus Holland-Moritz
3053140e5c boost::system::system_error -> std::system_error 2022-10-29 18:54:31 +02:00