libdeflate

mirror of https://github.com/cuberite/libdeflate.git synced 2025-09-12 13:58:35 -04:00

History

Eric Biggers 29dfcfd866 lib/matchfinder: support dynamic dispatch for init and rebase

Currently the optimized implementations of matchfinder_init() and
matchfinder_rebase() are chosen via static dispatch.  That means that
the AVX-2 implementations usually aren't used.

Fix this by using dynamic dispatch, like what libdeflate does for the
Adler-32 and CRC-32 checksums and for DEFLATE decompression.

Based on work by Andrew Steinborn <git@steinborn.me>
(https://github.com/ebiggers/libdeflate/pull/77).  He wrote:

"The main impact is on x86: the AVX2 matchfinder can now be properly
dynamically dispatched at runtime and if -mavx2 is included in CFLAGS
(or -march set to any platform with AVX2 support). On my Ryzen 9 3900X,
I got an approximately 1% boost in deflate time (measured with a
uncompressed tarball of the Silesia corpus) using just the changes in
this PR and the regular CFLAGS, and a 2.7% boost when specifying -mavx2
as CFLAGS. (I also tested with an Intel Xeon Skylake c5.large EC2
instance, and did not see any performance regression)."

2020-10-28 19:20:53 -07:00

arm

lib/matchfinder: support dynamic dispatch for init and rebase

2020-10-28 19:20:53 -07:00

x86

lib/matchfinder: support dynamic dispatch for init and rebase

2020-10-28 19:20:53 -07:00

adler32_vec_template.h

…

adler32.c

…

bt_matchfinder.h

lib/matchfinder: simplify init and rebase

2020-10-25 22:42:25 -07:00

cpu_features_common.h

…

crc32_table.h

…

crc32_vec_template.h

…

crc32.c

…

decompress_template.h

…

deflate_compress.c

lib/matchfinder: simplify init and rebase

2020-10-25 22:42:25 -07:00

deflate_compress.h

…

deflate_constants.h

…

deflate_decompress.c

…

gzip_compress.c

…

gzip_constants.h

…

gzip_decompress.c

…

hc_matchfinder.h

lib/matchfinder: simplify init and rebase

2020-10-25 22:42:25 -07:00

lib_common.h

…

matchfinder_common.h

lib/matchfinder: support dynamic dispatch for init and rebase

2020-10-28 19:20:53 -07:00

unaligned.h

…

utils.c

…

zlib_compress.c

…

zlib_constants.h

…

zlib_decompress.c

…