libdeflate

mirror of https://github.com/cuberite/libdeflate.git synced 2025-09-08 11:50:00 -04:00

Author	SHA1	Message	Date
cielavenir	9b565afd99	Fix ICC compilation - crc32: On ICC, __v2di is defined in immintrin.h - adler32.c: __v64qi etc are not available on ICC	2021-05-06 23:10:58 -07:00
Eric Biggers	83a1bbf1d3	lib: consistently use include guards A lot of the internal library headers don't have include guards because they aren't needed. It might look like a bug, though, and it doesn't hurt to add them. So do this. Update https://github.com/ebiggers/libdeflate/issues/117	2021-03-12 00:07:30 -08:00
Eric Biggers	ff8634427b	lib/matchfinder: simplify init and rebase Remove the ability of matchfinder_init() and matchfinder_rebase() to fail due to the matchfinder memory size being misaligned. Instead, require that the size always be 128-byte aligned -- which is already the case. Also, make the matchfinder memory always be 32-byte aligned -- which doesn't really have any downside.	2020-10-25 22:42:25 -07:00
Eric Biggers	ef936b6521	lib/x86/adler32: use unsigned vector types This is needed to avoid the following error when using -fsanitize=undefined with gcc: lib/x86/adler32_impl.h:214:2: runtime error: signed integer overflow: 1951294680 + 1956941400 cannot be represented in type 'int' Note that this isn't seen when using -fsanitize=undefined with clang. Old compilers don't have unsigned vector types, so work around that.	2020-10-18 15:14:15 -07:00
Eric Biggers	5729095d2d	lib/cpu_features: support disabling CPU features for testing Make test-only builds of libdeflate support an environmental variable LIBDEFLATE_DISABLE_CPU_FEATURES that contains a list of CPU features to disable like "avx512bw,avx2,sse2". This makes it possible to test all the variants of dynamically dispatched code without editing the source code. Note, this environmental variable is not a stable interface, so put the support for it behind a scary-looking option TEST_SUPPORT__DO_NOT_USE.	2020-10-05 00:35:19 -07:00
Eric Biggers	f23fd6ca7f	lib/x86/cpu_features: rename PCLMULQDQ feature bit to PCLMUL This is less unwieldy and is consistent with "DISPATCH_PCLMUL" and with the "-mno-pclmul" compiler flag.	2020-10-05 00:35:19 -07:00
Eric Biggers	82037908c7	lib/x86/cpu_features: add missing earlyclobber constraint for cpuid on i386 In cpuid() in the '__i386__ && __PIC__' case, the second output operand is written to before the input operands are used. So the second output operand needs the earlyclobber constraint.	2020-10-04 23:17:56 -07:00
Eric Biggers	a735fa830f	lib, programs: remove all unnecessary 'extern' keywords 'extern' on function declarations is redundant.	2020-04-17 21:27:56 -07:00
Eric Biggers	2a2e24dc8b	lib: fix some typos in comments	2019-08-24 17:38:50 -07:00
Eric Biggers	73017f08e5	lib/x86/adler32: add an AVX-512BW optimized Adler32 implementation	2018-12-24 17:36:07 -06:00
Eric Biggers	4548033845	lib/x86/cpu_features: detect AVX-512BW support	2018-12-24 17:36:07 -06:00
Eric Biggers	1fb34f86b5	lib: add template for vectorized CRC-32 implementations	2018-02-18 23:03:26 -08:00
Eric Biggers	1617206086	lib/x86: allow choosing adler32_sse2() at runtime Now that we detect CPU features on 32-bit x86, allow the SSE2 implementation of Adler-32 to be selected at runtime based on the presence of the SSE2 feature.	2018-02-18 23:03:26 -08:00
Eric Biggers	0d1260be99	lib/x86: allow CPU feature detection on 32-bit x86 The SSE2, AVX2, BMI2, etc. code actually works on 32-bit x86 if the CPU has those features. So there is no need to restrict it to x86_64-only.	2018-02-18 23:03:26 -08:00
Eric Biggers	58978af429	lib: make CPU feature masks and dispatch pointers volatile Use 'volatile' for the CPU feature masks and dispatched function pointers. We don't need memory barriers for them, so 'volatile' is good enough to stop the compiler from inserting bogus reads/writes.	2018-02-18 23:03:26 -08:00
Eric Biggers	4829a5add2	lib: refactor architecture-specific code Move the x86 and ARM-specific code into their own directories to prevent it from cluttering up the main library. This will make it a bit easier to add new architecture-specific code. But to avoid complicating things too much for people who aren't using the provided Makefile, we still just compile all .c files for all architectures (irrelevant ones end up #ifdef'ed out), and the headers are included explicitly for each architecture so that an architecture-specific include path isn't needed. So, now people just need to compile both lib/.c and lib//.c instead of only lib/.c.	2018-02-18 23:03:26 -08:00

16 Commits