mirrors/zstd

mirror of https://github.com/facebook/zstd.git synced 2024-11-27 08:16:45 +08:00

Author	SHA1	Message	Date
Yann Collet	4e1a87976f	update regression results	2024-10-25 16:33:00 -07:00
Yann Collet	c63b5d2294	update regression results	2024-10-24 14:47:55 -07:00
Yann Collet	ea082692aa	added block splitter variant for greedy & lazy (levels 5 to 7)	2024-10-24 13:27:01 -07:00
Yann Collet	70c77d20d6	update regression results first block is no longer splitted since adding the @savings over-split protection	2024-10-24 11:43:16 -07:00
Yann Collet	bbda1acf85	update regression results	2024-10-23 15:56:56 -07:00
Yann Collet	d2eeed53dc	updated compression results due to integration of `sample5` strategy, leading to better compression ratios on a range of levels	2024-10-23 11:50:57 -07:00
Yann Collet	dac26eaeac	updated regression test results	2024-10-23 11:50:57 -07:00
Yann Collet	41d870fbbf	updated regression tests results	2024-10-17 11:06:26 -07:00
Yann Collet	de6cc98e07	fix incorrect pointer manipulation fix #4155	2024-10-01 09:25:26 -07:00
Alex Murkoff	1d5e9705db	fixed memory leaks and almost all undefined behaviour	2024-04-13 02:24:14 +07:00
Yann Collet	887f5b62ae	update compression results for regression tests	2024-02-05 01:27:22 -08:00
Yann Collet	9ae3bf5ee2	update compression results good news: there are only improvements	2024-02-03 17:52:50 -08:00
Nick Terrell	a3c3a38b9b	[lazy] Skip over incompressible data Every 256 bytes the lazy match finders process without finding a match, they will increase their step size by 1. So for bytes [0, 256) they search every position, for bytes [256, 512) they search every other position, and so on. However, they currently still insert every position into their hash tables. This is different from fast & dfast, which only insert the positions they search. This PR changes that, so now after we've searched 2KB without finding any matches, at which point we'll only be searching one in 9 positions, we'll stop inserting every position, and only insert the positions we search. The exact cutoff of 2KB isn't terribly important, I've just selected a cutoff that is reasonably large, to minimize the impact on "normal" data. This PR only adds skipping to greedy, lazy, and lazy2, but does not touch btlazy2. \| Dataset \| Level \| Compiler \| CSize ∆ \| Speed ∆ \| \|---------\|-------\|--------------\|---------\|---------\| \| Random \| 5 \| clang-14.0.6 \| 0.0% \| +704% \| \| Random \| 5 \| gcc-12.2.0 \| 0.0% \| +670% \| \| Random \| 7 \| clang-14.0.6 \| 0.0% \| +679% \| \| Random \| 7 \| gcc-12.2.0 \| 0.0% \| +657% \| \| Random \| 12 \| clang-14.0.6 \| 0.0% \| +1355% \| \| Random \| 12 \| gcc-12.2.0 \| 0.0% \| +1331% \| \| Silesia \| 5 \| clang-14.0.6 \| +0.002% \| +0.35% \| \| Silesia \| 5 \| gcc-12.2.0 \| +0.002% \| +2.45% \| \| Silesia \| 7 \| clang-14.0.6 \| +0.001% \| -1.40% \| \| Silesia \| 7 \| gcc-12.2.0 \| +0.007% \| +0.13% \| \| Silesia \| 12 \| clang-14.0.6 \| +0.011% \| +22.70% \| \| Silesia \| 12 \| gcc-12.2.0 \| +0.011% \| -6.68% \| \| Enwik8 \| 5 \| clang-14.0.6 \| 0.0% \| -1.02% \| \| Enwik8 \| 5 \| gcc-12.2.0 \| 0.0% \| +0.34% \| \| Enwik8 \| 7 \| clang-14.0.6 \| 0.0% \| -1.22% \| \| Enwik8 \| 7 \| gcc-12.2.0 \| 0.0% \| -0.72% \| \| Enwik8 \| 12 \| clang-14.0.6 \| 0.0% \| +26.19% \| \| Enwik8 \| 12 \| gcc-12.2.0 \| 0.0% \| -5.70% \| The speed difference for clang at level 12 is real, but is probably caused by some sort of alignment or codegen issues. clang is significantly slower than gcc before this PR, but gets up to parity with it. I also measured the ratio difference for the HC match finder, and it looks basically the same as the row-based match finder. The speedup on random data looks similar. And performance is about neutral, without the big difference at level 12 for either clang or gcc.	2023-03-20 11:18:29 -07:00
daniellerozenblit	53bad103ce	patch-from speed optimization (#3545 ) * patch-from speed optimization: only load portion of dictionary into normal matchfinders * test regression for x8 multiplier * fix off-by-one error for bit shift bound * restrict patchfrom speed optimization to strategy < ZSTD_btultra * update results.csv * update regression test	2023-03-14 20:36:56 -04:00
Yonatan Komornik	a91e91d614	[Bugfix] row hash tries to match position 0 (#3548 ) #3543 decreases the size of the tagTable by a factor of 2, which requires using the first tag position in each row for head position instead of a tag. Although position 0 stopped being a valid match, it still persisted in mask calculation resulting in the matches loops possibly terminating before it should have. The fix skips position 0 to solve this problem.	2023-03-13 10:00:03 -07:00
Yonatan Komornik	33e39094e7	Reduce RowHash's tag space size by x2 (#3543 ) Allocate half the memory for tag space, which means that we get one less slot for an actual tag (needs to be used for next position index). The results is a slight loss in compression ratio (up to 0.2%) and some regressions/improvements to speed depending on level and sample. In turn, we get to save 16% of the hash table's space (5 bytes per entry instead of 6 bytes per entry).	2023-03-10 14:15:04 -08:00
Yann Collet	ebba9ff425	update regression results	2023-01-03 14:04:23 -08:00
daniellerozenblit	1c818e3a0a	Merge pull request #3302 from daniellerozenblit/optimal-huff-depth-speed Optimal huff depth speed improvements	2023-01-03 12:51:51 -05:00
Danielle Rozenblit	87becc567d	update regression results.csv	2023-01-03 08:41:40 -08:00
Danielle Rozenblit	c26f348dc8	fix CI errors	2022-12-20 12:43:46 -08:00
Danielle Rozenblit	482689b995	huf log speed optimization: unidirectional scan of logs + break when regressing	2022-12-20 12:27:38 -08:00
W. Felix Handte	8927f985ff	Update Copyright Headers 'Facebook' -> 'Meta Platforms' ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora $ -prune -o -type f); do sed -i 's/Facebook, Inc\./Meta Platforms, Inc. and affiliates./' $f; done ```	2022-12-20 12:37:57 -05:00
Elliot Gorokhovsky	bb3c01c853	Migrate other test usages of boolean LDM flag to paramSwitch enum	2022-11-21 16:20:38 -05:00
Danielle Rozenblit	75cd42afd7	Update regression results and better variable naming for HUF_cardinality	2022-10-14 13:37:19 -07:00
Danielle Rozenblit	117fddcd81	Update regression results	2022-10-12 13:49:44 -07:00
W. Felix Handte	1bc8019e10	Update results.csv	2022-05-11 10:27:35 -07:00
Elliot Gorokhovsky	3be9a81e46	Update results.csv	2022-05-04 16:05:37 -04:00
Yann Collet	cc7d23bcec	Merge pull request #2965 from facebook/offbase Converge sumtype (offset \| repcode) numeric representation towards offBase	2022-01-24 15:47:42 -08:00
Yann Collet	5595aec629	updated regression results	2022-01-07 15:08:06 -08:00
Yann Collet	03903f5701	fixed minor compression difference in btlazy2 subtle dependency on sumtype numeric representation	2021-12-29 18:51:03 -08:00
Yann Collet	01adddc3e0	update regression results	2021-12-16 20:43:23 -08:00
Yann Collet	7c7b9244d6	update regression results	2021-12-16 16:07:54 -08:00
Yann Collet	9a32492730	updated regression results.csv	2021-12-16 14:39:30 -08:00
W. Felix Handte	450fca9704	Update Regression Tests w/ New Sizes	2021-12-13 17:29:32 -05:00
Felix Handte	23c1a2d260	Merge pull request #2774 from felixhandte/zstd-dfast-pipelined-single Pipelined Implementation of ZSTD_dfast	2021-10-13 16:38:43 -04:00
Nick Terrell	c6c482fe07	[binary-tree] Fix underflow of nbCompares Fix underflow of `nbCompares` by switching to an `int` and comparing `nbCompares > 0`. This is a minimal fix, because I don't want to change the logic. These loops seem to be doing `nbCompares + 1` comparisons. The bug was reported by Dan Carpenter and found by Smatch static checker. https://lore.kernel.org/all/20211008063704.GA5370@kili/	2021-10-08 13:22:55 -07:00
W. Felix Handte	c2c32839dc	Update results.csv	2021-10-05 16:18:00 -04:00
Sen Huang	9360367371	Update regression test	2021-09-28 08:29:11 -07:00
Sen Huang	b8fd6bf30c	Skip most long matches in lazy hash table update	2021-09-28 08:19:39 -07:00
senhuang42	b5c35d7ea3	Use new paramSwitch enum for LCM, row matchfinder, and block splitter	2021-09-21 14:22:02 -04:00
Nick Terrell	8bf699aa59	[build] Add support for ASM files in Make + CMake * Extract out common portion of `lib/Makefile` into `lib/libzstd.mk`. Most relevantly, the way we find library files. * Use `lib/libzstd.mk` in the other Makefiles instead of repeating the same code. * Add a test `tests/test-variants.sh` that checks that the builds of `make -C programs allVariants` are correct, and run it in Actions. * Adds support for ASM files in the CMake build. The Meson build is not updated because it lists every file in zstd, and supports ASM off the bat, so the Huffman ASM commit will just add the ASM file to the list. The Visual Studios build is not updated because I'm not adding ASM support to Visual Studios yet.	2021-09-17 14:13:53 -07:00
Yann Collet	fd94b9d1c9	Merge branch 'dev' into opt_investigation	2021-09-14 01:15:51 -07:00
Sen Huang	d45d0ad9d8	Update regression test	2021-09-13 12:41:02 -04:00
Yann Collet	b6b2855b80	updated regression tests	2021-09-12 10:22:35 -07:00
Yann Collet	f58e63bee7	Merge branch 'dev' into opt_investigation	2021-09-12 01:42:49 -07:00
Felix Handte	d68aa19a2f	Merge pull request #2749 from felixhandte/zstd-fast-pipelined Pipelined Implementation of ZSTD_fast (~+5% Speed)	2021-09-09 17:05:30 -04:00
Yann Collet	4f0b1b9ee5	update regression tests	2021-09-08 14:37:42 -07:00
Yann Collet	b096a5c626	updated regression tests	2021-09-07 09:55:14 -07:00
Yann Collet	27a8bbe265	new initializer for ll price	2021-09-03 16:07:31 -07:00
Yann Collet	40e44bd56d	updated regression tests	2021-09-01 13:26:39 -07:00

1 2

97 Commits