mirrors/zstd

mirror of https://github.com/facebook/zstd.git synced 2024-11-24 07:16:49 +08:00

Author	SHA1	Message	Date
Yann Collet	f98c69d77c	fix : huge (>4GB) stream of blocks experimental function ZSTD_compressBlock() is designed for very small data in mind, for situation where saving the ~12 bytes of frame header can actually make a difference. Some systems though may have to deal with small and large data entangled. If it's larger than a block (> 128KB), compressBlock() cannot compress them in one round. That's why it's possible to compress in multiple rounds. This is a chain of compressed blocks. Some users push this capability to the limit, encoding gigantic chain of blocks. On crossing the 4GB limit, some internal overflow occurs. This fix moves the overflow correction mechanism higher in the call chain, so that it's applied also to gigantic chains of blocks. Added a test case in fuzzer.c, which crashes before the fix, and pass now.	2018-09-26 14:24:28 -07:00
Yann Collet	8ff17a6a09	Merge pull request #1329 from facebook/v04isout Changed default legacy support to v0.5+	2018-09-26 13:39:05 -07:00
Yann Collet	08f68d83c5	fixed usage of grep in Makefile when terminal uses colors as suggested by @danielshir (#1294)	2018-09-25 16:56:53 -07:00
Yann Collet	04f47bbdd2	Merge branch 'dev' into adapt	2018-09-24 16:56:45 -07:00
Yann Collet	9bb6c15f79	Merge pull request #1332 from facebook/minclevel defined a minimum negative level	2018-09-24 16:01:13 -07:00
Yann Collet	292d8e4a83	added some tests based on limits.h in order to ensure proper type mapping when not using stdint.h	2018-09-23 23:57:30 -07:00
Yann Collet	71a5210617	avoid recompiling dll every time under mingw	2018-09-21 17:40:30 -07:00
Yann Collet	c484345a82	Merge branch 'mingw' into adapt	2018-09-21 16:00:46 -07:00
Yann Collet	bfff4f4809	ensure all writes to job->cSize are mutex protected even when reporting errors, using a macro for code brevity, as suggested by @terrelln,	2018-09-21 16:00:39 -07:00
Yann Collet	32b7cf1bcf	fixed tautological tests involving ZSTD_TARGETLENGTH_MIN (== 0)	2018-09-21 15:04:43 -07:00
Yann Collet	c044345f8f	Merge branch 'mingw' into minclevel	2018-09-21 14:56:57 -07:00
Yann Collet	de6c75e4e5	Merge pull request #1318 from felixhandte/shadow-dict-matches Don't Search Dictionary Context When Working Context Search Resulted in Mismatch	2018-09-21 12:15:33 -07:00
Yann Collet	a54c86cfc6	defined a minimum negative level which can be probed using new function ZSTD_minCLevel(). Also : redefined ZSTD_TARGETLENGTH_MIN/MAX for consistency used the opportunity to bump version number to v1.3.6	2018-09-20 16:52:03 -07:00
Yann Collet	b2939163e1	Changed default legacy support to v0.5+ thus dropping read support for v0.4. It's always possible to re-enable it, by changing build macro ZSTD_LEGACY_SUPPORT to 4.	2018-09-20 14:30:20 -07:00
Yann Collet	7992942d66	fixed complex tsan issue when job->consumed == job->src.size , compression job is presumed completed, so it must be the very last action done in worker thread.	2018-09-20 13:47:31 -07:00
Yann Collet	6b07a66aec	fixed minor reporting discrepancy in MT mode	2018-09-19 16:30:55 -07:00
Yann Collet	ca02ebee07	removed static variables so that --adapt can work on multiple input files too	2018-09-19 15:25:50 -07:00
Yann Collet	89bc309d90	error out when --adapt is associated with --single-thread since they are not compatible	2018-09-19 14:49:13 -07:00
Yann Collet	2f78228f65	Merge branch 'dev' into adapt	2018-09-19 12:43:42 -07:00
Yann Collet	005f000aed	updated documentation of *refPrefix() indicating the equivalence with `diff` operation.	2018-09-18 13:07:08 -07:00
ko-zu	18b4a1da61	Fix clang build Fix dixygen comment Fix clang binary path	2018-09-16 10:27:02 +09:00
Yann Collet	7269fe6cd3	minor code comment update	2018-09-14 16:06:35 -07:00
Yann Collet	0403148315	Merge pull request #1295 from felixhandte/hdr-intro-comment-negative-lvls Proposed Update to Zstd.h Introduction Comment	2018-09-14 15:29:19 -07:00
W. Felix Handte	b76c888497	ZSTD_dfast: Don't Search Dict Context When Mismatch Was Found	2018-09-14 15:24:25 -07:00
W. Felix Handte	b048af5999	ZSTD_fast: Don't Search Dict Context When Mismatch Was Found	2018-09-14 15:23:35 -07:00
Yann Collet	0e5b447aaa	Merge pull request #1316 from facebook/coldDict Cold dictionary mitigation	2018-09-14 10:37:46 -07:00
Yann Collet	5512400677	updated code comments, based on @terrelln review	2018-09-13 16:44:04 -07:00
Yann Collet	d195eec97e	fixed msan error cold dictionary is detected through a comparison with dictEnd, which was not initialized at the beginning of first DCtx usage.	2018-09-13 12:29:52 -07:00
Yann Collet	674dd21bd0	final parameter tuning	2018-09-12 17:25:34 -07:00
Yann Collet	419dfd4ea3	clean traces	2018-09-12 16:40:28 -07:00
Yann Collet	2618253da2	fixed PREFETCH() macro for corner cases and platforms without this instruction	2018-09-12 16:15:37 -07:00
Yann Collet	44d3b83bb1	conditional dict content prefetching based on nbSeq.	2018-09-12 15:35:21 -07:00
Yann Collet	5fb5ed3b31	adjust heuristic decisions	2018-09-12 12:32:09 -07:00
Nick Terrell	f6daddf2db	Also allow x86	2018-09-12 12:05:32 -07:00
Nick Terrell	1e0bac6a9c	[libzstd] Fix cpu for MSFT ARM The `__cpuid()` and `__cpuidex()` intrinsics are only available on x86 and x86_64.	2018-09-12 10:35:16 -07:00
Yann Collet	4de344d505	added conditional prefetch depending on amount of work to do.	2018-09-12 10:29:47 -07:00
Yann Collet	63a519dbf6	implemented first prefetch based on dictID. dictContent is prefetched up to 32 KB (no contentSize adaptation)	2018-09-11 17:23:44 -07:00
Yann Collet	3675ef4762	added comment about minimum size of FSE tables required for DDict creation, which use this space as workspace during Hufman table building stage.	2018-09-10 11:24:17 -07:00
Yann Collet	f97ca36eab	strengthened conditions for using workplace into fse table space ensure that the structure layout is as expected. will trigger an error if it changes in the future. Another solution would be to use a union, this would be cleaner and get rid of these static asserts. However, in order to keep the current code unmodified, it would be necessary to use an un-named unions. And apparently, un-named unions are only possible on "recent" compilers (C99+).	2018-09-06 17:54:13 -07:00
Yann Collet	87406548f0	reduced DDict size, by -2KB corresponding to the removal of workspace which is needed while building huffman table and is now either present in DCtx, or temporarily borrowed from available FSE table space.	2018-09-06 17:07:53 -07:00
Yann Collet	50b216146f	Merge pull request #1304 from facebook/largeNbDicts contrib/largeNbDicts	2018-09-06 09:50:56 -07:00
Jennifer Liu	21721b75a3	Change default f to 20	2018-09-04 17:15:14 -07:00
Jennifer Liu	944c9986e0	Update comment on default steps of cover and fastcover	2018-08-30 15:37:29 -07:00
Jennifer Liu	16db0337b1	Always use splitPoint=1.0 for non-optimize cover and fastcover	2018-08-30 14:59:22 -07:00
Yann Collet	31ebb26945	Merge pull request #1301 from terrelln/lit-size [zstd] Fix seqStore growth	2018-08-28 17:10:25 -07:00
Nick Terrell	5e580de6da	[zstd] Fix seqStore growth We could undersize the literals buffer by up to 11 bytes, due to a combination of 2 bugs: * The literals buffer didn't have `WILDCOPY_OVERLENGTH` extra space, like it is supposed to. * We didn't check the literals buffer size in `ZSTD_sufficientBuff()`.	2018-08-28 13:24:44 -07:00
Yann Collet	b37a0a6bde	Merge pull request #1298 from facebook/bench Refactored bench.c	2018-08-28 12:25:02 -07:00
modbw	d14edf259f	Fixed memory leak detected by cppcheck cppcheck (which is run regularly in our CI environment) detected a possible memory leak.	2018-08-28 07:25:05 +02:00
Yann Collet	6782725155	first sketch for largeNbDicts test program	2018-08-26 19:29:12 -07:00
Yann Collet	af23d39eb8	Merge pull request #1297 from felixhandte/check-offset-table Fix Missing Offset Table Check	2018-08-24 17:36:44 -07:00
W. Felix Handte	37f17ee237	Mark Repeated Offset Table as Needing Check	2018-08-24 14:33:34 -07:00
Nick Terrell	e34e917655	Fix compiler warning	2018-08-23 17:48:06 -07:00
Nick Terrell	5ee5e71be3	[zstd] Add note about empty ZSTD_CDict	2018-08-23 17:48:06 -07:00
Nick Terrell	924944e471	[zstd] Reuse the ZSTD_CCtx more often with small data.	2018-08-23 17:48:06 -07:00
Yann Collet	2e45badff4	refactored bench.c for clarity and safety, especially at interface level	2018-08-23 14:21:18 -07:00
Jennifer Liu	9d6ed9def3	Merge fastCover into DictBuilder (#1274 ) * Minor fix * Run non-optimize FASTCOVER 5 times in benchmark * Merge fastCover into dictBuilder * Fix mixed declaration issue * Add fastcover to symbol.c * Add fastCover.c and cover.h to build * Change fastCover.c to fastcover.c * Update benchmark to run FASTCOVER in dictBuilder * Undo spliting fastcover_param into cover_param and f * Remove convert param functions * Assign f to parameter * Add zdict.h to Makefile in lib * Add cover.h to BUCK * Cast 1 to U64 before shifting * Remove trimming of zero freq head and tail in selectSegment and rebenchmark * Remove f as a separate parameter of tryParam * Read 8 bytes when d is 6 * Add trimming off zero frequency head and tail * Use best functions from COVER and remove trimming part(which leads to worse compression ratio after previous bugs were fixed) * Add finalize= argument to FASTCOVER to specify percentage of training samples passed to ZDICT_finalizeDictionary * Change nbDmer to always read 8 bytes even when d=6 * Add skip=# argument to allow skipping dmers in computeFrequency in FASTCOVER * Update comments and benchmarking result * Change default method of ZDICT_trainFromBuffer to ZDICT_optimizeTrainFromBuffer_fastCover * Add dictType enum and fix bug about passing zParam when converting to coverParam * Combine finalize and skip into a single parameter * Update acceleration parameters and benchmark on 3 sample sets * Change default splitPoint of FASTCOVER to 0.75 and benchmark first 3 sample sets * Initialize variables outside of for loop in benchmark.c * Update benchmark result for hg-manifest * Remove cover.h from install-includes * Add explanation of f * Set default compression level for trainFromBuffer to 3 * Add assertion of fastCoverParams in DiB_trainFromFiles * Add checkTotalCompressedSize function + some minor fixes * Add test for multithreading fastCovr * Initialize segmentFreqs in every FASTCOVER_selectSegment and move mutex_unnlock to end of COVER_best_finish * Free segmentFreqs * Initialize segmentFreqs before calling FASTCOVER_buildDictionary instead of in FASTCOVER_selectSegment * Add FASTCOVER_MEMMULT * Minor fix * Update benchmarking result	2018-08-23 12:06:20 -07:00
W. Felix Handte	e589ac6276	Reformat Introduction Comment and Mention Negative Levels	2018-08-22 17:07:34 -07:00
Yann Collet	c71c4f23d7	fix "unused parameter" in single-thread mode within newly added ZSD_toFlushNow()	2018-08-20 11:40:10 -07:00
Yann Collet	105677c6db	created ZSTDMT_toFlushNow() tells in a non-blocking way if there is something ready to flush right now. only works with multi-threading for the time being. Useful to know if flush speed will be limited by lack of production.	2018-08-17 18:11:54 -07:00
Yann Collet	36d6165a2d	Makefile: added variable SCANBUILD so that a different version of scan-build can be selected	2018-08-16 16:44:13 -07:00
Yann Collet	1515f0bb0d	fixed more issues detected by recent version of scan-build test run on Linux	2018-08-16 15:20:25 -07:00
Yann Collet	5291d9ac31	fix scope of scan-build tests exclude zlib code	2018-08-15 17:41:44 -07:00
Yann Collet	42a02ab745	fixed minor warnings issued by scan-build	2018-08-15 14:36:02 -07:00
Yann Collet	3692c31598	Merge branch 'dev' into scanbuild	2018-08-15 13:50:49 -07:00
Yann Collet	6e66bbf5dd	fixed several minor issues detected by scan-build only notable one : writeNCount() resists better vs invalid distributions (though it should never happen within zstd anyway)	2018-08-14 16:55:35 -07:00
Yann Collet	3e4617ef54	frameProgression reports nbActiveWorkers and output flushed	2018-08-14 11:49:25 -07:00
Yann Collet	e7a49c6683	introduced command --adapt	2018-08-11 20:48:06 -07:00
Yann Collet	2dd76037be	zstd cli can increase level when input is too slow	2018-08-09 15:51:30 -07:00
Yann Collet	79a35ac20d	minor code comments improvements	2018-08-09 15:16:31 -07:00
W. Felix Handte	2ca7c69167	Fix CDict Attachment to Handle CDicts with Non-Zero Starts CDicts were previously guaranteed to be generated with `lowLimit=dictLimit=0`. This is no longer true, and so the old length and index calculations are no longer valid. This diff fixes them to handle non-zero start indices in CDicts.	2018-08-07 18:14:14 -07:00
Yann Collet	5808027abf	Merge branch 'dev' into fix1241	2018-08-03 16:08:33 -07:00
Yann Collet	5892dd5da4	Merge pull request #1255 from terrelln/norm-fix [FSE] Fix division by zero	2018-08-02 11:48:56 -07:00
Nick Terrell	dc5a67cb7b	Disallow tableLog == srcLog	2018-08-02 11:12:17 -07:00
Jennifer Liu	f5228f2c44	Refactoring	2018-07-31 13:58:54 -07:00
Jennifer Liu	4e29bc2469	Use CDict instead of CCtx in analyzeEntropy	2018-07-31 10:36:45 -07:00
cyan4973	3f535007e4	fix %zu support under minGW and relevant test on Appveyor	2018-07-30 16:56:18 +02:00
cyan4973	aade1e5904	Merge branch 'dev' into fix1241	2018-07-30 16:30:35 +02:00
Nick Terrell	9889bca530	[FSE] Fix division by zero When the primary normalization method fails, and `(1 << tableLog) == (maxSymbolValue + 1)`, and every symbol gets assigned normalized weight 1 or -1 in the first loop, then the next division can raise `SIGFPE`.	2018-07-27 17:30:03 -07:00
Yann Collet	6e490a2f09	Merge pull request #1237 from terrelln/init-cstream-adv Set requestedParams in ZSTD_initCStream*()	2018-07-18 16:33:30 +02:00
cyan4973	9597b438e9	fix #1241 Ensure that first input position is valid for a match even during first usage of context by starting reference at 1 (avoiding the problematic 0).	2018-07-17 18:52:57 +02:00
cyan4973	53e1f0504e	zstdmt debug traces compatibles with mingw since mingw does not have `sys/times.h`, remove this path when detecting mingw compilation.	2018-07-17 14:39:44 +02:00
Nick Terrell	45821fac0c	Merge pull request #1225 from jennifermliu/dev Split samples when building dictionary for COVER	2018-07-13 13:26:15 -07:00
Nick Terrell	6d222c437c	Set requestedParams in ZSTD_initCStream() The correct parameters are used once, but once `ZSTD_resetCStream()` is called the default parameters (level 3) are used. Fix this by setting `requestedParams` in the `ZSTD_initCStream()` functions. The added tests both fail before this patch and pass after.	2018-07-12 18:35:55 -07:00
Jennifer Liu	612b346ed5	Add explanation for split=100	2018-07-11 15:50:28 -07:00
Jennifer Liu	5021441d86	Change default splitPoint to 100	2018-07-10 11:19:33 -07:00
Jennifer Liu	456f290e31	Change back to splitPoint<=0	2018-07-09 13:53:25 -07:00
Jennifer Liu	7efabb2cf6	Only make 0.0 default splitPoint	2018-07-09 12:26:53 -07:00
Yann Collet	bbd78df59b	add build macro NO_PREFETCH prevent usage of prefetch intrinsic commands which are not supported by c2rust (see https://github.com/immunant/c2rust/issues/13)	2018-07-06 17:06:04 -07:00
Jennifer Liu	015a00af0f	Change cover_sum back to 2 parameters and fix splitPoint issues	2018-07-06 14:24:18 -07:00
Jennifer Liu	0bbff01211	Fix testing parameter	2018-07-05 22:40:32 -07:00
Jennifer Liu	a085d1aae1	Allow splitPoint==1.0 (using all samples for both training and testing)	2018-07-05 10:38:45 -07:00
Jennifer Liu	0881184c89	Some edits based on pull request comments	2018-07-03 17:53:27 -07:00
Jennifer Liu	16e75e8804	Update minimal training sample size	2018-07-03 12:07:06 -07:00
Jennifer Liu	348e5f77a9	Add split=# to cli	2018-06-29 17:54:41 -07:00
Jennifer Liu	52fbbbcb6b	Explicitly cast double to unsigned	2018-06-29 16:17:20 -07:00
Jennifer Liu	f9d19b83fb	Fix variable declaration problem	2018-06-29 15:46:56 -07:00
Jennifer Liu	e061d84016	Another fix to comparator	2018-06-29 15:38:08 -07:00
Jennifer Liu	59797d3328	Fix splitPoint floating point comparison problem	2018-06-29 12:47:03 -07:00
Jennifer Liu	0ef06f2e8a	Split samples into train and test sets	2018-06-29 12:33:34 -07:00
Yann Collet	121aa2c388	Merge pull request #1211 from facebook/staticAssert updated DEBUG_STATIC_ASSERT()	2018-06-27 12:19:17 -07:00
Yann Collet	4489daec09	slightly adjusted default-distribution threshold depending on strategy. fast favors faster compression and decompression speeds.	2018-06-26 20:10:45 -07:00
Yann Collet	ff773bfcde	zeroise freq table with memset() improves decoding speed by ~5% in github_users sample set	2018-06-26 17:24:41 -07:00
Yann Collet	7b9bbf77c9	switched to a sizeof() version avoid -Werror=unused-variable issue	2018-06-26 14:08:35 -07:00
Yann Collet	f98ec46979	updated DEBUG_STATIC_ASSERT() following suggestion from #1209	2018-06-26 12:04:59 -07:00
Nick Terrell	b426bcc097	[zstdmt] Fix jobsize bugs (#1205 ) [zstdmt] Fix jobsize bugs * `ZSTDMT_serialState_reset()` should use `targetSectionSize`, not `jobSize` when sizing the seqstore. Add an assert that checks that we sized the seqstore using the right job size. * `ZSTDMT_compressionJob()` should check if `rawSeqStore.seq == NULL`. * `ZSTDMT_initCStream_internal()` should not adjust `mtctx->params.jobSize` (clamping to MIN/MAX is okay).	2018-06-25 15:21:08 -07:00
Yann Collet	3b53bfe4f3	Merge pull request #1200 from felixhandte/zstd-attach-dict-pref Add CCtx Param Controlling Dict Attachment Behavior	2018-06-25 12:42:31 -07:00
Yann Collet	31769ce702	error on no forward progress streaming decoders, such as ZSTD_decompressStream() or ZSTD_decompress_generic(), may end up making no forward progress, (aka no byte read from input __and__ no byte written to output), due to unusual parameters conditions, such as providing an output buffer already full. In such case, the caller may be caught in an infinite loop, calling the streaming decompression function again and again, without making any progress. This version detects such situation, and generates an error instead : ZSTD_error_dstSize_tooSmall when output buffer is full, ZSTD_error_srcSize_wrong when input buffer is empty. The detection tolerates a number of attempts before triggering an error, controlled by ZSTD_NO_FORWARD_PROGRESS_MAX macro constant, which is set to 16 by default, and can be re-defined at compilation time. This behavior tolerates potentially existing implementations where such cases happen sporadically, like once or twice, which is not dangerous (only infinite loops are), without generating an error, hence without breaking these implementations.	2018-06-22 17:58:21 -07:00
Yann Collet	3934e010a2	Merge pull request #1197 from facebook/poolResize Thread Pool resize	2018-06-22 14:20:07 -07:00
Yann Collet	fbd5dfc1b1	changed POOL_resize() return type to int return is now just en error code. This guarantee that `ctx` remains valid after POOL_resize(). Gets rid of internal POOL_free() operation.	2018-06-22 12:14:59 -07:00
Yann Collet	1d5648ca10	Merge pull request #1196 from felixhandte/zstd-btopt-in-place-dict ZSTD_btopt: Support Searching the Dictionary Context In-Place	2018-06-22 11:53:23 -07:00
Yann Collet	f6242d30b7	Merge pull request #1202 from facebook/barelyCompressible Increase threshold detection of poorly compressible data	2018-06-22 11:52:52 -07:00
Yann Collet	698fd00afb	huf: increase threshold detection of poorly compressible data	2018-06-21 18:32:38 -07:00
Yann Collet	243cd9d8bb	add a cond_broadcast after resize to make sure all threads (notably newly available threads) get awaken to immediately process potential items in the queue.	2018-06-21 18:04:58 -07:00
Yann Collet	818e72b4d5	added extended POOL test abrupt end + downsizing with running jobs remaining in queue. also : POOL_resize() requires numThreads >= 1	2018-06-21 14:58:59 -07:00
W. Felix Handte	01bb1c1016	Add CCtx Param Controlling Dict Attachment Behavior	2018-06-21 17:29:25 -04:00
W. Felix Handte	3e91dc4d6a	Add Repcode Bounds Check	2018-06-21 15:54:41 -04:00
W. Felix Handte	5bd3d4b7d2	Add Debug Log Statement	2018-06-21 15:54:07 -04:00
W. Felix Handte	3caba150c6	Fix `dmsBtLow` Test	2018-06-21 15:53:40 -04:00
W. Felix Handte	5da9bbc38e	Conceivably Dedup ZSTD_noDict and ZSTD_dictMatchState _insertBt1 Impls By reverting to the bool extDict flag, we call ZSTD_insertBt1 with the same const args in both non-extDict dictModes.	2018-06-21 11:20:01 -04:00
Yann Collet	6de249c1c6	fixed: bug when counting nb of active threads when queueSize > 1 also : added a test in testpool.c verifying resizing is effective.	2018-06-20 18:28:49 -07:00
Yann Collet	6b48eb12c0	change control of threadLimit now limits maximum nb of active threads even when queueSize > 1.	2018-06-20 14:35:39 -07:00
W. Felix Handte	5d81f71e83	Consistency in Guarding DMS-Only Variable Initializations	2018-06-20 16:54:53 -04:00
W. Felix Handte	9c14eafe3d	Also Use `matchLow` for HC3 Match	2018-06-20 15:51:14 -04:00
W. Felix Handte	0a6cf7cd1d	Minor Changes	2018-06-20 15:27:23 -04:00
W. Felix Handte	ae1f3898a2	Remove Dead(!) HC3 DMS Lookup	2018-06-20 15:27:12 -04:00
Yann Collet	93702a7a62	Merge pull request #1198 from facebook/msdebug made Visual Studio compatible with DEBUGLEVEL >= 2	2018-06-20 12:26:31 -07:00
cyan4973	ae0b7ffa0a	made Visual Studio compatible with DEBUGLEVEL >= 2	2018-06-20 09:45:02 -07:00
Yann Collet	62469c9f41	fixed wrong size in pthread struct transfer	2018-06-19 20:14:03 -07:00
Yann Collet	166901dc72	reduced POOL_resize() restriction It's not necessary to ensure that no job is ongoing. The pool is only expanded, existing threads are preserved. In case of error, the only option is to return NULL and terminate the thread pool anyway.	2018-06-19 18:07:18 -07:00
Yann Collet	066fbbfe1c	make zstdmt resize its context when nbThreads change. Technically, it only expands. But when instructed to use less threads, the thread pool will limit nb of concurrent threads.	2018-06-19 17:28:56 -07:00
Yann Collet	4567c57199	finalized POOL_resize() POOL_ctx* POOL_resize(POOL_ctx* ctx, size_t numThreads) The function may fail, and returns a NULL pointer in this case.	2018-06-19 16:03:12 -07:00
Yann Collet	6768cf53fd	Merge pull request #1190 from terrelln/ldm-adjust Adjust advanced parameters to source size	2018-06-19 14:40:56 -07:00
W. Felix Handte	03c39c540b	Fix Incorrect Param	2018-06-19 15:36:33 -04:00
W. Felix Handte	de639502aa	Update Dict Attachment Cut-Offs	2018-06-19 15:36:13 -04:00
W. Felix Handte	f0a13bcd68	Make Sure Position 0 Gets Into the Tree	2018-06-19 15:10:06 -04:00
W. Felix Handte	87fe4788a3	Fix Compression Ratio Regression #1	2018-06-19 13:01:21 -04:00
W. Felix Handte	4bb79f9c55	Misc Changes	2018-06-19 13:01:21 -04:00
W. Felix Handte	2091f34e9e	Find Proper Matches	2018-06-19 13:01:21 -04:00
W. Felix Handte	64348a15f1	Misc Fixes	2018-06-19 13:01:21 -04:00
W. Felix Handte	ade8586ce6	Find `mls == 3` Matches	2018-06-19 13:01:21 -04:00
W. Felix Handte	ce743312e2	Fix Typo	2018-06-19 13:01:21 -04:00
W. Felix Handte	a075864756	Switch `!= ZSTD_extDict` to `== ZSTD_noDict`	2018-06-19 13:01:21 -04:00
W. Felix Handte	1e03377bde	Implement RepCode Check	2018-06-19 13:01:21 -04:00
W. Felix Handte	ccbf067973	Add _dictMatchState Functions	2018-06-19 13:01:21 -04:00
W. Felix Handte	d5d8240967	Convert `extDict` Flag to `dictMode` Enum	2018-06-19 13:01:21 -04:00
W. Felix Handte	93c3184d44	Attach Dicts when Using ZSTD_btopt and ZSTD_btultra	2018-06-19 13:01:21 -04:00
Yann Collet	1c714fda3f	introduced POOL_resize() not complete yet : finalize behavior in case of unfinished expansion	2018-06-18 20:46:39 -07:00
Nick Terrell	3841dbac84	Adjust advanced parameters to source size In the new advanced API, adjust the parameters even if they are explicitly set. This mainly applies to the `windowLog`, and accordingly the `hashLog` and `chainLog`, when the source size is known.	2018-06-18 15:49:31 -07:00
Yann Collet	e30f13bde0	Merge pull request #1185 from felixhandte/zstd-btlazy-in-place-dict ZSTD_btlazy2: Support Searching the Dictionary Context In-Place	2018-06-18 13:29:44 -07:00
Yann Collet	d8462ecba2	Merge branch 'dev' into huf_rename	2018-06-14 20:42:10 -04:00
Yann Collet	b7e5ebef2a	grouped X2 function together	2018-06-14 20:41:50 -04:00
Yann Collet	9698d2fb72	Merge pull request #1189 from facebook/hist histogram module	2018-06-14 20:39:52 -04:00
Yann Collet	6901c94cd6	avoid duplicate code comments when a function is decribed in hist.h, do not describe it again in hist.c to avoid future doc synchronization issues.	2018-06-14 19:47:05 -04:00
Yann Collet	f70f829ff5	Merge pull request #1187 from facebook/fix1186 fix dctx initialization within ZSTD_decompress in stack mode	2018-06-14 16:22:22 -04:00
Yann Collet	a71513bec6	Merge pull request #1184 from facebook/debug Grouped debug functions into debug.h	2018-06-14 16:21:53 -04:00
Yann Collet	1adf84ccb7	renamed all HUF_decompressX4() functions into X2 to underline they generate up to 2 symbols per decoding, in preparation for a future X3 variant.	2018-06-14 15:17:03 -04:00
Yann Collet	a09af5eb6b	renamed all HUF_decompressX2() functions into X1 to underline they generate one symbol per decoding operation. The new naming scheme will make it easier to introduce an X3 variant.	2018-06-14 15:08:43 -04:00
W. Felix Handte	0c654d22c8	Force Inline BtFindBestMatch	2018-06-14 14:54:39 -04:00
Yann Collet	7fee966f02	fix dctx initialization within ZSTD_decompress in stack mode when ZSTD_HEAPMODE=0 (which is not default). Also : added an associated test (test-fuzzer-stackmode) run on travis CI fix #1186	2018-06-14 10:22:24 -04:00
Yann Collet	fc682263d0	fixed g_debuglevel variable name in debug.h	2018-06-13 20:02:33 -04:00
Yann Collet	2d76defbfe	grouped all histogram functions into hist.c renamed functions with HIST_* prefix	2018-06-13 19:49:31 -04:00
W. Felix Handte	0551de4b5a	Search Dict for Matches	2018-06-13 16:06:28 -04:00
W. Felix Handte	ace9cfa950	Attach Dicts when Using ZSTD_btlazy2	2018-06-13 16:06:28 -04:00
Yann Collet	fa41bcc2c2	grouped debug functions into debug.h There were 2 competing set of debug functions within zstd_internal.h and bitstream.h. They were mostly duplicate, and required care to avoid messing with each other. There is now a single implementation, shared by both. Significant change : The macro variable ZSTD_DEBUG does no longer exist, it has been replaced by DEBUGLEVEL, which required modifying several source files.	2018-06-13 15:43:09 -04:00
W. Felix Handte	d53200a846	Fix Cast Warning	2018-06-13 14:58:36 -04:00
W. Felix Handte	b82063b266	Extend Dictionary Matches Backwards	2018-06-13 14:58:36 -04:00
W. Felix Handte	d53a04211c	Update Dictionary Attachment Cutoff Values Again	2018-06-13 14:58:36 -04:00
W. Felix Handte	2162aa9f18	Do Not Inline DMS Search Function	2018-06-13 14:58:36 -04:00
W. Felix Handte	338bede9b5	Also Implement Depth Repcode Checks	2018-06-13 14:58:36 -04:00
W. Felix Handte	555ab9f8cf	Apply Match Continuation Bug Fix	2018-06-13 14:58:36 -04:00
W. Felix Handte	c87dd2121d	Update Dictionary Attachment Cutoff Values	2018-06-13 14:58:36 -04:00
W. Felix Handte	6204b6d592	Check Dict Match State in ZSTD_HcFindBestMatch_generic	2018-06-13 14:58:36 -04:00
W. Felix Handte	211a61b69b	Focus on Non-BT Impls for the Moment	2018-06-13 14:58:36 -04:00
W. Felix Handte	2e93736a77	Remove Pre-Existing Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	3b82a23a35	Second Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	a2a24bebec	First Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	f74c2cd673	Disallow Too-Long Repcodes When Using an Attached Dict	2018-06-13 14:58:36 -04:00
W. Felix Handte	c14db94450	Rename `base` -> `prefixLowest`	2018-06-13 14:58:36 -04:00
W. Felix Handte	5d90708a0a	Go Back to Separate Intermediate Functions for Different Dict Modes	2018-06-13 14:58:36 -04:00
W. Felix Handte	f84fc63a43	Further Templatize Intermediate Functions on dictMode	2018-06-13 14:58:36 -04:00
W. Felix Handte	529d3a5acd	Convert Existing U32 extDict Vars to ZSTD_dictMode Enums	2018-06-13 14:58:36 -04:00
W. Felix Handte	33e2240fac	Attach Dict When Using ZSTD_lazy Strategies	2018-06-13 14:58:36 -04:00
W. Felix Handte	90cfc799e5	Add _dictMatchState Stubs for ZSTD_lazy Functions	2018-06-13 14:58:36 -04:00
W. Felix Handte	a85ecb32bd	Add dictMode Param to ZSTD_compressBlock_lazy_generic	2018-06-13 14:58:36 -04:00
Yann Collet	750ee87a92	Merge pull request #1175 from ryandesign/macos Fix name of macOS	2018-06-13 11:32:06 -04:00
Yann Collet	b2632bcf6c	Merge pull request #1174 from duc0/document_default_level Expose ZSTD_CLEVEL_DEFAULT and update documentation	2018-06-12 12:09:01 -07:00
Duc Ngo	869e2718f6	Line break	2018-06-11 10:02:15 -07:00
Duc Ngo	e8ef725e13	Address comments	2018-06-11 10:01:35 -07:00
Ryan Schmidt	b567ce9d68	Fix name of macOS	2018-06-09 14:31:17 -05:00
Duc Ngo	e34c000e44	Expose ZSTD_CLEVEL_DEFAULT and update documentation	2018-06-08 11:33:44 -07:00
Yann Collet	3050733042	Merge branch 'dev' into negLevels	2018-06-07 15:51:35 -07:00
Yann Collet	c2c47e24e0	support targetlen==0 with strategy==ZSTD_fast to mean "normal compression", targetlen >= 1 now means "disable huffman compression of literals"	2018-06-07 15:49:01 -07:00
Yann Collet	a57b4df85f	removed literalCompression directive in this version, literal compression is always disabled for ZSTD_fast strategy. Performance parity between ZSTD_compress_advanced() and ZSTD_compress_generic()	2018-06-07 15:24:12 -07:00
Yann Collet	8537bfd85c	fuzzer: make negative compression level fail result of ZSTD_compress_advanced() is different from ZSTD_compress_generic() when using negative compression levels because the disabling of huffman compression is not passed in parameters.	2018-06-07 15:12:13 -07:00
Yann Collet	8ef75547ef	Merge pull request #1165 from facebook/ctxSizeDown Dynamic context downsize	2018-06-07 14:44:32 -07:00
Yann Collet	e3c42c739b	clean ZSTD_compress() initialization The (pretty old) code inside ZSTD_compress() was making some pretty bold assumptions on what's inside a CCtx and how to init it. This is pretty fragile by design. CCtx content evolve. Knowledge of how to handle that should be concentrate in one place. A side effect of this strategy is that ZSTD_compress() wouldn't check for BMI2 capability, and is therefore missing out some potential speed opportunity. This patch makes ZSTD_compress() use the same initialization and release functions as the normal creator / destructor ones. Measured on my laptop, with a custom version of bench manually modified to use ZSTD_compress() (instead of the advanced API) : This patch : 1#silesia.tar : 211984896 -> 73651053 (2.878), 312.2 MB/s , 723.8 MB/s 2#silesia.tar : 211984896 -> 70163650 (3.021), 226.2 MB/s , 649.8 MB/s 3#silesia.tar : 211984896 -> 66996749 (3.164), 169.4 MB/s , 636.7 MB/s 4#silesia.tar : 211984896 -> 65998319 (3.212), 136.7 MB/s , 619.2 MB/s dev branch : 1#silesia.tar : 211984896 -> 73651053 (2.878), 291.7 MB/s , 727.5 MB/s 2#silesia.tar : 211984896 -> 70163650 (3.021), 216.2 MB/s , 655.7 MB/s 3#silesia.tar : 211984896 -> 66996749 (3.164), 162.2 MB/s , 633.1 MB/s 4#silesia.tar : 211984896 -> 65998319 (3.212), 130.6 MB/s , 618.6 MB/s	2018-06-07 14:05:25 -07:00
Yann Collet	b27c7389e3	Merge pull request #1164 from GeorgeLu97/CustomMacros Partial Compilation Macros	2018-06-06 16:47:42 -07:00
Yann Collet	24319975b6	bumped version number to v1.3.5	2018-06-06 15:51:55 -07:00
Yann Collet	f1ea383f45	context can be sized down even with constant parameters when parameters are "equivalent", the context is re-used in continue mode, hence needed workspace size is not recalculated. This incidentally also evades the size-down check and action. This patch intercepts the "continue mode" so that the size-down check and action is actually triggered.	2018-06-06 15:04:12 -07:00
Yann Collet	e5e17d009f	changed member name to workSpaceOversizedDuration	2018-06-06 15:00:27 -07:00
Yann Collet	f7392f3dc9	added test case	2018-06-05 14:53:28 -07:00
George Lu	11d5bfdaa9	Revert "Partial compilation test?" This reverts commit `b2496ab606`.	2018-06-05 13:55:36 -07:00
George Lu	b2496ab606	Partial compilation test?	2018-06-05 13:24:00 -07:00
Yann Collet	3d523c741b	added workSpaceTooLarge and workSpaceWasteful also : slightly increased speed of test fuzzer.16	2018-06-05 11:42:48 -07:00
George Lu	b3ef314830	Fix Typos	2018-06-04 17:19:06 -07:00
Yann Collet	357c648c3f	changed a few variable names to unify naming convention	2018-06-04 17:10:50 -07:00
George Lu	609d72b0ca	Added Deprecated Dependencies	2018-06-04 14:33:21 -07:00
George Lu	9437021d2f	Remove old file declaration	2018-06-04 13:32:41 -07:00
George Lu	6a617d70ed	Documentation	2018-06-04 09:56:37 -07:00
George Lu	65de25a463	Created Macros	2018-06-04 09:56:29 -07:00
Yann Collet	2108decb41	Fixed a nasty corruption bug recently introduce into the new dictionary mode. The bug could be reproduced with this command : ./zstreamtest -v --opaqueapi --no-big-tests -s4092 -t639 error was in function ZSTD_count_2segments() : the beginning of the 2nd segment corresponds to prefixStart and not the beginning of the current block (istart == src). This would result in comparing the wrong byte.	2018-06-01 18:54:34 -07:00
Yann Collet	143fc9ff6c	Merge pull request #1157 from facebook/decompressedSize minor : improved zstd.h API code comment	2018-06-01 10:28:17 -07:00
Yann Collet	7c33b48221	Merge pull request #1151 from felixhandte/zstd-dfast-in-place-dict-goto ZSTD_dfast: Support Searching the Dictionary Context In-Place (Alternate `goto` Implementation)	2018-05-31 17:37:09 -07:00
W. Felix Handte	48deab92de	Allow Different Dict Attachment Cut-Offs for Different Strategies	2018-05-31 17:37:44 -04:00
W. Felix Handte	f86796639e	Remove Incorrect and Extraneous Repcode Bounds Check	2018-05-31 17:02:29 -04:00
Yann Collet	9b979d0e33	minor : improved API code comment Extend guarantee that ZSTD_getFrameContentSize() will delivering the decompressed size to any single-pass compression function. Answer #1156	2018-05-31 11:12:18 -07:00
Yann Collet	809f2f9322	minor update of literal cost function just assert() there is no negative cost evaluation for literals	2018-05-29 15:34:50 -07:00
Yann Collet	463a0fe38b	simplified optimal parser removed "cached" structure. prices are now saved in the optimal table. Primarily done for simplification. Might improve speed by a little. But actually, and surprisingly, also improves ratio in some circumstances.	2018-05-29 14:07:25 -07:00
Yann Collet	bb6eaf6495	Merge pull request #1153 from facebook/dynThreshold changed dynamic fse threshold for offset	2018-05-26 08:43:45 -07:00
Yann Collet	e916c365a1	fixed minor visual warning	2018-05-25 20:43:09 -07:00
Yann Collet	a7fdceeccd	changed dynamic fse threshold for offset recent experienced showed that default distribution table for offset can get it wrong pretty quickly with the nb of symbols, while it remains a reasonable choice much longer for lengths symbols. Changed the formula, so that dynamic threshold is now 32 symbols for offsets. It remains at 64 symbols for lengths. Detection based on defaultNormLog	2018-05-25 17:41:16 -07:00
Yann Collet	4b3a36d5d8	Merge branch 'dev' into lowCompression	2018-05-25 15:45:03 -07:00
Yann Collet	5f177f1c53	btultra accepts blocks with poorer compression ratio zstd rejects blocks which do not compress by at least a certain amount. In which case, such block is simply emitted uncompressed (even if a little bit of compression could be achieved). This is better for decompression speed, hence for energy. The logic is controlled by ZSTD_minGain(). The rule is applied uniformly, at all compression levels. This change makes btultra accepts blocks with poor compression ratios. We presume that users of btultra mode prefers compression ratio over some decompress speed gains. The threshold for minimum gain is lowered for btultra from s>>6 (~1.5% minimum gain) to s>>7 (~0.8% minimum gain). This is a prudent change. Not sure if it's large enough.	2018-05-25 15:19:52 -07:00
Yann Collet	e2c0e3d437	slightly nudge choices towards less sequences also slightly improve some strange detrimental corner cases.	2018-05-25 14:52:21 -07:00
W. Felix Handte	5b292b5685	Check Long + 1 Matches in Both Prefix and Dict in Bothe Short Match Paths	2018-05-25 13:13:57 -04:00
W. Felix Handte	88b733b380	Interleave Prefix and Dict Searches	2018-05-25 13:13:57 -04:00
W. Felix Handte	1850025156	Refactor ZSTD_dfast to Use `goto`s	2018-05-25 13:13:57 -04:00
W. Felix Handte	43606f9c83	... When I Said "HashTable", I Meant "ChainTable"	2018-05-25 13:13:28 -04:00
W. Felix Handte	ec7efe88f5	Fix Off-By-One Error	2018-05-25 13:13:28 -04:00
W. Felix Handte	2bfe43267e	Disallow Too-Long Repcodes When Using an Attached Dict	2018-05-25 13:13:28 -04:00
W. Felix Handte	b97ad3f457	Port Changes Made to ZSTD_fast to ZSTD_dfast	2018-05-25 13:13:28 -04:00
W. Felix Handte	2313cca1b7	Implement Second Repcode Check	2018-05-25 13:13:28 -04:00
W. Felix Handte	0998f10813	Implement First Repcode Check	2018-05-25 13:13:28 -04:00
W. Felix Handte	50c5b2bb90	Find Dict Hash Table Matches	2018-05-25 13:13:28 -04:00
W. Felix Handte	7a25f7ef5b	Existing Repcode Check Only Applies to noDict Case	2018-05-25 13:13:28 -04:00
W. Felix Handte	8b241da4df	Properly Initialize Repcode Values	2018-05-25 13:13:28 -04:00
W. Felix Handte	7097a03749	Add Necessary Dict Variables	2018-05-25 13:13:28 -04:00
W. Felix Handte	aacbbf4f9a	Rename 'lowest' to 'localLowest' to Prepare to Introduce Dict Indices	2018-05-25 13:13:28 -04:00
W. Felix Handte	c10d1b4011	Skeleton for In-Place Impl for ZSTD_dfast	2018-05-25 13:13:28 -04:00
Yann Collet	f6ad59ab5c	Merge branch 'dev' into staticDictCost	2018-05-24 16:21:02 -07:00
Yann Collet	b5ef32fea7	Merge branch 'dev' into fracFse	2018-05-24 14:09:49 -07:00
Yann Collet	776128d16f	fix corner case when requiring cost of an FSE symbol ensure that, when frequency[symbol]==0, result is (tableLog + 1) bits with both upper-bit and fractional-bit estimates. Also : enable BIT_DEBUG in /tests	2018-05-24 13:59:11 -07:00
Yann Collet	08c5be5db3	Merge pull request #1117 from felixhandte/zstd-fast-in-place-dict ZSTD_fast: Support Searching the Dictionary Context In-Place	2018-05-23 19:32:25 -07:00
Nick Terrell	06b70179da	Work around bug in zstd decoder (#1147 ) Work around bug in zstd decoder Pull request #1144 exercised a new path in the zstd decoder that proved to be buggy. Avoid the extremely rare bug by emitting an uncompressed block.	2018-05-23 18:02:30 -07:00
Nick Terrell	f2d0924b87	Variable declarations	2018-05-23 14:58:58 -07:00
W. Felix Handte	d9c7e67125	Assert that Dict and Current Window are Adjacent in Index Space	2018-05-23 17:53:03 -04:00
W. Felix Handte	298d24fa57	Make loadedDictEnd an Index, not the Dict Len	2018-05-23 17:53:03 -04:00
W. Felix Handte	7ef85e0618	Fixes in re Comments	2018-05-23 17:53:03 -04:00
W. Felix Handte	582b7f85ed	Don't Attach Empty Dict Contents In weird corner cases, they produce unexpected results...	2018-05-23 17:53:03 -04:00
W. Felix Handte	9c92223468	Avoid Undefined Behavior in Match Ptr Calculation	2018-05-23 17:53:03 -04:00

... 3 4 5 6 7 ...

2717 Commits