Moinak Ghosh
507e7c75d3
Centralise data analysis routine for optimum performance and leverage.
...
Utilise buffer data analysis for preprocessing filters.
2014-11-06 22:23:33 +05:30
Moinak Ghosh
cc68550670
Add metadata stream flag for archive.
...
Change flag bit to not collide with checksum id.
Handle '-T' option properly.
2014-10-25 22:57:31 +05:30
Moinak Ghosh
e7081eb5a3
Git commit - rehash. Incorrect earlier commit.
...
Implement Separate metadata stream.
Fix blatant wrong check in Bzip2 compressor.
Implement E8E9 filter fallback in Dispack.
Improve dict buffer size checks.
Reduce thread count to control memory usage in archive mode.
2014-10-24 23:30:40 +05:30
Moinak Ghosh
2e5f2d8aab
Make DICT filter useful.
...
Improve data analysis in adaptive_compress.
2014-09-20 21:49:06 +05:30
Moinak Ghosh
071a9e2b26
Update,simplify analyzer function to indicate text data for Dict filter.
...
Fix archive header writing bug.
Strip ^M chars from dict filter files.
Include DICT preprocessing type.
Fix a bunch of bugs found by Xcode.
2014-09-20 12:49:00 +05:30
Moinak Ghosh
af39994a59
Working Wavpack filter for compressing WAV filies.
...
Improved error handling of filter routines.
Improved verbose logging.
2014-09-17 20:34:38 +05:30
Moinak Ghosh
10f40e1c6f
Part 1 changes to allow dual licensing to MPLV2.
...
Make external LGPL code/features disabled in MPLV2 variant.
Nuke some unwanted whitespace (cstyle).
2014-07-24 22:20:30 +05:30
Moinak Ghosh
63bef473cc
Working MAC OS X port.
...
Compatibility layer for semaphore handling.
2014-05-04 21:11:31 +05:30
Moinak Ghosh
935717373b
Capability to list offset and length of each block when deduplication for external use.
2014-03-30 17:35:21 +05:30
Moinak Ghosh
2f11e11717
Bump Version and update Changelog for 3.1 Release.
2014-03-02 18:46:03 +05:30
Moinak Ghosh
62568e9066
Basic capability to list contents of an archive without extracting to disk.
2014-01-12 20:38:20 +05:30
Moinak Ghosh
3ddaf6d45f
Bump version and update command help text.
2014-01-04 21:45:23 +05:30
Moinak Ghosh
ea345a902a
Overhaul documentation part #1
...
Detect and handle uncompressed PDF files using libbsc.
Force binary/text data detection for tar archives.
Get rid of unnecessary CLI option.
Add full pipeline mode check when archiving.
2013-12-30 23:24:37 +05:30
Moinak Ghosh
a741f34f78
Move MSDOS COM single-byte magic number checks to last in the list.
...
Move advanced options flag into context structure.
Include dtd files as text type.
2013-12-18 00:09:32 +05:30
Moinak Ghosh
a851bac247
Check harder with more strides in Delta2 for extreme compression levels.
2013-12-13 19:53:14 +05:30
Moinak Ghosh
fb25e53b4f
Add forked and optimized copy of LGPL version of Libbsc.
...
Strip out Sort Transform from Libbsc copy.
Reduce Libbsc memory use.
Avoid redundant adler32 of data block in Libbsc.
2013-11-30 22:13:33 +05:30
Moinak Ghosh
0192790c02
Add Dispack filter with auto-detection of x86 executables in archive mode.
...
More elaborate magic header based detection of 32-bit and 64-bit x86 binaries.
Always use fast-mode LZ4 in Adaptive modes.
2013-11-24 19:45:58 +05:30
Moinak Ghosh
1e2c3e479a
Optimize preprocessed compression and avoid a bunch of memory copies.
...
Fix a crash.
Add a few more file types.
More comments.
2013-11-22 20:44:26 +05:30
Moinak Ghosh
11584cab52
Add fast handling of totally incompressible data (like Jpegs) in adaptive modes.
...
Add function to indicate totally incompressible data when archiving.
Reformat if statements in some places to reduce branching.
2013-11-15 21:06:23 +05:30
Moinak Ghosh
cae9de9b2e
Leverage file type detection(archiver) to improve compression performance.
...
Use detected file/data type(archiver) for Adaptive compression modes.
Update type flags and add more extensions.
2013-11-08 23:50:28 +05:30
Moinak Ghosh
991482403b
Add extension based file type detection and setting segment data type.
...
Use Bob Jenkins Minimal Perfect Hash to check for known extensions.
Use semaphore signaling and direct buffer copy for extraction.
Miscellaneous fixes.
2013-11-07 21:48:54 +05:30
Moinak Ghosh
448890a014
Replace slow pipe with direct memory copy for archive extraction.
...
Miscellaneous corrections and tweaks.
2013-11-03 23:15:55 +05:30
Moinak Ghosh
7ed532133e
Avoid using pipe during archive creation. Use semaphores and direct memory copy.
2013-11-02 23:43:59 +05:30
Moinak Ghosh
a374ca5909
Use mmap to read from the pathlist file for performance.
2013-11-02 12:14:46 +05:30
Moinak Ghosh
dcccffd7fa
Archiving support using Libarchive: Fully functional archiving and extraction.
...
Functionality to sort pathnames based on file extension and size.
2013-11-01 23:15:40 +05:30
Moinak Ghosh
e09d8a485c
Archiving support using Libarchive: Working archive extraction.
2013-10-31 00:15:17 +05:30
Moinak Ghosh
8e4b774c8c
More changes for archiving.
...
Allow multiple filenames on command line when archiving.
Remove unneded small block writes with libarchive.
2013-10-27 20:36:48 +05:30
Moinak Ghosh
46b11def08
Archiving support using Libarchive: Work in progress changes #3 .
...
Make log_msg() add newline by default.
2013-10-24 00:16:04 +05:30
Moinak Ghosh
7f81869874
Archiving support using Libarchive: Work in progress changes.
...
Change all perror() calls to use logger.
Make the config script a little verbose.
2013-10-20 23:54:27 +05:30
Moinak Ghosh
cee8d88ded
Bump version for upcoming release.
2013-08-27 21:41:16 +05:30
Moinak Ghosh
413a2a2fb1
Update Changelog and bump version for 2.3 release.
2013-08-10 10:25:41 +05:30
Moinak Ghosh
fe18afbcf4
Use wrapper script to set paths when launching pcompress from build directory.
...
Use smaller max block size when doing global dedupe.
Fix init of executable name.
2013-08-07 22:03:52 +05:30
Moinak Ghosh
c859cf35d5
Make Pcompress functionality into a library - initial changes.
2013-06-02 20:54:33 +05:30
Moinak Ghosh
8db0bef184
Bump version and update Changelog for 2.2 release.
2013-05-28 21:45:19 +05:30
Moinak Ghosh
8b3761ee81
Update Changelog, docs and bump version for 2.1 release.
2013-05-09 18:53:11 +05:30
Moinak Ghosh
f05c7905b2
Bump version for release.
2013-04-26 18:47:52 +05:30
Moinak Ghosh
fbf4658635
Implement Global Deduplication.
2013-03-24 23:21:17 +05:30
Moinak Ghosh
b7fdeb08bc
Work in progress global dedupe changes.
2013-03-20 22:47:03 +05:30
Moinak Ghosh
f2806d4ffa
Work in progress global dedupe changes.
2013-03-19 20:13:44 +05:30
Moinak Ghosh
f8f23e5200
Major License text cleanup.
2013-03-07 20:26:48 +05:30
Moinak Ghosh
370e84f2be
Update Changelog and bump version for 1.4 release.
...
Add license text to Salsa20 files.
2013-03-06 17:15:03 +05:30
Moinak Ghosh
efe5232cdc
Add compatibility to decode old-format parallel hashes created with version 1.2.
...
Bump archive version to 7 as parallel hashes are now merkle style hashes.
2013-02-24 20:05:16 +05:30
Moinak Ghosh
6badbcaea7
Make global dedupe bits buildable and fix errors.
...
Rename Adaptive compression type constants to avoid conflict with global constants.
2013-02-17 21:05:40 +05:30
Moinak Ghosh
9983d79e62
Bump version and update changelog for 1.3.0 release.
...
Fix issue #3 .
2013-01-29 21:42:54 +05:30
Moinak Ghosh
2da0d0950b
Use BLAKE2 parallel version for single-chunk archives (whole file in one chunk).
...
Set decompression threads correctly for single-chunk archives.
2013-01-26 18:28:13 +05:30
Moinak Ghosh
26bb137257
Changes for generalized runtime SSE/AVX/XOP detection.
...
Multi instruction set XXhash build with runtime selection.
Extend CPUID code to detect more instruction sets.
Add options for BLAKE2 hash.
Move GCC builtins into utils header.
Bump file format version number due to extended digest flags.
Add descriptions to digest list.
2013-01-25 00:10:12 +05:30
Moinak Ghosh
16b1d9e7a3
Bump version, update Changelog and documentation for 1.2 release.
2013-01-03 23:40:21 +05:30
Moinak Ghosh
26a4f42506
Introduce strict compiler flags and fix scores of warnings/issues.
...
Avoid different optimization flags for Dedupe sources.
Fix liberal mixing of uint64_t and int64_t (should all be uint64_t).
Fix corner case crash when decompressing.
2012-12-27 23:06:48 +05:30
Moinak Ghosh
c2df666daf
Major improvement to Delta2 encoding algorithm.
2012-12-18 23:32:55 +05:30
Moinak Ghosh
224fb529e9
Get rid of size_t in places where 64-bitness is assumed.
2012-12-09 10:15:06 +05:30