Moinak Ghosh
3db5188445
Support for deduplication using 2KB block size.
2013-08-19 13:38:52 +05:30
Moinak Ghosh
58f3113558
Avoid unnecessary re-hashing of 64-bit keys of the segment index.
2013-08-17 22:08:55 +05:30
Moinak Ghosh
2a218e9da5
Fix Dedupe Mode initialization.
2013-07-12 18:21:49 +05:30
Moinak Ghosh
c0dd0102a5
A few minor fixes.
2013-06-14 22:25:01 +05:30
Moinak Ghosh
0a1e3b39ef
Correspond segment size to chunk size for Segmented Dedupe for better accuracy.
2013-05-15 22:20:45 +05:30
Moinak Ghosh
c6da2325e3
Allow SKEIN to be used as a Global Dedupe chunk lookup hash.
2013-05-04 15:59:29 +05:30
Moinak Ghosh
79a6e7f770
Capability to output data to stdout when compressing.
...
Always use segmented similarity bases dedupe when using -G option in pipe mode.
Standardize on average 8MB segment size for segmented dedupe.
Fix hashtable sizing.
Some miscellaneous cleanups.
Update README with details of new features.
2013-04-24 23:03:58 +05:30
Moinak Ghosh
5d6ffd969d
More tweaks to slightly improve segment dedupe efficiency.
...
Use on average 8MB segments for all cases.
Some minor cleanps.
2013-04-24 19:13:07 +05:30
Moinak Ghosh
d29f125ca7
Clean up temp cache dir handling.
...
Allow temp dir setting via specific env variable to point to fast devices like ramdisk,ssd.
2013-04-22 22:57:31 +05:30
Moinak Ghosh
c0b4aa0116
Many optimizations and changes to Segmented Global Dedupe.
...
Use chunk hash based similarity matching rather than content based.
Use sorting to order hash buffer rather than min-heap for better accuracy.
Use fast CRC64 for similarity hash for speed and lower memory requirements.
2013-04-21 18:11:16 +05:30
Moinak Ghosh
8ae571124d
Complete implementation for Segmented Global Deduplication.
2013-04-18 21:26:24 +05:30
Moinak Ghosh
50251107de
Work in progress changes for Segmented Global Deduplication.
2013-04-09 22:23:51 +05:30
Moinak Ghosh
c357452079
Implement global dedupe in pipe mode.
...
Update hash index calculations to use upto 75% memavail when file size is not known.
Use little-endian nonce format for Salsa20.
2013-03-29 15:18:25 +05:30
Moinak Ghosh
fbf4658635
Implement Global Deduplication.
2013-03-24 23:21:17 +05:30
Moinak Ghosh
876796be5c
Work in progress changes for global dedupe.
2013-03-21 22:00:38 +05:30
Moinak Ghosh
f2806d4ffa
Work in progress global dedupe changes.
2013-03-19 20:13:44 +05:30
Moinak Ghosh
f8f23e5200
Major License text cleanup.
2013-03-07 20:26:48 +05:30
Moinak Ghosh
6badbcaea7
Make global dedupe bits buildable and fix errors.
...
Rename Adaptive compression type constants to avoid conflict with global constants.
2013-02-17 21:05:40 +05:30