hanoidb

Author	SHA1	Message	Date
Kresten Krab Thorup	380a4f9cfc	Redo work load computation The simplistic approach has a race condition. This works for now, albeit still issuing too much work.	2012-04-30 22:44:21 +02:00
Kresten Krab Thorup	be507c0e13	Syntax error	2012-04-30 21:59:55 +02:00
Kresten Krab Thorup	0009e17d4f	Change delegate work computation We were delegating too much work. The original algorithm description said that for each insert, "1" unit of merge work has to be done at each level … implying that if nothing needs doing at a level, that "not done work" does not add to work done elsewhere. This fix gets us back to that situation (by always subtracting at least 2^TOP_LEVEL from the presented work amount), while maintaining the (beneficial) effect of chunking merge work at at anything but the last level. Effectively, this reduces the maximum amount of merge work done, also reducing our worst case latency. Now that we understand this, we can refactor the algorithm to delegate "DoneWork", because then each level can determine the total work, and see if any work is left "for me". That's next.	2012-04-30 21:38:53 +02:00
Kresten Krab Thorup	74686b1380	Implement merge hibernation for tail scan When scanning just one file (because all it's keys are after the ones in the other file), we also can need hibernation to save memory. Especially the bloom filters being built take a lot of mem.	2012-04-30 21:28:33 +02:00
Kresten Krab Thorup	6ce7101506	Correct step counts in merger Merge was progressing too fast. This corrects the progress house keeping in processing merge work.	2012-04-30 19:28:20 +02:00
Kresten Krab Thorup	d6b8491a3d	Make step code more explicit This change has no semantic effect, only makes the code easier to read	2012-04-30 19:27:13 +02:00
Kresten Krab Thorup	18c197d959	New config: {read\|write}_buffer_size These two parameters (defaulting to 512k) control the amount of erlang file buffer space to allocate for delayed_write and read_ahead when merging. This config parameter is per merge task of which there can be many for each open HanoiDB; and again multiplied by number of active vnodes in Riak. As such, this can config parameter is significant for the memory usage of a Riak with Hanoi, but setting it too low will kill the performance.	2012-04-30 00:06:42 +02:00
Kresten Krab Thorup	a6952cdb77	Rename _Variables to remove compiler warnings	2012-04-29 23:57:49 +02:00
Kresten Krab Thorup	e63df328ed	remove verbose info_msg	2012-04-29 18:43:38 +02:00
Kresten Krab Thorup	77a81499f9	Fix problem in merge hibernation The merge state includes an bloom reference, which needed to be properly serialized.	2012-04-29 01:32:02 +02:00
Kresten Krab Thorup	f0833de3fc	Set default pagesize to 8k Also reduce read ahead / delayed write parameters so we don't need too much memory in merge procs.	2012-04-29 00:33:15 +02:00
Kresten Krab Thorup	15fc05634a	Implement hibernation in merge processes Analysis seems to indicate that merge processes (from high-numbered levels) tend to be activated quite infrequent. Thus, we term-to-bin/gzip the merge process state, and invoke explicit gc before waiting for a {step, …} message again.	2012-04-29 00:32:50 +02:00
Kresten Krab Thorup	801817cf70	Move tree traversal to separate process Looks like we're generating a lot of garbage here. Moving this to a separate process lets us avoid a lot of garbage collection work, since we don't cache these parsed nodes anyway.	2012-04-28 22:40:39 +02:00
Kresten Krab Thorup	b53d6fc3c3	Add riak_core dependency For unit tests to run	2012-04-28 18:45:53 +02:00
Kresten Krab Thorup	4fbd7d17ed	Update README/TODO	2012-04-28 18:42:04 +02:00
Kresten Krab Thorup	682191ce06	Tree writing code was broken In some cases, inner nodes were not being emitted. This some times would cause queries (get / range_fold) to only include results in a right-most branch.	2012-04-28 18:35:35 +02:00
Kresten Krab Thorup	940fa823e2	integer index values fixed The code was assuming index values were binaries.	2012-04-28 18:34:03 +02:00
Kresten Krab Thorup	9d3542c4a0	Enable debug print upon opening a level Also clean up some variable names to be more descriptive / correct.	2012-04-28 18:32:50 +02:00
Kresten Krab Thorup	5c717b1ec3	Fix range_fold Range fold with from_key < first_key would always return an empty result.	2012-04-28 18:31:19 +02:00
Kresten Krab Thorup	be5db4e4be	Update readme on 2i and fast bucket listing	2012-04-27 12:09:19 +02:00
Kresten Krab Thorup	4e354c0379	Implement "fast" fold buckets function do repeated limit=1 range queries based on the sext encoding of {o, Bucket, Key}	2012-04-27 12:08:35 +02:00
Kresten Krab Thorup	eb63ce1d04	Fix two more unit tests	2012-04-27 10:23:52 +02:00
Kresten Krab Thorup	9a7e2131a1	Implement 2i Most code copied from eleveldb backend, except we can do more precise range folds with hanoi so no need to throw exceptions from fold functions.	2012-04-27 10:03:19 +02:00
Kresten Krab Thorup	89b04fe4fb	Add debug logging	2012-04-27 10:01:53 +02:00
Kresten Krab Thorup	f41aaa265e	Fix bug with fold termination	2012-04-27 10:00:36 +02:00
Kresten Krab Thorup	2d928fce73	Fix unit tests	2012-04-27 09:59:09 +02:00
Kresten Krab Thorup	6b47d8dd1e	Fix race condition When merge is completed, and inject-to-next-level is pending, there is still a B file, but no current merge_pid. In this case, don't try to do merge work at this level.	2012-04-27 09:47:21 +02:00
Kresten Krab Thorup	b07d16d292	Add hanoi:transact, and CRC checks for nursery.log This involves some cleanup/reorg of code in hanoi_util. Streaming trees and nursery now use the same cry checking code. Future: Keep the CRC-encoded binary around, and reuse it when writing trees. This will reduce cpu costs involved in re-computing those all the time.	2012-04-26 17:18:49 +02:00
Kresten Krab Thorup	67f1c46b7e	Code cleanup Clean up a little in hanoi_level, avoiding an extra message send when initiating incremental merge	2012-04-26 17:13:47 +02:00
Kresten Krab Thorup	eba7f820ef	Update README	2012-04-26 17:12:37 +02:00
Kresten Krab Thorup	ce898b5063	Don't do merge before inject	2012-04-26 08:17:48 +02:00
Kresten Krab Thorup	334d21971f	sample 20 min run (MacBook Air/SSD)	2012-04-26 01:51:08 +02:00
Kresten Krab Thorup	3269e6fa96	Add lager dependency	2012-04-26 01:08:38 +02:00
Kresten Krab Thorup	51250d04da	Initial work on hanoi:write	2012-04-26 01:02:56 +02:00
Kresten Krab Thorup	5e3cc8be51	Fix race condition with insert and bottom level	2012-04-26 01:01:34 +02:00
Kresten Krab Thorup	e147dd9d1e	Improve merge work computation	2012-04-26 01:00:01 +02:00
Kresten Krab Thorup	5d95070669	Add debug loggin	2012-04-26 00:58:16 +02:00
Kresten Krab Thorup	6c8492c7d0	Use plain_rpc	2012-04-26 00:56:59 +02:00
Kresten Krab Thorup	085f400bb8	Do incremental merge before inject	2012-04-26 00:51:04 +02:00
Kresten Krab Thorup	d58ab9ea32	Introduce plain_rpc Institutionalize the way hanoi_level handles RPC. This is embodied in a new module, which should be pushed to plain_fsm, but we'll keep it here for now.	2012-04-26 00:49:57 +02:00
Gregory Burd	db116447fe	Minor change.	2012-04-25 14:07:06 -04:00
Gregory Burd	e4d8615a99	Cleanup	2012-04-24 10:37:45 -04:00
Kresten Krab Thorup	86516d4b2d	Add some notes on file descriptors to DESIGN.md	2012-04-24 14:07:15 +02:00
Kresten Krab Thorup	84f7fcc75b	Update README a bit	2012-04-24 14:06:54 +02:00
Kresten Krab Thorup	a8a66a43a0	Add CRC32 data validation If a KV entry does not validate CRC, then we simply ignore it for now. TODO: decide how to notify about broken KVs.	2012-04-24 13:49:00 +02:00
Kresten Krab Thorup	472ba4551e	New visualizer Shows 0..9 for merge activity at each level Also prints disk size used and free	2012-04-24 01:43:53 +02:00
Kresten Krab Thorup	ff36e401b7	Incremental merge refactor, step #2 Now incremental merge has a new strategy. In stead of doing the same amount of merge work at all levels, we now compute the total merge work load, and do as much as possible on the first level, subtract work done, and delegate to the next level, etc. The effect of this is that we do more IO on fewer files, improving sequential-ness of the workload involved in the incremental merge.	2012-04-24 00:31:28 +02:00
Kresten Krab Thorup	9fb8a5e73f	Refactor step #1 , know current max_level This refactoring just adds the stat to the master gen_server of a Hanoi instance to know the current number of levels. Until now, we've only held a reference to the current top level.	2012-04-23 22:32:51 +02:00
Kresten Krab Thorup	755788ecfb	Fix for hanoi store recovery If we're opening a hanoi store configured with smaller nursery size than the default, then we need to make sure that we also open the small levels. Future feature is to actually squash the smaller levels.	2012-04-23 14:14:12 +02:00
Kresten Krab Thorup	8b725cceaa	Improve recovery This improves recovery two-fold: 1. make sure that we actually wait for initial merge to complete (issue incremental_merge(0)) 2. compute minimum required merge work for merge to establish invariant that there's room for a new nursery inject any time.	2012-04-23 13:53:42 +02:00

... 2 3 4 5 6 ...

364 commits