machi

greg/machi

Author	SHA1	Message	Date
Scott Lystig Fritchie	8df7d58365	Add partition simulator support to fitness service	2015-09-11 16:45:29 +09:00
Scott Lystig Fritchie	c14b9ce50f	Minor cleanup, add more partitions to converge demo	2015-09-10 16:39:15 +09:00
Scott Lystig Fritchie	b7aa33c617	Yeah, nearly there. AP fails occasionally in multiple-asymmetric-partition sequence	2015-09-09 23:10:39 +09:00
Scott Lystig Fritchie	7af863d840	Add stubs of machi_fitness server	2015-09-08 16:13:07 +09:00
Scott Lystig Fritchie	4376ce9ec1	Remove all flap counting and inner projection stuff	2015-09-04 17:17:49 +09:00
Scott Lystig Fritchie	2e2f5f44c4	Another tweak to private_projections_are_stable()	2015-09-01 00:51:12 +09:00
Scott Lystig Fritchie	823b47bef3	Bugfix: convergence property for CP mode, again	2015-08-30 19:52:31 +09:00
Scott Lystig Fritchie	764708f3ef	Fix private_projections_are_stable() for long CP mode chains	2015-08-30 00:03:51 +09:00
Scott Lystig Fritchie	94394d3429	Bugfix: allow none proj to re-emerge from flapping (more) See comments added in this commit at A40. So far, I've been doing CP mode testing with a handful of (very useful) network partition combinations using: machi_chain_manager1_converge_demo:t(3, [{private_write_verbose,true}, {consistency_mode, cp_mode}, {witnesses, [a]}]). Next steps: * Expand number & types of partitions * Expand to chain lengths of 5 and beyond	2015-08-29 21:36:53 +09:00
Scott Lystig Fritchie	ee19a0856b	WIP: justincase	2015-08-29 19:59:46 +09:00
Scott Lystig Fritchie	dc5ae4047a	Bugfix: react_to_env_A30 inner->norm fix, make_zerf() none proj derp fix	2015-08-29 18:01:13 +09:00
Scott Lystig Fritchie	85eb3567a3	Bugfix: convergence property for CP mode	2015-08-29 15:57:23 +09:00
Scott Lystig Fritchie	403cb5b7a6	WIP: improvements, but now flapping inner epoch keeps increasing {sigh}	2015-08-28 21:13:54 +09:00
Scott Lystig Fritchie	3dfe5c2677	WIP: fix annotation history on disk	2015-08-28 18:37:11 +09:00
Scott Lystig Fritchie	12b74a52fd	WIP: pre-dinner paranoid checkin	2015-08-27 18:45:27 +09:00
Scott Lystig Fritchie	1c5a17b708	WIP: adjust throttle of flapping 'shut up'	2015-08-25 17:01:14 +09:00
Scott Lystig Fritchie	9a86453753	WIP: half-baked idea, stopping for the night (more) So, I'm 50% sure this is a good idea for CP mode: if there's a later public projection than P_current, then who knows what we might have missed. So, call make_zerf() to find out the absolute latest. Problem: flapping state appears to be lost, booo.	2015-08-24 21:54:30 +09:00
Scott Lystig Fritchie	2f82fe0487	WIP: cp_mode improvements	2015-08-24 19:04:26 +09:00
Scott Lystig Fritchie	f6e81e6cd0	Add damper check for flapping of inner projections, whee!	2015-08-23 20:01:44 +09:00
Scott Lystig Fritchie	2b2facaba2	Add more FLU choices to converge demo	2015-08-22 14:56:26 +09:00
Scott Lystig Fritchie	14fad2d704	End-to-end chain state checking is still broken (more) If we use verbose output from: machi_chain_manager1_converge_demo:t(3, [{private_write_verbose,true}, {consistency_mode, cp_mode}, {witnesses, [a]}]). And use: tail -f typescript_file \| egrep --line-buffered 'SET\|attempted\|CONFIRM' ... then we can clearly see a chain safety violation when moving from epoch 81 -> 83. I need to add more smarts to the safety checking, both at the individual transition sanity check and at the converge_demo overall rolling sanity check. Key to output: CONFIRM by epoch {num} {csum} at {UPI} {Repairing} SET # of FLUs = 3 members [a,b,c]). CONFIRM by epoch 1 <<96,161,96,...>> at [a,b] [c] CONFIRM by epoch 5 <<134,243,175,...>> at [b,c] [] CONFIRM by epoch 7 <<207,93,225,...>> at [b,c] [] CONFIRM by epoch 47 <<60,142,248,...>> at [b,c] [] SET partitions = [{c,b},{c,a}] (1 of 2) at {22,3,34} CONFIRM by epoch 81 <<223,58,184,...>> at [a,b] [] SET partitions = [{b,c},{b,a}] (2 of 2) at {22,3,38} CONFIRM by epoch 83 <<33,208,224,...>> at [a,c] [] SET partitions = [] CONFIRM by epoch 85 <<173,179,149,...>> at [a,c] [b]	2015-08-13 22:16:28 +09:00
Scott Lystig Fritchie	e956c0b534	Fix (yet again) converge demo stable criteria	2015-08-13 21:26:07 +09:00
Scott Lystig Fritchie	eecf5479ed	Tweak stability criteria for converge demo	2015-08-13 16:18:33 +09:00
Scott Lystig Fritchie	30a5652299	WIP: refining stable success for machi_chain_manager1_converge_demo, even better	2015-08-07 15:06:23 +09:00
Scott Lystig Fritchie	c8ddce103e	WIP: refining stable success for machi_chain_manager1_converge_demo	2015-08-07 12:28:51 +09:00
Scott Lystig Fritchie	3ca0f4491d	WIP: always start chain manager with none projection	2015-08-06 19:24:14 +09:00
Scott Lystig Fritchie	52dc40e1fe	converge demo: converged iff all private projs are stable and all inner/outer	2015-07-21 14:19:08 +09:00
Scott Lystig Fritchie	19ce841471	Merge slf/chain-manager/cp-mode (fix conflicts)	2015-07-17 16:39:37 +09:00
Scott Lystig Fritchie	8d76cfe0db	Robust'ify the testing of projection stability	2015-07-10 21:04:34 +09:00
Scott Lystig Fritchie	2060b80830	Keep good refactorings from commit a8390ee2 Also, add more misc details to the 'react' breadcrumb trail. Also, save get(react) results into dbg2 whenever we write a private projection, very valuable for debugging. Also: cleanup PULSE code, add regression commands as option and controls with some new environment variables. These regression sequences were responsbile for several fruitful debugging sessions, so we keep them for posterity and for their ability (with new seeds and PULSE) to find new interleavings.	2015-07-10 15:04:50 +09:00
Scott Lystig Fritchie	297d29c79b	Finish fixups to the chmgr state transition checking	2015-07-07 23:03:14 +09:00
Scott Lystig Fritchie	3aa3e00806	WIP: major fixups to the chmgr state transition checking (more below) So, the PULSE test is failing, which is good. However, I believe that the failures are all due to the model now being too strict. The model is now catching failures which are now benign, I think. {bummer_NOT_DISJOINT,{[a,b,b,c,d], [{a,not_in_this_epoch}, {b,not_in_this_epoch}, {c,"[{epoch,1546},{author,c},{upi,[c]},{repair,[b]},{down,[a,d]},{d,[{ps,[{a,c},{c,a},{a,d},{b,d},{c,d}]},{nodes_up,[b,c]}]},{d2,[]}]"}, {d,"[{epoch,1546},{author,d},{upi,[d]},{repair,[a,b]},{down,[c]},{d,[{ps,[{c,b},{d,c}]},{nodes_up,[a,b,d]}]},{d2,[]}]"}]}}}, In this and all other examples, the UPIs are disjoint but the repairs are not disjoint. I believe the model ought to be ignoring the repair list. {bummer_NOT_DISJOINT,{[a,a,b], [{a,"[{epoch,1174},{author,a},{upi,[a]},{repair,[]},{down,[b]},{d,[{ps,[{a,b},{b,a}]},{nodes_up,[a]}]},{d2,[]}]"}, {b,"[{epoch,1174},{author,b},{upi,[b]},{repair,[a]},{down,[]},{d,[{ps,[]},{nodes_up,[a,b]}]},{d2,[]}]"}]}}}, or {bummer_NOT_DISJOINT,{[c,c,e], [{a,not_in_this_epoch}, {b,not_in_this_epoch}, {c,"[{epoch,1388},{author,c},{upi,[c]},{repair,[]},{down,[a,b,d,e]},{d,[{ps,[{a,b},{a,c},{c,a},{a,d},{d,a},{e,a},{c,b},{b,e},{e,b},{c,d},{e,c},{e,d}]},{nodes_up,[c]}]},{d2,[]}]"}, {d,not_in_this_epoch}, {e,"[{epoch,1388},{author,e},{upi,[e]},{repair,[c]},{down,[a,b,d]},{d,[{ps,[{a,b},{b,a},{a,c},{c,a},{a,d},{d,a},{a,e},{e,a},{b,c},{c,b},{b,d},{b,e},{e,b},{c,d},{d,c},{d,e},{e,d}]},{nodes_up,[c,e]}]},{d2,[]}]"}]}}},	2015-07-07 22:11:19 +09:00
Scott Lystig Fritchie	c8ce99023e	WIP: model checking refactoring TODO	2015-07-07 18:32:04 +09:00
Scott Lystig Fritchie	d5f521f2bd	Various test updates	2015-07-07 15:02:29 +09:00
Scott Lystig Fritchie	471cde1f2c	WIP: debugging fmt shuffle	2015-07-06 16:11:14 +09:00
Scott Lystig Fritchie	9b3cd9056a	Un-TEST'ify testr_react_to_env() everywhere	2015-07-03 16:18:40 +09:00
Scott Lystig Fritchie	9cf77f4406	WIP: Refactoring and prototyping goop, broken test	2015-07-03 00:59:04 +09:00
Scott Lystig Fritchie	670bd2cafc	Add some flexibility to machi_chain_manager1_converge_demo:t/1 and t/2	2015-07-01 14:08:17 +09:00
Scott Lystig Fritchie	3ce3fb93b9	Use infinity timeout for sanity check	2015-06-17 12:42:53 +09:00
Scott Lystig Fritchie	b244a3b8e4	Reduce verbosity, try fix up convergence demo for chain len=4	2015-06-15 12:41:16 +09:00
Scott Lystig Fritchie	be62300b3b	Bug fixes: model and real bugs, thanks PULSE and converge_demo both!	2015-06-04 17:39:29 +09:00
Scott Lystig Fritchie	c1318d3bbb	WIP: wip wip a doowip	2015-06-02 22:13:15 +09:00
Scott Lystig Fritchie	deabe14d29	Un-proplist-ify the inner projection	2015-06-02 20:55:18 +09:00
Scott Lystig Fritchie	207be8729b	Un-proplist-ify the flapping_i info	2015-06-02 20:32:52 +09:00
Scott Lystig Fritchie	67019493aa	Round 1 of cleanup	2015-06-02 18:10:45 +09:00
Scott Lystig Fritchie	185c670b2f	WIP: refactoring machi_cr_client:append_chunk*	2015-05-18 19:06:06 +09:00
Scott Lystig Fritchie	ae1d038abe	Change default value of chmgr's use_partition_simulator to false	2015-05-08 13:40:44 +09:00
Scott Lystig Fritchie	238c8472cd	WIP: timeout comments	2015-05-07 18:52:01 +09:00
Scott Lystig Fritchie	02bc7fe0bc	WIP: Fix bug that flaps inside an inner projection, oops!	2015-04-14 18:23:00 +09:00
Scott Lystig Fritchie	9e587b3d11	WIP: crufty TODO & comment cleanup	2015-04-14 16:17:49 +09:00

1 2

57 commits