mentat

Author	SHA1	Message	Date
Nick Alexander	46c2a0801f	Add type checking and constraint checking to the transactor. (#663 , #532 , #679 ) This should address #663, by re-inserting type checking in the transactor stack after the entry point used by the term builder. Before this commit, we were using an SQLite UNIQUE index to assert that no `[e a]` pair, with `a` a cardinality one attribute, was asserted more than once. However, that's not in line with Datomic, which treats transaction inputs as a set and allows a single datom like `[e a v]` to appear multiple times. It's both awkward and not particularly efficient to look for _distinct_ repetitions in SQL, so we accept some runtime cost in order to check for repetitions in the transactor. This will allow us to address #532, which is really about whether we treat inputs as sets. A side benefit is that we can provide more helpful error messages when the transactor does detect that the input truly violates the cardinality constraints of the schema. This commit builds a trie while error checking and collecting final terms, which should be fairly efficient. It also allows a simpler expression of input-provided :db/txInstant datoms, which in turn uncovered a small issue with the transaction watcher, where-by the watcher would not see non-input-provided :db/txInstant datoms. This transition to Datomic-like input-as-set semantics allows us to address #532. Previously, two tempids that upserted to the same entid would produce duplicate datoms, and that would have been rejected by the transactor -- correctly, since we did not allow duplicate datoms under the input-as-list semantics. With input-as-set semantics, duplicate datoms are allowed; and that means that we must allow tempids to be equivalent, i.e., to resolve to the same tempid. To achieve this, we: - index the set of tempids - identify tempid indices that share an upsert - map tempids to a dense set of contiguous integer labels We use the well-known union-find algorithm, as implemented by petgraph, to efficiently manage the set of equivalent tempids. Along the way, I've fixed and added tests for two small errors in the transactor. First, don't drop datoms resolved by upsert (#679). Second, ensure that complex upserts are allocated. I don't know quite what happened here. The Clojure implementation correctly kept complex upserts that hadn't resolved as complex upserts (see `9a9dfb502a/src/common/datomish/transact.cljc (L436)`) and then allocated complex upserts if they didn't resolve (see `9a9dfb502a/src/common/datomish/transact.cljc (L509)`). Based on the code comments, I think the Rust implementation must have incorrectly tried to optimize by handling all complex upserts in at most a single generation of evolution, and that's just not correct. We're effectively implementing a topological sort, using very specific domain knowledge, and its not true that a node in a topological sort can be considered only once!	2018-05-14 15:22:45 -07:00
Nick Alexander	e5e37178af	Pre: Remove ancient Clojure code comments.	2018-05-14 15:22:45 -07:00
Richard Newman	3cba87c74b	Allow pull aliases to be non-namespaced. (#694 ) r=nalexander	2018-05-14 10:45:48 -07:00
Emily Toop	013629dec6	iOS and Android (Java) sdk framework (#643 ) Documents the FFI layer for Mentat, and provides transaction functionality via an EDN string. Creates two native libraries for iOS (Swift) and Android (Java) and fully tests the FFI for both platforms. Closes #619 #614 #611	2018-05-14 16:20:36 +01:00
Richard Newman	60cb5d2432	Pull improvements (#682 ) r=nalexander * Parse and handle aliased pull attributes. * Allow :db/id to be mentioned as a pull attribute. * Clean up comment. * Remove unused function.	2018-05-13 14:15:36 -07:00
Nick Alexander	4fde4fe0a6	Bustage fixes: compile on stable; avoid unused variable warning.	2018-05-11 10:22:57 -07:00
Richard Newman	3dc68bcd38	Combine NamespacedKeyword and Keyword. (#689 ) r=nalexander * Make properties on NamespacedKeyword/NamespacedSymbol private * Use only a single String for NamespacedKeyword/NamespacedSymbol * Review comments. * Remove unsafe code in namespaced_name. Benchmarking shows approximately zero change. * Allow the types of ns and name to differ when constructing a NamespacedName. * Make symbol namespaces optional. * Normalize names of keyword/symbol constructors. This will make the subsequent refactor much less painful. * Use expect not unwrap. * Merge Keyword and NamespacedKeyword.	2018-05-11 09:52:17 -07:00
Nick Alexander	c8f74fa41b	[edn] Round-trip instants. (#686 ) (#687 ) r=rnewman First, the parser had a small grouping bug where-by it wouldn't parse Z as timezone correctly. Second, we weren't printing instants in the format that we parse.	2018-05-11 02:11:04 -07:00
Thom	37a6f7be28	Use Cell instead of AtomicUsize in RcCounter. (#646 ) r=rnewman	2018-05-11 02:03:09 -07:00
Nick Alexander	9a4bd0de4f	Use rust-peg for tx parsing. r=rnewman	2018-05-10 10:32:27 -07:00
Nick Alexander	7a8c9d90c2	Post: Remove tx-parser crate entirely.	2018-05-10 10:24:05 -07:00
Nick Alexander	cbffe5e545	Use rust-peg for tx parsing. There are few reasons to do this: - it's difficult to add symbol interning to combine-based parsers like tx-parser -- literally every type changes to reflect the interner, and that means every convenience macro we've built needs to chagne. It's trivial to add interning to rust-peg-based parsers. - combine has rolled forward to 3.2, and I spent a similar amount of time investigating how to upgrade tx-parser (to take advantage of the new parser! macros in combine that I think are necessary for adapting to changing types) as I did just converting to rust-peg. - it's easy to improve the error messages in rust-peg, where-as I have tried twice to improve the nested error messages in combine and am stumped. - it's roughly 4x faster to parse strings directly as opposed to edn::ValueAndSpan, and it'll be even better when we intern directly.	2018-05-10 10:24:05 -07:00
Nick Alexander	e437944d94	Pre: Don't use tx-parser for destructuring map notation. This was always a choice, but we've outgrown it: now we want to accept value types that don't come from EDN and/or tx-parser.	2018-05-10 10:19:54 -07:00
Nick Alexander	4c4af46315	Add TransactableValue abstracting value places that can be transacted. This is a stepping stone to transacting entities that are not based on `edn::ValueAndSpan`. We need to turn some value places (general) into entity places (restricted), and those restrictions are captured in tx-parser right now. But for `TypedValue` value places, those restrictions are encoded in the type itself. This lays the track to accept other value types in value places, which is good for programmatic builder interfaces.	2018-05-10 10:19:54 -07:00
Emily Toop	e1e7cbaa44	Closes #634 - Fix variables in predicates (#635 ) r=rnewman We were forgetting to check for bound variables when resolving types other than ref types during inequality handling. This patch adds in the binding checks and `bails` if the bound variable is of the wrong type. #634	2018-05-09 16:24:12 +01:00
Richard Newman	e21156a754	Implement simple pull expressions (#638 ) r=nalexander * Refactor AttributeCache populator code for use from pull. * Pre: add to_value_rc to Cloned. * Pre: add From<StructuredMap> for Binding. * Pre: clarify Store::open_empty. * Pre: StructuredMap cleanup. * Pre: clean up a doc test. * Split projector crate. Pass schema to projector. * CLI support for printing bindings. * Add and use ConjoiningClauses::derive_types_from_find_spec. * Define pull types. * Implement pull on top of the attribute cache layer. * Add pull support to the projector. * Parse pull expressions. * Add simple pull support to connection objects. * Tests for pull. * Compile with Rust 1.25. The only choice involved in this commit is that of replacing the anonymous lifetime '_ with a named lifetime for the cache; since we're accepting a Known, which includes the cache in question, I think it's clear that we expect the function to apply to any given cache lifetime. * Review comments. * Bail on unnamed attribute. * Make assert_parse_failure_contains safe to use. * Rework query parser to report better errors for pull. * Test for mixed wildcard and simple attribute.	2018-05-04 12:56:00 -07:00
Nick Alexander	90465ae74a	Flip ValueRc to Arc in order to allow TypedValue in errors. (#677 ) (#678 ) r=rnewman @mmacedoeu did a good deal of work to show that Arc instead of Rc wasn't too difficult in #656, and @rnewman pushed the refactoring across the line in #659. However, we didn't flip the switch at that time. For #673, we'd like to include TypedValue instances in errors, and with error-chain (and failure) error types need to be 'Sync + 'Send, so we need Arc. This builds on #659 and should also finish #656.	2018-05-03 16:46:49 -07:00
Nick Alexander	1b66818ac9	Post: Fix CLI bustage.	2018-05-01 16:10:06 -07:00
Nick Alexander	9513012aa5	[tx] Fail transactions where complex upserts resolve to multiple entids. (#670 ) r=rnewman	2018-05-01 15:35:44 -07:00
Nick Alexander	2b82ffb2e5	[tx] Fail transactions where complex upserts resolve to multiple entids. (#670 ) This innocuous looking change (upserts_ev -> upserts_e -> resolved in all situations, rather than upserts_ev -> resolved in some situations) is a significant change in semantics and assumptions in the transactor. Witness the large comment being removed about the same tempid resolving in different generations! To support this change, we provide more holistic errors for conflicting upserts, which entails collecting some (relatively expensive) diagnostic data. I left in some debug logging, simply since it shouldn't hurt in general, and will likely be useful for the next bug we see in the transactor.	2018-05-01 15:34:44 -07:00
Nick Alexander	7960b4ccd2	Pre: Get ready to use log in mentat_db. We don't yet have a logging system for production use, but I'd like to start experimenting with log, which seems to be (close to) a Rust standard. We're already using it in mentat_cli.	2018-05-01 13:46:03 -07:00
Nick Alexander	d4a635f4e7	(tx) Replace :db/tx with (current-tx) transaction function and broaden support. (#664 ) r=rnewman	2018-04-26 19:33:16 -07:00
Nick Alexander	32ed56685e	(tx) Replace :db/tx with (transaction-tx) transaction function and broaden support. (#664 ) :db/tx (and Datomic's version, :datomic/tx) suffer from the same ambiguities that [a v] lookup references do -- determining the type of the result is context sensitive. (In this case, is :db/tx a reference to the current transaction ID, or is it a valid keyword?) This commit addresses the ambiguity by introducing a notion of a transaction functions, and provides a little scaffolding for adding more (should the need arise). I left the scaffolding in place rather than handling just (transaction-tx) because I started trying to implement (transaction-instant) as well, which is more difficult -- see the comments. It's worth noting that this approach generalizes more or less directly to ?input variables, since those can be eagerly bound like the implemented transaction function (transaction-tx).	2018-04-26 19:32:14 -07:00
Richard Newman	f979044ba1	Refactor value type boxing. (#659 ) r=nalexander * Pre: eliminate some occurrences of Rc, largely through the magic of Into. * Pre: introduce FromRc to convert between refcounted types. * Introduce ValueRc as an abstraction over Rc/Arc choice. * Move Cloned to core. * Move CString-creation methods to TypedValue. * Finish transition.	2018-04-25 14:23:27 -07:00
Richard Newman	a2e13f624c	Add 'Binding', a structured value type to return from queries. (#657 ) r=nalexander Bump to 0.7: breaking change.	2018-04-24 15:08:38 -07:00
Richard Newman	1818e0b98e	Split mentat_core TypedValue code into separate files for clarity.	2018-04-24 15:05:04 -07:00
Richard Newman	a74a2deffc	Introduce RelResult rather than Vec<Vec<TypedValue>>. (#639 ) r=nalexander * Pre: clean up core/src/lib.rs. * Pre: use indexmap 1.0 in db and query-projector. * Change rel results to be a RelResult instance, not a Vec<Vec<TypedValue>>. This avoids memory fragmentation and improves locality by using a single heap-allocated vector for all bindings, rather than a separate heap-allocated vector for each row. We hide this abstraction behind the `RelResult` type, which tracks the stride length (width) of each row. * Don't allocate temporary vectors when projecting RelResults.	2018-04-24 15:04:00 -07:00
Nick Alexander	0c31fc7875	(query) Implement tx-log API: (tx-ids ...) and (tx-data ...) functions. r=rnewman	2018-04-19 09:59:05 -07:00
Nick Alexander	c8da4be38f	(query) Implement tx-log API: (tx-ids ...) and (tx-data ...) functions. `tx-ids` allows to enumerate transaction IDs efficiently. `tx-data` allows to extract transaction log data efficiently. We might eventually allow to filter by impacted attribute sets as well.	2018-04-19 09:58:41 -07:00
Nick Alexander	e532614908	(query) Pre: Model columns that don't have type tags closer to Column.	2018-04-19 09:58:41 -07:00
Nick Alexander	36eca0bfb0	(chore) Pre: Use the same features of uuid throughout the project.	2018-04-19 09:58:41 -07:00
Richard Newman	1f1818448a	Begin adding worked examples. (#629 ) r=nalexander	2018-04-17 10:39:36 -07:00
Richard Newman	8ca657ec03	Simplify vocabulary migration test. We don't need to explicitly retract if the transactor will do it for us -- which it will if an attribute is cardinality-one.	2018-04-12 11:53:19 -07:00
Richard Newman	1509d16c3e	Fix (the ?foo) (#633 ) r=nalexander Don't group by ?var when processing (the ?var). This PR also finishes error generation in the projector.	2018-04-10 11:58:58 -07:00
Richard Newman	909b2a8be5	Refactoring: split up the projector crate. No other code changes.	2018-04-09 10:26:09 -07:00
Richard Newman	39f1d61175	Bump version to 0.6.2.	2018-04-09 09:47:49 -07:00
Richard Newman	3f8464e8ed	Implement vocabulary-driven schema upgrades. (#595 ) r=emily	2018-04-09 09:47:49 -07:00
Richard Newman	a5cda7c3e9	Allow passing a TermBuilder to be transacted by InProgress; add TermBuilder::is_empty. r=emily	2018-04-09 09:47:49 -07:00
Richard Newman	27dde378e0	Allow retraction of some schema attributes. (#379 ) r=nalexander	2018-04-09 09:47:49 -07:00
Emily Toop	175958e754	Address review comments @rnewman	2018-04-06 10:46:15 +01:00
Emily Toop	19ddf9c384	Spacing	2018-04-06 10:46:15 +01:00
Emily Toop	fa7dd2ceab	Add FFI for query building	2018-04-06 10:46:15 +01:00
Emily Toop	9741435026	Add helper functions for FFI. Many of these will go away as we expose the entity builder	2018-04-06 10:46:15 +01:00
Emily Toop	7382e3297d	Add QueryBuilder to make querying over FFI easier	2018-04-06 10:46:15 +01:00
Emily Toop	b3e27d86a9	Add converter functions from TypedValue to underlying type	2018-04-06 10:46:15 +01:00
Richard Newman	a57f7aff99	Add specialized tx-before and tx-after predicates. (#599 ) r=emily	2018-04-05 10:49:06 -07:00
Richard Newman	8607ecb745	Fix merge conflict.	2018-04-03 14:54:46 -07:00
Richard Newman	4d8e179a59	Expose component_attributes on Schema. (#623 ) r=nalexander Some parts of the query engine and transactor need to know whether an attribute is a component attribute, and sometimes want to do so in a generated SQL query. This is one way to do that.	2018-04-03 14:25:53 -07:00
Richard Newman	558906df4f	Fix: db/component should be db/isComponent. (#624 ) r=nalexander	2018-04-03 14:25:28 -07:00
Richard Newman	6c54e1d370	Support :db/noHistory for attributes. (#622 ) r=nalexander At this point we never discard history, but this completes the API support for doing so.	2018-04-03 14:23:46 -07:00

1 2 3 4 5 ...

819 commits